Mission

Provide access to point based structure inventories with attribution to support evaluation of consequences from natural and man made hazards.

Vision

To support all federal agencies interested in collaborating on structure inventory data,

Goals

  1. Provide access to the data to as many people and agencies as possible.
  2. Improve the quality of the data.
  3. Improve the ability for the U.S. to respond to disasters.
  4. Improve the ability for the U.S. to plan for future disasters.

Purpose

The National Structure Inventory (NSI) is a system of databases containing structure inventories of varying quality and spatial coverage. The purpose of the NSI databases is to facilitate storage and sharing of point based structure inventories used in the assessment and analysis of natural hazards. Flood damage analysis is the primary usage, but sufficient data exists on each structure to compute damages due to other hazard types. The purpose of this document is to describe the NSI data structure and to document the processes utilized to produce the 2019 NSI base data.

Overview of the National Structure Inventory

The National Structure Inventory Base layer is created by the U. S. Army Corps of Engineers (USACE). The USACE Base data layer was created to simplify the GIS pre-processing workflow for the USACE Modeling Mapping and Consequence center. The NSI is a repository of point structure inventories with a structured RESTful API service, and the inventory contains a series of required attributes or fields that describe each point in the inventory. 

NSI Required Fields

The NSI application programming interface (API) requires structure inventory attributes to be consistent across all datasets in the NSI databases. These required attributes exist to meet the computational constraints of the software consuming the NSI. To successfully upload datasets to the NSI, the datasets must contain the required attributes with the fields populated. The analyst is responsible for giving approximate values for each attribute, and documenting the assumptions in providing those attributes. The required attributes for the NSI are:

Field Name (must be exactly as written)

Description

Attribute Type

Limits

FD_ID

A number that should be unique for all structures.

Integer


X

X coordinate of each structure; it is in the Geographic Coordinate System (GCS) WGS84.

Double


Y

Y coordinate for each structure in GCS WGS84.

Double


CB_ID

Census Block that contains the structure.

String

15 Characters

FD_NAME

Name for the structure; each structure within a single uploaded inventory must have a unique name, but it is not critical that the name be unique among all structures within a quality level.

String


ST_DAMCAT

Damage category of the structure. Damage categories are a larger aggregation than occupancy type (e.g., Residential, Commercial, Industrial, or Public).

String


ST_DMGFUN

Damage Function or Occupancy Type of the structure. This field relates the structures depth-damage relationships, number of stories, number of households, and other generic characteristics to the structure location.

String


BASEMENT

Defines if the structure has a basement or not. String entered must be either yes or no.

String

Yes or No

CONSTR_TYPE

Construction type of the structure. This is informative for structural stability criteria. From the HAZUS database this comes out of the MSH.mdb (e.g., Masonry, Wood, Manufactured, or Steel).

String


HT_FOUND

Describes the foundation height of the structure in feet from the ground elevation (e.g., Pier or Slab).

Double


TYPE_FOUND

Describes the type of foundation on the structure.

String


VAL_STRUCT

Value in dollars of the structure. The analyst should document what value is represented (e.g., depreciated replacement cost, replacement cost, market value, etc.).

Double


VAL_CONT

Value in dollars of the contents of the structure.

Double


VAL_CAR

Value in dollars of the cars at the structure.

Double


MED_YR_BLT

Describing a best estimate for the year the building was built.

Integer


POST_FIRM

Value declaring if the year built was after the house entered the National Flood Insurance Program (NFIP). If the value is 1, the structure was built after the NFIP program, if it is 0 the structure was built after the FIPS program.

Integer

1 or 0

POP_2AM_U

Population at night for the structure of people under the age of 65.

Integer


POP_2AM_O

Population at night for the structure of people over the age of 65.

Integer


POP_2PM_U

Population during the day for the structure of people under the age of 65.

Integer


POP_2PM_O

Population during the day for the structure of people over the age of 65.

Integer


NUM_STORY

Number of stories the structure has.

Integer


NUM_CARS

Number of cars at the structure.

Integer


GOUND_ELE

Ground elevation (in meters) at the structure.

Double


USACE-Developed NSI Base Data

This section of the document serves as the metadata for the NSI Base data provided by USACE. The document assumes a familiarity with GIS, and FEMA's HAZUS database. The HAZUS (2010) database provided the bulk of the base data included in the NSI Base layer (website: https://msc.fema.gov/portal/resources/hazus). The team converted Census Block aggregated HAZUS data into disaggregated, discrete points, and modified and improved the attributes of the points to preserve the aggregate values. This Base quality data is not an exact representation of reality, but rather a pseudo inventory with homogeneous assumptions across the United States. Although there are some accuracy issues, the Base dataset functions as a minimum standard for the United States. Appropriate uses include situations where more accurate data is too costly to produce and cannot be created, or when limited by time constraints. Another general use of the NSI Base dataset is for assessments on a national level, where regional assumptions may introduce bias into the analysis.
The current NSI Base data represents the 2010 HAZUS dataset. The following sections describe the processes used to produce the NSI Base data.

2019 Base Quality Level Data Generation

In 2018 and 2019 the NSI team created the data using the following inputs from numerous input data sources. The two main sources of data are CoreLogic parcel files for residential structures and ESRI business layer for non-residential structures. Each data file used contains data on the type of development that exists at a given location. For example, the parcel data often stated whether a structure was Single Family Residential or a multi-family structure; ESRI data reported the NAICS code for each structure. These source data categories were converted to a format consistent with one of 40 different HAZUS Occupancy Type classification. Residential Occupancy types are further revised later in the process based on other structure characteristic assignment, with single family residences’ “RES1” classification being appended with the number of stories and basement status (e.g. “RES1-2SNB”).

Main Data Sources

SourceDatabaseDatasetDescription
HAZUSBndrygbs.mdbhzMeansCountyLocationsProvides county level price adjustments.


hzExposureOccupBInforms estimated dollar per square foot used in structure valuation.


hzCensusBlockProvides the structure building schemes and block type.


flSchemeCoastal, flSchemeRiverine, flSchemeGLakesProvides information on foundation type and height.

MSH.mdbflGenBldgSchemeProvides the construction type distributions and NFIP entry year for structures.
USACENSI 2015Base layerUsed in any Census Block that lacks ESRI or CoreLogic data.
Homeland Infrastructure Foundation-Level DataCoreLogicCounty Level DataParcel polygons and associated data tables; used for initial spatial location and Occupancy Type.
EsriBusiness LayerInfoGroupProvides initial structure location; NAICS code informs occupancy type, number of employee field informs square footage estimate and population weighting.
MicrosoftBuilding FootprintsState level polygonsPaired with parcel polygons to improve structure location and to inform structure aggregation.
U. S. Census BureauAmerican Community SurveyPopulation, DemographicsInforms population growth estimates, disability rates, and age distribution.

Characteristics of New HousingAnnual, VariousProvide structure characteristic data such as number of stories and square feet.

Longitudinal Employer-Household Dynamic DatabasePopulation DataContains worker counts by origin and destination census blocks.
NCESSchools DatabaseSchool DataContains the locations of schools, number of teachers and students per school by census block.
U. S. Geological SurveyNational Elevation Dataset10 Meter Dataset (?)Provides raster ground elevation (in feet) data.

Structure Placement Refinement

The XY location for each structure is initially provided by the source data, such as the centroid of the parcel or the geo-reference of a business’s address. However, the NSI Generator modifies these initial locations by matching the structures to Microsoft buildings footprints within the same parcel polygon. If there are multiple footprints within a parcel polygon, structures are placed in the largest footprints first. If there are multiple structures types within a parcel polygon, then structures are paired with footprints in the following order: schools first, then commercial structures, and finally residential structures. Structures are placed in unpaired footprints until all footprints are paired with structures, at which point multiple structures of the same type may be stacked within the same footprint

Structure Aggregation

If structures are stacked within the same location, then the structures may be partially or completely merged together. Residential units stacked at the same location are assumed to be multi-family structures; the number of units will be used later to update the occupancy type of the structure (for instance, more than 50 units would mean that a residential structure would be identified as a RES3F). However, commercial structures are not completely merged; instead, the NSI generator links the stacked structures so that they share certain characteristics such as number of stories and construction material. Each commercial business within the stack will receive a weighted portion of the square footage which informs the valuation of each structure.

Population Growth and Assignment to Structures

County level population estimates were available for 2017, however the most recent block level residential population estimates are from the 2010 Decennial Census. To account for this difference, the NSI Generator was provided a table that recorded the number of increased persons residing in a county above 2010 population levels (counties that lost population received no adjustment). The NSI estimates block level population growth in an iterative process until the total increased population for the county is depleted. Population is first added to structures that had no housing units in 2010 but now have housing units in the newly generated inventory. Next population is distributed to blocks whose number of housing units is greater in the NSI than it was in the 2010 census. Finally, population is randomly assigned to census blocks until the population growth is fully distributed.

Commercial worker population was derived from the U. S. Census Bureau’s Longitudinal Employer-Household Dynamics (LEHD) database (website: https://lehd.ces.census.gov/). This database contains counts for the number of residents leaving a census block to work and the number of workers arriving in a census block. Departing workers are subtracted from the residential population; as are enrolled students.

Once block level population estimates are made, population is assigned to particular structures within the block. Population is assigned from 8 separate pools, reflecting combinations of Day and Night, Over and Under 65 years of age, and Workers and Residents. Population is assigned from commercial population pools to commercial businesses weighted by number of employees, and from residential population pools to residences weighted by number of housing units. The assignment process also accounts for the relative likelihood of those over 65 years of age to work or stay at home. Schools based on NCES data had student estimates added directly to those structures in addition to the teachers added through the worker assignment process

Structure Valuation

The HAZUS dataset contains dollars per square foot for each Occupancy Type; these values are taken from 2014 RS Means estimates, except for RES1 structures which are taken from 2006 estimates. These values are indexed to 2018 prices levels using the ENR Construction Cost Index. Dollars per square foot estimates are then multiplied by the square footage estimate for each structure to obtain the structure value.

These replacement values for structures are then depreciated in order to obtain depreciated replacement value; each structure is depreciated by 1% per year for the first 20 years, after which it is assumed that routine maintenance would keep structure values at 80% of their replacement values.

Content values are obtained by multiplying structure values against an occupancy type specific structure to content value ratio. It is important to note that RES1 structures assumed content values are equal to structure values; this is because USACE Economic Guidance Memorandum (EGM) depth damage functions implicitly assume such a relationship. If NSI users are not relying on the USACE EGM curves, they should instead assume a 50% relationship unless better data is available to suggest otherwise.

Occupancy Type

Occupancy types are used to help determine structure valuation, population, and to define structure damage criteria (for flooding). The occupancy types are based on the FEMA occupancy type definitions with further classification to meet the criteria for USACE economic guidance memorandums. The table of occupancy type names and their descriptions are below. These are utilized to support the base level data and are not required for other datasets.

Damage CategoryOccupancy Type NameDescription
ResidentialRES1-1SNBSingle Family Residential, 1 story, no basement
ResidentialRES1-1SWBSingle Family Residential, 1 story, with basement
ResidentialRES1-2SNBSingle Family Residential, 2 story, no basement
ResidentialRES1-2SWBSingle Family Residential, 2 story, with basement
ResidentialRES1-3SNBSingle Family Residential, 3 story, no basement
ResidentialRES1-3SWBSingle Family Residential, 3 story, with basement
ResidentialRES1-SLNBSingle Family Residential, split-level, no basement
ResidentialRES1-SLWBSingle Family Residential, split-level, with basement
ResidentialRES2Manufactured Home
ResidentialRES3AMulti-Family housing 2 units
ResidentialRES3BMulti-Family housing 3-4 units
ResidentialRES3CMulti-Family housing 5-10 units
ResidentialRES3DMulti-Family housing 10-19 units
ResidentialRES3EMulti-Family housing 20-50 units
ResidentialRES3FMulti-Family housing 50 plus units
ResidentialRES4Average Hotel
ResidentialRES5Nursing Home
ResidentialRES6Nursing Home
CommercialCOM1Average Retail
CommercialCOM2Average Wholesale
CommercialCOM3Average Personal & Repair Services
CommercialCOM4Average Professional Technical Services
CommercialCOM5Bank
CommercialCOM6Hospital
CommercialCOM7Average Medical Office
CommercialCOM8Average Entertainment/Recreation
CommercialCOM9Average Theater
CommercialCOM10Garage
IndustrialIND1Average Heavy Industrial
IndustrialIND2Average light industrial
IndustrialIND3Average Food/Drug/Chemical
IndustrialIND4Average Metals/Minerals processing
IndustrialIND5Average High Technology
IndustrialIND6Average Construction
CommercialAGR1Average Agricultural
CommercialREL1Church
PublicGOV1Average Government Services
PublicGOV2Average Emergency Response
PublicEDU1Average School
PublicEDU2Average College/University

Construction Type

The hzCensusBlock table contains an attribute for building scheme, and this attribute is related to the flGenBldgScheme tables from the MSH.mdb database. The building scheme attribute is used to define structures as Wood, Masonry, Concrete Block, Manufactured, and Steel using random assignment based on the probabilities indicated in the HAZUS table. Structures that were estimated to be more than 5 stories are assumed to be of steel construction.

Foundation Type and Height

Based on the information in the hzCensusBlock table for building scheme and the tables in the MSH.mdb database that also contain the building scheme attribute, structures are classified into Slab, Pier, Unattached, and Basement using random assignment.

Foundation height (in feet) are calculated and provided based on the foundation type and whether the structures are in blocks that were dated pre- or post-NFIP.

Vehicles

Vehicle values for each structure are based on the number of housing units for residential structures or the number of employees for commercial structures.

Ground Elevations

Ground elevations (feet) are determined using the USGS National Elevation Dataset (NED), based on the structure location (website: https://nationalmap.gov/elevation.html).

Acronyms

API

Application Programming Interface

FEMA

Federal Emergency Management Agency, Department of Homeland Security

FIPS

Federal Information Processing Standard

FIRM

Flood Insurance Rate Maps

GCS

Geographic Coordinate System

GIS

Geospatial Information Systems

HAZUS

FEMA's Hazards of the United States

LEHD

U.S. Census Bureau's Longitudinal Employer-Household Dynamics Database, Department of Commerce

MRLC

Multi-Resolution Land Characteristics Consortium

NED

National Elevation Dataset

NFIP

National Flood Insurance Program

NLCD

National Land Cover Dataset

NSI

National Structure Inventory

USACE

U. S. Army Corps of Engineers, Department of Defense

USGS

U. S. Geological Survey, Department of the Interior