Technical Documentation
Mission
Provide access to point based structure inventories with attribution to support evaluation of consequences from natural and man made hazards.
Vision
To support all federal agencies interested in collaborating on structure inventory data,
Goals
- Provide access to the data to as many people and agencies as possible.
- Improve the quality of the data.
- Improve the ability for the U.S. to respond to disasters.
- Improve the ability for the U.S. to plan for future disasters.
Purpose
The National Structure Inventory (NSI) is a system of databases containing structure inventories of varying quality and spatial coverage. The purpose of the NSI databases is to facilitate storage and sharing of point based structure inventories used in the assessment and analysis of natural hazards. Flood damage analysis is the primary usage, but sufficient data exists on each structure to compute damages due to other hazard types. The purpose of this document is to describe the NSI data structure and to document the processes utilized to produce the 2019 NSI base data.
Overview of the National Structure Inventory
The National Structure Inventory Base layer is created by the U. S. Army Corps of Engineers (USACE). The USACE Base data layer was created to simplify the GIS pre-processing workflow for the USACE Modeling Mapping and Consequence center. The NSI is a repository of point structure inventories with a structured RESTful API service, and the inventory contains a series of required attributes or fields that describe each point in the inventory.
NSI Required Fields
The NSI application programming interface (API) requires structure inventory attributes to be consistent across all datasets in the NSI databases. These required attributes exist to meet the computational constraints of the software consuming the NSI. To successfully upload datasets to the NSI, the datasets must contain the required attributes with the fields populated. The analyst is responsible for giving approximate values for each attribute, and documenting the assumptions in providing those attributes. The required attributes for the NSI are:
Field Name (must be exactly as written) | Description | Attribute Type | Limits |
---|---|---|---|
FD_ID | A number that should be unique for all structures. | Integer | |
X | X coordinate of each structure; it is in the Geographic Coordinate System (GCS) WGS84. | Double | |
Y | Y coordinate for each structure in GCS WGS84. | Double | |
CB_ID | Census Block that contains the structure. | String | 15 Characters |
FD_NAME | Name for the structure; each structure within a single uploaded inventory must have a unique name, but it is not critical that the name be unique among all structures within a quality level. | String | |
ST_DAMCAT | Damage category of the structure. Damage categories are a larger aggregation than occupancy type (e.g., Residential, Commercial, Industrial, or Public). | String | |
ST_DMGFUN | Damage Function or Occupancy Type of the structure. This field relates the structures depth-damage relationships, number of stories, number of households, and other generic characteristics to the structure location. | String | |
BASEMENT | Defines if the structure has a basement or not. String entered must be either yes or no. | String | Yes or No |
CONSTR_TYPE | Construction type of the structure. This is informative for structural stability criteria. From the HAZUS database this comes out of the MSH.mdb (e.g., Masonry, Wood, Manufactured, or Steel). | String | |
HT_FOUND | Describes the foundation height of the structure in feet from the ground elevation (e.g., Pier or Slab). | Double | |
TYPE_FOUND | Describes the type of foundation on the structure. | String | |
VAL_STRUCT | Value in dollars of the structure. The analyst should document what value is represented (e.g., depreciated replacement cost, replacement cost, market value, etc.). | Double | |
VAL_CONT | Value in dollars of the contents of the structure. | Double | |
VAL_CAR | Value in dollars of the cars at the structure. | Double | |
MED_YR_BLT | Describing a best estimate for the year the building was built. | Integer | |
POST_FIRM | Value declaring if the year built was after the house entered the National Flood Insurance Program (NFIP). If the value is 1, the structure was built after the NFIP program, if it is 0 the structure was built after the FIPS program. | Integer | 1 or 0 |
POP_2AM_U | Population at night for the structure of people under the age of 65. | Integer | |
POP_2AM_O | Population at night for the structure of people over the age of 65. | Integer | |
POP_2PM_U | Population during the day for the structure of people under the age of 65. | Integer | |
POP_2PM_O | Population during the day for the structure of people over the age of 65. | Integer | |
NUM_STORY | Number of stories the structure has. | Integer | |
NUM_CARS | Number of cars at the structure. | Integer | |
GOUND_ELE | Ground elevation (in meters) at the structure. | Double |
USACE-Developed NSI Base Data
This section of the document serves as the metadata for the NSI Base data provided by USACE. The document assumes a familiarity with GIS, and FEMA's HAZUS database. The HAZUS (2010) database provided the bulk of the base data included in the NSI Base layer (website: https://msc.fema.gov/portal/resources/hazus). The team converted Census Block aggregated HAZUS data into disaggregated, discrete points, and modified and improved the attributes of the points to preserve the aggregate values. This Base quality data is not an exact representation of reality, but rather a pseudo inventory with homogeneous assumptions across the United States. Although there are some accuracy issues, the Base dataset functions as a minimum standard for the United States. Appropriate uses include situations where more accurate data is too costly to produce and cannot be created, or when limited by time constraints. Another general use of the NSI Base dataset is for assessments on a national level, where regional assumptions may introduce bias into the analysis.
The current NSI Base data represents the 2010 HAZUS dataset. The following sections describe the processes used to produce the NSI Base data.
2019 Base Quality Level Data Generation
In 2018 and 2019 the NSI team created the data using the following inputs from numerous input data sources. The two main sources of data are CoreLogic parcel files for residential structures and ESRI business layer for non-residential structures. Each data file used contains data on the type of development that exists at a given location. For example, the parcel data often stated whether a structure was Single Family Residential or a multi-family structure; ESRI data reported the NAICS code for each structure. These source data categories were converted to a format consistent with one of 40 different HAZUS Occupancy Type classification. Residential Occupancy types are further revised later in the process based on other structure characteristic assignment, with single family residences’ “RES1” classification being appended with the number of stories and basement status (e.g. “RES1-2SNB”).
Main Data Sources
Source | Database | Dataset | Description |
---|---|---|---|
HAZUS | Bndrygbs.mdb | hzMeansCountyLocations | Provides county level price adjustments. |
hzExposureOccupB | Informs estimated dollar per square foot used in structure valuation. | ||
hzCensusBlock | Provides the structure building schemes and block type. | ||
flSchemeCoastal, flSchemeRiverine, flSchemeGLakes | Provides information on foundation type and height. | ||
MSH.mdb | flGenBldgScheme | Provides the construction type distributions and NFIP entry year for structures. | |
USACE | NSI 2015 | Base layer | Used in any Census Block that lacks ESRI or CoreLogic data. |
Homeland Infrastructure Foundation-Level Data | CoreLogic | County Level Data | Parcel polygons and associated data tables; used for initial spatial location and Occupancy Type. |
Esri | Business Layer | InfoGroup | Provides initial structure location; NAICS code informs occupancy type, number of employee field informs square footage estimate and population weighting. |
Microsoft | Building Footprints | State level polygons | Paired with parcel polygons to improve structure location and to inform structure aggregation. |
U. S. Census Bureau | American Community Survey | Population, Demographics | Informs population growth estimates, disability rates, and age distribution. |
Characteristics of New Housing | Annual, Various | Provide structure characteristic data such as number of stories and square feet. | |
Longitudinal Employer-Household Dynamic Database | Population Data | Contains worker counts by origin and destination census blocks. | |
NCES | Schools Database | School Data | Contains the locations of schools, number of teachers and students per school by census block. |
U. S. Geological Survey | National Elevation Dataset | 10 Meter Dataset (?) | Provides raster ground elevation (in feet) data. |
Structure Placement Refinement
The XY location for each structure is initially provided by the source data, such as the centroid of the parcel or the geo-reference of a business’s address. However, the NSI Generator modifies these initial locations by matching the structures to Microsoft buildings footprints within the same parcel polygon. If there are multiple footprints within a parcel polygon, structures are placed in the largest footprints first. If there are multiple structures types within a parcel polygon, then structures are paired with footprints in the following order: schools first, then commercial structures, and finally residential structures. Structures are placed in unpaired footprints until all footprints are paired with structures, at which point multiple structures of the same type may be stacked within the same footprint
Structure Aggregation
If structures are stacked within the same location, then the structures may be partially or completely merged together. Residential units stacked at the same location are assumed to be multi-family structures; the number of units will be used later to update the occupancy type of the structure (for instance, more than 50 units would mean that a residential structure would be identified as a RES3F). However, commercial structures are not completely merged; instead, the NSI generator links the stacked structures so that they share certain characteristics such as number of stories and construction material. Each commercial business within the stack will receive a weighted portion of the square footage which informs the valuation of each structure.
Population Growth and Assignment to Structures
County level population estimates were available for 2017, however the most recent block level residential population estimates are from the 2010 Decennial Census. To account for this difference, the NSI Generator was provided a table that recorded the number of increased persons residing in a county above 2010 population levels (counties that lost population received no adjustment). The NSI estimates block level population growth in an iterative process until the total increased population for the county is depleted. Population is first added to structures that had no housing units in 2010 but now have housing units in the newly generated inventory. Next population is distributed to blocks whose number of housing units is greater in the NSI than it was in the 2010 census. Finally, population is randomly assigned to census blocks until the population growth is fully distributed.
Commercial worker population was derived from the U. S. Census Bureau’s Longitudinal Employer-Household Dynamics (LEHD) database (website: https://lehd.ces.census.gov/). This database contains counts for the number of residents leaving a census block to work and the number of workers arriving in a census block. Departing workers are subtracted from the residential population; as are enrolled students.
Once block level population estimates are made, population is assigned to particular structures within the block. Population is assigned from 8 separate pools, reflecting combinations of Day and Night, Over and Under 65 years of age, and Workers and Residents. Population is assigned from commercial population pools to commercial businesses weighted by number of employees, and from residential population pools to residences weighted by number of housing units. The assignment process also accounts for the relative likelihood of those over 65 years of age to work or stay at home. Schools based on NCES data had student estimates added directly to those structures in addition to the teachers added through the worker assignment process
Structure Valuation
The HAZUS dataset contains dollars per square foot for each Occupancy Type; these values are taken from 2014 RS Means estimates, except for RES1 structures which are taken from 2006 estimates. These values are indexed to 2018 prices levels using the ENR Construction Cost Index. Dollars per square foot estimates are then multiplied by the square footage estimate for each structure to obtain the structure value.
These replacement values for structures are then depreciated in order to obtain depreciated replacement value; each structure is depreciated by 1% per year for the first 20 years, after which it is assumed that routine maintenance would keep structure values at 80% of their replacement values.
Content values are obtained by multiplying structure values against an occupancy type specific structure to content value ratio. It is important to note that RES1 structures assumed content values are equal to structure values; this is because USACE Economic Guidance Memorandum (EGM) depth damage functions implicitly assume such a relationship. If NSI users are not relying on the USACE EGM curves, they should instead assume a 50% relationship unless better data is available to suggest otherwise.
Occupancy Type
Occupancy types are used to help determine structure valuation, population, and to define structure damage criteria (for flooding). The occupancy types are based on the FEMA occupancy type definitions with further classification to meet the criteria for USACE economic guidance memorandums. The table of occupancy type names and their descriptions are below. These are utilized to support the base level data and are not required for other datasets.
Damage Category | Occupancy Type Name | Description |
---|---|---|
Residential | RES1-1SNB | Single Family Residential, 1 story, no basement |
Residential | RES1-1SWB | Single Family Residential, 1 story, with basement |
Residential | RES1-2SNB | Single Family Residential, 2 story, no basement |
Residential | RES1-2SWB | Single Family Residential, 2 story, with basement |
Residential | RES1-3SNB | Single Family Residential, 3 story, no basement |
Residential | RES1-3SWB | Single Family Residential, 3 story, with basement |
Residential | RES1-SLNB | Single Family Residential, split-level, no basement |
Residential | RES1-SLWB | Single Family Residential, split-level, with basement |
Residential | RES2 | Manufactured Home |
Residential | RES3A | Multi-Family housing 2 units |
Residential | RES3B | Multi-Family housing 3-4 units |
Residential | RES3C | Multi-Family housing 5-10 units |
Residential | RES3D | Multi-Family housing 10-19 units |
Residential | RES3E | Multi-Family housing 20-50 units |
Residential | RES3F | Multi-Family housing 50 plus units |
Residential | RES4 | Average Hotel |
Residential | RES5 | Nursing Home |
Residential | RES6 | Nursing Home |
Commercial | COM1 | Average Retail |
Commercial | COM2 | Average Wholesale |
Commercial | COM3 | Average Personal & Repair Services |
Commercial | COM4 | Average Professional Technical Services |
Commercial | COM5 | Bank |
Commercial | COM6 | Hospital |
Commercial | COM7 | Average Medical Office |
Commercial | COM8 | Average Entertainment/Recreation |
Commercial | COM9 | Average Theater |
Commercial | COM10 | Garage |
Industrial | IND1 | Average Heavy Industrial |
Industrial | IND2 | Average light industrial |
Industrial | IND3 | Average Food/Drug/Chemical |
Industrial | IND4 | Average Metals/Minerals processing |
Industrial | IND5 | Average High Technology |
Industrial | IND6 | Average Construction |
Commercial | AGR1 | Average Agricultural |
Commercial | REL1 | Church |
Public | GOV1 | Average Government Services |
Public | GOV2 | Average Emergency Response |
Public | EDU1 | Average School |
Public | EDU2 | Average College/University |
Construction Type
The hzCensusBlock table contains an attribute for building scheme, and this attribute is related to the flGenBldgScheme tables from the MSH.mdb database. The building scheme attribute is used to define structures as Wood, Masonry, Concrete Block, Manufactured, and Steel using random assignment based on the probabilities indicated in the HAZUS table. Structures that were estimated to be more than 5 stories are assumed to be of steel construction.
Foundation Type and Height
Based on the information in the hzCensusBlock table for building scheme and the tables in the MSH.mdb database that also contain the building scheme attribute, structures are classified into Slab, Pier, Unattached, and Basement using random assignment.
Foundation height (in feet) are calculated and provided based on the foundation type and whether the structures are in blocks that were dated pre- or post-NFIP.
Vehicles
Vehicle values for each structure are based on the number of housing units for residential structures or the number of employees for commercial structures.
Ground Elevations
Ground elevations (feet) are determined using the USGS National Elevation Dataset (NED), based on the structure location (website: https://nationalmap.gov/elevation.html).
Acronyms
API | Application Programming Interface |
FEMA | Federal Emergency Management Agency, Department of Homeland Security |
FIPS | Federal Information Processing Standard |
FIRM | Flood Insurance Rate Maps |
GCS | Geographic Coordinate System |
GIS | Geospatial Information Systems |
HAZUS | FEMA's Hazards of the United States |
LEHD | U.S. Census Bureau's Longitudinal Employer-Household Dynamics Database, Department of Commerce |
MRLC | Multi-Resolution Land Characteristics Consortium |
NED | National Elevation Dataset |
NFIP | National Flood Insurance Program |
NLCD | National Land Cover Dataset |
NSI | National Structure Inventory |
USACE | U. S. Army Corps of Engineers, Department of Defense |
USGS | U. S. Geological Survey, Department of the Interior |