Mission

Provide access to consistent, and nationally available point based structure inventories with attribution to support evaluation of consequences from natural and man-made hazards.

Vision

To support all federal agencies interested in collaborating on structure inventory data,

Goals

Provide access to the data to as many people and agencies as possible.
Improve the quality of the data.
Improve the ability for the U.S. to respond to disasters.
Improve the ability for the U.S. to plan for future disasters.

Purpose

The National Structure Inventory (NSI) is a system of databases containing structure inventories of varying quality and spatial coverage. The purpose of the NSI databases is to facilitate storage and sharing of point based structure inventories used in the assessment and analysis of natural hazards. Flood risk is the primary usage, but sufficient data exists on each structure to compute damages and life safety risk due to other hazard types. The purpose of this document is to describe the NSI data structure and to document the processes utilized to produce the 2022 NSI base data.

Overview of the National Structure Inventory

The National Structure Inventory Base layer was created and is maintained by the U. S. Army Corps of Engineers (USACE). The USACE base data layer was created to simplify the GIS pre-processing workflow for the USACE Modeling Mapping and Consequence center, but the data has gone on to see use in a variety of USACE, FEMA, and other agency applications. The NSI is a repository of point structure inventories with a structured RESTful API service, and the inventory contains a series of required attributes or fields that describe each point in the inventory.

Accessing the NSI

The NSI is available in GeoPackage and GeoJSON format. State level GeoPackages can be downloaded using the NSI Download web page. GeeJSONs may be downloaded for individual counties or specified bounding boxes using the NSI API.

NSI Public Fields

The NSI application programming interface (API) requires structure inventory attributes to be consistent across all datasets in the NSI databases. These attributes exist to meet the computational constraints of the software consuming the NSI. To successfully upload datasets to the NSI, the datasets must contain the required attributes. The analyst is responsible for giving approximate values for each attribute, and documenting the assumptions in providing those attributes. Currently, only the NSI development team is capable of adding inventories to the NSI and only the base layer is available, however, upload functionality is expected to be expanded in coming months. The NSI attributes available to the general public are:

Field Name	Description	Attribute Type	Limits
fd_id	A number that should be unique for all structures.	Integer
bid	A building ID. This is the north axis aligned "bounding box" of its footprint represented as the centroid (in grid reference system format) and four cardinal extents.	String
x	X coordinate of each structure; it is in the Geographic Coordinate System (GCS) WGS84.	Double
y	Y coordinate for each structure in GCS WGS84.	Double
cbfips	Census Block that contains the structure. Currently, the NSI refers to 2010 census blocks.	String	15 Characters
st_damcat	Damage category of the structure. Damage categories are a larger aggregation than occupancy type (e.g., Residential, Commercial, Industrial, or Public).	String
occtype	Damage Function or Occupancy Type of the structure. This field relates the structures depth-damage relationships, number of stories, number of households, and other generic characteristics to the structure location.	String
bldgtype	Building type of the structure. Commonly associated with exterior wall and used for structure stability functions (i.e M= Masonry, W = Wood, H = Manufactured, S= Steel).	String
source	The source of the initial iteration of the structure (i.e. P = Parcel, E = ESRI, H = HIFLD Hospital, N = HIFLD Nursing Home, S = National Center for Education Statistics, X = HAZUS/NSI-2015).	String
sqft	The estimated Square footage of the structure. This field is used when estimating the depreciated replacement value.	Double
ftprntid	An identifier of the building footprint record used for estimating fields such as sqft and num_story. Stacked structures will share the same footprint ID.	String
ftprntsrc	The source of the utilized footprint (i.e. B = Bing, O = Oak Ridge National Labs, N = National Geospatial-Intelligence Agency, M = Map Building Layer).	String
found_type	Describes the type of foundation on the structure (C = Crawl, B = Basement, S = Slab, P = Pier, I = Pile, F = Fill, W = Solid Wall).	String
found_ht	Describes the foundation height of the structure in feet from the ground elevation.	Double
num_story	The number of stories of the structure.	Double
val_struct	Value in dollars of the structure. The base NSI estimates depreciated replacement value.	Double
val_cont	Value in dollars of the contents of the structure.	Double
val_vehic	Value in dollars of the cars at the structure.	Double
med_yr_blt	Describing the median year built of structures within the Census Block.	Integer
pop2amu65	Population at night for the structure of people under the age of 65.	Integer
pop2amo65	Population at night for the structure of people over the age of 65.	Integer
pop2pmu65	Population during the day for the structure of people under the age of 65.	Integer
pop2pmo65	Population during the day for the structure of people over the age of 65.	Integer
students	Number of students attending the school as estimated by the source NCES data.	Integer
o65disable	The percent of the county population over the age of 65 that is expected to have an ambulatory disability.	Double
u65siable	The percent of the county population under the age of 65 that is expected to have an ambulatory disability.	Double
firmzone	Estimated 2021 flood zone for the structure.	String
grnd_elv_m	Ground elevation (in meters, NAVD88) at the structure.	Double
ground_elv	Ground elevation (in feet, NAVD88) at the structure.	Double

NSI Private Fields

The NSI restricts access to certain fields to Federal users. While these attributes are generally not critical to the use of common consequence models, they may provide additional context to users attempting to refine the data for their analyses. The available attributes for the NSI are:

Field Name	Description	Attribute Type
cenusregion	The Census region in which the structure is located.	String
basement	Basement type of the structure (i.e. finished, unfinished, none).	String
bldcostcat	The category assumed for replacement cost estimations.	String
naics	The six digit NAICS code assumed for any business located at the structure.	String
empnum	The estimated number of employees for employers located at the structure.	Integer
apn	The source parcel's reported Assessor's Parcel Number.	String
resunits	The estimated number of housing units for at the structure.	Integer
yrbuilt	The reported year of the structure's construction from the parcel data.	Integer
total_room	The reported number of rooms for the structure from the parcel data.	Integer
bedrooms	The reported number of bedrooms for the structure from the parcel data.	Integer
total_bath	The reported number of bath rooms for the structure from the parcel data.	Integer
p_garage	The reported garage type from the parcel data.	String
parkingsp	The reported number of parking spaces at the structure.	Integer
p_extwall	The reported exterior wall type for the structure from the parcel data.	String
p_fndtype	The reported foundation type for the structure from the parcel data.	String
p_bsmnt	The reported basement type for the structure from the parcel data.	String
ftprntsqft	The square footage of the footprint polygon used during NSI generation.	Double
bldheight	The reported height of the building (in meters) from the footprint source.	Double
nursghmpop	The reported bed count for nursing home facilities from the source data.	Integer
othinstpop	Currently, the reported bed count for hospitals from the source data.	Integer
surplus	Currently all values in the base NSI are null for this field.	Integer

USACE-Developed NSI Base Data

This section of the document serves as the metadata for the NSI Base data provided by USACE. The document assumes a familiarity with GIS and terms related to the NSI attributes. The HAZUS (2014) database (website: https://msc.fema.gov/portal/resources/hazus) provided the bulk of the base data included in the NSI-2015 Base layer. The team has shifted to a reliance on multiple input sources in order to create discrete points and attributes to represent structures. This Base quality data is not an exact representation of reality, but rather a pseudo inventory with many homogeneous assumptions. Although there are some accuracy issues, the Base dataset functions as a minimum standard for the United States. Appropriate uses include situations where more accurate data is too costly to produce and cannot be created, or when limited by time constraints. Another general use of the NSI Base dataset is for assessments on a national level, where regional assumptions may introduce bias into the analysis. The following sections describe the processes used to produce the NSI Base data.

2022 Base Quality Level Data Generation

In June 2022 the NSI team created the dataset using the following inputs from numerous input data sources. The two main sources of data are Lightbox parcel files for residential structures and ESRI business layer for non-residential structures. Each data file used contains data on the type of development that exists at a given location. For example, the parcel data often stated whether a structure was single-family residential or a multi-family structure; ESRI data reported the NAICS code for each business. These source data categories were converted to a format consistent with one of 40 different HAZUS Occupancy Type classification. Residential Occupancy types are further revised later in the process based on other structure characteristic assignment, with single family residences’ “RES1” classification being appended with the number of stories and basement status (e.g. “RES1-2SNB”).

Main Data Sources

Source	Database	Dataset	Description
HAZUS	Bndrygbs.mdb	hzCensusBlock	Provides the structure building schemes and block type.
		flSchemeCoastal, flSchemeRiverine, flSchemeGLakes	Provides information on foundation type and height.
	MSH.mdb	flGenBldgScheme	Provides the construction type distributions and NFIP entry year for structures.
USACE	NSI 2015	Base layer	Used in any Census Block that lacks ESRI or CoreLogic data.
Homeland Infrastructure Foundation-Level Data	Lightbox	County Level Data	Parcel polygons and associated data tables; used for initial spatial location and occupancy type, and may influence structure attributes (square feet, foundation type, etc) of single-family structures.
	Nursing Home	Nationwide Data	Point data indicating the presence of a nursing home and its number of beds.
	Hospital	Nationwide Data	Point data indicating the presence of a hospital and its number of beds.
	Mobile Home	Nationwide Data	Point data indicating the presence of a mobile home park and the number of units associated with the park (either exact units, or a range),
	Map Building Layer	Nationwide Data	Nationwide building footprint parcel. Largely restricted to central business districts. Often indicating the height of the building to the nearest meter. Used to improve structure locations, square foot estimates and number of stories estimates.
Esri	Business Layer	InfoGroup	Provides initial structure location; NAICS code informs occupancy type and the number of employee field influences population weighting and square footage estimates.
Microsoft	Building Footprints	State level polygons	Paired with parcel polygons to improve structure location and to inform structure aggregation and square footage estimates.
FEMA Geospatial Resource Center	USA Structures	State level polygons	Includes both ORNL and NGA generated footprint polygons. Paired with parcel polygons to improve structure location and to inform structure aggregation and square footage estimates. NGA based footprints include heights in meters and help inform number of stories estimates.
U. S. Census Bureau	American Community Survey	Population, Demographics	Informs population growth estimates, disability rates, and age distribution.
	Characteristics of New Housing	Annual, Various	Provide structure characteristic data such as number of stories and square feet.
	Longitudinal Employer-Household Dynamic Database	Population Data	Contains worker counts by origin and destination census blocks. Used to decrease residential populations (primarily in the day) and to create a population pool for commercial workers.
NCES	Schools Database	School Data	Contains the locations of schools, number of teachers and students per school.
U. S. Geological Survey	National Elevation Dataset	10 Meter Dataset	Provides raster ground elevation data.

Structure Placement Refinement

The XY location for each structure is initially provided by the source data, such as the centroid of the parcel or the geo-reference of a business’s address. However, the NSI Generator modifies these initial locations by matching the structures to buildings footprints within the same parcel polygon. Commercial structures located outside of a parcel will be moved to an appropriate parcel based on consideration of distance and use code similarity. If there are multiple footprints within a parcel polygon, structures are placed in the largest footprints first. If there are multiple structure types within a parcel polygon, then structures are paired with footprints in the following order: schools first, then commercial structures, and finally residential structures. Structures are placed in unpaired footprints until all footprints are paired with structures, at which point multiple structures may be stacked within the same footprint. Structures that are not matched with parcels (most commonly NSI-2015 based structures in areas that lack parcel data) are moved to an available footprint within their census block instead.

Structure Aggregation

If structures are stacked within the same location, then the structures may be partially or completely merged together. Residential units stacked at the same location are assumed to be multi-family structures; the number of units will be used later to update the occupancy type of the structure (for instance, more than 50 units would mean that a residential structure would be identified as a RES3F). However, commercial structures are not completely merged unless they share an occupancy type; instead, the NSI generator links the stacked structures so that they share certain characteristics such as number of stories and construction material. Each commercial business within the stack will receive a weighted portion of the square footage which informs the valuation of each structure.

Population Growth and Assignment to Structures

Due to the unreliability of census block level population estimates in the 2020 census. the NSI-2022 attempts to estimate 2020 level populations using 2010 block data and 2020 county data. To accomplish this, the NSI Generator was provided a table that recorded the number of increased persons residing in a county above 2010 population levels (counties that lost population received no adjustment). The NSI estimates block level population growth in an iterative process until the total increased population for the county is depleted. Population is first added to structures that had no housing units in 2010 but now have housing units in the newly generated inventory. Next, population is distributed to blocks whose number of housing units is greater in the NSI than it was in the 2010 census. Finally, population is randomly assigned to census blocks until the population growth is fully distributed.

Commercial worker population was derived from the U. S. Census Bureau’s Longitudinal Employer-Household Dynamics (LEHD) database (website: https://lehd.ces.census.gov/). This database contains counts for the number of residents leaving a census block to work and the number of workers arriving in a census block. Departing workers are subtracted from the residential population; as are enrolled students. 2015 era data, last accessed in 2019, was used to support this version of the NSI.

Once block level population estimates are made, population is assigned to particular structures within the block. Population is assigned from 20 separate pools, reflecting combinations of Day and Night, Over and Under 65 years of age, and for five population categories, including Workers and Residents. Population is assigned from commercial population pools to commercial businesses weighted by number of employees, and from residential population pools to residences weighted by number of housing units. The three remaining population pools: institutional population, nursing home population, and student population are used to adjust or assign population to specific subcategories of non-residential structures. The assignment process also accounts for the relative likelihood of those over 65 years of age to work or stay at home.

Structure Valuation

Depreciated replacement values were estimated based on an assumed replacement category and a dollar per square footage estimate for that category; these assignments are informed by an analysis of survey data, parcel use types, and other source inputs. All values are in 2021 prices levels. Dollars per square foot estimates are then multiplied by the square footage estimate for each structure to obtain the structure value.

These replacement values for structures are then depreciated in order to obtain depreciated replacement value; each structure is depreciated by 1% per year for the first 20 years, after which it is assumed that routine maintenance would keep structure values at 80% of their replacement values.

Content values are obtained by multiplying structure values against an occupancy type specific structure to content value ratio; these ratios are shown below in the Occupancy Type table.

Occupancy Type

Occupancy types are used to help determine structure valuation, population, and to define structure damage criteria (for flooding). The occupancy types are based on the FEMA occupancy type definitions with further classification to meet the criteria for USACE economic guidance memorandums. The table of occupancy type names and their descriptions are below. These are utilized to support the base level data and are not required for other datasets.

As previously discussed, Occupancy Types are largely mapped from a parcel use type or available NAICS codes. However, occupancy types are often modified if another data input (specific datasets for Nursing Homes, Hospitals, Mobile Home, or Schools) overwrites the initial use type, or, if the use type is updated based on the estimated number of housing units.

Damage Category	Occupancy Type Name	Description	Content-to-structure value ratio
Residential	RES1-1SNB	Single Family Residential, 1 story, no basement	0.5
Residential	RES1-1SWB	Single Family Residential, 1 story, with basement	0.5
Residential	RES1-2SNB	Single Family Residential, 2 story, no basement	0.5
Residential	RES1-2SWB	Single Family Residential, 2 story, with basement	0.5
Residential	RES1-3SNB	Single Family Residential, 3 story, no basement	0.5
Residential	RES1-3SWB	Single Family Residential, 3 story, with basement	0.5
Residential	RES1-SLNB	Single Family Residential, split-level, no basement	0.5
Residential	RES1-SLWB	Single Family Residential, split-level, with basement	0.5
Residential	RES2	Manufactured Home	0.5
Residential	RES3A	Multi-Family housing 2 units	0.5
Residential	RES3B	Multi-Family housing 3-4 units	0.5
Residential	RES3C	Multi-Family housing 5-10 units	0.5
Residential	RES3D	Multi-Family housing 10-19 units	0.5
Residential	RES3E	Multi-Family housing 20-50 units	0.5
Residential	RES3F	Multi-Family housing 50 plus units	0.5
Residential	RES4	Average Hotel	0.5
Residential	RES5	Institutional Dormitory	0.5
Residential	RES6	Nursing Home	0.5
Commercial	COM1	Average Retail	1
Commercial	COM2	Average Wholesale	1
Commercial	COM3	Average Personal & Repair Services	1
Commercial	COM4	Average Professional Technical Services	1
Commercial	COM5	Bank	1
Commercial	COM6	Hospital	1.5
Commercial	COM7	Average Medical Office	1.5
Commercial	COM8	Average Entertainment/Recreation	1
Commercial	COM9	Average Theater	1
Commercial	COM10	Garage	0.5
Industrial	IND1	Average Heavy Industrial	1.5
Industrial	IND2	Average light industrial	1.5
Industrial	IND3	Average Food/Drug/Chemical	1.5
Industrial	IND4	Average Metals/Minerals processing	1.5
Industrial	IND5	Average High Technology	1.5
Industrial	IND6	Average Construction	1
Commercial	AGR1	Average Agricultural	1
Commercial	REL1	Church	1
Public	GOV1	Average Government Services	1
Public	GOV2	Average Emergency Response	1.5
Public	EDU1	Average School	1
Public	EDU2	Average College/University	1.5

Construction Type

Construction type is mapped from parcel data where available. When this data is not available, a construction type is randomly assigned from HAZUS data. The hzCensusBlock table contains an attribute for building scheme, and this attribute is related to the flGenBldgScheme tables from the MSH.mdb database. The building scheme attribute is used to define structures as Wood, Masonry, Concrete Block, Manufactured, and Steel using random assignment based on the probabilities indicated in the HAZUS table.

Square Footage and Number of Stories

Methods for estimating square footage and number of stories varies between RES1s and other occupancy types. RES1s, the most common occupancy type, generally have more reliable parcel data from which to pull values; there are typically values recorded for square footage and number of stories and there is typically only one structure within each single-family parcel so there is no confusion regarding which structure to apply the recorded values too. Therefore, there is typically little need to estimate these values using other methods.

Nonetheless, certain counties lack this data as do certain structures within counties that generally possess the data. When square footage values are missing for RES1s, they are estimated by taking 86% of the structure's footprint; this attempts to remove unoccupied areas such as garages and porches from the footprint area. This percentage was estimated at a nationwide level using available parcel, survey, and footprint datasets. If no footprint data is available either, then square footage is randomly assigned a value from a distribution that varies by the structure's year built and census region. If number of stories data is not available for RES1s, it is estimated by diving the square footage by the structure's footprint; if the value is greater than 1.25, then a second floor is assumed (or if greater than 2.25, then a third floor is assumed and so on). If not footprint is available, then number of stories is randomly assigned from a distribution that varies by year built and census region.

RES3s and Non-Residential structures generally do not have reliable parcel data from which these values can be assigned. There is often no data reported for these use types or there are multiple structures in each parcel and its not clear which, if any, of the structures are represented by the parcel data.

Instead, square footage is estimated based on an assumed needed square foot per housing unit or employees an the assumed number of housing units or employees. If this estimated demand for square footage is less than the structures footprint, then the footprint's square footage is used and the number of stories is kept at a value of one. If the estimated demand for square footage is greater than the footprint's area then an additional story is added. Number of stories added through this process are capped by the footprint's height field when available.

Foundation Type and Height

Foundation type is mapped from parcel data where available. When this data is not available, a foundation type is randomly assigned using HAZUS data. Based on the information in the hzCensusBlock table for building scheme and the tables in the MSH.mdb database that also contain the building scheme attribute, structures are classified into Slab, Pier, Crawl Space, and Basement. The likelihoods vary with flood zone (e.g. coastal or riverine high risk areas) and the structure's year built (for example, those in coastal flood zones are more likely to have a pile foundation than those in low-risk riverine areas).

Foundation heights (in feet) are mapped to each foundation type (for example, for RES1s slab = 0.5 feet and basement = 2ft). These heights were estimated in a 2021 survey completed by USACE economists with each assumed height closely matching the median value from the survey.

Vehicles

Vehicle values for each structure are based on the number of housing units for residential structures and the number of employees for commercial structures. These estimates do not vary with vehicle ownership rates or income levels throughout the nation.

Ground Elevations

Ground elevations (feet) are determined using the USGS National Elevation Dataset (NED), based on the structure location (website: https://nationalmap.gov/elevation.html).

Acronyms

API	Application Programming Interface
FEMA	Federal Emergency Management Agency, Department of Homeland Security
FIPS	Federal Information Processing Standard
FIRM	Flood Insurance Rate Maps
GCS	Geographic Coordinate System
GIS	Geospatial Information Systems
HAZUS	FEMA's Hazards of the United States
LEHD	U.S. Census Bureau's Longitudinal Employer-Household Dynamics Database, Department of Commerce
MRLC	Multi-Resolution Land Characteristics Consortium
NED	National Elevation Dataset
NFIP	National Flood Insurance Program
NLCD	National Land Cover Dataset
NSI	National Structure Inventory
USACE	U. S. Army Corps of Engineers, Department of Defense
USGS	U. S. Geological Survey, Department of the Interior