Skip to main content
The LEHD Commuter Flows dataset contains ~454K origin-destination pairs derived from Census Bureau LODES v8 (Longitudinal Employer-Household Dynamics). Each pair represents a commute flow between two H3 resolution-8 cells, enriched with income breakdowns, NAICS sector counts, and Codex-computed accessibility indices. Every record inherits the full APRS envelope (record_id, chunk_id, bitemporal fields, confidence_score, provenance) and carries the join keys documented below.

Dataset-specific fields

Flow identifiers

FieldTypeNullableDescription
origin_h3stringnoH3 resolution-8 cell of the workplace (derived from LODES w_geocode).
destination_h3stringnoH3 resolution-8 cell of the residence (derived from LODES h_geocode).
origin_block_fipsstringnoCensus block FIPS code (15-character) for the workplace. Retained as a Census-native identifier.
destination_block_fipsstringnoCensus block FIPS code for the residence.
metro_slugstringyesMetro area identifier (derived from metro FIPS).

Worker counts

FieldTypeNullableDescription
worker_countintegernoTotal number of jobs (LODES S000).
worker_count_lt30kintegernoJobs with earnings < $1,250/month (LODES SE01).
worker_count_30to60kintegernoJobs with earnings 1,2511,251–3,333/month (LODES SE02).
worker_count_gt60kintegernoJobs with earnings > $3,333/month (LODES SE03).
income_bandenumnoCodex-derived dominant income band: low, mid, or high.
job_sector_naicsstringyesNAICS sector code with aggregated counts (LODES SI01SI03).

Codex enrichments

FieldTypeNullableDescription
accessibility_indexfloat [0,1]noCodex-computed accessibility score per origin-destination H3 pair. Higher values indicate better transit and commute options.
distance_kmfloatnoCentroid-to-centroid distance in kilometers between origin and destination H3 cells.
h3_neighbor_rankintegernok-ring distance from origin to destination (0 = same cell, 1 = immediate neighbor, etc.).
lehd_yearintegernoReference year for the LODES data (e.g. 2023).

Income bands

The income_band field is derived from the LODES earnings breakdown:
BandCriterion
lowPlurality of workers earn < $1,250/month
midPlurality of workers earn 1,2511,251–3,333/month
highPlurality of workers earn > $3,333/month

Accessibility index

The accessibility_index is a Codex-computed score that factors in transit coverage, commute distance, and commute volume between the origin and destination cells. It is useful for identifying well-connected corridors versus underserved commute routes.
Census LODES data includes noise infusion to protect respondent privacy. Codex preserves this noise as-is — small counts (under ~10 workers) may not reflect exact flows.

Join keys

KeyPresenceNotes
record_idalwaysAPRS URN
chunk_idalwaysDeterministic from record_id
origin_h3alwaysJoin with Urban Signal Grid, POI Intelligence, or any H3-indexed dataset
destination_h3alwaysSame as above
origin_block_fipsalwaysCensus-native identifier for block-level joins
destination_block_fipsalwaysSame as above
metro_slugoftenMetro area identifier

Example query

Find the highest-volume commute corridors into a downtown cell:
SELECT
  origin_h3,
  destination_h3,
  worker_count,
  income_band,
  accessibility_index,
  distance_km
FROM read_parquet('lehd-commuter-flows-2026-04.parquet')
WHERE destination_h3 = '88283082b9fffff'
  AND worker_count >= 50
ORDER BY worker_count DESC
LIMIT 20;

Known limitations

  • LODES data lags by 2–3 years. The lehd_year field indicates the reference year — it does not reflect current conditions.
  • Census noise infusion means small cell-pair flows (under ~10 workers) carry significant uncertainty.
  • Block-level granularity is collapsed to H3 resolution 8. Multiple Census blocks may map to the same H3 cell.
  • metro_slug is null for flows in rural areas outside defined metro boundaries.