Skip to main content
The POI Intelligence dataset contains ~89K points of interest across U.S. metros sourced from Foursquare, OpenStreetMap, Google Places, and manual verification. Each POI is enriched with industry classification (NAICS and ISIC), walkability and transit scores, and a pioneer-flag indicator for recently opened businesses in growth areas. Every record inherits the full APRS envelope (record_id, chunk_id, bitemporal fields, confidence_score, provenance) and carries the join keys documented below.

Dataset-specific fields

FieldTypeNullableDescription
poi_idstringnoPrimary POI identifier.
namestringnoBusiness or location name.
categorystringnoTop-level category (e.g. restaurant, retail, office).
subcategorystringyesRefined category (e.g. fast_casual, coworking).
naics_codestringyes6-digit NAICS industry code (2022 revision).
isic_codestringyesISIC Rev. 4 international industry code.
latfloatnoWGS84 latitude.
lngfloatnoWGS84 longitude.
h3_indexstringnoH3 resolution-8 cell.
addressstringyesFull postal address.
phonestringyesContact phone number.
websiteURLyesBusiness website.
is_pioneerbooleannotrue when the POI opened in the last 12 months in a growth context. See pioneer classification.
walk_scoreinteger [0,100]yesWalkability score for the location.
transit_scoreinteger [0,100]yesTransit accessibility score.
reviews_sampleJSON arrayyesUp to 5 representative reviews.
photo_countintegeryesNumber of photos available from source feeds.
source_feedenumnoOriginating source: foursquare, osm, google, manual.

Pioneer classification

A POI is classified as a “pioneer” when it meets all three criteria:
  1. Opened within the last 12 months
  2. Located in a cell with a rising Urban Signal Grid composite score
  3. Nearby area shows positive net migration and recent construction permits
Pioneer POIs are early indicators of neighborhood transformation. Use the is_pioneer flag to identify emerging commercial corridors before they appear in traditional market reports.

Industry classification

Each POI carries up to three parallel industry taxonomies:
TaxonomyFieldCoverage
Codex categorycategory / subcategory100% of records
NAICSnaics_code~85% of records
ISICisic_code~80% of records
Use naics_code for cross-joins with OSHA Safety, LEHD Commuter Flows, and other government datasets that use NAICS classification.

Source feeds and licensing

SourceLicenseNotes
OpenStreetMapODbL-1.0Available in all tiers
FoursquareCC-BY-4.0Research and Commercial tiers
Google PlacesRestrictedCommercial tier only
Multi-source identity resolution maps duplicate entries from different feeds to a single poi_id. Source-native identifiers are preserved in the identifier array within metadata.

Join keys

KeyPresenceNotes
record_idalwaysAPRS URN
chunk_idalwaysDeterministic from record_id
poi_idalwaysPrimary POI identifier
h3_indexalwaysH3 resolution-8 spatial key
naics_codeoftenJoin with OSHA Safety, LEHD, and other NAICS-indexed datasets
entity_urnsometimesEntity resolution link (null until resolution pipeline runs)

Example query

Find pioneer restaurants in a target metro area:
SELECT
  poi_id,
  name,
  category,
  subcategory,
  naics_code,
  walk_score,
  transit_score,
  is_pioneer
FROM read_parquet('poi-intelligence-2026-04.parquet')
WHERE h3_index LIKE '8828308%'
  AND is_pioneer = true
  AND category = 'restaurant'
ORDER BY walk_score DESC;

Known limitations

  • reviews_sample contains at most 5 representative reviews per POI. Full review text is not redistributable.
  • walk_score and transit_score are null for POIs in areas without sufficient data coverage.
  • Multi-source resolution means a single real-world business may have been merged from 2–3 source entries. Check metadata.identifier for source-native IDs.
  • Pioneer classification depends on Urban Signal Grid scoring freshness — newly scored cells may take up to one refresh cycle to propagate.