Skip to content

Metadata

Our proximity also comes with its own suite of metadata. We've taken the time to analyze this data across several different large datasets (such as demographics and sales) in order to make the segments more understandable.

Taxonomy Metadata

Head on over to https://taxonomy.spatial.ai/ for a look at the metadata for the Proximity dataset. If you are a Proximity dataset customer, you can find the full metadata in .json and .csv form in your cloud account.

The taxonomy for the Proximity dataset provides a beautiful interface to explore the segments. You can see, for each segment, how they compare across a range of demographic and other attributes including:

  • Social Topics
  • Geography
  • Age
  • Income
  • Ethnicity
  • Education
  • Employment
  • Population Density
  • Home Ownership
  • Voting Preferences
  • Tendencies Over Time
  • Retail & Restaurant Correlations

We'll provide a little bit more information about some these sections below.

Social Topics

At its core, the proximity dataset is all about social topics. This dataset is based 100% on social media, and for each segment we are providing examples of the most common words/topics that drive high scores in the segment. Let's consider the Coffee Connoisseur segment's topics.

Coffee Connoisseur example topics

When lots of people are posting words and phrases like those in the screenshot, the Coffee Connoisseur segment will have a high score.

Demographics

There are many demographic variables included in the metadata. In order to calculate these variable values, we selected the top 5% of US Census block groups for each segment and calculated aggregate demographics for those areas.

Tendencies Over Time

In order to understand tendencies over time for the segments, we randomly chose social media from each month of the year, day of the week, and time of day. We split that social media into groups and plotted changes in segment scores over those time splits. The results are what you see in the displayed chart.

Chart of Coffee Connoisseur scores over time

Retail & Restaurant Correlations

For each segment, we provide the types of businesses that are correlated with this segment. When this segment scores highly, these are the businesses that are most likely to be nearby. These business types have been split into 2 categories-- Drivers and Opportunities.

Drivers are the business types that typically lead to an increase in a certain segments score. For example, coffee shops drive higher scores in the Coffee Connoisseur segment and stadiums drive higher scores in the Organized Sports Segment.

Types of Business that drive Coffee behavior

On the other hand, businesses that tend to show up in areas that are high in Coffee Connoisseur behavior but don't lead people to make coffee-related posts are Opportunities. These are the types of businesses that succeed when coffee-related behaviors are present in an area.

Types of Business that do well near coffee behavior

Other Dataset Metadata

Using other datasets, we've generated some extremely useful breakouts of these segments. If you are a Proximity dataset customer, you can find the following datasets in your cloud account.

Brand Profiles

By combining mobile visitation data with the Proximity dataset, it is possible to understand brand profiles based on their Proximity data. Here's what this data looks like:

Segment United States Postal Service (USPS) Subway Dollar General McDonald's Shell Oil Starbucks ...
EA01 - Bookish 21.3% 25.2% 19.9% 24.5% 24.1% 29.0% ...
EA02 - Engine Enthusiasts 30.7% 30.4% 30.3% 30.5% 30.4% 29.1% ...
EA03 - Green Thumb 27.9% 28.0% 25.8% 27.1% 28.6% 31.0% ...
EA04 - Natural Beauty 35.4% 29.1% 33.8% 28.2% 30.5% 27.7% ...
EA05 - Wanderlust 32.3% 26.6% 30.6% 25.8% 27.7% 25.0% ...
... ... ... ... ... ... ... ...

This data is interpreted as the percentage of visitors to a brands' locations that come from areas scoring high for the given segment. For example, the first value in the table is interpreted as 21.3% of visitors to USPS come from areas that score highly for the Bookish segment. The threshold for a "high" score is 80th percentile.

This kind of data can give useful information for marketing or understanding the types areas that a brand should locate in. For example, let's look at Starbucks... here are the top 10 Proximity segments for Starbucks based on mobile visitation.

Segment Starbucks
EK01 - Organized Sports 38.2%
EE07 - Lighthearted Fun 35.6%
EE02 - Competitive Nature 35.6%
EK02 - Fitness Obsession 35.0%
EK04 - Yoga Advocates 35.0%
EC04 - Connected Motherhood 34.9%
EB07 - Farm Culture 34.7%
EA06 - Handcrafted 34.2%
EA07 - Animal Advocates 33.9%
EI04 - Fitness Fashion 33.7%

Starbucks should locate near areas high in Organized Sports, Lighthearted Fun, Competitive Nature, Fitness Obsession, and Connected Motherhood. The combination of these neighborhoods indicate that Starbucks customers come from well off suburban areas. They could even combine these 5 segments and treat them as in index for choosing new locations (like this).

Retail Sales Relationships

We also provide Proximity segments' relationships to various retail sales categories.

Segment Art Dealers - naics_45392 Children's & Infants' Clothing Stores - naics_44813 Family Clothing Stores - naics_44814 Men's Clothing Stores - naics_44811 Clothing & Accessories Stores - naics_448 ...
Bookish - EA01 266.40 187.22 188.33 196.75 184.73 ...
Engine Enthusiasts - EA02 44.93 149.71 151.03 127.21 139.43 ...
Green Thumb - EA03 84.94 90.04 92.76 113.81 97.71 ...
Natural Beauty - EA04 49.25 29.45 40.59 53.76 44.53 ...
Wanderlust - EA05 157.66 99.62 121.66 134.59 122.59 ...
... ... ... ... ... ... ...

In this data, an index of 100 indicates that sales are equivalent to the national average and an index greater than 100 indicates higher than average sales.


Back to top