Data Story
The Location Package
A stamped binder of footprint vs reality: branches on the cover, locations on the shoot day.
filmimdbtmdblocationscompanieslogistics
Dataset scope
7164
films1914–2024
years11687
branch records3771
with locationsLocation strings are messy. The chart keeps recognizable country tokens and visualizes drift rather than exact distance.
Loading the location package
Hypothesis
The mismatch rate between company footprints and filming locations increases over time as production becomes more distributed.
Question: Do production companies tend to shoot near where they have branches?
Method: Compare branch country sets with filming location country sets and aggregate flows by decade.
Prediction: Mismatch increases in later decades and concentrates on a few hubs.
Test: Trend mismatch rates by decade and inspect top branch→shoot routes.
Narrative Arc
Act I
Company branch stamps fill the binder cover — the official footprint.
Act II
Shoot-day stamps layer over the package; threads bind footprint to reality.
Act III
The drift becomes the story: repeated routes and growing mismatch.
Datasets
- imdb.company_branches
- imdb.film_locations
- imdb.film_companies
- tmdb.movies
- 22_location_package.json
Limitations
- Branch and location coverage is incomplete.
- Location tokens include regions/cities and require normalization.
- Match is set overlap, not geographic distance.
Next
Want another story? Head back to the film data stories index or explore a new concept.
Back to indexarrow_forward