Data Story

The Location Package

A stamped binder of footprint vs reality: branches on the cover, locations on the shoot day.

filmimdbtmdblocationscompanieslogistics
Dataset scope
7164
films
1914–2024
years
11687
branch records
3771
with locations
Location strings are messy. The chart keeps recognizable country tokens and visualizes drift rather than exact distance.
Loading the location package
Hypothesis

The mismatch rate between company footprints and filming locations increases over time as production becomes more distributed.

Question: Do production companies tend to shoot near where they have branches?

Method: Compare branch country sets with filming location country sets and aggregate flows by decade.

Prediction: Mismatch increases in later decades and concentrates on a few hubs.

Test: Trend mismatch rates by decade and inspect top branch→shoot routes.

Narrative Arc
Act I

Company branch stamps fill the binder cover — the official footprint.

Act II

Shoot-day stamps layer over the package; threads bind footprint to reality.

Act III

The drift becomes the story: repeated routes and growing mismatch.

Datasets
  • imdb.company_branches
  • imdb.film_locations
  • imdb.film_companies
  • tmdb.movies
  • 22_location_package.json
Limitations
  • Branch and location coverage is incomplete.
  • Location tokens include regions/cities and require normalization.
  • Match is set overlap, not geographic distance.
Next

Want another story? Head back to the film data stories index or explore a new concept.

Back to indexarrow_forward