Skip to content

Data Sources

Transparency about data sources is fundamental to OpenPlanetData. This page documents all the sources we use, what data they provide, and how we evaluate their reliability.

We evaluate potential data sources based on:

CriterionDescription
AuthorityIs the source the official or authoritative provider of this data?
AccuracyHow accurate is the data based on independent verification?
CurrencyHow frequently is the source updated?
AccessibilityIs the data freely accessible and machine-readable?
LicensingDoes the license allow redistribution?

The official source for country codes:

  • ISO 3166-1 - Country codes (alpha-2, alpha-3, numeric)
  • ISO 3166-2 - Subdivision codes

What we use: Official country codes, country names

Official international organization data:

  • UN Statistics Division - Country and area codes
  • UN Member States - Membership information

What we use: Country status, UN membership, official names

Open geographic database with extensive location data:

  • Countries - Country information and boundaries
  • Administrative Divisions - Regions, states, provinces

What we use: Geographic coordinates, population data, alternate names

SourceData Provided
Rest CountriesSupplementary country data
World BankPopulation, economic data
CIA World FactbookGeneral country information

We monitor source updates and refresh our data accordingly:

Source TypeCheck FrequencyUpdate Trigger
Country DataMonthlyMonthly pipeline run
Static ReferencesQuarterlyManual review

When sources disagree, we apply these rules:

  1. Official sources first - ISO for country codes, UN for designations
  2. Consensus wins - If 3+ sources agree, use that value
  3. Document ambiguity - Flag uncertain data in metadata
  4. Manual review - Critical conflicts are reviewed by maintainers

All sources we use have licenses compatible with our CC BY 4.0 distribution:

SourceLicense
GeoNamesCC BY 4.0
ISO CodesFreely usable
UN DataOpen

Know of a high-quality data source we should consider? Open an issue in the relevant repository with:

  1. Source name and URL
  2. What data it provides
  3. How it could improve our datasets
  4. Licensing information