Open Data DictionaryOpen Data Dictionary

About

Our mission, values, and vision for standardizing data terminology across organizations.

Open Data Dictionary is a community-driven platform for standardizing data terminology across organizations. It provides a shared, open vocabulary so that teams, tools, and systems can speak the same language when working with data.

The Problem

Every organization reinvents data definitions. customer_id means different things in different systems. status could be a boolean, an enum, or a free-text field depending on who built the table. Data teams waste hours reconciling terminology before they can even begin analysis.

There is no shared, open standard for data dictionary terms — until now.

Our Beliefs

  1. Data terminology should be open and shared. Like open-source software, data definitions benefit from community consensus rather than proprietary silos.

  2. Community-driven standards outlast top-down mandates. Voting, submissions, and peer review produce better definitions than a single authority dictating terms.

  3. Interoperability starts with shared language. Before systems can talk to each other, teams need to agree on what terms mean. A universal data dictionary is the foundation.

  4. Machine-readable definitions enable automation. MCP integration, APIs, and structured schemas make definitions actionable — not just documentation sitting in a wiki.

  5. Accessibility over gatekeeping. Free, open, and usable by anyone — from solo developers to enterprise data teams.

How It Works

Word Lifecycle

Submit → Community Vote → Admin Review → Approved
  1. Submit — Anyone can propose a new term with a word name and definition.
  2. Vote — The community votes on submissions to surface the most useful definitions.
  3. Review — Admins approve, reject, or request changes based on community input and quality standards.
  4. Published — Approved terms appear in the public dictionary, searchable by anyone.

What Makes an ODD Entry

Each entry in the Open Data Dictionary includes:

FieldDescription
WordThe canonical term name (e.g., customer_id, created_at)
DefinitionClear, concise explanation of the term's meaning and usage
Data TypeExpected data type (String, Integer, Boolean, Timestamp, etc.)
CategoriesIndustry or domain categories (Finance, Healthcare, E-commerce, etc.)
SynonymsAlternative terms that mean the same thing
StatusLifecycle stage (pending, approved, rejected)

Integration

Getting Involved

  • Submit terms — Propose new data terminology definitions
  • Vote — Help surface the best definitions through community voting
  • Use the API — Integrate standardized definitions into your tools and workflows
  • Connect via MCP — Let your AI assistant query the dictionary directly

On this page