Data Dynamics Open-Sources 'Argus Catalog,' an Integrated AI·Data·API Catalog
Data Dynamics has fully open-sourced its integrated metadata management solution 'Argus Catalog' on GitHub under the Apache License 2.0, declaring that data sovereignty must be secured by enterprises themselves, along with a commitment to community-driven continuous improvement.
"Data sovereignty must be secured by enterprises themselves, not by a specific vendor" … Fully released via GitHub, with a declaration of community-driven continuous improvement
Data Dynamics, Inc., a company specializing in AI and data platforms, announced on the 11th that it is open-sourcing its integrated metadata management solution 'Argus Catalog.' The source code is distributed through a public GitHub repository (DataDynamics-OSS/argus-catalog) under the Apache License 2.0, which places no restrictions on commercial use.
Argus Catalog is an integrated AI·Data·API catalog designed so that enterprises can discover, trust, and govern their scattered data and AI assets in one place. Data Dynamics is fully releasing on GitHub every area — backend, frontend, SDK, AI agent, quality batch, and user manual — except for the metadata ingestion connector modules.
Key Components
- Data Catalog — Along with DataHub-style dataset registration, search, tagging, and a glossary, it provides column-level lineage that traces flows across heterogeneous systems, ER diagrams, data quality profiling, rule validation, automatic quality-score synchronization, and morphological-analysis-based measurement of data standard compliance.
- ML Model Registry — It governs model versions, stages, and metrics through a Unity Catalog OSS–compatible API, MLflow integration, and an OCI manifest–based model hub (a HuggingFace-style browser with air-gapped import).
- AI Governance — It includes LLM-based automatic generation of AI metadata (descriptions, summaries, column descriptions, tag recommendations, PII detection), a tool-use AI assistant that answers by invoking the catalog's real data as a tool, and an AI Agent catalog that manages agents' tools, MCP, evaluation, and metering.
- API Catalog & Semantic Search — It comes built-in with cached metadata / Avro schema APIs for external systems and pgvector-based hybrid (keyword + semantic) search.
In particular, AI metadata generation and the assistant integrate not only with OpenAI and Anthropic but also with local LLMs such as Ollama, enabling full AI governance even in on-premises and air-gapped environments where data never leaves the organization.
"Enterprises Securing Data Sovereignty Themselves"
Data Dynamics emphasized "data sovereignty" as the backdrop for this release.
"Metadata and governance are at the heart of an enterprise's data strategy, yet when this very area becomes locked into a specific vendor, the enterprise can no longer freely examine even the flow of its own data. Releasing Argus Catalog on GitHub under Apache 2.0 is the practice of our belief that any enterprise should be able to control its own data and AI assets on its own infrastructure."
— Byounggon Kim, CEO
He added, "The ingestion connectors will be released sequentially at a later date, and since we have released the entire core engine apart from those, enterprises can verify the code directly, extend it to fit their own environments, and operate it without any external data leakage."
A Roadmap for Community-Driven Continuous Improvement
Data Dynamics also presented a roadmap to continuously evolve the features together with the open-source community — using the GitHub repository as a base — rather than stopping at a one-time release. CEO Byounggon Kim cited the following as key tasks to pursue with the community:
- Expanding the connector ecosystem to diverse data sources and query engines
- Advancing AI agent governance and evaluation frameworks
- Strengthening the level of automation for lineage, quality, and standards
Releasing the user manual (Antora-based), demo seed data, and the local development infrastructure setup together — lowering the barrier to contribution — is part of the same effort. Anyone can take part in improvements through issues and pull requests in the GitHub repository.
"The more enterprises and developers participate, the more refined the catalog becomes, and those benefits flow back to all users. We will grow Argus Catalog together with the community so that it becomes a common foundation for many enterprises to secure data sovereignty — beyond being any single company's product."
— Byounggon Kim, CEO