DataHub Releases
Summary
| Version | Release Date | Links |
|---|---|---|
| v1.5.0.6 | 2026-05-11 | Release Notes, View on GitHub |
| v1.5.0.5 | 2026-05-07 | Release Notes, View on GitHub |
| v1.5.0.4 | 2026-05-06 | Release Notes, View on GitHub |
| v1.5.0.3 | 2026-04-25 | Release Notes, View on GitHub |
| v1.5.0.2 | 2026-04-13 | Release Notes, View on GitHub |
| v1.5.0.1 | 2026-03-25 | Release Notes, View on GitHub |
| v1.5.0 | 2026-03-24 | Release Notes, View on GitHub |
| v1.4.0.3 | 2026-02-19 | Release Notes, View on GitHub |
| v1.4.0.2 | 2026-02-10 | View on GitHub |
| v1.4.0 | 2026-02-06 | View on GitHub |
v1.5.0.6
Released on 2026-05-11 by @david-leifker.
Remove kubectl binary
Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0.5...v1.5.0.6
v1.5.0.5
Released on 2026-05-07 by @david-leifker.
Security Update
Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0.4...v1.5.0.5
v1.5.0.4
Released on 2026-05-06 by @david-leifker.
Security Update
Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0.3...v1.5.0.4
v1.5.0.3
Released on 2026-04-25 by @david-leifker.
Security Patch Release
Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0.2...v1.5.0.3
v1.5.0.2
Released on 2026-04-13 by @david-leifker.
Security Patch Release
Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0.1...v1.5.0.2
v1.5.0.1
Released on 2026-03-25 by @david-leifker.
Changes
Include bundled venvs for datahub-gc, datahub-documents
Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0...v1.5.0.1
v1.5.0
Released on 2026-03-24 by @david-leifker.
DataHub v1.5.0 Release Notes
Helm Chart Requirement: 0.9.2
Full technical release notes: Updating DataHub
Product Features
- V1 UI officially sunset. All development targets V2 UI going forward. Ensure
THEME_V2_ENABLED=trueandTHEME_V2_DEFAULT=true. - Multiple data products per asset (backend and UI).
- Policy targeting by Glossary Terms and Groups.
- Domain-scoped policies now include child domain assets.
datahub searchCLI with semantic search, query projection, and agent-context integration.
Platform
- Java 17 runtime required. Spark upgraded to 3.3.4, Hadoop to 3.3.6. Spark lineage users must be on Spark 3.3.0+.
- Default token signing key & salt removed. Operators must explicitly set
DATAHUB_TOKEN_SERVICE_SIGNING_KEYandDATAHUB_TOKEN_SERVICE_SALT. Helm users are unaffected. - Retention service disabled \= no version history. Only the current version (v0) is retained when the retention service is not enabled.
- TLS 1.0/1.1 disabled on frontend custom truststores.
- Elasticsearch reindex/index-creation retries for improved upgrade resilience.
- Kubernetes optional scale-down during system-update for blocking upgrades like reindexing. Disabled by default.
- SDK:
emit_mcps()now returnsList[TraceData]instead ofint. Trace IDs exposed forSYNC_PRIMARYandASYNCmodes. - Reproducible ingestion Docker builds via pinned transitive dependencies (
uv.lock,constraints.txt). - Python deps migrating from
setup.pytopyproject.toml(PEP 621);setup.pystill the editing source for now.
Ingestion
New Connectors
- RDF, Snowplow, Apache Doris
Breaking Changes (see migration guide for details)
- PowerBI M-Query lineage rewritten using Microsoft's official parser.
native_query_parsing: falsebehavior changed. - SQL view query IDs now use SHA-256 hashes — old query entities become orphaned. Use stateful ingestion to clean up.
- Oracle multitenant URNs now use PDB name instead of CDB name when connecting via
service_name. - Fabric OneLake workspace containers moved to
fabricplatform (fromfabric-onelake). - Vertex AI pipeline URNs restructured for stable DataFlow entities; ML Metadata extraction enabled by default (requires additional GCP permissions).
- DataHub source now uses URN pattern filtering to exclude secrets, ingestion sources, and execution requests by default.
- Kafka Connect Debezium SQL Server platform changed from
sqlservertomssql.
Enhancements
- dbt: Semantic model and exposures ingestion;
convert_urns_to_lowercaseoption for case-insensitive platforms. - Snowflake: Metadata pattern pushdown, table type filtering, external DMF assertion ingestion.
- Power BI: Column-level lineage enabled by default.
- Kafka Connect: Debezium and Confluent JDBC sink connector support; bundled JVM removes system Java requirement.
- SQL parsing: Major CTE/subquery join resolution performance improvements across all SQL-based connectors.
- Mode: Concurrent API fetching, response caching, SQL parsing optimizations.
- Trino: Column-level lineage on upstream datasets.
- Iceberg: Ingestion-time domain assignment.
- Azure Data Factory: Column lineage for Copy activity.
- Airflow plugin: Multi-statement SQL parsing for lineage.
- Sigma: Workbook filtering.
- BigQuery:
convert_column_urns_to_lowercaseoption. - Kafka source: Option to disable Avro schema name validation.
- Great Expectations & SQLAlchemy profilers brought to feature parity.
- Browse paths: DataFlow/DataJob entities get
browsePathsV2with platform instance when configured. - Vertex AI: Cross-platform lineage, hierarchical UI folders, stateful ingestion for large projects.
- Oracle: Fixed container naming with
service_name. - Configurable report sample sizes and richer failure logging.
Deprecations
- Vertex AI:
region→regions,project_id→project_ids. Old fields still work. - Vertex AI:
normalize_external_dataset_pathswill default totruein the next major version.
Full Changelog: https://github.com/datahub-project/datahub/compare/v1.4.0.3...v1.5.0
v1.4.0.3
Released on 2026-02-19 by @jjoyce0510.
Release Highlights
- This minor release addresses a bug in Semantic Search configuration where the modelId was not respecting the environment variables for OpenAI + Cohere (Non-Bedrock embeddings model). Please use this release if you intend to leverage the Documents & semantic search capabilities in DataHub.
What's Changed
- [semantic-search] semantic search improvements for clarity (#16250) by @nwadams in https://github.com/datahub-project/datahub/pull/16267
- [v1.4.0.3] bump default cli version (#16268) by @nwadams in https://github.com/datahub-project/datahub/pull/16269
- fix(ingest): constrain setuptools<82 to fix pkg_resources runtime error by @kyungsoo-datahub in https://github.com/datahub-project/datahub/pull/16283
- fix(semanticsearchretry): ensure retry for semanticsearch (#16281) by @nwadams in https://github.com/datahub-project/datahub/pull/16286
Full Changelog: https://github.com/datahub-project/datahub/compare/v1.4.0.2...v1.4.0.3
v1.4.0.2
Released on 2026-02-10 by @david-leifker.
View the release notes for v1.4.0.2 on GitHub.
v1.4.0
Released on 2026-02-06 by @jjoyce0510.
View the release notes for v1.4.0 on GitHub.