Skip to content

Releases: datahub-project/datahub

DataHub v0.8.7

03 Aug 19:56
3701931

Choose a tag to compare

DataHub v0.8.7 Pre-release
Pre-release

Release Stability

  • There are a few bugs reported on this release that are fixed in 0.8.8. Users are highly recommended to skip past this release!

Release Highlights

  • Dataset Profiling and support for time-series metadata
  • UI for ML Models, Features; support for AWS SageMaker and Feast
  • Cli: support for rollback operations after ingestion
  • Integration fixes for Looker, dbt, and many more.
  • Demos for all these features are available in our July Townhall video

ChangeLog

Read more

DataHub v0.8.6

12 Jul 17:47
973c08d

Choose a tag to compare

Release Highlights

  • Fix issue when using Elasticsearch as graph database in certain configurations
  • Fix caching issues in React UI
  • Efficiency improvement for schema aspect storage
  • Improvements and fixes to various ingestion sources

Changelog

DataHub v0.8.5

01 Jul 06:26
ad08113

Choose a tag to compare

Release Highlights

  • Various stability fixes for v0.8.4
  • Address docker image vulnerabilities
  • New integrations: AWS SageMaker
  • Support for restoring indexes with how-to
  • Ingestion improvements: mongodb, looker, hive, snowflake

Changelog

DataHub v0.8.4

28 Jun 06:22
942196e

Choose a tag to compare

Release Highlights

  • Dataset Popularity, Recent Queries powered by Usage logs (support for Snowflake, BigQuery)
  • Markdown descriptions and editing
  • New Integrations : Glue Jobs, Feast
  • Versioned API for metadata GETs
  • No neo4j requirement, Elastic for Graph
  • Docker image hardening
  • Improved logging
  • GCP Deployment Guide

Changelog

DataHub v0.8.3

18 Jun 19:15
38471ac

Choose a tag to compare

Release Notes

Bug fix release that fixes editable descriptions bug from previous release.

Previous version release notes: https://github.com/linkedin/datahub/releases/tag/v0.8.2

Changelog

#2718 @topwebtek7 fix(react): update schema description edit behavior

DataHub v0.8.2

18 Jun 13:42
550a9de

Choose a tag to compare

Release Notes

Bug fix release that fixes installation, upgrade and usability issues with v0.8.1 specifically around product analytics.
Read the release notes for v0.8.1 here.
Full list of improvements follow.

Changelog

DataHub v0.8.1

04 Jun 20:16
a483933

Choose a tag to compare

Release Notes

  • Bug fix release that fixes installation and upgrade issues with v0.8.0.
  • Read the release notes for v0.8.0 here.

Changelog

DataHub v0.8.0

03 Jun 20:38
97e9660

Choose a tag to compare

Notable Highlights

  • Product Analytics : Understand how your users are interacting with DataHub
  • Product Improvements: Auto-complete across types, Task list view under Pipelines
  • Features: Business Glossary (incubating)
  • Integration improvements
    • Looker, dbt, Hive, Redshift, Glue, MongoDB
    • Kafka Connect (incubating)

and finally,

NoCodeMetadata

This release introduces a major refactor that permits extension of DataHub’s metadata model without writing any imperative code.

Highlights:

  • Removed strongly-typed, entity-specific DAOs. Added more generic services.
  • Introduced Elastic settings & mappings generation, dynamic index registration & evolution
  • Decoupled persistence layer from Pegasus + Java by removing fully-qualified class names (aspects, relationships)
  • Introduced declarative, annotation-based mechanisms for defining indexed fields, foreign key fields, entities & aspects
  • In-place upgrade CLI to aid in adopting this upgrade (datahub-upgrade)

For more information, see

The PR: #2629
Technical Overview
The DataHub Metadata Model
Extending the Metadata Model
No Code Upgrade Guide

ChangeLog

Read more

DataHub v0.7.1

23 Apr 07:48
ae4def2

Choose a tag to compare

Notable Highlights

  • Lineage Visualization
  • Pipelines and Tasks, Flows and Jobs
  • Airflow Lineage
  • Editable Field Descriptions
  • Nested Schema Viz
  • Search Improvements
  • datahub CLI
  • Official PyPi packages
  • Production-quality Helm scripts
  • New Integrations
    • Officially-supported Sources: Airflow, AWS Glue, dbt, Druid, Superset, MongoDB, Oracle

Changelog

Read more

DataHub v0.7.0

19 Mar 03:02
5e91014

Choose a tag to compare

Notable Highlights

  • New React Application re-written from the ground up
  • Support for GraphQL
  • New Metadata Ingestion Framework (Python)
    • Officially-supported Sources: Kafka, MySQL, SQL Server, Hive, Postgres, Snowflake, BigQuery, AWS Athena, Druid, LDAP
  • New Homepage and Hosted Docs redesign at datahubproject.io
  • Product Features: SSO (OIDC), Tags, Themes, Dashboards
  • Metadata Backend Implementations: MLModel ecosystem, DataFlow ecosystem
  • Move to Elasticsearch 7. Migration guide from 5.x here

Changelog

Read more