1 Star 0 Fork 0

lastwizard / datahub

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
features.md 3.29 KB
一键复制 编辑 原始数据 按行查看 历史
Mars Lan 提交于 2020-03-11 05:25 . Update features.md

DataHub Features

DataHub is made up of a generic backend and a Ember-based UI. Original DataHub blog post talks about the design extensively and mentions some of the features of DataHub. Our open sourcing blog post also provides a comparison of some features between LinkedIn production DataHub vs open source DataHub. Below is a list of the latest features that are available in DataHub, as well as ones that will soon become available.

Data Constructs (Entities)

Datasets

  • Search: full-text & advanced search, search ranking
  • Browse: browsing through a configurable hierarchy
  • Schema: table & document schema in tabular and JSON format
  • Coarse grain lineage: support for lineage at the dataset level, tabular & graphical visualization of downstreams/upstreams
  • Ownership: surfacing owners of a dataset, viewing datasets you own
  • Dataset life-cycle management: deprecate/undeprecate, surface removed datasets and tag it with "removed"
  • Institutional knowledge: support for adding free form doc to any dataset
  • Fine grain lineage: support for lineage at the field level [coming soon]
  • Social actions: likes, follows, bookmarks [coming soon]
  • Compliance management: field level tag based compliance editing [coming soon]
  • Top users: frequent users of a dataset [coming soon]

Users

  • Search: full-text & advanced search, search ranking
  • Browse: browsing through a configurable hierarchy [coming soon]
  • Profile editing: LinkedIn style professional profile editing such as summary, skills

Schemas [coming soon]

  • Search: full-text & advanced search, search ranking
  • Browse: browsing through a configurable hierarchy
  • Schema history: view and diff historic versions of schemas
  • GraphQL: visualization of GraphQL schemas

Jos/flows [coming soon]

  • Search: full-text & advanced search, search ranking
  • Browse: browsing through a configurable hierarchy
  • Basic information:
  • Execution history: Executions and their status. Link to external service for viewing full info.

Metrics [coming soon]

  • search: full-text & advanced search, search ranking
  • Browse: browsing through a configurable hierarchy
  • Basic information: ownershp, dimensions, formula, input & output datasets, dashboards
  • Institutional knowledge: support for adding free form doc to any metric

Dashboards [coming soon]

  • search: full-text & advanced search, search ranking
  • Basic information: ownership, location. Link to exzternal service for viewing the dashboard.
  • Institutional knowledge: support for adding free form doc to any dashboards

Metadata Sources

You can integrate any data platform to DataHub easily. As long as you have a way of Extracting metadata from the platform and Transform that into our standard MCE format, you're free to Load/ingest metadata to DataHub from any available platform.

We have provided example ETL ingestion scripts for:

  • Hive
  • Kafka
  • RDBMS (MySQL, Oracle, Postgres etc)
  • LDAP
1
https://gitee.com/lastwizard_admin/datahub.git
git@gitee.com:lastwizard_admin/datahub.git
lastwizard_admin
datahub
datahub
master

搜索帮助