Comments (2)
@teabot : It's great to hear that you're interested in learning more about Marquez! The design doc linked in the README was originally authored internally at WeWork, then shared publicly. If you shoot me an email at [email protected] I'd be happy to get you access to the doc. And given that the project is in the early development phase, it'd be great to get your feedback as well. Note that there are gaps in the design doc and sections that need to be revisited, but updates to the doc will happen more regularly.
We are currently working on building metadata collection as a core requirement into all jobs (streaming or batch) at WeWork. Are immediate focus is to integrate Marquez with Airflow in order to capture the job (=task) runtime arguments, input/output datasets and state (RUNNING, COMPLETED, etc). This will help define both the Job API and Dataset API.
We have an internal roadmap and milestones for Marquez at WeWork, but our goal is to be transparent about the project and it's direction. I just opened issue #74
from marquez.
@teabot : We have made our design public! see https://marquezproject.github.io/marquez
from marquez.
Related Issues (20)
- Streaming jobs do not cumulate datasets sent through a run
- Run-level Graph HOT 1
- Web UI does not show Job facets HOT 2
- Column lineage doesn't work with symlink dataset HOT 1
- Switch from Column Level mode to Table Level Mode in UI
- [Marquez Web] Possible to set base path for static resources? HOT 4
- Marquez doesnt create Dataset and Job with Flink Start Event HOT 1
- Using custom port and host address for the database HOT 1
- Add docs for dataset symlinks HOT 1
- Add symbolic link to dataset in UI
- Spark Integration - Schema for text file and column level lineage is not captured HOT 1
- Optimize Column Lineage Query Performance HOT 3
- Docker release with `arm` architecture HOT 4
- bug: cannot query lineage if job namespace contains colon character HOT 2
- Extra spacing when toggling "Show Field Tags"
- Add Job Tagging to UI
- DATASETS JOBS Something went wrong while fetching initial data. HOT 4
- Job, Dataset, and Event Tables should be filterable, sortable, and searchable
- Web UI error - Module not found HOT 4
- Expose SQL queries, queries count and execution time via Prometheus HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from marquez.