dataops-tk / tapdance Goto Github PK
View Code? Open in Web Editor NEWTapdance is an orchestration layer for the open source Singer tap platform.
License: MIT License
Tapdance is an orchestration layer for the open source Singer tap platform.
License: MIT License
Passing in a comma-separated list of tables via tap_settings[“TABLE_NAME”] should cause tapdance to sync only the tables specified. However, the current behavior is that this filter gets ignored and it instead uses the default value of “*”, which syncs all the tables.
I suspect the issue is on line 111 in syncs.py which currently checks the table_name variable first before looking at tap_settings[“TABLE_NAME”]. If they’re both populated (with a truthy value), the OR condition will return the value from table_name.
Given the rule:
!x*
Expected result is to exclude x$somethingorother
but not exclude myextratable
. Currently both are getting excluded.
Most likely this bug is caused by the table matching logic not properly checking for left-side boundary when performing the regex check.
The workflow cicd.yml is referencing action actions/checkout using references v1. However this reference is missing the commit a6747255bd19d7a757dbdda8c654a9f84db19839 which may contain fix to the some vulnerability.
The vulnerability fix that is missing by actions version could be related to:
(1) CVE fix
(2) upgrade of vulnerable dependency
(3) fix to secret leak and others.
Please consider to update the reference to the action.
An inquiry from slack came through regarding whether this can be run without any python wrapper.
This would be good to explore and document.
From the logs, here's the docker run statement from plan
when running locally. (The plan
statement is Tapdance's replacement for discover
.)
docker run -v C:\Files\Source\tapdance\samples\taps:/projects/my-project -w /projects/my-project dataopstk/tapdance:tap-covid-19 tapdance plan covid-19
And here's the sync command:
docker run -v C:\Files\Source\tapdance\samples\taps\covid-19:/projects/my-project -w /projects/my-project dataopstk/tapdance:covid-19-to-csv tapdance sync covid-19 csv
And a screenshot of a sync running natively in ECS (dockerized):
(Note that currently, running the sync
command also runs plan
.)
Esp.:
Expected default behavior:
TODO
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.