ianozsvald / data_science_delivered Goto Github PK
View Code? Open in Web Editor NEWObservations from Ian on successfully delivering data science products
Observations from Ian on successfully delivering data science products
What sort of plan can be proposed to layout a successful project from idea through to deployment?
How to derisk? Evaluate value, costs and risks. How to stage it. How to get buy-in. How to demonstrate progress. How to deploy to a non-DS team?
Hi Ian,
Maybe add www.yhathq.com and www.sense.io to the list for 'deployment platforms'.
But I think what you have is fine as it is
learning strategies
clustering for EDA
cleaning
process
getting hired:
list of tools I'd like to see
further reading
pipeline building
tools on my radar
review:
Really interesting read!
Don't know if you are looking for listing specific data cleaning tools, but we've built a few that are useful in our own work.
https://github.com/datamade/dedupe
https://github.com/datamade/usaddress
https://github.com/datamade/probablepeople
https://github.com/datamade/parserator
For making reproducible data workflows, we also use Make. https://github.com/datamade/data-making-guidelines Would be interested to hear how you structure your data steps
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.