mydatastack / pipes Goto Github PK
View Code? Open in Web Editor NEWOne-click deployment, open source platform for moving data with radically less overhead and cost
One-click deployment, open source platform for moving data with radically less overhead and cost
Artikel zu AI + technischer Fortschritt
Must-read:
https://www.datasciencecentral.com/profiles/blogs/what-makes-a-successful-ai-company
http://www.bradfordcross.com/blog/2017/6/13/vertical-ai-startups-solving-industry-specific-problems-by-combining-ai-and-subject-matter-expertise
Nice-to-read:
https://www.datasciencecentral.com/profiles/blogs/comparing-the-four-major-ai-strategies
https://www.datasciencecentral.com/profiles/blogs/comparing-ai-strategies-systems-of-intelligence
https://www.datasciencecentral.com/profiles/blogs/comparing-ai-strategies-vertical-vs-horizontal
https://www.datasciencecentral.com/profiles/blogs/ai-strategies-incremental-and-fundamental-improvements
Full stack AI:
https://sifted.eu/articles/machine-learning-full-stack/
Trigger alarm if no incoming data in S3 bucket within e.g. 24hrs. Should be defined in the parameters by customers.
Remove the lambda that does the configuration and change it to the cf configuration see above.
Classify ip to locations either during ETL or during validation / transformation phase.
Checkout performance penalty.
Create a model for deduplication and proper tracking.
Inspiration? * https://help.amplitude.com/hc/en-us/articles/115003135607
/ga/source/event/year=XXXX/month=XX/day=XX/hour=XX/
There is too much focus on building the model, not much focus on how to integrate into existing products.
Hi
Can you write a brief description in readme file about your code? please explain how to execute the make file and their order also, please.
Based on the workshop:
Data sent to the Import API is processed and sent to your destination through Stitch like data from any other integration.
Singer is a powerful way to write data integration jobs, called taps. Singer provides core functionality needed by applications whose goal it is to replicate data from a source to a destination on an incremental, scheduled basis. Common functionality provided by the protocol includes:
Persistent bookmarks for incremental replication
Authentication for common authentication schemes
Support for common data formats
What’s particularly critical, however, is that every Singer tap can be run within the Stitch platform. This is important because 80%+ of the cost associated with a data integration is in the maintenance phase. With your tap deployed on Stitch, you won’t have to worry about:
hosting a server where jobs are run
scheduling jobs
viewing log output of jobs
building notification systems to let you know if there are run failures
Currenly the redeployment happens in manual mode.
This exec tearsheet is our most important dashboard. Each dashboard included the metrics that the teams were responsible for in the exec tearsheet, and also included other supporting metrics. The metrics that we measure exist in relation to our engagement funnels. You can think of three funnels at 500px: 1) visitor -> signup -> daily active -> daily engaged -> paid subscriber 2) visitor -> signup -> photo upload -> photo submit to marketplace -> photo sold on marketplace 3) visitor -> signup -> purchase photo from marketplace
Each team owns different parts of this funnel for different products:1) The marketing teams own (page views) and the top and bottom (revenue) of this funnel. 2)The product teams has less of an emphasis on top of funnel metrics. 3)The development teams (web and mobile), want to see the entire funnel with respect to their own products.
Source: https://medium.com/@samson_hu/building-analytics-at-500px-92e9a7005c83
Parse user_agent to get and spread into a complex data structure for further analysis.
Find partner agencies, companies with a high leverage (labor + capital) that are going to promote the solution to their customers (e.g. pandata, leroi, trakken)
Mentioned by Matthias see case w/ Tobias.
Agencies that have big customers in their portfolio, customers to which you won't have access as a 1-men-show.
The bots should be filtered out during validation or firehose transformation.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.