soumyadeep-github Goto Github PK
Name: Soumyadeep Mukhopadhyay
Type: User
Company: Merck KGaA
Location: Bengaluru
Name: Soumyadeep Mukhopadhyay
Type: User
Company: Merck KGaA
Location: Bengaluru
A simple pipeline to transform data within Azure Data Factory using Azure Databricks. Although it is written in Scala the same can be replicated in Python.
Tutorial like code for how to deploy airflow using docker and how to use the DockerOperator.
Get all product reviews for a product on Amazon.
This contains an attempt towards analyzing the Black Friday data set from Kaggle.
This repository contains files which were used to transform a set of .txt files such that the date column can be shifted to the front of the given files.
This repository contains a .pbix (power bi desktop) and excel file. The excel file contains sales reports of Bellings and Ready Ware, two retails chains running throughout Australia while the power bi file contains some analysis about the same.
The aim of this project was to analyse given data set and find out if there exists any trends. The data is produced by a tool similar to Google analytics and the dataset is about a website which is an online repository for books.
The aim of this project is automate data ingestion from flat files like CSV and compressed files GZIP into a database like Postgres. The entire setup is automated using Docker and is pretty fast too as multiprocessing is being used.
Data Wrangling, Analysis and AB Testing with SQL
Basic setup for Django with Docker
Have you ever needed to go through multiple e-commerce websites simultaneously to check the price of the same product? Prices can differ but having a list side by side would be a bonus, wouldn't it? This Flask App can do exactly that.
The fastai deep learning library, plus lessons and tutorials
Just a barebones program to get into Flipkart scraping.
My ML Learning
My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggregates Twitter and US stock market data for user sentiment analysis using open source tools - Apache Kafka for data ingestions, Apache Spark & Spark Streaming for batch & real-time processing, Apache Cassandra f or storage, Flask, Bootstrap and HighCharts f or frontend.
Example for article Running Spark 3 with standalone Hive Metastore 3.0
A simple Java project to fetch top 250 movies from IMDB into a CSV file.
A repository for learning Apache Spark using Scala.
Scraping data from LinkedIn. The aim was to scrape data off of LinkedIn. The scraping project was solely performed as an experiment and has no other intentions.
This repository contains a loans dataset and a Power BI file to visualize this data.
Assignment for Public Finance Researcher position CivicDataLab
The aim of this project is to perform analysis on some (car crash) data using PySpark and make the entire process deployable using Docker.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.