Code Monkey home page Code Monkey logo

Welcome to the world of Tanay!

I like to introduce myself as a Statistician and a Data Science Enthusiast with 6 years of experience in analytics domain executing data-driven solutions to increase efficiency, and accuracy in data processing with strong programming expertise. I am experienced at creating regression models, using predictive modeling, and deciphering data mining algorithms to deliver insights and implement action-oriented solutions to complex business problems.

I’ve built my career in a variety of roles and industries, mostly around Analytics and Data Science and have worked with Fortune 500 companies like IBM, Ogilvy (WPP Group), Maersk, and Tesla. I have worked with clients from across the globe and had a wonderful time learning about different cultures.

I am comfortable with R and Python and can use it to solve various problems starting from data wrangling, modeling, automation, etc. I have built multiple projects on ML libraries and my work is accessible via GitHub repositories here. I am pretty confident with SQL in terms of fetching data and doing necessary operations to ensure we have clean and filtered data for processing. Also, have good experience with data visualization tools like Tableau and Google Data Studio.

I have a decent understanding of big data and have worked on AWS wherein I configured and established connections all by myself for multitude of available cloud services. I have worked with EMR, EC2, S3, Athena, Kinesis. I am familiar with Hadoop ecosystem and I am regularly building more on this part to acquire knowledge by implementing projects.

Presently, accelerating fast to get accustomed to the world of deep learning. You got anything to share or discuss? Please feel free to reach out to me at [email protected], and I would love to colloborate.

During free time, I like to read or listen to music. Watching sports is my best way to relax - be it cricket, tennis or soccer, I am in for all. Last read this book called - Crime and Punishment by Fyodor Dostoevsky. If you haven't read it yet, then please do at the first chance.

Want to know more? Connect with me on LinkedIn and we can have more discussions over a cup of Tea/Coffee/Icecream. :)

We bear, we grow; like seeds, like plants.

Tanay Mukherjee's Projects

a-b-testing-in-r icon a-b-testing-in-r

A/B testing (or split-testing) is a randomized experiment with two variants A and B. It includes application of statistical hypothesis testing (or two-sample hypothesis testing), as used in the field of statistics. A/B testing is a way to compare two versions of a single variable, typically by testing a subject's response to variant A against variant B, and determining which of the two variants is more effective.

analysing-sentiments-with-tweets icon analysing-sentiments-with-tweets

We will use NTLK package and Baye's theorem to analyse the tweets and predict if there is any depression or negative feeling noticed with the user.

analyzing-epidemics icon analyzing-epidemics

The use of mathematical models to understand infectious disease dynamics has a very rich history in epidemiology. The dynamics of infectious diseases shows a wide diversity of pattern. In this exercise, we will look into these patterns precisely.

animated-charts-in-r icon animated-charts-in-r

Till now we have figured some really cool graphing techniques, but imagine of you can make those graphs animate like a feature film.

aws-athena icon aws-athena

AWS Athena is a server less interactive querying service provided by AWS that is built on top of PrestoDb.

building-your-own-chatbox icon building-your-own-chatbox

We will use NLTK(Natural Language Toolkit) to develop our own simple chatbox that will respond based on user queries using a defined corpus.

case-study-predicting-bankruptcy icon case-study-predicting-bankruptcy

Based on available data from bank and parameters to identify the variables that influence the most, predict the bankruptcy of the given financial model

certificates icon certificates

All the certificates completed by me for different technologies and programming language

clickstream-analysis-in-r icon clickstream-analysis-in-r

This R code is an example of analyzing Clickstream Data using Markov Chains and data mining SPADE algorithm.

data icon data

Data and code behind the articles and graphics at FiveThirtyEight

data-structures icon data-structures

A data structure is a particular way of organizing data in a computer so that it can be used effectively.

data-visualization-in-r-tufte icon data-visualization-in-r-tufte

The idea behind Tufte in R is to use R - the most powerful open-source statistical programming language - to replicate excellent visualization practices developed by Edward Tufte.

deep-learning-with-pytorch icon deep-learning-with-pytorch

PyTorch is an open source machine learning library based on the Torch library, used for applications such as computer vision and natural language processing, primarily developed by Facebook's AI Research lab. It is free and open-source software released under the Modified BSD license.

dimensionality-reduction icon dimensionality-reduction

In statistics, machine learning, and information theory, dimensionality reduction or dimension reduction is the process of reducing the number of random variables under consideration by obtaining a set of principal variables. Approaches can be divided into feature selection and feature extraction.

dissecting-yelp-dataset icon dissecting-yelp-dataset

This dataset is a subset of Yelp's businesses, reviews, and user data. It was originally put together for the Yelp Dataset Challenge which is a chance for students to conduct research or analysis on Yelp's data and share their discoveries. In the dataset you'll find information about businesses across 11 metropolitan areas in four countries.

exploring-sql-with-r icon exploring-sql-with-r

The idea is to use the SQL skills in R by converting data into relational database from text files and then using it to run queries to filter data by SQL

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.