Code Monkey home page Code Monkey logo

aiengrhaseeb / the-spark-foundation-internship Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 3.0 35.79 MB

This repository contains all the tasks with videos for the Data Science and Analytics Intern at The Sparks Foundation.

License: MIT License

Jupyter Notebook 100.00%
python data-science machine-learning internship-task prediction-model supervised-learning unsupervised-learning business-analytics graduate-rotational-internship exploratory-data-analysis

the-spark-foundation-internship's Introduction

The Sparks Foundation -Graduation Rotational Internship Program

This repository is dedicated to the completion of all my tasks with videos from The Sparks Foundation (Graduate Rotational Internship Program). As of now, I will be updating the tasks from my domain : Data Science and Business Analytics for the May 2021 batch.

Tools/IDE : Python/Google Colab/Jupyter Notebook

Technical : Task 1 - Prediction using Supervised ML (Level - Beginner)

Predict the percentage of an student based on the no. of study hours.

This is supposed to be done with linear regression as we will be using just 2 variables.

  • Dataset for this model can be found at : http://bit.ly/w-data.
  • Code for this model can be found at : Task_1_Code.
  • Video for this model can be found at : Task_1_Video.

    What are we supposed to do with the given dataset?

    We need to predict the score of the student if he/she studies for 9.25 hrs/day.

    Technical : Task 2 - Prediction using Unsupervised ML (Level - Beginner)

    Predict the optimum number of clusters, from the given "iris" dataset and represent it visually.

    I will be implementing this with the help of K-Means Clustering algorithm.

  • Dataset for this model can be found at : https://bit.ly/3kXTdox.
  • Code for this model can be found at : Task_2_Code.
  • Video for this model can be found at : Task_2_Video.

    What are we supposed to do with the given dataset?

    We need to predict the optimum number of clusters and it's visualization.

    Technical : Task 3 - Exploratory Data Analysis - Retail (Level - Beginner)

    Perform ‘Exploratory Data Analysis’ on dataset ‘SampleSuperstore’

    I will be doing this with the help of python libraries i.e. matplotlib, plotly, plotnine and seaborn.

  • Dataset can be found at : https://bit.ly/3i4rbWl.
  • Code for this model can be found at : Task_3_Code.
  • Video for this model can be found at : Task_3_Video.

    What are we supposed to do with the given dataset?

    As a business manager, we will try to find out the weak areas where we can work tomake more profit. Also, what all business problems can be derived by exploring the data.

    Technical : Task 4 - Exploratory Data Analysis - Terrorism (Level - Intermediate)

    Perform ‘Exploratory Data Analysis’ on dataset ‘Global Terrorism’

    I will be doing this with the help of seaborn, plotly and folium libraries in python.

  • Dataset can be found at : https://bit.ly/2TK5Xn5.
  • Code for this model can be found at : Task_4_Code.
  • Video for this model can be found at : Task_4_Video.

    What are we supposed to do with the given dataset?

    As a security/defense analyst, we will try to find out the hot zone of terrorism. Also, what all security issues and insights can be derived by EDA.

  • the-spark-foundation-internship's People

    Contributors

    aiengrhaseeb avatar

    Stargazers

     avatar

    Watchers

     avatar  avatar

    Recommend Projects

    • React photo React

      A declarative, efficient, and flexible JavaScript library for building user interfaces.

    • Vue.js photo Vue.js

      🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

    • Typescript photo Typescript

      TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

    • TensorFlow photo TensorFlow

      An Open Source Machine Learning Framework for Everyone

    • Django photo Django

      The Web framework for perfectionists with deadlines.

    • D3 photo D3

      Bring data to life with SVG, Canvas and HTML. 📊📈🎉

    Recommend Topics

    • javascript

      JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

    • web

      Some thing interesting about web. New door for the world.

    • server

      A server is a program made to process requests and deliver data to clients.

    • Machine learning

      Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

    • Game

      Some thing interesting about game, make everyone happy.

    Recommend Org

    • Facebook photo Facebook

      We are working to build community through open source technology. NB: members must have two-factor auth.

    • Microsoft photo Microsoft

      Open source projects and samples from Microsoft.

    • Google photo Google

      Google ❤️ Open Source for everyone.

    • D3 photo D3

      Data-Driven Documents codes.