This repository contains the tasks that I completed while working as an intern for The Sparks Foundation.
- Internship Category - Data Science and Business Analytics
- Internship Duration - 1 Month ( May-2021 )
- Internship Type - Work from Home
Notebooks in which i have written the code,were taking memory of more than 25MB,so inorder to push my files,i have compressed them and pushed them,one can also view the exact code which i have written in those notebooks,by clicking the Chrome icon on the Right side.The Data used in all the tasks is present in the Data Folder and all the Individual Tasks have been made seperatedly made with their name as a .rar file
Please click on the images on right side to view my solution.
- Predict the percentage of marks of an student based on the number of study hours.
- This is a simple linear regression task as it involves just 2 variables.
- Data can be found at http://bit.ly/w
- You can use R, Python, SAS Enterprise Miner or any other tool.
- What will be predicted score if a student studies for 9.25 hrs/ day?
Please click on the images on right side to view my solution.
- From the given ‘Iris’ dataset, predict the optimum number of clusters and represent it visually.
- Use R or Python or perform this task.
- Data can be found at https://bit.ly/3cGyP8j
Please click on the images on right side to view my solution.
- Perform ‘Exploratory Data Analysis’ on dataset ‘SampleSuperstore’
- As a business manager, try to find out the weak areas where you can work to make more profit.
- Data can be found at https://bit.ly/3kXTdox
Please click on the images on right side to view my solution.
- Perform ‘Exploratory Data Analysis’ on dataset ‘Global Terrorism’
- As a security/defense analyst, try to find out the hot zone of terrorism.
- What all security issues and insights you can derive by EDA?
- Data can be found at https://bit.ly/2TK5Xn5
- You can choose any of the tool of your choice (Python/R/Tableau/PowerBI/Excel/SAP/SAS)
Please click on the images on right side to view my solution (preferably youtube).
- Perform ‘Exploratory Data Analysis’ on dataset ‘Indian Premier League’
- As a sports analysts, find out the most successful teams, players and factors contributing win or loss of a team.
- Suggest teams or players a company should endorse for its products.
- You can choose any of the tool of your choice (Python/R/Tableau/PowerBI/Excel)
- Dataset link :https://bit.ly/34SRn3b
Please click on the images on right side to view my solution (preferably youtube).
- Create the Decision Tree classifier and visualize it graphically.
- The purpose is if we feed any new data to this classifier, it would be able to predict the right class accordingly.
- Dataset link :https://bit.ly/3kXTdox
One can view my work on Youtube,by pressing the button on the right.