Code Monkey home page Code Monkey logo

cmpe255's Introduction

College Major Analysis based on economical factors

CMPE 255

Professor Carlos Rojas

Authors

Puneet Tokhi: @puneettokhi

Sai Kapadekar: @Sai-kapadekar

Shivang Patel: @Shivang-Patel

Aaryaneil Nimbalkar: @aaryaneil

Questions

1. Project title?

College Major Analysis based on economical factors

2. What data you’ll use and where you’ll get it?

https://github.com/fivethirtyeight/data/blob/master/college-majors/all-ages.csv https://github.com/fivethirtyeight/data/blob/master/college-majors/grad-students.csv https://github.com/fivethirtyeight/data/blob/master/college-majors/majors-list.csv https://github.com/fivethirtyeight/data/blob/master/college-majors/recent-grads.csv https://github.com/fivethirtyeight/data/blob/master/college-majors/women-stem.csv

3. Description of the problem you’ll solve or the question you’ll investigate.

Many students after high school have either a vague idea about their college major or they enter college with an undeclared major. Most of the students don’t have any idea about the prospective high-paying careers and end up settling for jobs that don’t require a college degree. Investing in a college degree has to be both fruitful and viable for the students because the economical factor is as important as the desire to pursue their field of interest.

There are multiple factors to consider like the employment ratio in that field, the number of job opportunities, and the median pay. This will help students to better understand and have a clear vision of their future goals. It will also help students in taking an informed decision about their future.

By analyzing the data on college majors, employment, and gender diversity, our goal is to provide a data model that can help students and parents choose a college major and understand how big a financial difference it makes.

4. Potential methods you will consider apply (these can change as you play with the data).

To solve this problem, we have considered applying the data mining techniques of clustering and decision trees as the primary techniques for the project. Clustering can be useful in forming clusters of majors that fall within the defined salary ranges. With the help of classifications and decision trees, a major could be determined as economic or not. Other possible techniques which can be considered after starting the initial implementation and if time permits are Classification, Linear Regression, Outlier detection, and Association. We also consider using the random forest classifier technique which is a classification algorithm consisting of many decision trees.

5. How will you measure success?

For this project, We have decided to implement clustering, regression and dimensionality reduction. Accuracy of these models can be measured by using confusion matrix, classification report or accuracy score.

Instructions to run:

You will require Jupyter Notebook or any Python IDE with Python 3.0 or later installed to run the code. Change the directory of the data while loading it.

Our project also requires the following libraries:

  1. pandas
  2. matplotlib
  3. seaborn
  4. numpy

These libraries should be imported

However, following libraries would require pip install:

  1. scipy
  2. sklearn
  3. kneed
  4. pydot
  5. umap

cmpe255's People

Contributors

puneettokhi avatar sai-kapadekar avatar aaryaneil avatar shivangpatel24 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.