Code Monkey home page Code Monkey logo

poets_quants_handicap's Introduction

Handicapping EVERYONES MBA Odds

Blog post on motivation

This project is essentially a webscrape of an MBA website where a former admissions officer gives people the odds that they'll get into certain programs across the country.

There is an api deployed at https://jcp.pythonanywhere.com/api/v1 and it processes POST requests with the following format.

The following JSON payload would be sent to the api to receive the chances of admission to University of Chicago Booth MBA for an underrepresented minority male with a 3.1 gpa, a 650 gmat, a STEM major, and a degree from a well recognized school (whatever that means).

{"gpa":"3.1",
 "gmat":"650",
 "major":"1",
 "race":"1",
 "gender":"0",
 "school":"1",
 "university":"booth"}

gpa: your gpa on a 4 point scale

major: 1 for STEM, 0 for non stem

urm: 1 for underrepresented minority, 0 for not

gender: 1 for female, else 0

university: 1 if you went to a school everyone knows, 0 if its less known

school: the school you want to predict your MBA admission chances for. choices are ["stanford","harvard","wharton","booth","columbia","sloan","kellogg"]

noting that these features are not high enough resolution to truly represent an applicant, but read the blog post for more on that


  • The page_parser is used to build a dataset of applicant profiles from Poets and Quants

  • Applicant information will be the features that go into a ML model. probably just linear regression. Not sure yet. Progress on those results can be found here

  • Uses the basic features from the webscrape:

    • GMAT or GMAT equivalent (using this conversion tool)
    • GPA
    • school
    • major
    • gender
    • race

End goal

predict someones1 chances of getting into different2 business schools based on their profile3.


1 someone willing to provide the features listed above

2 so poets & quants or their readers really only seem to be interested in the same 6 or 7 schooles. harvard, stanford, yale, ross, kellogg, booth, anderson, and occassionally a few others. so its not any school.

3Yes, a lot more goes into an admission decision, but this is still fun to do. Since I'm assuming people don't want to read all 250+ profiles looking for one with a similar background/stats to them. I think there's real demand for this because the comments in those articles are overflowing with people posting their profiles and waiting for the website to do a feature on them.


** Peep the latest data scrape progress from a more updated version **

old visuals of the webscraper doin its thingy thing.

demo_gif

Contributing

Its still pretty early but if you have suggestions, thoughts, feedback, criticism, etc feel free to open a PR or submit an Issue.

Thanks in advance ๐Ÿ˜Š


Donating

If ya feeling generous, hollr @ the kid โค๏ธ

https://www.paypal.me/hijodelsol

BTC: 3EbMygEoo8gqgPHxmqa631ZVSwgWaoCj3m

ETH: 0x2F2604AA943dB4E7257636793F38dD3B1808A9e7

LTC: MQVgzNDgw43YzyUg3XmH3jQ7L8ndVswmN3

poets_quants_handicap's People

Contributors

weallwegot avatar

Stargazers

 avatar

Watchers

 avatar

poets_quants_handicap's Issues

School Categorization

This code is simultaneously hilarious and terrible:

	# in progress lol, this is stupid.
	def parse_uni(self,uni_str):
		s = uni_str.upper()
		# need to rework to have this be a list that allows you to update and add more entries
		# this methodology is pretty bad. right now but ok for first pass.
		if ('IVY' in s and not 'NEAR' in s) or ('M.I.T' in s) or ('COLUMBIA' in s) or ('YALE' in s):
			return 'Tier 1'
		elif 'NEAR' in s and 'IVY' in s:
			return 'Tier 2'
		else:
			try:
				logging.warning("Not enough info to parse university: {}".format(uni_str))
			except UnicodeEncodeError:
				pass
			return 'Tier 3'

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.