Code Monkey home page Code Monkey logo

cfbfootball's Introduction

College Football Scraping Setup

A set of R scripts that scrape college football data from cfbdatawarehouse.com and wikipedia.org. All division I-A and I-AA teams are included. The data includes:

  • Game by game records
  • Bowl game records
  • Conference and Division Affiliations
  • Year end polls for all time
  • Yearly coach records
  • Current coaches coaching and playing background
  • Rivals recruiting rankings

Potential Questions

  • Predict whether a team will be bowl eligible for the next season
  • Is recruiting ranking (number of k-star recruits) predictive of success?
  • Is it primarily the school or head coach that predicts the quality of recruits?

Final Data - Feb 20, 2013

Current version of the final data can be found within the directory: Data/Final/cfbData-Feb2013.Rdata. This can be loaded with load("cfbData-Feb2013.Rdata") from within the directory. The data is stored in the object cfbData or you can use the command ls() to see all the objects currently loaded within R.

Games nested within Years nested within School/Teams

Games

  • Win (1/0)
  • Opponent
  • Date
  • Location
  • Score

Years

  • Number of wins
  • Number of points scored
  • Number of points allowed
  • Bowl (1/0)
  • Head Coach
    • Salary
    • Played college football (1/0)
    • Played professional football (1/0)
    • Coached high school
    • Coached college head
    • Coached college assistant
    • Coached professional
  • End of season ranking
  • Conference
  • Division (e.g., I-A, I-AA)
  • Athletic revenue
  • Number of assistant coaches
  • Rivals.com recruit ranking
  • ESPN.com recruit ranking

School/Team

  • Geographic data (long/lat)
  • Sector (e.g., public, private)

Analysis Plan

TBA

Wish List

Pull the rosters from each years teams. These are available in Media Guides http://www.collegefootballdatadvds.com/guides/guides.html in PDF format

Pull the All-Americans, All-Conference, and award winners for each year. Also the coaching awards.

Add weather data to the game-level data.

cfbfootball's People

Contributors

lebebr01 avatar zief0002 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

cfbfootball's Issues

Mismatched names

Names of newly scraped wikipedia data is messy. Needs cleaning to help match names in master file to merge.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.