Code Monkey home page Code Monkey logo

statsathon-2016's Introduction

BTV Statsathon 2016

What

A Kaggle-esque competition for members of the BTV Data Science meetup group. An opportunity to develop your analytical skills, and earn some bragging rights.

The data consists of 250,000 records, each representing a traumatically injured patient who survived long enough to reach a hospital. Each observation has 1,000 binary predictors prefixed ais representing individual possible injuries (0=injury not present, 1=injury present) and a single binary outcome died, that is, patients who went on to die before discharge from the hospital (0=survived, 1=died).

The training and test data are available as a tar.gz archive here. Once you extract the archive, you'll have a statsathon directory containing the training (AIS_train70.csv) and test (AIS_test30.csv) data.

Where

Details of the event are available on the Meetup page.

Feel free to participate by yourself, or form a team.

Rules

Analyze the data using whatever methods you want. To qualify for the Grand Prize of a free BTV Data Science sticker, you must use open-source software (e.g. R, Python) and make your code available, preferably in this GitHub repository.

Entries must be made by 9pm on Tuesday March 22nd. The preferred format for submission is by submitting a Pull Request to this respository (details below), with your predictions as a comma-delimited (.csv) file placed the model_evaluation directory, given your team's name. The first column of the csv file should be the ais_data_test30.INC_KEY in the test data, and the second column is your prediction of died, like this:

ais_data_test30.INC_KEY,pred
13000000,0
13000005,0
13000013,1
13000026,1
13000045,0
....

A complete example file, example1.csv, is in the model_evaluation directory. Alternately, you can email your results to [[email protected]].

All entries will be be scored against the test set, and submissions ranked by their accuracy, sensitivity, and specificity.

Please follow this Guide to using Pull Requests to submit your results and contribute your analysis. We're relying on the honor system that you don't look at any code that is submitted early :)

General outline:

  1. Fork this repository to your personal account
  2. Create a new branch with your group name (e.g. BTVawesome)
  3. Create a new directory within the project using your group name (e.g. BTVAwesome_analysis)
  4. Analyze data!
  5. Include your scripts for analysis in your directory and commit code to your branch
  6. Place final predictions as a csv (described above) with your team name (e.g. BTVawesome_pred.csv) in the model_evaluation directory
  7. Submit a Pull Request to the BTVDataScience/statsathon-2016 repository

statsathon-2016's People

Contributors

johnstantongeddes avatar

Watchers

James Cloos avatar McFink avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.