Code Monkey home page Code Monkey logo

ds-sf-24's Introduction

General Assembly Data Science Class

6/14/2016 to 8/18/2016

Instructor: Hamed Hasheminia

Tuesdays Thursdays
6/14: Data Science - Introduction Part I 6/16 Data Science - Introduction Part II
6/21: Linear Regression Lines Part I 6/23: Linear Regression Lines Part II
6/28: Model Selection 6/30: Missing Data and Imputation
7/5: K-Nearest Neighbors 7/7: Logistic Regression Part I
7/12: Logistic Regression Part II 7/14: In Class Project
7/19: Decision Trees Part I 7/21: Decision Trees Part II
7/26: Natural Language Processing 7/28: Time Series Models
8/2: Principal Component Analysis 8/4: Data Visualization
8/9: Naive Bayes 8/11: Course Review
8/16: Final Project Presentations I 8/18: Final Project Presentations II

##Lecture 1 Summary (Data Science - Introduction Part I)

  • Data Science - meaning
  • Continuous, Discrete and Qualitative Data
  • Supervised vs Unsupervised Learning
  • Classification vs Regression
  • Time series vs cross-sectional data
  • Numpy
  • Pandas

Resources

Set up GitHub - Self-study guide

Pre-work for second lecture

Additional Resources

Lecture 2 Summary (Data Science Intorduction - Part II)

  • Measures of central tendency (Mean, Median, Mode, Quartiles, Percentiles)
  • Measures of Variability (IQR, Standard Deviation, Variance)
  • Skewness Coefficient
  • Boxplots, Histograms, Scatterplots
  • Central Limit Theorem
  • Class/Dummy Variables
  • Walkthrough describing and visualizing data in Pandas

Resources

HW 1 is Assigned

  • Please read and follow instructions from readme
  • This homework is due on June 23rd, 2016 at 6:30PM

Additional Resources

  • Here you can find valuable resources for matplotlib
  • A good Video on Centeral Limit Theorem

Lecture 3 Summary (Linear Regression Lines - Part I)

  • Linear Regression lines
  • Single Variable and Multi-Variable Regression Lines
  • Capture non-linearity using Linear Regression lines.
  • Interpretting regression coefficients
  • Dealing with dummy variables in regression lines
  • intro on sklearn and searborn library

Resources

Additional Resources

Lecture 4 Summary (Linear Regression Lines - Part II)

  • Hypothesis test - test of significance on regression coefficients
  • p-values
  • Capture non-linearity using Linear Regression lines.
  • R-squared
  • Interaction Effects

Resources

Additional Resources

HW 2 is Assigned

  • Please read and follow instructions from readme
  • Here you can find iPython notebook of your 2nd assignment.
  • This homework is due on June 30th, 2016 at 6:30PM

ds-sf-24's People

Contributors

dataminingclass avatar timlpayne avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.