Code Monkey home page Code Monkey logo

springboard-datasciencetrack-student's Introduction

alt text

Springboard Data Science Career Track

Hi!

My name is Mikiko Bazeley and this is my repo for the Springboard Data Science Track.

From Oct 2018 to April 2019 I completed a number of projects, including two capstones, as part of the DS track.

All of the documentation, code, and notes can be found here, as well as links to other resources I found helpful for successfully completing the program.

For questions or comments, please feel free to reach out on LinkedIn.

If you find my repo useful, let me know OR ☕ consider buying me a coffee! https://www.buymeacoffee.com/mmbazel ☕.

Regards, Mikiko

alt text


Project List by Unit of Study

For a comprehensve list of the projects and corresponding skills needed, please see the list below.

1. The Python Data Science Stack

Topics covered:

  • Python
  • Matplotlib, Seaborn—visualization tools in Python
  • Writing clear, elegant, readable code in Python using the PEP8 standard

2. Data Wrangling

Topics covered:

  • Deep dive into Pandas for data wrangling
  • Data in files: Work with a variety of file formats from plain text (.txt) to more structured and nested formats files like csv and JSON
  • Data in databases: Get an overview of relational and NoSQL databases and practice data querying with SQL
  • APIs: Collect data from the internet using Application Programming Interfaces (APIs)

Projects:

3. Data Story

4. Statistical Inference

Topics covered:

  • Theory of inferential statistics
  • Statistical significance
  • Parameter estimation
  • Hypothesis testing
  • Correlation and regression
  • Exploratory data analysis
  • A/B testing

5. Machine Learning

Topics covered:

  • Scikit-learn
  • Supervised and unsupervised learning
  • Top machine learning techniques:
    • Linear and logistic regression
    • naive bayes
    • support vector machines
    • decision trees
    • clustering
  • Ensemble learning with random forests and gradient boosting
  • Best practices
  • Evaluating and tuning machine learning systems

6. Capstone Project 1: Building a Data Product

7. The Natural Language Processing (NLP) Track

Topics covered:

  • How to work with text and natural language data
  • NLP in Python, using common libraries such as NLTK and spaCy
  • Basics of Deep Learning in NLP using word2vec and TensorFlow
  • Data Science at Scale using Spark
  • Software Engineering for Data Scientists

8. Second Capstone Project: NLP

springboard-datasciencetrack-student's People

Contributors

mikikobazeley avatar mmbazel avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.