Code Monkey home page Code Monkey logo

titanic-investigation-and-machine-learning-from-disaster's Introduction

Titanic-Investigation-and-Machine-Learning-from-Disaster

TITANIC

  • Goal of this Notebook:

    • Investigation - whether passenger's features correlate with survival?
    • Analysis of Titanic dataset.
    • Finding the best working classifier for Titanic dataset.
  • This Notebook will show basic examples of:

    • Investigation - whether passenger's features correlate with survival?

      • Importing libraries
      • Importing data with Pandas
      • Feature engineering
      • Determine passenger's features to test against survival¶
    • Analysis of Titanic dataset

      • Plot/graph variables and mathematically exploration
    • Machine Learing on Titanic dataset

      • Importing machine learning libraries
      • Data cleaning
      • Feature extraction
      • Classification algorithms
      • Visualization
  • Dependencies

    • Numpy: It provides a high-performance multidimensional array and basic tools to compute with and manipulate these arrays.
    • Pandas: For data manipulation and analysis.
    • SciKit-Learn: Python machine learning library which features various classification, regression and clustering algorithms.
    • Matplotlib: Python Plotting.
    • Seaborn: Python's Statistical Data Visualization Library.
    • Scipy: It provides a large number of functions that operate on numpy arrays and are useful for different types of scientific and engineering applications.
  • Overview of Titanic dataset

    • survival: Survival; 0 = No, 1 = Yes
    • pclass: Ticket class, a proxy for socio-economic status (SES); 1 = 1st (Upper), 2 = 2nd (Middle), 3 = 3rd (Lower)
    • sex: Sex of passenger
    • Age: Age in years
    • sibsp: # of siblings / spouses aboard the Titanic
    • parch: # of parents / children aboard the Titanic
    • ticket: Ticket number
    • fare: Passenger fare
    • cabin: Cabin number
    • embarked: Port of Embarkation; C = Cherbourg, Q = Queenstown, S = Southampton
  • Conclusions

    • Investigation: Which passenger characteristics correlate with survival?

      • Passenger gender correlates with survival: Females were more likely to survive than males.
      • Passenger class correlates with survival: The higher the passenger's class, the more likely that he or she survived.
    • Machine Learning on Titanic dataset

      • GradientBoostingClassifier can be used to find insights on the Titanic dataset.

titanic-investigation-and-machine-learning-from-disaster's People

Contributors

ankitrhode avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.