Code Monkey home page Code Monkey logo

eda's Introduction

EDA (Exploratory Data Analysis)

EDA is an approach to analyzing datasets to summarize their main characteristics often with statistical and visual methods, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task.

The primary scope this project is an in-depth study into the Exploratory data analysis and the various statistical and plotting tools to represent and understand the data set before modeling and machine learning is done.

1)Our main objective is to use the iris dataset and classify a given iris flower into one of the three categories (setosa, virginica, versicolor). This is shown by plotting graphs to extract the essential features required before starting further analysis.

2)Our main objective is to use The dataset contains cases from a study that was conducted between 1958 and 1970 at the University of Chicago's Billings Hospital on the survival of patients who had undergone surgery for breast cancer. Survival of patients who had undergone surgery for breast cancer.This is shown by plotting graphs to extract the essential features required before starting further analysis.

The project intends to critically revisit and access the following topics: Data and its types( Here an IRIS data & Haberman data set is taken and explained ) , EDA ( what and why is it done) , Analysis Process ( Statistical measures: Measures of Central Tendency, Measures of Dispersion ) and Graphical Representation ( Scatter Plots and Pair Plots) to show which are the most useful features among sepal length, sepal width, petal length and petal width to identify various flower types and also on the survival of patients who had undergone surgery for breast cancer.

Tools

Anaconda

Anaconda is a free and open-source distribution of the Python and R programming languages for scientific computing, that aims to simplify package management and deployment. Package versions are managed by the package management system conda.

Jupyter Notebook (IDE)

The Jupyter Notebook is an incredibly powerful tool for interactively developing and presenting data science projects. A notebook integrates code and its output into a single document that combines visualizations, narrative text, mathematical equations, and other rich media. The intuitive workflow promotes iterative and rapid development, making notebooks an increasingly popular choice at the heart of contemporary data science, analysis, and increasingly science at large.

Technologies

Python

Python is an interpreted, high-level, general-purpose programming language. Created by Guido van Rossum and first released in 1991, Python has a design philosophy that emphasizes code readability, notably using significant whitespace. It provides constructs that enable clear programming on both small and large scales

eda's People

Contributors

pranavmeduri avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.