Code Monkey home page Code Monkey logo

breast-cancer-diagnosis-prediction's Introduction

Exploratory-Data-Analysis-For-Breast-Cancer-Diagnosis-Prediction-using-Logistic-Regression

Description: This repository presents an exploratory data analysis (EDA) and logistic regression modeling for predicting breast cancer diagnosis (benign or malignant) based on features extracted from cell nuclei. The analysis utilizes the Wisconsin Diagnostic Breast Cancer dataset sourced from Kaggle, comprising 569 instances with 32 attributes.

Key Highlights:

  • Dataset Overview: The dataset includes 30 features computed for each cell nucleus, with mean values of area, smoothness, and symmetry chosen as predictor variables. The outcome variable, diagnosis, is converted into a binary factor variable (Benign/Malignant).

  • Data Cleaning and Preparation: The dataset underwent cleaning to handle missing values and select relevant predictor and outcome variables. The diagnosis variable was converted into a binary factor, and no outlier analysis was performed based on dataset characteristics.

  • Data Analysis: Logistic regression was utilized to build a predictive model for breast cancer diagnosis. Assumptions were checked, including multicollinearity, linearity of independent variables, and independence of errors, ensuring the validity of the model.

  • Statistical Analysis: Statistical tests were conducted to assess the significance of predictor variables and confirm model fit. The analysis included checking p-values, deviance comparisons, AIC values, and confidence intervals for predictor variables.

  • Conclusion: The logistic regression model effectively predicts cancer diagnosis based on area, smoothness, and symmetry features of cell nuclei. The analysis concludes that these variables significantly influence diagnosis, with smoothness demonstrating particularly strong predictive power.

Dataset Citation:

  • Kaggle. "Breast Cancer Wisconsin (Diagnostic) Dataset. (2016, September 25)." Available at: Dataset Link

breast-cancer-diagnosis-prediction's People

Contributors

rohanarora03 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.