Code Monkey home page Code Monkey logo

teddyoweh / dimensionality-reduction-pca Goto Github PK

View Code? Open in Web Editor NEW
11.0 1.0 0.0 7 KB

Dimensionality reduction is basically a process of reducing the amount of random features,attributes variables or in this case called dimensions in a dataset and leaving as much variation in the dataset as possible by obtaining a set of only relevant features to increase the effiency of a model.

Jupyter Notebook 100.00%
data-science dataset dimensional-analysis dimensionality-reduction feature-extraction feature-selection machine-learning

dimensionality-reduction-pca's Introduction

Dimensionality Reduction PCA

Dimensionality is refered to the attribute or features of a dataset

Dimensionality reduction is basically a process of reducing the amount of random features,attributes variables or in this case called dimensions in a dataset and leaving as much variation in the dataset as possible by obtaining a set of only relevant features to increase the effiency of a model.

Importance of Dimensionality Reduction

When running models or training models machine Learning not all the dimensions of a datasets are relevant, and to make you a model train in a less time and more effienct dimensionality reduction should be carried out on a dataset to remove the irrelevant datasets. It avoids overfitting of a dataset because, when attributes or dimensions are many, the model tends to become complex and overfit on the training data. Also useful when visualizing data

Two approaches are either keeping the most important features and removing the rest and .. combination of features to reduce the dimensions

Feature Extraction and Feature Selection?

Feature selection removes irrelevant features from a dataset. Feature extraction makes new features from existing ones.

Both dimensionality reduction processes.

Why feature selection is important?

Feature selection is rather important because it is an effiecient method of dimensionality reduction by removing irrelevant features. It also builds an accurate model with better prediction power and overfitting by selecting only relevant features.

Wrapper-based feature selection

In wrapper methods, it is based on a specific machine learning algorithm. It evaluates all the features of a dataset and gives optimal results of the various combinations of features.

Some methods are

Forward Selection

This process starts off with no feature then keeps features which improves the model the best till when a new feature doesn't improve the model

Backward Elimination

This process removes the least significant feature which improves the performance of the model.

Filter-based feature selection

In this process the features Dimensionality reduction is a machine learning (ML) or statistical technique of reducing the amount of random features,attributes variables or in this case called dimensions in a dataset by obtaining a set of only relevant features to increase the effiency of a model. are selected based of their results in statical tests

Embedded-based feature selection

This process combines both the Wrapper and Filter based selection methods. It tests both methods,generate a combination of features and selection the best of the methods that improves the models performance

dimensionality-reduction-pca's People

Contributors

teddyoweh avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.