Code Monkey home page Code Monkey logo

geometric-smote's Introduction

Welcome

Hey there, I'm Georgios Douzas! I'm a machine learning researcher and software engineer. If you're interested, feel free to take a look at my CV or personal blog.

Projects

A list of my personal projects:

  • geometric-smote: An implementation of the Geometric-SMOTE algorithm. Compatible with scikit-learn and imbalanced-learn libraries.
  • cluster-over-sampling: A general interface for clustering based over-sampling algorithms. It allows to create oversampling algorithms as combinations of clusterers and oversamplers. Compatible with scikit-learn and imbalanced-learn libraries.
  • sports-betting: A collection of tools that makes it easy to create machine learning models for sports betting and evaluate their performance. Compatible with scikit-learn.
  • sage-physics: A Python package to create and simulate physics models. It aims to provide a unified interface for various physics models. It is based on SageMath.
  • copier-pdm-nox: Copier template for Python projects managed by PDM with the help of Nox.

Publications

  • Geometric SMOTE for regression. Expert Systems with Applications.
  • Improving the quality of predictive models in small data GSDOT: A new algorithm for generating synthetic data. PLOS ONE.
  • G-SOMO: An oversampling approach based on self-organized maps and geometric SMOTE, Expert Systems with Applications.
  • Increasing the effectiveness of active learning: Introducing artificial data generation in active learning for land use/land cover classification. Remote Sensing.
  • Improving imbalanced land cover classification with k-means smote: Detecting and oversampling distinctive minority spectral signatures. Information.
  • Geometric SMOTE a geometrically enhanced drop-in replacement for SMOTE. Information Sciences.
  • Imbalanced learning in land cover classification: Improving minority classes’ prediction accuracy using the geometric SMOTE algorithm. Remote Sensing.
  • Effective data generation for imbalanced learning using conditional generative adversarial networks. Expert Systems with Applications.
  • Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE. Information Sciences.
  • Self-Organizing Map Oversampling (SOMO) for imbalanced data set learning. Expert Systems with Applications
  • Coset space dimensional reduction and Wilson flux breaking of ten-dimensional N=1, E8 gauge theory. The European Physical Journal C.
  • Coset space dimensional reduction and classification of semi-realistic particle physics models. Progress of Physics.

Skils

My Skills

Stats

georgedouzas's Stats

georgedouzas's Streak

georgedouzas's Top Languages

geometric-smote's People

Contributors

georgedouzas avatar joaopfonseca avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

geometric-smote's Issues

use with categorical variables ?

Can I use the geometricsmote with categorical variables ?
Something like SMOTENC from imblearn package ?
I have only one continuous variable in my dataset

Thank you

Update publications

Update the references to publications in README and/or documentation that use G-SMOTE.

Categorical variables

There is an implementation of categorical variables in 0.2.X branch. It should be merged to master and add also the appropriate tests.

Incorrect error message

When k_neighbors>n_samples the error message displayed refers to an inexistent hyperparameter n_neighbors:

ValueError: Expected n_neighbors <= n_samples, but n_samples = 2, n_neighbors = 6

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.