Code Monkey home page Code Monkey logo

Walter Silima's Projects

hyper-parameter-optimization-for-xgb-boost icon hyper-parameter-optimization-for-xgb-boost

This project is more of a build up on the regression repository. It main focus on optimizing the hyper-parameters of the XGBoost regressor to best estimate the photometric redshifts under study. We used 80% of the dataset for training the algorithm and 20% for testing. We used sk-learn Randomised Search CV with r2_score, Median absolute deviation and both of them in different trial to find the best parameters for our testing data. The Median absolute deviation provides the best RMS and NMAD for this project.

hyper-parameter_optimization_for_random_forest icon hyper-parameter_optimization_for_random_forest

In this repository we optimize the random forest (RF) hyper-parameters for the dataset; DR16 cross-matched with the WISE catalogue. In this case, we trained the algorithms on about 80% of the dataset to find the best parameter settings for the algorithms to best estimate the photometric redshifts using the sk-learn RandomisedSearchCV. We used the "neg_mean_squared_error", "neg_median_absolute_deviation" and both "neg_mean_squared_error" and "neg_median_absolute_deviation" as a scoring metrics. The "neg_median_absolute_deviation" yields best results for this project.

regression icon regression

The photometric redshifts estimation is currently the most powerful and efficient way to estimate the distances to the extragalactic sources. The exponential data avalanche continues and this will require low cost, fast and efficient data-driven methods to analyse and make predictions from the data. In this study, we present the supervised machine learning algorithms that were used to attain the photometric redshifts of the galaxies and quasars found in Sloan Digital Sky Survey data release 16 (SDSS DR16). We adopt the K-Nearest Neighbour (KNN) and Random Forest (RF) regressors to estimate the photometric redshifts of 285685 galaxies and 124688 quasars by considering their photometric measurements.

rucio icon rucio

Rucio - Scientific Data Management

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.