Code Monkey home page Code Monkey logo

benchmark_map4's Introduction

benchmark_map4

Benchmarking the MAP4 fingerprint in regression models

benchmark_map4's People

Contributors

patwalters avatar unixjunkie avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

benchmark_map4's Issues

unfolded counted fingerprints

Dear Patrick,

In this benchmark, since it is about regression modeling,
it might be quite interesting to see at least one or two unfolded-counted fingerprints:
e.g. ECFP or atom-pairs.

I might send a PR about this, if I manage to wrap my head
around your piece of code.

Regards,
F.

add/replace baseline regressor by a RFR

xgboost might be good to give you a competition winning model; but it's way too slow (at least to my taste)
to get a baseline regressor.

I might contribute a random forest regressor to replace it.

5xCV instead of 10xCV

That would also make training models two times faster.

Problem might be: you compute mean and stddev over the 10 folds for each model.
Maybe, you will not trust mean and stddev computed over just 5 folds.

This being said, maybe you want to do iterated testing and do those stats over something like
50 repeats (e.g. we train/test on random partitions of 80%/20% and we repeat this 50
times for each model).

install.sh script

Dear Pat,

You might be interested in such a script:

#!/bin/bash                                                                     
                                                                                
pip3 install tmap mhfp xgboost seaborn jupyter                                  
                                                                                
# # extract pure Python code                                                    
# jupyter nbconvert --to python benchmark_map4.ipynb

I did not probably cover everything needed, but at least it allows to run the benchmark on my computer.

Regards,
Francois.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.