Code Monkey home page Code Monkey logo

fastforest's Introduction

fast-forest

A forest that is fast. Works on OS X and linux so far.

Building

Requires cmake. To install and run a test:

conda install pybind11 # use conda not pip
cmake .
make fastforest
python test.py

NB: the makefile is created by cmake, so don't edit it. Instead, edit CMakeLists.txt and then rerun "cmake .".

Building for PyPi publishing

To use the mechanism for publishing to pypi:

conda install pybind11 # use conda not pip
python setup.py build_ext --inplace
python test.py

fastforest's People

Contributors

parrt avatar jph00 avatar

Stargazers

 avatar  avatar Nick avatar  avatar Marcus McCurdy avatar  avatar Sani avatar Serird avatar  avatar Limber Cheng avatar Daniel avatar 爱可可-爱生活 avatar saw avatar Hassan ISMAIL FAWAZ avatar Tom avatar nemo avatar lixinwu avatar Martin Gabdushev avatar Saksham Gupta avatar Tao avatar Adam Erickson avatar artu avatar Mauro Risonho de Paula Assumpção avatar Robin avatar rodrigo figueroa avatar Dimitri Diakopoulos avatar Philip Krejov avatar Edward Atkins avatar STYLIANOS IORDANIS avatar RANJEET SINGH avatar Marc avatar Neil Blake avatar Kevin Du avatar Michael Pieler avatar joelxiangnanchen avatar ppj avatar Urviskumar avatar Vijay Sai Mutyala avatar Kumar Shridhar avatar  avatar  avatar

Watchers

 avatar artu avatar Tom avatar  avatar

fastforest's Issues

support observation weights

It would be nice to support weighted observations so people don't have to rebalance their data sets. Unlike scikit-learn's we could actually specify what those weights mean and the range. haha.

std::shuffle is in the standard library

If I remember correctly we have a shuffle function but there is one in the standard C++ library these days so we could avoid our own implementation that way. I just use this in my isolation forest so thought I would leave a note here.

TODO: define python-facing API

For broader adoption, I think we should make a drop in replacement for scikit-learn's. I.e., hyper parameters go in the constructor, fit(), predict(), predict_proba().

The one place we might want to deviate is the oob_score_ and feature_importances_. Probably should make these functions or at least get rid of the trailing underscore because these are public facing fields.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.