Code Monkey home page Code Monkey logo

featureranking's People

Contributors

cbmira01 avatar

Watchers

 avatar  avatar

featureranking's Issues

Needs tests

Needs tests for:

  • each OpenCL kernel
  • dataset entropy
  • column entropy
  • compare OpenCL and Numpy results

Do this issue before refactoring issues

Need this project to work on Linux

What will docs look like?

What will device driver setup look like?

How will project deployment work?

What will different Linux distributions look like? Or just target one (Mint/Ubuntu, CentOS)?

Feature ranking is completely wrong.

Feature ranking is completely wrong.

Re-read Kantardzic p68: Implement this properly in the trial runner.

Fix Wiki Report, restate properly how features are selected for removal in the round-robin trial.
Also, get rid of existing workstation results in the report.

Fix other wording as encountered.

OpenCL double-precision option

Look at kernel double-precision floats (cl_khr_fp64 option), use if possible.

Look for platforms that DO NOT have this option, and see how that works.

Make host code less grindy

Is there any way to make the host-code portion look any less ugly...??
Or is OpenCL host code just ugly to begin with?

Refactor ranking protocol

'Ranking protocol' code is duplicated in the feature reduction test runners, and needs to be pulled out into its own 'trial runner' module. That's also the place to make a 'trial context' object that knows everything about how to run a test.

Clean up data

  • Put feature names in the CSVs
  • Do better job indicating key and target fields
  • Clean up descriptive text

Work on logging

This project should have had logging early, to support refactoring and optimization.
Log files should be ignored by git.

Precompute column entropies and value ranges

Pre-compute column entropies and value ranges.
Big performance gain here.
Do the 'trial runner' first, because that's where these pre-computations will go.
Work on logging and OpenCL reduction before pre-computation.

NAN failure in cardio dataset trial

A NAN is being returned in OpenCL get_entropy function. Track this down please.
At least guard and fail nicely for it before going deeper.

Fix Open-CL sum reduction

We need Open-CL sum reduction to work properly, because at least two big calculations depend on it.

A fix here may generalize to min/max reductions.

Strip input

Make sure the menu input is less confusing

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.