Code Monkey home page Code Monkey logo

characteristic-based-time-series-clustering's People

Contributors

njfritter avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

sarmad-m lfywork

characteristic-based-time-series-clustering's Issues

Finish up Exploratory Analysis Items for WESAD

Couple of final items for the exploratory analysis:

  • Add outlier detection method and apply it to data
  • Remove displaying full sets of data (i.e. the full correlation dictionaries)
  • Remove the chest data concatenation that was done twice to see if dropping the indexes would speed up the process

Will update as needed.

Rearrange File Directories

I decided to divide up the directories by category of code (i.e. code I attempted to make from 2006 paper, actual code from 2012, tsfresh, etc.) but this is confusing and would end up having a lot of duplicate scripts and notebooks since there's also exploratory data analysis for each data set.

I will either:

  • Remove this directory structure altogether and just differentiate which code I'm using by the script name
  • Change the directory structure and keep these directories

Will update this issue once started.

Feature Extraction & Clustering of NBA/Other Sports Data

After exploratory analysis of the data, extract features and cluster together. Some interesting use cases:

  • Out of all the stars in the league (Steph Curry, Lebron James, Kawhi Leonard, Kevin Durant, etc.) which are the most similar?
  • What lesser known player is most like these above stars (and vice versa)?
  • Can we train a clustering algorithm to try and predict new players (i.e. rookies) and whose game they are most similar to once they start playing?
  • Etc.

Push Rest of Code Reproduction from 2006 Paper

This project started out with me trying to reproduce the Characteristic Based Time Series Clustering Paper calculations in code. Currently it is kind of a mess, so it needs to be invested in and fixed so it will work with any time series data.

Figure Out Minimum Weighted Biparite Matching in Python

The last part of the whole time series clustering exercise is to validate the quality of the clusterings. In order to properly do this, one must cluster on two separate subsets of our data (drawing from the same distribution) and compare.

However, since clustering is unsupervised the labels from clustering don't have any significant meaning in themselves (i.e. clustering 1 could lead to labels A/B/C and Clustering 2 could lead to labels D/E/F) so we need a way to compare them.

The Hungarian problem (Specifically minimum weighted biparite matching) is how these clustering labels can be compared. This issue is to figure out how to properly code it.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.