njfritter / characteristic-based-time-series-clustering Goto Github PK

View Code? Open in Web Editor NEW

6.0 6.0 2.0 10.02 MB

Time Series Clustering Based on Characteristic Based Feature Extraction inspired by the paper mentioned in the README

Python 0.52% R 0.14% Shell 0.01% Jupyter Notebook 99.33%

characteristic-based-time-series-clustering's People

Contributors

Stargazers

Watchers

Forkers

sarmad-m lfywork

characteristic-based-time-series-clustering's Issues

Start Using tsfresh For Feature Extraction + Analysis

Once the initial pass through of feature extraction + clustering + analysis is done (with the code reproduced from the 2012 R code), start using tsfresh and compare the quality of features being extracted.

Finish up Exploratory Analysis Items for WESAD

Couple of final items for the exploratory analysis:

Add outlier detection method and apply it to data
Remove displaying full sets of data (i.e. the full correlation dictionaries)
Remove the chest data concatenation that was done twice to see if dropping the indexes would speed up the process

Will update as needed.

Rearrange File Directories

I decided to divide up the directories by category of code (i.e. code I attempted to make from 2006 paper, actual code from 2012, tsfresh, etc.) but this is confusing and would end up having a lot of duplicate scripts and notebooks since there's also exploratory data analysis for each data set.

I will either:

Remove this directory structure altogether and just differentiate which code I'm using by the script name
Change the directory structure and keep these directories

Will update this issue once started.

Feature Extraction & Clustering of WESAD Data

After exploratory analysis is finished, begin extracting features and initial stages of clustering data.

Exploratory Analysis of WESAD Data

Already begun here: #4

Feature Extraction & Clustering of NBA/Other Sports Data

After exploratory analysis of the data, extract features and cluster together. Some interesting use cases:

Out of all the stars in the league (Steph Curry, Lebron James, Kawhi Leonard, Kevin Durant, etc.) which are the most similar?
What lesser known player is most like these above stars (and vice versa)?
Can we train a clustering algorithm to try and predict new players (i.e. rookies) and whose game they are most similar to once they start playing?
Etc.

The Hungarian problem (Specifically minimum weighted biparite matching) is how these clustering labels can be compared. This issue is to figure out how to properly code it.

njfritter / characteristic-based-time-series-clustering Goto Github PK

characteristic-based-time-series-clustering's People

Contributors

Stargazers

Watchers

Forkers

characteristic-based-time-series-clustering's Issues

Start Using tsfresh For Feature Extraction + Analysis

Finish up Exploratory Analysis Items for WESAD

Rearrange File Directories

Feature Extraction & Clustering of WESAD Data

Exploratory Analysis of WESAD Data

Feature Extraction & Clustering of NBA/Other Sports Data

Exploratory Analysis of NBA/Other Sports Data

Push Rest of Code Reproduction from 2006 Paper

Figure Out Minimum Weighted Biparite Matching in Python

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent