Comments (4)
Thank you for pointing this out.
There are two cases:
-
the query is z-normed: thus we have to z-norm all the subsequences.
-
query is not z-normed:
a) we can either force the query to be z-normed, which is one call to query.norm() before performing the query or
b) we can not apply z-normalization at all, which would be a check in getDistance.
Typically, we force the query to be z-normed in this case. However, I agree that it is possible to leave out z-norm, too. The proposed pull-request however, breaks the code by removing z-norm for subsequences.
from sfa.
For my understanding, it's better to return the un-normalized distance measure. So a potential fix will be: if the query is normalized -> unnormalize its values, if the timeseries is normalized -> un-normalize the values -> then calculate the euclidean distance between original un-normalized versions of both timeseries data and query :)
Big thanks anyway!
from sfa.
Removing it, is not sooo easy. For example, SFA uses the momentary Fourier transform. The MFT applies z-normalization to each subsequence:
SFA/src/main/java/sfa/transformation/MFT.java
Line 112 in 43cfad8
The same does TimeSeries.getDisjointSequences
and TimeSeries.getSubsequences
So, when removing it from getDistance
, you suddenly try to lower bound un-normalized subsequences with z-normalized words. This should fail.
from sfa.
yes i know, that's why i didn't propose a fix. I saw that in subsequence matching, each subsequence is normalized with different mean and std, so i can't un-normalized. maybe we need 2 separate methods.
from sfa.
Related Issues (20)
- Planning a release? HOT 11
- WEASEL at timeseriesclassification.com HOT 4
- NullPointerException in WEASEL.createBagOfPatterns HOT 2
- TimeSeriesLoader.java exception HOT 1
- Instructions on how to build the project? HOT 3
- WEASEL+MUSE: Datasets Format HOT 9
- SFATrie search by radius HOT 1
- Memory related errors when running UCR tests HOT 1
- Python release? HOT 4
- README Inconsistent HOT 3
- Meaning of output HOT 1
- Data for histograms HOT 1
- Use New Dataset HOT 1
- may you share python code for multivariate time series HOT 3
- Size of training data HOT 1
- can you classify categorical data time series ? HOT 1
- TEASER algorithm - Python version HOT 1
- Running WeaselMUSE on UCR dataset HOT 1
- MTSClassificationTest fails HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sfa.