Comments (4)
I just repeated this behavior for five fiducial tracks, each with about 20000 frames on the Olympus computer. It filled up all 16gb of memory. And didn't complete.
Three tracks worked, filling only 12 GB and took 10 or 20 seconds.
Perhaps I can implement ELKI's version of DBSCAN.
from bstore.
@nberliner recommended trying HDBSCAN as a high performance implementation.
from bstore.
I'm not sure about the memory consumption. It apparently is a bit faster than the sklearn DBSCAN implementation (see here). Interestingly, it appears from that comparison as if the sklearn DBSCAN implementation can cluster 200000 points on a laptop with 8GB.
One advantage of HDBSCAN is that it dynamically selects a suitable density for clustering which can vary for each cluster in the field of view. There is only one parameter, the minimum number of clusters, which must be set by the user. I found the description given on the project page very good (see here).
from bstore.
See this discussion: scikit-learn/scikit-learn#5275
Also note discussion on DBSCAN and memory usage here: http://scikit-learn.org/stable/modules/clustering.html#dbscan
It seems that if the neighborhood radius is made too large, then the memory consumption blows up. I noticed this when I recently tried to cluster a dataset that was in units of pixels instead of nanometers. Setting the neighborhood radius to "50" included nearly every point in the radius and ate up all my memory. Resetting it to 0.5 pixels worked without much memory consumption.
from bstore.
Related Issues (20)
- HDFDatastore.build() is not returning a DataFrame but always returns None on development branch HOT 1
- PositionParser configuration fails in new 1.1.0 build HOT 1
- Add Python 3.6 support and packages to conda
- Add backwards compatibility for old DatasetID's HOT 1
- On the concept of global and local datasets HOT 2
- Change bstore.__version__ to follow Python standard practices
- Fiducial drift correction with `removeFiducials=False` raises error HOT 1
- Include a x-y scatter plot next to the fiducial drift trajectories
- Move MergeFang and MergeFangTS out of B-Store and into a custom plugin
- CSVReader and JSONReader not compatible with Garg HOT 1
- Unit tests failing with new tifffile version 0.12.0 HOT 1
- Upgrade to tifffifle 0.12.0 HOT 2
- Unit tests failing with Pandas 0.20.1 HOT 2
- Astigmatic calibration curves need to be persistent
- The Overlay multiprocessor should not assume that stats argument has cluster_id as the index
- Overlay processor will not display any clusters if annotateCol=False
- Overlay processor makes a mark for the center of the "noise" cluster
- Add ability to save astigmatic calibration curves and wobble curves
- OverlayClusters multiprocessor does not return to intepreter with matplotlib > 2.1 HOT 2
- Documentation builds failing due to infinite recursion in Mock HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bstore.