Code Monkey home page Code Monkey logo

Comments (4)

kmdouglass avatar kmdouglass commented on May 26, 2024

I just repeated this behavior for five fiducial tracks, each with about 20000 frames on the Olympus computer. It filled up all 16gb of memory. And didn't complete.

Three tracks worked, filling only 12 GB and took 10 or 20 seconds.

Perhaps I can implement ELKI's version of DBSCAN.

from bstore.

kmdouglass avatar kmdouglass commented on May 26, 2024

@nberliner recommended trying HDBSCAN as a high performance implementation.

from bstore.

nberliner avatar nberliner commented on May 26, 2024

I'm not sure about the memory consumption. It apparently is a bit faster than the sklearn DBSCAN implementation (see here). Interestingly, it appears from that comparison as if the sklearn DBSCAN implementation can cluster 200000 points on a laptop with 8GB.

One advantage of HDBSCAN is that it dynamically selects a suitable density for clustering which can vary for each cluster in the field of view. There is only one parameter, the minimum number of clusters, which must be set by the user. I found the description given on the project page very good (see here).

from bstore.

kmdouglass avatar kmdouglass commented on May 26, 2024

See this discussion: scikit-learn/scikit-learn#5275

Also note discussion on DBSCAN and memory usage here: http://scikit-learn.org/stable/modules/clustering.html#dbscan

It seems that if the neighborhood radius is made too large, then the memory consumption blows up. I noticed this when I recently tried to cluster a dataset that was in units of pixels instead of nanometers. Setting the neighborhood radius to "50" included nearly every point in the radius and ate up all my memory. Resetting it to 0.5 pixels worked without much memory consumption.

from bstore.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.