Code Monkey home page Code Monkey logo

Comments (8)

aviezerl avatar aviezerl commented on June 20, 2024

Original comment by Yaniv Lubling (Bitbucket: 557058:11761e0c-f009-41f9-a29c-6f6993e0539c, ).


Hi,

The other parameter that sets the number of processes metacell uses is mc_cores. It is 16 by default, which should be ok on your machine. Did you change it?
you can get it with: n_cores = tgconfig::get_param("mc_cores", "metacell")

As a sanity, let's see if it works when you set both mc_cores and tgs_max.processes to 4.
To set mc_cores, do: tgconfig::set_param("mc_cores", 4, "metacell")

Yaniv

from metacell.

aviezerl avatar aviezerl commented on June 20, 2024

Original comment by JPLau (Bitbucket: 5c2729a816ac1e4f7cbbda0b, ).


Hi,
I have followed your reply but it runs slowly with 4 threads. I read metacell paper and it says metacell could handle 160k PBMC dataset. Could you upload the R script of 160k dataset processing when it is convenient for you? I want to re run it on my station to check whether the error would produce or not . Thank you.

from metacell.

aviezerl avatar aviezerl commented on June 20, 2024

Original comment by Amos Tanay (Bitbucket: 557058:5a3503bb-91a6-4af0-972c-ab1d0c3bb50f, GitHub: amostanay).


So if it runs with 4 threads, it is hard to understand why it fails with more, unless there are some machine-specific restrictions . The system should indeed run with 160K cells and more...and we usually use something like 16-24 cores on nodes that look very much like what you described (dual CPU 12x2 physical cores, 512G RAM)

Can you try to increase the number of cores and see where it get blocked? make sure your machine is not loaded by other stuff?

(And thanks for reporting these issues)

from metacell.

aviezerl avatar aviezerl commented on June 20, 2024

Original comment by JPLau (Bitbucket: 5c2729a816ac1e4f7cbbda0b, ).


without changing mc_cores, I used tgs_max.processes 16, 24 ,48 , and 96 but all produce same error. therefore, I set mc_cores to 4 and tgs_max.processes to 16, the error is produced again. Now, I set 4 to both params, it is slow so I will check it tomorrow and report result.

from metacell.

aviezerl avatar aviezerl commented on June 20, 2024

Original comment by Amos Tanay (Bitbucket: 557058:5a3503bb-91a6-4af0-972c-ab1d0c3bb50f, GitHub: amostanay).


BTW - why are you using n_resamp=5000? for strater take it down to 100 or 50 - assuming the resampling is the bottleneck you'll know how things look like 100 time faster..

We do not recommend more than 500 bootstrap - even 200 should typically be enough

Amos

from metacell.

aviezerl avatar aviezerl commented on June 20, 2024

Original comment by JPLau (Bitbucket: 5c2729a816ac1e4f7cbbda0b, ).


maybe I misunderstood this parameter, I thought it related with meta cell number. in 8k pbmc tutorial, n_resample is 500, another tutorial is 1000, so I select a larger number. maybe it is the reason why the error produced.

BTW, n_resample 2500, 4 for mc_cores and tgsmaxcore produce same error. and I reduced the bootstrap, I will update the result later.

from metacell.

aviezerl avatar aviezerl commented on June 20, 2024

Original comment by JPLau (Bitbucket: 5c2729a816ac1e4f7cbbda0b, ).


indeed, larger n_resample number leads this error, setting to 1000 will be ok. Thank you~

another question: if there are some small clusters in dataset, does the n_resample parameter affect the metacell precision?

from metacell.

aviezerl avatar aviezerl commented on June 20, 2024

Original comment by Amos Tanay (Bitbucket: 557058:5a3503bb-91a6-4af0-972c-ab1d0c3bb50f, GitHub: amostanay).


You will not gain more sensitivity from running more boostraps - 500 are absolutely enough.

More boostraps can help making metacell cover of large clusters a bit more accurate - if you are worried about small clusters - these should be detected with high sensitivity even without any boostraping.

Paradoxically - it is easier to define small and tight rare cell types than to describe accurately the subbtle gene expression variation within prevalent cell types..

from metacell.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.