Comments (8)
Original comment by Yaniv Lubling (Bitbucket: 557058:11761e0c-f009-41f9-a29c-6f6993e0539c, ).
Hi,
The other parameter that sets the number of processes metacell uses is mc_cores. It is 16 by default, which should be ok on your machine. Did you change it?
you can get it with: n_cores = tgconfig::get_param("mc_cores", "metacell")
As a sanity, let's see if it works when you set both mc_cores and tgs_max.processes to 4.
To set mc_cores, do: tgconfig::set_param("mc_cores", 4, "metacell")
Yaniv
from metacell.
Original comment by JPLau (Bitbucket: 5c2729a816ac1e4f7cbbda0b, ).
Hi,
I have followed your reply but it runs slowly with 4 threads. I read metacell paper and it says metacell could handle 160k PBMC dataset. Could you upload the R script of 160k dataset processing when it is convenient for you? I want to re run it on my station to check whether the error would produce or not . Thank you.
from metacell.
Original comment by Amos Tanay (Bitbucket: 557058:5a3503bb-91a6-4af0-972c-ab1d0c3bb50f, GitHub: amostanay).
So if it runs with 4 threads, it is hard to understand why it fails with more, unless there are some machine-specific restrictions . The system should indeed run with 160K cells and more...and we usually use something like 16-24 cores on nodes that look very much like what you described (dual CPU 12x2 physical cores, 512G RAM)
Can you try to increase the number of cores and see where it get blocked? make sure your machine is not loaded by other stuff?
(And thanks for reporting these issues)
from metacell.
Original comment by JPLau (Bitbucket: 5c2729a816ac1e4f7cbbda0b, ).
without changing mc_cores, I used tgs_max.processes 16, 24 ,48 , and 96 but all produce same error. therefore, I set mc_cores to 4 and tgs_max.processes to 16, the error is produced again. Now, I set 4 to both params, it is slow so I will check it tomorrow and report result.
from metacell.
Original comment by Amos Tanay (Bitbucket: 557058:5a3503bb-91a6-4af0-972c-ab1d0c3bb50f, GitHub: amostanay).
BTW - why are you using n_resamp=5000? for strater take it down to 100 or 50 - assuming the resampling is the bottleneck you'll know how things look like 100 time faster..
We do not recommend more than 500 bootstrap - even 200 should typically be enough
Amos
from metacell.
Original comment by JPLau (Bitbucket: 5c2729a816ac1e4f7cbbda0b, ).
maybe I misunderstood this parameter, I thought it related with meta cell number. in 8k pbmc tutorial, n_resample is 500, another tutorial is 1000, so I select a larger number. maybe it is the reason why the error produced.
BTW, n_resample 2500, 4 for mc_cores and tgsmaxcore produce same error. and I reduced the bootstrap, I will update the result later.
from metacell.
Original comment by JPLau (Bitbucket: 5c2729a816ac1e4f7cbbda0b, ).
indeed, larger n_resample number leads this error, setting to 1000 will be ok. Thank you~
another question: if there are some small clusters in dataset, does the n_resample parameter affect the metacell precision?
from metacell.
Original comment by Amos Tanay (Bitbucket: 557058:5a3503bb-91a6-4af0-972c-ab1d0c3bb50f, GitHub: amostanay).
You will not gain more sensitivity from running more boostraps - 500 are absolutely enough.
More boostraps can help making metacell cover of large clusters a bit more accurate - if you are worried about small clusters - these should be detected with high sensitivity even without any boostraping.
Paradoxically - it is easier to define small and tight rare cell types than to describe accurately the subbtle gene expression variation within prevalent cell types..
from metacell.
Related Issues (20)
- mc2d_force_knn fails if mc wasn't colorized beforehand HOT 1
- Is it possible to get knn index file from Metacell?
- linear pipeline collapse with Error due to undefined variable 'marks_fn' used to coloring mc
- Could we add the proteomic data into metacell
- code error in gstats.r HOT 1
- mcell_mc_export_tab->Source Code may need to be modified HOT 3
- mcell_add_gene_stat HOT 3
- install issue HOT 1
- comp mc graph using the graph RGdb_50_graphs and K 20 Error in confu/csize2 : non-conformable arrays
- Coudn't install metacell on windows HOT 1
- Can't download the tutorial data HOT 2
- Function to extract mat class with mc and mc2d HOT 1
- Question: Assign a metacell HOT 1
- mcell_mc_plot_marks creates incorrect annotation colorbar when using focus_mcs
- Log Transformation
- Batch
- install scdb, download test
- dysfunction and activation score
- mcell_mc_hierarchy while condition HOT 1
- Class of mc number here should be character but not numeric
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from metacell.