Code Monkey home page Code Monkey logo

Comments (7)

huidongchen avatar huidongchen commented on July 4, 2024 1
  1. Yes. Different root branches are generated just for visualization. It doesn't affect the assigned branch id for each cell. (The latent tree structure is still held)

  2. Sorry about it. We tested it in server, in which we have a massive amount of memory. Basically in current pipeline, the dimension reduction step is consuming a lot of memory for large data and somehow the docker image doesn't deal with it well. The worst case is that it quits without finishing. For now, i'm afraid the only workaround is to try further increase the memory. But we are working on it and will release the new version soon, in which this issue will be solved. Sorry about the inconvenience.

from stream.

huidongchen avatar huidongchen commented on July 4, 2024

Hi,

The cell_info.tsv file should contain the information of all cells, including branch id, pseudotime, etc.

In STREAM pipeline, we only filter out genes but keep all the cells. So my guess is that your input matrix might not be appropriate. The input matrix should have cells in columns and genes in rows. You might want to transpose your gene expression matrix before feeding it to STREAM.

Could you please check it? Thanks!

from stream.

anjanbharadwaj avatar anjanbharadwaj commented on July 4, 2024

Hi,

I checked the gene expression data, and the cells are in columns and the genes are in rows, so I don't think I need to transpose it. I also tried the following command on another dataset of mine previously, and it worked on that (generated all values in the cell_info). I think the issue I may have been seeing is not that the cell_info.tsv file is filtering data, but rather that when I run the below command, it is only generating the "Precomputed" folder, and no other files/pdfs/etc.

The command is: docker run -v $PWD:/data -w /data pinellolab/stream STREAM -m mat.tsv -l labels.tsv -s all -o stream_results.

Also, another question (this is a bit of a different), but I see that STREAM generates a set of pseudotimes for each possible root branch. Is there any way that STREAM indicates which branch it predicts should be the starting one?

from stream.

anjanbharadwaj avatar anjanbharadwaj commented on July 4, 2024

Yes, actually the issue seems to be that (what I mentioned in the first paragraph) - when I run the command, it generates the values in "Precomputed", but doesn't create the new cell_info.tsv file. The reason why I thought that the cells were filtered was because I probably first ran it on a small dataset with only 450 cells, and then on the 4000 cell dataset.

Any idea on why this might be happening, though?

from stream.

huidongchen avatar huidongchen commented on July 4, 2024

Hi,

Thanks for checking it!

  1. That's a good point. But the decision of root node is quite a open challenge. In order to achieve it, currently we have to reply on some exterior information like FACS sorting, marker gene or sampling time. That's why in the default output STREAM gives all the possible trajectories. Sorry that there is no automatic way available in STREAM to determine the root branch.

2)Yes, cells were actually not filtered out. It was because the analysis didn't get finished. The issue is caused by the memory efficiency issue in current docker image, especially when it comes to large data. We did notice the issue and are working on the new version. We'll keep you posted. In the meantime, can you try increase the memory size in docker (Docker -> Preferences -> Advanced)?

I hope the above is useful to you.

from stream.

anjanbharadwaj avatar anjanbharadwaj commented on July 4, 2024

Hi,

Thanks for letting me know!

  1. Just to confirm, though; even though all of the root branches are generated, we do know which branch each cell type is on, correct?

  2. Will try this out. Do you think 4GB will be sufficient?

from stream.

anjanbharadwaj avatar anjanbharadwaj commented on July 4, 2024

I tried it out with 6GiB of memory, 2 CPUs, and 1 GiB of Swap, but it still only generated the Precomputed folder.

from stream.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.