Code Monkey home page Code Monkey logo

Comments (5)

r3fang avatar r3fang commented on June 11, 2024
  1. That must be a mistake for the parameters (>1000 reads per cell, 3% consecutive promoter coverage, 20% reads in peak) this will give more cells, i will update the parameter shortly. 3% consecutive promoter is lower than what we have used before.
  2. the reason that you did not see a good seperation of clusters probability because the number of cells. 4,600 cells include many cells of low quality therefore makes the cluster result undesirable. One thing you can try quickly is to change consecutive promoter coverage to filter cells to 1,500 cells and cluster again
    I will update the barcode filtering parameter shortly.

from snatac.

jy634 avatar jy634 commented on June 11, 2024

Thank you for your reply! I really like the way this pipeline is formatted, and I was hoping this would ease my analysis as well...

  1. Would you be able to help me understand what exactly a consecutive promoter coverage means..? Is it simply promoter regions merged&sorted together? I downloaded the mm10.consecutive_promoter.bed file from the server directly, but I wasn't quite sure what region it is actually specifying.
  2. I increased the threshold to 0.09, which resulted in giving 1,465 cells. However I suddenly ran into an error that I've never seen before :
    Error in La.svd(V) : error code 1 from Lapack routine 'dgesdd'
    Is this anything expected along the pipeline?

+3. I've also inspected the actual *.jacard files to check whether it contains the right value (which should be 0 < index < 1, from my understanding). Yet, what I find is a matrix with a value of 1000.00, and 0s on the diagonal. Is this an error or is it just the way jaccard matrix displays in my text editor...? I wasn't able to look at the snATAC_jacard code itself as it seems to be in a non-readable format.

Thank you so much in advance for all your help.

from snatac.

jy634 avatar jy634 commented on June 11, 2024

(update on 3) Hello, I actually just found out that my bmat file didn't contain any 1s in it from the first place. I'm trying to understand why the bmat function haven't been calling in the peaks... Do you think this can be a python version specific issue? (I'm currently using python 2.7)

  • In python version 2.x, while True might need to be while 1.
    After specifying this part the code at least runs through, but I'm still not getting any intersections. Would you be able to specify which version of pybedtools and Python you're using?

from snatac.

r3fang avatar r3fang commented on June 11, 2024
  1. consecutive_promoters are promoters that are accessible across all tissues/cell lines using all published bulk ATAC/DHS sites.
  2. python 2.7 should be fine. i never saw this error ...
  3. because jaccard index is very small, it is jaccard x 1000

from snatac.

jy634 avatar jy634 commented on June 11, 2024

Thanks for clarifying for me!

I think something might not've run through when I chunked the read file up. I managed to get this working now :) thank you so much!

from snatac.

Related Issues (10)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.