Code Monkey home page Code Monkey logo

cellranger-snakemake's People

Contributors

inmanjm avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

Forkers

inmanjm

cellranger-snakemake's Issues

Add FindAllMarkers and create various heatmaps

FindAllMarkers has been added, and code is in place that should create the heatmap for Seurat objects containing merged 'groups':

if ( str_detect( newname, 'group_') )  {
  
    message( 'Running FindAllMarkers')
    DE_markers <- FindAllMarkers(x)
  
    clusters_up   <- setorder( setDT( DE_markers ), -avg_logFC )[ , head( .SD, 100 ), keyby='cluster' ]
    clusters_down <- setorder( setDT( DE_markers ),  avg_logFC )[ , head( .SD, 100 ), keyby='cluster' ]
  
    pdffile = str_replace( seurat_filepath, "seurat\\.Rdata", "heatmap.pdf" )
    pdfs[[pdffile]] <- DoHeatmap( x, features = c( clusters_up$V1, clusters_down$V1 ), raster=F ) +
      scale_fill_gradientn( colors = rev( RColorBrewer::brewer.pal( n = 10, name = "RdBu" )) ) +
      guides( color = FALSE)

  }

... but it is unclear whether this is working or not, as the .pdf files are not being generated.

NOTE: This issue may be a non-issue if the .pdf file-making issues are resolved and they appear to be ok.

Add filtration of various 'noisy' genes

There is a list of house-keeping genes, pseudo-genes, etc. that should be removed from these kinds of analysis. Import that list and remove them from the analysis, probably after the conversion to gene symbols from ensg.genes.

Create tSNE, UMAP plots as pdfs

An attempt has been made at having the pipeline produce pdf files of the following plots:

  • pca
  • tSNE
  • UMAP
  • DE Heatmap

However, having the pipeline create and output the plots in parallel is presenting challenges. As of now, the code tries to store the plots in a list, and outside of the foreach loop print them to disk like so:

for ( pdffile in pdfs ) {
 
  pdf(pdffile)
  pdfs[[pdffile]]
  dev.off()

}

This produces no image files.
Investigate R best-practices for this and implement them.

Create SOP describing setup, pipeline execution, etc.

Documentation needs to be provided that explains the setup and execution of the pipeline, including:

  • Required inputs
  • Pre-requisites for execution (biowulf acct, etc.)
  • Steps for typical execution
  • Description of output.

Ideally, this documentation will exist as perldoc within run_cellranger.pl and in README.md of this repository.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.