ncats / cellranger-snakemake Goto Github PK
View Code? Open in Web Editor NEWSnakemake workflow and supporting files for running typical cellranger commands
Snakemake workflow and supporting files for running typical cellranger commands
This has been a request for a while now.
foreach ( ) %dopar%
is failing sporadically. Determine the cause of these failures.
Remove a bunch of the 'intermediate' files that are of no interest after the pipeline is finished.
Be sure to document which files are removed.
FindAllMarkers has been added, and code is in place that should create the heatmap for Seurat objects containing merged 'groups':
if ( str_detect( newname, 'group_') ) {
message( 'Running FindAllMarkers')
DE_markers <- FindAllMarkers(x)
clusters_up <- setorder( setDT( DE_markers ), -avg_logFC )[ , head( .SD, 100 ), keyby='cluster' ]
clusters_down <- setorder( setDT( DE_markers ), avg_logFC )[ , head( .SD, 100 ), keyby='cluster' ]
pdffile = str_replace( seurat_filepath, "seurat\\.Rdata", "heatmap.pdf" )
pdfs[[pdffile]] <- DoHeatmap( x, features = c( clusters_up$V1, clusters_down$V1 ), raster=F ) +
scale_fill_gradientn( colors = rev( RColorBrewer::brewer.pal( n = 10, name = "RdBu" )) ) +
guides( color = FALSE)
}
... but it is unclear whether this is working or not, as the .pdf files are not being generated.
NOTE: This issue may be a non-issue if the .pdf file-making issues are resolved and they appear to be ok.
There is a list of house-keeping genes, pseudo-genes, etc. that should be removed from these kinds of analysis. Import that list and remove them from the analysis, probably after the conversion to gene symbols from ensg.genes.
An attempt has been made at having the pipeline produce pdf files of the following plots:
However, having the pipeline create and output the plots in parallel is presenting challenges. As of now, the code tries to store the plots in a list, and outside of the foreach loop print them to disk like so:
for ( pdffile in pdfs ) {
pdf(pdffile)
pdfs[[pdffile]]
dev.off()
}
This produces no image files.
Investigate R best-practices for this and implement them.
Documentation needs to be provided that explains the setup and execution of the pipeline, including:
Ideally, this documentation will exist as perldoc within run_cellranger.pl
and in README.md of this repository.
Currently the pipeline is using ensg.genes
if it exists in the working directory. It should default to looking for it in the ./bin dir. There should be a way to have it copy the file into the workdir; and retain the ability to have it newly created, if desired.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.