Code Monkey home page Code Monkey logo

scflow's Introduction

Combiz is a full-stack data scientist based in the UK, specializing in bioinformatics and FinTech. With expertise in analyzing multimodal data, he develops comprehensive analytical solutions, including automated pipelines, machine learning models, statistical models, APIs, and visualizations.

His work with open-source software and various other projects can be found here on GitHub. For a deeper dive into his notes and projects, please visit his website.

Recent Highlights

scFlow, an R package toolkit for the analysis of scRNAseq data with higher levels of abstraction. And a Nextflow pipeline, nf-core-scflow, for automated, cloud scalable, reproducible analyses of full scRNAseq datasets using the scFlow toolkit. See the pre-print here or a lay summary here.

     

 

Standard Toolkit

python R NumPy Pandas Jupyter bash C++ linux sklearn PyTorch Keras dagster FastAPI Flask Django Kubernetes Streamlit GCP AWS PySpark dbt SQL MongoDB docker git react svelte html5 css3 javascript illustrator photoshop postman vscode

scflow's People

Contributors

al-murphy avatar combiz avatar mahdimoradimarjaneh avatar mike-robiology avatar nathanskene avatar nfancy avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

scflow's Issues

Permutation based testing of differential expression

After the main differential expression analysis is done, the values of the main value tested (i.e. Braak stage) should be permuted, and the DE repeated. Are a similar number of genes found to be differentially expressed with the permuted labels?

Can scFlow R package be installed in RStudio server? Does the requirement for libudunits2-dev block this?

Presumably many people would try doing this the first time they try using it.

Mahdi reported: "A few packages need to be installed separately. Still struggling with “units”. It seems the only way is to first installing libudunits2-dev which needs a sudo access. Any clue? Or I will check with the RCS team."

If it cannot be done, the requirement should be clearly stated alongside installation instructions

Two monocle related installation errors: ( libudunits2.so ) and (gdal-config)

Another monocle related error during scFlow installation that should be mentioned in installation documentation

The first gives this error:

configure: error: in `/tmp/RtmpORsFJ4/R.INSTALL3175f370031/units':
configure: error: 
--------------------------------------------------------------------------------
  Configuration failed because libudunits2.so was not found. Try installing:
    * deb: libudunits2-dev (Debian, Ubuntu, ...)
    * rpm: udunits2-devel (Fedora, EPEL, ...)
    * brew: udunits (OSX)
  If udunits2 is already installed in a non-standard location, use:
    --configure-args='--with-udunits2-lib=/usr/local/lib'
  if the library was not found, and/or:
    --configure-args='--with-udunits2-include=/usr/include/udunits2'
  if the header was not found, replacing paths with appropriate values.
  You can alternatively set UDUNITS2_INCLUDE and UDUNITS2_LIBS manually.
--------------------------------------------------------------------------------

See `config.log' for more details
ERROR: configuration failed for package ‘units’
* removing ‘/home/nskene/Projects/scFlowExamples/renv/library/R-3.6/x86_64-pc-linux-gnu/units’
Error: Failed to install 'scflow' from GitHub:
  Failed to install 'monocle3' from GitHub:
  (converted from warning) installation of package ‘units’ had non-zero exit status

I've made a (wiki page)[https://github.com/combiz/scflow/wiki/%23Error:-libudunits2.so-was-not-found] about that error.

The second error is:

* installing *source* package ‘sf’ ...
** package ‘sf’ successfully unpacked and MD5 sums checked
** using staged installation
configure: CC: gcc -std=gnu99
configure: CXX: g++ -std=gnu++11
checking for gdal-config... no
no
configure: error: gdal-config not found or not executable.
ERROR: configuration failed for package ‘sf’
* removing ‘/home/nskene/Projects/scFlowExamples/renv/library/R-3.6/x86_64-pc-linux-gnu/sf’
Error: Failed to install 'scflow' from GitHub:
  Failed to install 'monocle3' from GitHub:
  (converted from warning) installation of package ‘sf’ had non-zero exit status

Dependencies on batchelor and DelayedmatrixStats via monocle3

Got the following error while installing:

Installing 26 packages: batchelor, DelayedMatrixStats, grr, Matrix.utils, pbmcapply, pheatmap, proxy, pscl, rsample, RhpcBLASctl, slam, spdep, speedglm, viridis, furrr, sp, spData, sf, deldir, LearnBayes, coda, expm, gmodels, classInt, units, e1071
Installing packages into ‘/home/nskene/Projects/scFlowExamples/renv/library/R-3.6/x86_64-pc-linux-gnu’
(as ‘lib’ is unspecified)
Error: Failed to install 'scflow' from GitHub:
Failed to install 'monocle3' from GitHub:
(converted from warning) packages ‘batchelor’, ‘DelayedMatrixStats’ are not available (for R version 3.6.1)

Create QC document with summary statistics for cell types

This report would be produced post-merge, post-reddim/cluster/celltype annotation and would report:

  • read depth per cell type
  • total cells per cell type
  • mitochondrial percentage per cell type
  • distribution across individuals per cell type
  • ...

Modify so it can handle MGI IDs

Ideally the pipeline should be able to handle data from any species. Currently it doesn't recognise any genes that have MGI IDs.

#Error: missing required header GL glu.h

The error may generate during installing R package rgl.

Install the following packaged using the command sudo apt install libftgl2 libcgal-dev libglu1-mesa-dev libglu1-mesa-dev .

solution was taken from this link

viewing UMAP3D on web browser

I'm using PuTTY to SSH to a Linux system (GNU/Linux 4.15.0-74-generic x86_64).
After merging and dimensionality reduction, I get the following error when running sce@metadata:

> sce@metadata
$reduced_dim_plots
$reduced_dim_plots$umap3d_plot_ly
$reduced_dim_plots$umap3d
Couldn't find a suitable web browser!
Set the BROWSER environment variable to your desired browser.
Warning: program returned non-zero exit code #1
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: www-browser: not found
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: links2: not found
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: elinks: not found
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: links: not found
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: lynx: not found
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: w3m: not found
xdg-open: no method available for opening '/tmp/RtmpCOkJfQ/viewhtml7b8f2acc62e6/index.html'
> Couldn't find a suitable web browser!
Set the BROWSER environment variable to your desired browser.
Warning: program returned non-zero exit code #1
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: www-browser: not found
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: links2: not found
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: elinks: not found
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: links: not found
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: lynx: not found
/usr/bin/xdg-open: 851: /usr/bin/xdg-open: w3m: not found
xdg-open: no method available for opening '/tmp/RtmpCOkJfQ/viewhtml7b8f7ff070b3/index.html'

sce@metadata tries to open all reduced dimension plots of which umap3d needs a web browser.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.