Code Monkey home page Code Monkey logo

pa-seq-compendia's Introduction

Computationally efficient assembly of a Pseudomonas aeruginosa gene expression compendium

This repository contains scripts used to build compendia of publicly available RNAseq data for Pseudomonas aeruginosa.

The construction and composition of these compendia is documented in Doing et al. 2022.

Required input files and the main output files like raw and normalized compendia are stored on OSF: https://osf.io/s9gyu/

The compendia were originally constructed using a collection of bash, python, and R scripts which are documented in compendia_construction. This pipeline required a pre-downloaded SRA run table which was used to download FASTQ files of RNAseq data using fasterq-dump. These files were then mapped against P. aeruginosa transcriptomes using salmon, and counts were post-processed in R and python.

To facilitate the addition of new RNAseq samples to the compendia, we have provided an automated pipeline that will process new SRA experiment accessions, add the new counts to the original raw compendia, and re-filter and re-normalize the compendia. This pipeline is provided in auto_add_to_compendia. This pipeline automates the bash, R, and python scripts provided in compendia_construction with minor modifications to the original code.

Support

Please submit an issue to the issue tracker.

Roadmap

This is a proof-of-principle project that could help set the stage for a versatile, scaleable pipeline.

Authors

Georgia Doing (@georgiadoing), Taylor Reiter (@taylorreiter)

License

See the LICENSE file above.

Status

Complete.

Citation

Please cite

Computationally efficient assembly of a Pseudomonas aeruginosa gene expression compendium Georgia Doing, Alexandra J. Lee, Samuel L. Neff, Jacob D. Holt, Bruce A. Stanton, Casey S. Greene, Deborah A. Hogan bioRxiv 2022.01.24.477642; doi: https://doi.org/10.1101/2022.01.24.477642

pa-seq-compendia's People

Contributors

georgiadoing avatar ajlee21 avatar samlo777 avatar taylorreiter avatar

Forkers

taylorreiter

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.