Code Monkey home page Code Monkey logo

exrna_breast_cancer's Introduction

Cancer prediction with extracellular (exRNA) in serum

Requirements (TODO)

conda installation requirements.txt

Dataset

SILVER-seq on 128 serum samples from healthy and cancer (recurrent and non-recurrent) patients.

Linked publication: Zixu Zhou, et al., Extracellular RNA in a single droplet of human serum reflects physiologic and disease states.ย  PNAS, 2019, 116:19200โ€“19208.

Linked accessions: GEO - GSE131512 SRA - SRP198979 BioProject - PRJNA543872

Download raw fastq

  1. Get accession list (SRR_Acc_List.txt) and run table (SraRunTable.txt) from https://trace.ncbi.nlm.nih.gov/Traces/study/?acc=SRP198979&o=acc_s%3Aa
  2. Pull out non-index fastq accessions SraRunTable using process_metadata.ipynb (SRR_Acc_List_NonIndex.txt)
  3. Downloaded fastqs using download_data.sh
  • requires fastq-dump from sra-toolkit (conda install -c bioconda sra-tools)

Align fastq to GRCh38

  1. Download reference genome from Illumina iGenomes:
  • wget ftp://igenome:[email protected]/Homo_sapiens/NCBI/GRCh38/Homo_sapiens_NCBI_GRCh38.tar.gz
  1. Run bwa.sh to align to GRCh38 and pull out unmapped reads
  • Use [bwa mem] to align (conda install -c bioconda bwa)
  • Use [samtools] to pull out unmapped reads (conda install -c samtools)

Classify unmapped sequences

  1. Install kraken2 (conda install -c bioconda kraken2)
  2. Download MiniKraken2_v1_8GB database (no build required)
  • wget ftp://ftp.ccb.jhu.edu/pub/data/kraken2_dbs/old/minikraken2_v1_8GB_201904.tgz
  1. Run kraken2.sh to classify unmapped sequences

Analyze feature table using Qiime2

  1. Create qiime environment (https://docs.qiime2.org/2020.2/install/native/#install-miniconda)
  2. Step through qiime_analysis.ipynb to move data into qiime
  • Generate sample_metadata.tsv using process_metadata.ipynb

Sample classification

Qiime2

https://docs.qiime2.org/2019.10/plugins/available/sample-classifier/ https://docs.qiime2.org/2020.2/tutorials/sample-classifier/ Run qiime_analysis.ipynb to perform sample classification using Qiime plugin

TODO: Other classification techniques

Other

exrna_breast_cancer's People

Contributors

adamklie avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.