Code Monkey home page Code Monkey logo

picsnatac's Introduction

PICsnATAC

The goal of PICsnATAC is to accurately quantify snATAC-seq data. The two key components are:

1. Constructing cell-by-peak matrix with Paired Insertion Counting (PIC).

Motivation:

There are two common counting approaches for snATAC-seq data, fragment-based counting and insertion-based counting. They generate inconsistent results (see figure below).

Here, we present PIC as a uniform counting choice for various assays, including 10X Genomics snATAC-seq, sci-ATAC-seq, SNARE-seq, dsc-ATAC-seq, s3-ATAC-seq, and scNANO-ATAC.

2. Conducting Differential Accessible Region (DAR) identification analysis using the PIC matrix.

Motivation:

Two insertion events with correct primer configurations are required for one viable fragment. We derived the probability distribution of fragment count as a function of insertion rate and derived a model-based insertion rate estimation framework. We further applied a likelihood ratio test approach to detect DARs between populations.

Building upon our model-based insertion rate estimation framework, this analysis provides a more sensitive test of DAR between cell groups (cell types).

Dependencies

Please install the dependent libraries by running the following codes

install.packages('data.table') ## (please make sure it is newer than 1.8)
if (!require("BiocManager", quietly = TRUE))
    install.packages("BiocManager")
BiocManager::install(c('IRanges', 'GenomicRanges','Rsamtools', 'GenomeInfoDb'))

Installation

You can install the most up-to-date version of PICsnATAC from GitHub with:

# install.packages("devtools")
devtools::install_github("Zhen-Miao/PICsnATAC")

PIC-snATAC is a light-weight package. Installation typically takes <5 min.

Example

This is a basic example which shows you how to construct PIC-based cell by peak matrix:

library(PICsnATAC)

pic_matrix <- PIC_counting(cells, fragment_tsv_gz_file_location, peak_sets)

Vignettes

The vignettes can be found at our pkgdown page:

https://zhen-miao.github.io/PICsnATAC/

Interactive Notebooks

Here, we provided two Google Colab notebooks for users to run the codes on cloud (processed example data and link provided). However, as snATAC-seq data are typically large and computer intensive, it is recommended to run it using computing cluster for large datasets

Notebook 1: Run PIC counting on PBMC data.

Notebook 2: Run PIC DAR on SNARE-seq2 brain data.

Citation

If you use PICsnATAC in your research, please cite our paper:

Miao Z and Kim J. Uniform quantification of single-nucleus ATAC-seq data with Paired- Insertion Counting (PIC) and a model-based insertion rate estimator. Nature Methods 2023 (In press)

picsnatac's People

Contributors

zhen-miao avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.