Code Monkey home page Code Monkey logo

brahmanangusassemblyscripts's Introduction

Brahman and Angus Assembly Scripts

This repository contains custom scripts to analyse scaffolds and validate genome assemblies of Angus and Brahman cattles. The novel contig assembly approach that generated haplotype-resolved contigs was published elsewhere by Koren et al (2018) and this step will not be detailed here. The work described here followed up from the contigs assemblies to generate haplotype-resolved chromosome-level scaffolds. It also contains some scripts used for specific analyses that involved SNPs, indels, and various types of structural variants (SVs).

Table of contents

All scripts are given in the scripts directory. Specific scripts and datasets for the various assembly stages are given or listed in directories below in this repository

  • scaffolding_with_optical_map_and_HiC
  • sex_chromosomes_assemblies_and_validation
  • comparison_of_gaps
  • QV_estimation
  • SV_analyses
  • selective_sweep
  • phasing_transcripts

Here is a brief description of the contents of each directory.

scaffolding with optical map and HiC

This folder contains information on how optical map based scaffolds and Hi-C based scaffolds were analysed together with recombination map markers to produce the final validated scaffolds.

sex chromosomes assemblies and validation

Here are the specific scripts and information used to put together the sex chromosomes, which was more challenging given the higher number of gaps. The assemblies utilized different sources of linkage and radiation hybrid (RH) markers to guide order and orientation of the contigs that belong to Brahman X and Angus Y chromosomes.

comparison of gaps

This folder has the raw datasets of gap and ungapped contigs lengths as well as the R scrips used for analysis.

QV estimation

Here are the results by Derek Bickhart on QV estimation of the Angus and Brahman assemblies. The folder contains notes on software requirements and how to run the QV estimation.

SV analyses

This folder details the comparison of SV between Brahman and Angus assemblies.

selective sweep

This folder details the selective sweep analysis after obtaining SNPs called from GATK and annotated with Annovar.

phasing transcripts

The phasing of transcripts using isoseq3 and IsoPhase is described in this folder.

brahmanangusassemblyscripts's People

Contributors

lloydlow avatar njdbickhart avatar cynthialiu avatar

Stargazers

Srikanth Krishnamoorthy avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.