Code Monkey home page Code Monkey logo

davidson_et_al_2020_kac_prx's Introduction

Davidson_et_al_2020_Kac_PRX

Proteomic Data Analysis for the following publication:

Davidson, M.T., Grimsrud, P.A., Lai, L., Draper, J.A., Fisher-Wellman, K.H., Narowski, T.M., Abraham, D.M., Koves, T.R., Kelly, D.P., and Muoio, D.M. (2020). Extreme Acetylation of the Cardiac Mitochondrial Proteome Does Not Promote Heart Failure. Circulation Research. 2020;127:1094โ€“1108. 10.1161/CIRCRESAHA.120.317293 PMID: 32660330

Raw Data

The raw data for the proteomics experiments was submitted to the Proteome Xchange Consortium via the PRIDE partner Repository with the dataset identifier PXD013935

Directory Structure

The directory contains the analyses for 3 individual proteomic experiments. The structure of these folders is summarized below.

The 4th folder (Comparisons between Experiments) uses data processed from experiments 2 and 3 for additional analysis.

Structure for Experiments Folders

Exp#_..._Analysis.ipynb

A jupyter notebook containing the python code used to analyze the data produced from Proteome Discoverer 2.2 for the specific experiment

pd22_exports

Contains the tab-delimited data exported from Proteome Discoverer 2.2. One file contains the peptide information, the other contains protein information.

limma

Contains the R code used to analyze the data with the limma library, along with the input for and the output from this analysis. The limma analysis is an intermediate step in processes detailed/performed in the Exp#_..._Analysis.ipynb file.

processed_files

Contains the analysis results from Exp#_..._Analysis.ipynb in both Excel files and Comma Separated Value format. See below for more information

Information about files within processed_files

There are 2 Excel files within the directory:

  1. The File ending with plex contains the processed data for the acetylpeptides corrected for the "relative occupancy" (abb. "RO"), acetylpeptides only corrected for loading content, and the proteins

  2. The file ending with Top25 contains processed data derived from the data in file #1 that shows the top 25 most differentially abundant acetylpeptides (corrected for the relative occupancy) along with basic information about the acetylated residues involved and the protein information

All of the sheets within the Excel files are also stored in ".csv" formatted files.


Within the first Excel file (and corresponding .csv files) for each experiment, a basic nomenclature is utilized.

There are 2 sheets for the acetylpeptide data. One contains the "relative occupancy" data which is subsequently analyzed within the limma software. The other doesn't contain the "relative occupancy data" and only analyzes the load-normalized data. In the paper, for consistency all numbers and graphs utilize the "relative occupancy" data.

Nomenclature in the Acetylpeptide sheets:

  • columns that end in _Acetyl are the "raw" quantification values exported from Proteome Discoverer 2.2
  • columns that end in _norm are the quantification values corrected for differences in loading between the individual channels
  • columns that end in _norm_log2 are simply the log2 transformation of the load-normalized columns
  • columns that end in _norm_log2_ro have used the log2 transformation data of the protein (not peptide) to correct for any protein expression changes that may account for differences in the acetyl-peptide abundance. The ro stands for "relative occupancy". The relative occupancy of the post-translational modification is calculated by subtracting changes in the protein abundance from the acetyl-peptide abundance. For example, if a PTM site is found to be increasing by 2 fold, but the protein abundance is also increasing by 2 fold, then the relative occupancy would be 0.
  • The results from the limma analysis start after the last _ro column until the end of each table

The Proteins Sheet follows a similar nomenclature. The difference is that the columns ending in _Protein are the "raw" quantification values exported from Proteome Discoverer 2.2

In the limma analysis, columns ending in _significant indicate the statistical significance of the specified comparison after multiple hypothesis correction with the Benjamini-Hochberg method and an FDR of <0.05.

  • 0 indicates not significantly different
  • 1 indicates significantly different, increased in the listed comparison
  • -1 (negative one) indicates significantly different, decreased in the listed comparison

Abbreviations:

  • DKO Dual Knock-out. (Crat and Sirt3 ablation via cre-recombinase induced under the muscle creatine kinase promoter)
  • DFC Dual Flox control. (Floxed controls for the DKO animals)
  • S3KO Sirt3 Knock-out. (Sirt3 ablation via cre-recombinase induced under the muscle creatine kinase promoter)
  • S3FL Sirt3 Flox control. (Floxed controls for the S3KO animals)
  • TACMI Trans-aortic constriction + small apical myocardial infarction. (Surgical procedure to induce heart failure)
  • TAC Trans-aortic constriction (Surgical procedure to induce heart failure)
  • FC Fold Change
  • Log2FC Fold Change in the Log2 space
  • Kac Peptide containing an acetylated lysine residue (K=abbreviation for lysine)
  • RO relative occupancy

davidson_et_al_2020_kac_prx's People

Contributors

mtd91429 avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.