Code Monkey home page Code Monkey logo

c_striatum_gems's Introduction

Memote Report

License: MIT standard-GEM ZENODO DOI Frontiers DOI

C_striatum_GEMs

This repository contains all changes made to the strain specific models of Corynebacterium striatum.

Note: creating the memote report written to GEM_status.html will take around 10 minutes. You will need to pull afterwards for the updated file.

To access older versions of the model and the corresponding report browse the commits and choose the corresponding commit.

Please link specific additions to the respective issues to track the work you did (#1: model changes, #2: analysis / simulation, #3: novel additions).

More information on what to expect in the different folder is given in separate READMEs located in the folders.

Corynebacterium striatum

Corynebacterium striatum, a gram-positive and non-sporulating rod, has recently been discovered for its pathogenic properties. Even though it has been known since the early 20th century, C.striatum was often disregarded as a pathogen since it is part of the typical human skin microbiota1. Nevertheless, it was found that, especially in immunocompromised patients, C.striatum can be the source for diseases such as Chronic Obstructive Pulmonary Disease, also known as COPD or pneumonia2. Not only is C. striatum active within the respiratory tract, but it was also attributed to long-standing open wound infections3 and prolonged hospitalizations4.

How to cite

Famke Bäuerle, Gwendolyn O. Döbel, Laura Camus, Simon Heilbronner, and Andreas Dräger. Genome-scale metabolic models consistently predict in vitro characteristics of Corynebacterium striatum. Front. Bioinform., oct 2023. doi:10.3389/fbinf.2023.1214074.

Tricks

Revert only model files:

  1. search for commit hash in commit history
  2. git checkout [commit ID] -- path/to/file
  3. commit and push

Software

Name Version More Information
diamond 2.0.11 github wiki
CPLEX 20.1.0 product website
CarveMe 1.5.1 docs
libSBML 5.19.0 github
COBRApy 0.25.0 docs
refineGEMs 0.0.1 repository
MassChargeCuration 0.1 repository
ModelPolisher 2.0.1 repository

c_striatum_gems's People

Contributors

famosab avatar

Stargazers

 avatar

Watchers

 avatar

c_striatum_gems's Issues

[all models] How to build GEMs checklist

This checklist was developed by Alina Renz during her PhD thesis.

  • 1. Decide for an organism of interest. Denote name and strain.
    • Check whether the strain is available on the DSMZ for laboratory use.
  • 2. Database search: Search in different databases for already existing models of your organism.

Draft reconstruction

  • 3. Generate a first draft of your GEM with CarveMe. Write down CarveMe, Diamond and CPLEX version in a table.
  • 4. Initial Analysis of the model
    • Number of Reactions, Metabolites, Genes
    • SBML-Version
    • MEMOTE score
    • How many genes does the organism have?
    • How many genes code for hypothetical proteins?
    • How many genes are still missing in the model (excluding hypothetical)?

Add annotations

  • 5. Annotations: Add annotations to the model. Either extract them from the ‘notes’ field (you can use refineGEMs for this) or use ModelPolisher. When you use the ModelPolisher, keep in mind to re-check all bounds, the objective function, and the genes! Write down the ModelPolisher version.
    • Bounds
    • Objective function
    • Genes
  • 6. KEGG Gene annotations: If your organism also occurs in the KEGG database, add the KEGG Gene ID (Locus tag) as gene annotations
  • 7. SBO Terms: The Systems Biology Ontology (SBO) contains a set of controlled vocabularies that are commonly used in systems biology. Every item in our GEMs can be labeled with an SBO-Term, including all genes, metabolites, and reactions. SBO-terms can be added using libSBML. You can use refineGEMs for this, write down the used version.
    • Genes
    • Metabolites
    • Reactions
  • 8. Add ECO terms: The Evidence and Conclusion Ontology (ECO) terms provide information about the curator's confidence about a reaction's inclusion into the model. The use of ECO terms is advised over the use of confidence scores, as confidence scores are not uniquely defined in the literature. ECO terms may be added based on the genes' evidence in a reaction's gene-protein-reaction association (GPR). If no GPR is associated with a reaction, the reaction obtains the ECO term with the lowest confidence score. Based on the evidence level of the genes in the GPR, ECO terms are added to the reactions. A helpful resource is the UniProt database, where the protein existence column gives information about the evidence level of a particular gene (inferred from homology, evidence at protein level, etc.).
  • 9. Add KEGG Pathways: If your organism occurs in the KEGG database, extract the KEGG reaction ID from the annotations of your reactions and identify, in which KEGG pathways this reaction occurs. Add all KEGG pathways for a reaction then as annotations with the biological qualifier ‘OCCURS_IN’ to the respective reaction. You can use refineGEMs for this, write down the used version.

Corrections and Improvements

  • 10. Check the chemical formulas: CarveMe adds the chemical formula of a metabolite to the ‘notes’-field. Transfer the chemical formula from the ‘notes’-field to the species description using the libSBML fbc-package. If you used the --fbc2 option during the model generation with CarveMe, check, whether the chemical formulas were correctly transferred from the ‘notes’-field.
  • 11. Check the charges: Charges are missing in the description of the model’s metabolites. The BiGG database, however, can be used to identify the charge for a given metabolite.
    Keep in mind that sometimes, more than one charge is given in the database.
  • 12. Correct Mass and Charge Imbalances: Mass and charge imbalanced reactions can be reported by COBRApy and also memote returns a list of mass and charge imbalanced reactions in the report. Evaluate the mass and charge imbalanced reactions and try to fix them. You can use the MCC tool, write down the used version.
  • 13. Reduce Orphan and Dead-end metabolites: Get a list of dead-end and orphan metabolites and try to identify reactions that can be added to connect the metabolites further to the network.
  • 14. Check for energy-generating cycles (EGC): Models may contain thermodynamically infeasible energy-generating cycles. These models can produce energy without consuming nutrients. Fritzemeier et al. (DOI: 10.1371/journal.pcbi.1005494) suggested a pipeline to identify the 14 different energy metabolites ATP, CTP, GTP, UTP, ITP, NADH, NADPH, FAD, FADH, ubiquinol-8, menaquinol-8, 2-demethylmenaquinol 8, acetyl-CoA, and L-glutamate. Add a dissipation reaction for each metabolite, constrain all uptake rates to zero and subsequently maximize each dissipation reaction. If any optimization returns a result unequal zero, you have identified an energy-generating cycle that needs to be eliminated. You can use refineGEMs for this.
  • 15. Improve the biomass objective function (BOF): As CaveMe provides a universal biomass equation, you can further improve and specify the BOF of your organism. One possibility is to use the Python package BOFdat (DOI: 10.1371/journal.pcbi.1006971). The stoichiometric coefficients for (i) major macromolecules, (ii) inorganic ions and coenzymes, and (iii) the remaining species-specific metabolic biomass precursors are calculated and incorporated into the BOF. If genomics, transcriptomics, proteomics, lipidomics, or other experimental data is available, you can use BOFdat to refine your organism's BOF.

Model extension

  • 16. Add further reactions: If your organism also occurs in the KEGG database, search for genes that are so far not included in the model. Identify the reactions associated with those genes and add these reactions to the model. Keep track of the model properties: Number of Reactions, Metabolites, Genes and Memote Score.

Model validation

  • 17. Compare model predictions to experimental data: To validate your GEM, search the literature for experimental results from your organism, or even strain, of interest. These results could include additional growth media (e.g., M9 or LB), auxotrophies, gene essentialities, or other physiological properties. Simulate the conditions described in the laboratory experiment and compare the in silico and in vitro results.
    • Do your in silico results agree with the in vitro results?
    • If your results do not agree with the experimental results, identify where the discrepancy comes from. Check whether you
    • need to add reactions to the model
    • need to eliminate reactions from the model
    • need to change the directionality of reactions
    • have identified a knowledge gap
  • 18. Growth on SNM3: We are highly interested in organisms that grow in the human nose. In Tübingen, a medium that mimics the nasal environment was experimentally developed: the Synthetic Nasal Medium (SNM3) (https://doi.org/10.1371/journal.ppat.1003862).
    • Use the defined medium to simulate the growth of your organism in the human nose. To do so, change the uptake rate (lower bound) of the respective metabolite to 10.0 mmolgDW-1 h- 1. Oxygen should get an uptake rate of 20.0 mmolgDW-1 h-1. All other exchange reactions for metabolites that are not listed as components of the SNM3 should have an uptake rate of 0 mmolgDW-1 h-1. You can use refineGEMs for this.
    • If your organism does not grow on SNM3, identify which additional compounds are required to enable growth. One approach is to compare the originally active exchange reactions to the exchange reactions that are defined by the SNM3.
    • If you have identified metabolites that are required for growth but not defined in SNM3, do some literature research to figure out whether the specific metabolite can internally be produced in your organism. If this is the case, add the missing reactions, metabolites, and genes to the model and test the growth on SNM3 again.
  • 19. Behavior with and without oxygen: Check whether your organism is anerobic, aerobic or a facultative anaerobe. This should be reflected in the GEMs abilitz to grow with and without oxygen.

Model publication

  • 20. Make the model available: Make your model available in a modeling database to meet the FAIR data principle (findable, accessible, interoperable, and reusable). Before uploading the model.
    • check one last time whether your model is a valid SBML file.
    • Additionally, check whether your model complies with the minimum standardized content for a metabolic network reconstruction (DOI: 10.15252/msb.20199235, Box 2).
    • Write down the final model properties
    • Upload the valid SBML file to, e.g., BioModels. You can use a COMBINE archive and the OMEX file format (DOI: 10.1186/s12859-014-0369-z) to share all information of your model, including, e.g., media definitions. For BioModels, you can gain reviewers' access there. Then, the model is initially only visible with the corresponding log-in credentials. After publishing your model in, e.g., a scientific journal, don't forget to make your model publicly available to everyone.

Update FROG reports to later version of cobrapy

During the curation we found a small problem with the FROG reports that were submitted for Biomodels MODEL2304270001 to MODEL2304270005 (all 5 related to the Corynebacterium striatum model publication).

In essence, the reports were executed with an older CobraPy and Cameo (before we unified the format of 02_fva.tsv). The model seems reproducible but all comparison tools will give some errors unnecessarily, because the older Cobrapy version uses the "flux" column for the "flux through objective" instead for "flux through the reaction".

If you had a bit of time, could you please submit FROG reports generated with the latest Cobrapy version? This is probably not required to have the curation successful (everything is perfectly OK so the fluxes must be OK as well, and thus the model is fully reproducible), but it would be great to have this fixed eventually for completeness.

The new reports will be created with https://fbc-model-tests.lcsb.uni.lu/.

@GwennyGit You should be able to submit them afterwards.

CGXII comparison shows that some metabolites cannot be used because of missing exchanges

EX_ni2_e, EX_na1_e, EX_34dhbz_e and EX_btn_e are not present in any of the strains. More research into whether these metabolites should be consumed by the model is necessary since we add all of those to the medium mixture in the wetlab. While ni2, 34dhbz and btn can be removed in the wetlab, na1 is not removable since NaOH is being used to adjust the pH of the medium. This also relates to issue #5 in C_striatum_wetlab.

  • determine whether cobalt2, nmn, ala_B, pnto__R and cgly can be put into wetlab stock solutions or replaced by related metabolites
  • check for na1 uptake / transport reactions
  • research ni2 usage
  • research 34dhbz usage -> seems to be carbon source for C. glutamicum
  • research btn usage
  • compare fluxes of 'good' additives to other used metabolites for hints on concentration

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.