Code Monkey home page Code Monkey logo

cdno's Introduction

Build Status DOI

Compositional Dietary Nutrition Ontology

The CDNO provides structured terminologies to describe nutritional and other attributes of material entities that contribute to human diet. These terms are intended primarily to be associated with datasets that quantify concentration of dietary chemical components derived from samples taken from any stage in the production of food raw materials (including from crops, livestock, fisheries) and through processing and supply chains. Additional knowledge associated with these dietary sources may be represented by terms that describe functional, physical and other attributes.

Whilst recognising that dietary components within food substrates may be present as complex and dynamic physical and chemical structures or mixtures, CDNO focuses on components typically quantified in an analytical chemistry laboratory. The primary CDNO class ‘dietary chemical component’ contains hierarchical sets of terms organised to reflect commonly used classifications of chemical food composition. This class does not represent an exhaustive classification of chemical components, but focuses on structuring terms according to widely accepted categories. This class is independent of, but may be used in conjunction with, classes that describe ‘analytical methods’ for quantification, ‘nutritional functional attribute’, ‘dietary material physical attribute’ or ‘dietary role’. Quantification data may be used and reported in research literature, to inform food composition tables and labelling, or for supply chain quality assurance and control.

More specifically, terms within the ‘chemical component concentration’ class may be used to represent quantification of components described in the ‘dietary chemical component’ class. Concentration data are intended to be described in conjunction with post-composed metadata concepts, such as represented by FoodOn ‘Food product by organism’, which derives from some food or anatomical entity and a NCBI organismal classification ontology (NCBITaxon) entity. In addition, one or more CDNO analytical method term may be used to describe the process used to acquire concentration data, where these terms may be inherited from the Ontology for Biomedical Investigations (OBI), the Chemical Methods ontology (CHMO) or the Physico-chemical Methods and Properties ontology (FIX). In combination these formalise the language used to describe sample source and status in the crop production/food supply chain.

Additional knowledge relating to interactions of food components with human physiological function is embodied within the CNDO dietary function class, where a nutritional component may be associated with one or more functional categories.

The common vocabulary and relationships defined within CDNO should facilitate description, communication and exchange of material entity-derived nutritional composition datasets typically generated by analytical laboratories. The organisation of the vocabulary is structured to reflect common categories variously used by those involved in crop, livestock or other organismal production, associated R&D and breeding, as well as the food processing and supply sector, and nutritionists, inlcuding compilers and users of food composition databases. The CDNO therefore supports characterisation of genetic diversity and management of biodiversity collections, as well as sharing of knowledge relating to dietary composition between a wider set of researchers, breeders, farmers, processors and other stakeholders. Further development of the functional class should also assist in understanding how interactions between organismal genetic and environmental variation contribute to human diet and health in the farm to fork continuum.

A more descriptive guide for the Crop and now Compositional Dietary Nutrition Ontology (CDNO) was published in Crop Science in an article by Andres-Hernandez et al. 2020 doi:10.1002/csc2.20092.

More information can be found at http://obofoundry.org/ontology/cdno

Versions

Stable release versions

The latest version of the ontology can always be found at:

http://purl.obolibrary.org/obo/cdno.owl

(note this will not show up until the request has been approved by obofoundry.org)

Editors' version

Editors of this ontology should use the edit version, src/ontology/cdno-edit.owl

Citing the CDNO

Please use this GitHub repository's Issue tracker to request new terms/classes or report errors or specific concerns related to the ontology.

If you use the ontology, please cite its IRI: http://obofoundry.org/ontology/cdno

If you mention CDNO in a paper, please cite one of the following: Andrés‐Hernández, L., Baten, A., Azman Halimi, R., Walls, R., & King, G. J. (2020). Knowledge representation and data sharing to unlock crop variation for nutritional food security. Crop Science, 60(2), 516-529. doi.org/10.1002/csc2.20092

Contributing to CDNO

Please use the issue tracker for comments, requests for new terms, and suggesting changes to existing terms.

cdno

Contact

CDNO curator Liliana Andres Hernandez Email: [email protected] Linkedin: https://www.linkedin.com/in/liliana-andres/

cdno's People

Contributors

kaiiam avatar lilyandres avatar matentzn avatar nutrientinstitute avatar

Stargazers

 avatar Ege Güneş avatar  avatar Chris Cargile avatar  avatar Charles Tapley Hoyt avatar  avatar  avatar Michael Schmitt avatar Graham King avatar Lucia Segovia de la Revilla avatar

Watchers

James Cloos avatar Ramona Walls avatar Michael Schmitt avatar

cdno's Issues

concentration of 5-formyltetrahydrofolic acid in plant structure

This concentration points to CHEBI:15640 - 5-formyltetrahydrofolic acid, but it seems like the hasDbXrefs are coming in at too broad a level. Can the INFOODs and USDA_NDB tags be associated with concentrations at a more granular level? Is it really the case that multiple ones are correct for 5-formyltetrahydrofolic acid? Ideally INFOODs and USDA_NDB are only 1-1 DB Xrefs.

image

Remove material entity direct subclasses

Following form the work in #52 we now have the issue that some classes are asserted as subclass to material entity when we only want them to show up in cdno-edit.owl as subclass to the appropriate terms from the dietary nutritional component hierarchy.

image

We'll need to remove the subclass axioms that are subclass to material entitiy, for example

image

NTR: soluble dietary fibre

For new term requests, please provide the following information:

Preferred term label

soluble dietary fibre

Textual definition

Water- soluble/well fermented fibres

Anita, FP. 1973.

Suggested parent term

dietary fibre

NTR: resistant starch

Preferred term label

resistant starch

Textual definition

Proceedings for the 2nd plenary meeting of EURESTA: European FLAIR Concerted Action

The total amount of starch, and the products of starch degradation that resists digestion in the small intestine of healthy people. The resistance of starch to digestion is influenced by the nature of the association between starch polymers, with higher amylose levels in the starch being associated with slower digestibility rates

Suggested parent term

soluble dietary fibre

Create new release

Following up on some discussion with @LilyAndres we have an opportunity for some potential interest and maybe adoption of CDNO by some of our collaborators. In our next release we'll need to make sure that we clarify that we've changed the axioms on original set of terms away form plant and to material to be more general. Normally this would merit deprecation and replacement, however, since our initial release 2021-02-08 was experimental and we did not yet have a user-base, we opted to simply change the axioms within the chemical concentration's while keeping the IDs the same. In our next release we'll need to make sure that's clear.

Measures of concentrations of chemicals in animals

CDNO is currently focused on 'concentration of' chemicals in 'plant structure'. The FoodOn ontology curation team is looking at concentrations of chemicals in animal and plant materials - including mercury in fish for example. One move would be to generalize all the CDNO concentration expressions to apply to 'organism' instead of just 'plant structure'? What are your thoughts on that?

Assuming CDNO could take on more general organism related concentrations, we'd then be in a position to contribute them. For example, there is a variation on your mercury concentration (probably not a subclass though) which is concentration of 'methylmercury compound' in organism, which I guess is the form commonly quantified in food.

We have a google sheet where we are working on the precomposed terms and patterns for high concentration of X in Y, and as well food classes having high (or low) concentrations.

Cheers,

Damion

Add mappings robot template.

In order to map the CDNO chemical concentration terms to existing labels or other information from the consortia such as the USDA or INFOODs, we'll need a new mapping robot template.

NTR: plant secondary metabolite

Preferred term label

plant secondary metabolite

Textual definition

A group of naturally occurring compound classes biosynthesized by differing biochemical pathways whose plant content and regulation is strongly susceptible to environmental influences and to potential herbal predators.

Pavarini D. et al., 2012

Suggested parent term

Parent CDNO 'dietary nutritional component'

NTR: Dry matter

Preferred term label

Dry matter

Textual definition

Difference between the total weight and the moisture content.

doi.org/10.1046/j.0028-646X.2001.00320.x

Suggested parent term

Parent CDNO 'dietary nutritional component'

We'll make it be subclass to 'plant matter' [ENVO:01001121]. With an axiom linking to 'dry' [PATO:0001824]

Reason cdno.owl

I was expecting that when we ran make prepare release that it would call a robot reasoning routine on the cdno.owl file and have a clean concentration hierarchy, akin to running elk locally in the ontology. However, it did not. looking more into it other ontologies such as ENVO have the same workflow and don't seem to have ELK run to make a nicer hierarchy. There is probably a way to fix this using robot's reason command.

example of the concentration hierarchy when reasoned:

image

Vs the un-reasoned version we currently have in cdno.owl (after running make prepare release)

image

Creation of design pattern for harvested materials

As per ongoing discussions with @CropStoreDb @ramonawalls and @LilyAndres we will need to create terms for harvested materials which are the cross of NCBITaxon ids and PO plant parts. @LilyAndres has created a spreadsheet of relevant crops and their anatomical parts relevant to use-cases within the scope of CDNO here: CDNO_PO_harvested_materials. A preview of the sheet is as follows:

food product organismal source (food source) relationship taxon_id cultivated_species cultivated_source common name of harvested_product PO term harvested food material synonyms
crop_plant   NCBI:txid3750 Malus domestica Borkh apple fruit PO:0030110 pome fruit  

As we formulate an appropriate DOSDP or Robot Template, we'll want to capture the derives from relationship (or similar) between the PO and NCBITaxon terms respectively. These terms are intended to fill the part of some Y in the CDNO concentration of terms, of which the nutritional framework terms will inhere in.

concentration of and inheres in some (
    'Nutritional Framework Term' and
    part of some `Harvested Material Term`)

Add NCF conversion script

I wrote a python script to convert the NCF into the working Robot templates. I'll add that work to CDNO in a util directory in cdno/src/ontology.

NTR: free sugar

We are going to create some nutritional framework terms in CDNO, the term is "free sugar" which does not exist in any other OBO ontology as far as I am aware

Preferred term label

free sugar

Textual definition

Based on Razlin's definition from Nutrition schema_framework_002:

Mono-, di-, or oligosaccharides that naturally occur freely, not bound or part of larger polysaccharides

Suggested parent term

digestible carbohydrate

NTR: concentration of methylmercury compound in material entity

For new term requests, please provide the following information:

Preferred term label

concentration of methylmercury compound in material entity

Synonyms

Textual definition

The concentration of methylmercury compound (http://purl.obolibrary.org/obo/CHEBI_25322) in some material entity.

Suggested parent term

concentration of trace element in material entity (http://purl.obolibrary.org/obo/CDNO_0200146)

(A FoodON nutritionist say this is the group of mercury compounds measured for mercury content in foodstuffs.
I think this is probably not a subclass of "concentration of mercury in plant structure". )

Attribution

https://orcid.org/0000-0002-8844-9165

Add LICENSE

We'll need to choose and add an appropriate license.

Definition of vitamins and vitamers in CHEBI and CDNO

This issue has been created to discuss the relationships of vitamins and their vitamers in CDNO and CHEBI.

1. We have ask CHEBI team, specially @amalik01 for clarification about the relationships described in CHEBI. One issue raised is the different relationships assigned to fat and water soluble vitamins. For example:

  • The ‘B vitamin’ [CHEBI:75769] and ‘Vitamin C’ [CHEBI:21241] have the ‘is_a’ relationship with ‘water-soluble vitamin’ [CHEBI:27314].

  • In contrast ‘Vitamin A’ [CHEBI:12777], ‘Vitamin D’ [CHEBI:27300], ‘vitamin E’ [CHEBI:33234] and ‘Vitamin K’ [CHEBI:28384] have the ‘has_role’ relationship with ‘fat-soluble vitamin’ [CHEBI:24020].

The term vitamer is not defined as such in CHEBI. However, some vitamer molecules have been assigned to specific vitamins with:

  • has_role’ = eg some vitamers from ‘B vitamin’ [CHEBI:75769] and ‘vitamin C’ [CHEBI:21241]

and the rest with

  • is_a

See the following link:

The proposal from CHEBI is to

  • assign 'is a' vitamin to both fat and water soluble vitamin

  • assign the 'has role' relationship to all vitamers (in their case vitamin) molecules.

2. Within CDNO we propose to refine/extend the relationships, re-using where possible the CHEBI definitions.
Thus: (reading the relationships right to left)

image

3. We propose establishing definitions/relationships for vitamin roles within the ‘dietary_function’ class of CDNO.

A separate thread/issue will be generated to discuss this class.

What do you think?

Definition of labels and reuse of CHEBI terms in the CDNO mineral nutrient class

This issue has been created to discuss the labels and reuse of CHEBI terms in the CDNO mineral nutrient class.

Having checked some CHEBI labels that were incorrectly re-labeled in CDNO (see issue #64), e.g. re-labeled as 'calcium', we identified some terms where the representation of mineral nutrients could be improved in the dietary nutrition domain:

1. We propose the reuse of other CHEBI terms. For example the ionic form (atom with more or less electrons) of an element is often measured when quantifying e.g. the 'concentration of calcium' in milk, typically using a method (eg ICP-MS) that estimates the concentration of calcium (2+) ions. However, the quantity being calculated is typically expressed as atomic Ca, even though it may be present in multiple bound, ionised or other molecular forms (eg salts). In some situations, there may be an assay that allows the estimation of concentration (via stoichiometry) of a specific salt.

Thus, within the 'dietary nutritional component' and 'dietary nutritional component concentration' classes, we would include eg. 'calcium(2+)' [CHEBI:29108] in addition to 'calcium atom' [CHEBI:22984].

2. To ensure there is vocabulary more tailored to dietary nutrition, we propose the adoption of the terms ‘dietary calcium’, ‘dietary magnesium’ etc, and under these have options for hierarchical sets of terms including: ‘calcium atom’, ‘calcium ion’, ‘calcium salt’.

We welcome feedback to know whether this provides a way forward to help nutritionists navigate to appropriate ChEBI terms and distinguish in what form an element may be present, quantified/estimated or reported depending on assay.

3. The hierarchical classification would look something like this, using 'calcium' as an example:
image

CDNO dietary role class

Preferred term label

dietary role

Textual definition

A biological role that may be assigned to a dietary nutritional component based on evidence that supports a function defined at the levels of molecular interaction, cellular process or physiological role.

Suggested parent term

'role' [BFO:0000023]

Attribution

https://orcid.org/0000-0002-5975-6051

NTR: Water soluble protein

We are going to create some nutritional framework terms in CDNO the term is "Water soluble protein" which does not exist in any other OBO ontology as far as I am aware

Preferred term label

Water soluble protein

Textual definition

Based on Razlin's definition from Nutrition schema_framework_002:

Major category of storage proteins that are soluble in proteins.

Suggested parent term

CHEBI:36080 - protein

NTR: Free sugar

Preferred term label

Free sugar

Textual definition

The World Health Organization defines free sugars as:

All monosaccharides and disaccharides which are part of foods or food additives.

Suggested parent term

Available carbohydrate in CDNO

Suggestion for classification of the compositional constituants

As the coverage of CDNO is now "Compositional Dietary Nutrition Ontology" I would suggest to clarify the current hierarchy which is "dietary nutritional component"

Actually, they are a mismatch between compositional constituents and "real" nutritional components.

I mean we can enlarge the coverage of this branch by renaming it "compositional constituent" and then distinguish between nutrient subclasses and other subclasses (see my suggestion below)

This is the direction I am taking in my own work. I would be happy if we could agree on this.

composition-branch

NTR: Non digestible carbohydrate

We are going to create some nutritional framework terms in CDNO the first term is "Non digestible carbohydrate" which does not exist in any other OBO ontology as far as I am aware

Preferred term label

Non digestible carbohydrate

Textual definition

Based on Razlin's definition from Nutrition schema_framework_002:

Carbohydrate components which are not easily digestible, and form major part of dietary fibre (TDF)

Suggested parent term

CHEBI:16646 - carbohydrate

NTR: Digestible carbohydrate

We are going to create some nutritional framework terms in CDNO the first term is "Digestible carbohydrate" which does not exist in any other OBO ontology as far as I am aware

Preferred term label

Digestible carbohydrate

Textual definition

Based on Razlin's definition from Nutrition schema_framework_002:

Also known as available carbohydrate corresponds to fractions of total carbohydrate that is digestible to provide energy.

Suggested parent term

CHEBI:16646 - carbohydrate

Mapping mismatch.

@LilyAndres noticed the following terms are miss-matched between the modules/mappings.owl and modules/chemical_concentration.owl files.

CDNO:0200181	5-formyltetrahydrofolic acid
CDNO:0200182	vitamin B

It should be an easy fix to just switch them in the templates/mappings.csv to be in the same order as they are in the templates/chemical_concentration.csv file.

Maybe @LilyAndres can double check to see if there are any others that got switched. I suspect this happened because there was some confusion about the vit b term, but so maybe its the only one but if were not sure it's good to check.

Fix grammer in definitions of concentration terms

Note about the definitions in the concentration terms, the grammer is a little off it's

image

The concentration of X when measured in plant structure.

It should be something like a plant structure. or some plant structure. @CropStoreDb and @LilyAndres any preferences? I think some works. I'll add this to the DOSDP file before we run all the new terms.

@LilyAndres what's the plan are you going to run the whole workflow soon? If so I we should quickly address this.

NTR: seed storage protein

Preferred term label

seed storage protein

Textual definition

A protein that accumulates in the developing seed with the intent to act as a storage reserve for nitrogen, carbon, and sulfur.

Suggested parent term

protein CHEBI:36080

Some confusion in FoodOn regarding CHEBI vitamins

I am in the process of deprecating FoodOn nutrient terms in favour of the CDNO ones. On the vitamin branch I've managed to eliminate FoodOn duplicates for the most part, but there are some vitamin B & C underlings that I'm not sure about. See the bolded FoodOn vitamin branch (which uses NCIT Vitamin term http://purl.obolibrary.org/obo/NCIT_C944) which will be obsoleted soon:

image

I wasn't sure if the underlings given on the FoodOn side should be added to the CHEBI side? Or are they erroneous?

NTR: Salt soluble protein

We are going to create some nutritional framework terms in CDNO the term is "Salt soluble protein" which does not exist in any other OBO ontology as far as I am aware

Preferred term label

Salt soluble protein

Textual definition

Based on Razlin's definition from Nutrition schema_framework_002:

A family of globular proteins that have higher molecular weights than albumin, and makeup the majority of legume seed storage protein. Examples of globulin: legumin and vicillin (soybean). These proteins can cause allergic reactions if they bind with human IgE antibodies.

Suggested parent term

CHEBI:36080 - protein

INFOOD & USDA Mapping annotation property

@LilyAndres @CropStoreDb @ddooley @ramonawalls

Regarding the CDNO mappings, I realize it might not quite show up how you’d want/expect. In the robot template it makes sense as USDA and INFOODS are in the column headers.

image

However, those won’t get compiled so what will end up happening is we’ll get

image

Perhaps this is desired and we can stop there, however, if not I propose two possible solutions:

  1. Change the annotation property from http://www.geneontology.org/formats/oboInOwl#hasDbXref to a newly invented one in CDNO perhaps something like http://www.geneontology.org/formats/oboInOwl#USDADbXref

  2. Add something like USDA_ to the beginning of all the rows in the mappings csv and keep the hasDbXref annotation property.

image

That way it'll show up looking something like this:

image

I'd advocate for solution 2 but would like to hear everyone's thoughts/ suggestions.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.