Code Monkey home page Code Monkey logo

Comments (3)

jsevereyn avatar jsevereyn commented on September 26, 2024 2

I figured, but maybe this could inspire others to make a substantial contribution by improving those needed ones

Thanks for the reply @meren

from anvio.org.

jsevereyn avatar jsevereyn commented on September 26, 2024

I added some (probably super basic) concepts, which are the ones I get asked the most. Im giving some basic lessons of genomics for geology students (they had very little background in biological sciences).

+### Indel
+
+Stands for insertion - deletion, of some bases in a DNA sequence. Those are considered short polymorphisms and the differ from point mutations, as the latter are substitutions.
+
+### Protein Structure
+
+Is the three-dimensional arrangement, shaping and folding of atoms in an amino acid-chain molecule (at different levels) to form a functional protein, which can be a monomeric and aggregated into homo- or hetero-polymers.
+
+### Shotgun sequencing
+
+Is the sequencing of the whole nucleotide sequences present in given sample, after being randomly fragmented into short pieces and ligated into known fragments during the sequencing library construction. Its called Shotgun from the concept that a large sequence is essentially broken up in to many, many smaller pieces, similar to how a shotgun shell breaks apart when fired.
+
+### Amplicon Sequencing
+
+(AKA: 16S, ITS, metabarcoding, metataxonomic). Is the sequencing of PCR targeted fragments of interest. It is based on the use of specific primers (degenerated in general) of some marker of interest, commonly resolutive for taxonomic classification (as it should have a variable region flanked by conserved ones).
+
+### Sequence Alignment
+
+Is a technique used to identify (at quantifiable levels) regions of similarity between sequences, which could be nucleotidic (DNA, RNA) or aminoacidic (Proteins). This simmilarities may indicate functional, structural and/or evolutionary relationships between those biological sequences. Aligned sequences are commonly represented as rows within a matrix. Sequence alignments can be used to calculate distances c
+
+### Mapping
+
+Read mapping is the process to align the reads on a reference genomes and/or sequences. Mapper tools takes reference as input and a set of reads to align one by one, allowing some degree of mismatches, indels and clipping of some short fragments on one or the two ends of the reads. This technique maps the positions of reads that are easily recognisable and only occur once in the reference.
+
+### COGs
+
+The database of Clusters of Orthologous Groups of proteins (COGs) is an attempt on phylogenetic classification of the proteins encoded in given set of data to help simplify functional categorization using controlled vocabulary. Each COGs includes proteins that are inferred to be orthologs (direct evolutionary counterparts), maximising their usefulness for functional and evolutionary studies. 
+
+### Annotation
+
+Is the process of identifying functional elements along genome sequences, thus giving meaning to it by by the identification of known elements or by comparison with databases through different analysis, comparison, estimation, precision, and other mining techniques deriving the structural and functional information of a protein or gene. This is a essential step as DNA sequencing generates sequence information without its functional role. 

from anvio.org.

meren avatar meren commented on September 26, 2024

Dear @jsevereyn, thank you for these suggestions. Probably because they're coming from teaching material, some of them are partially overlapping with existing terms in the dictionary (i.e., mapping or shotgun sequencing), and others are very much needed, but needs more 'encyclopedia' like descriptions (i.e., indels). So these are not yet quite at a level of copy-paste convenience :)

from anvio.org.

Related Issues (6)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.