Code Monkey home page Code Monkey logo

gramene's People

Contributors

gingi avatar kyclark avatar

Watchers

 avatar  avatar  avatar

Forkers

gingi narciso64

gramene's Issues

Rice multi-gene track caption is misleading

In summary, the track is made up of multiple gene sets and the name of the track only shows one of the gene sets. I initially proposing deleting the obsolete gene sets, but eg proposed keeping them, only make them non-displayable by setting the displayable flag in analysis_description off. This was clever, as late on Pankaj arguing keeping the old low quality genes. Finally I found we can separate these gene sets to different tracks by changing the key in their analysis_description web_data field.

In this hash data structure, the 'key' => ''ensembl' dictates which analysis goes to the same gene track on the browser. Various analyses sharing the same key will be put in the same track.

The 'caption' => 'IRGSP gene': dictate the track name on left side of the track.

The 'name' => 'IRGSP Genes': dictate the configuration pop up window for you to select which gene track to load to this page.

{'multi_name' => 'IRGSP Genes','colour_key' => '[biotype]','caption' => 'IRGSP gene','name' => 'IRGSP Genes','label_key' => '[biotype]','default' => {'MultiTop' => 'gene_label','contigviewbottom' => 'transcript_label','MultiBottom' => 'collapsed_label','contigviewtop' => 'gene_label','cytoview' => 'gene_label','alignsliceviewbottom' => 'as_collapsed_label'},'key' => 'ensembl'}

For core database:

update analysis_description set web_data= replace(web_data, 'ensembl', 'ena') where analysis_id=355;

update analysis_description set web_data= replace(web_data, 'ensembl', 'ena_ncrna') where analysis_id=358;

update analysis_description set web_data= replace(web_data, 'ensembl', 'rfam') where analysis_id=467;

update analysis_description set web_data= replace(web_data, 'ensembl', 'trnascan') where analysis_id=468;

update analysis_description set web_data=replace(web_data, '\'name\' => \'Genes\'', '\'name\' => \'ncRNA Genes\'') where analysis_id=358;
update analysis_description set web_data=replace(web_data, '\'name\' => \'Genes\'', '\'name\' => \'Rfam RNA Genes\'') where analysis_id=467
update analysis_description set web_data=replace(web_data, '\'name\' => \'Genes\'', '\'name\' => \'tRNA Genes\'') where analysis_id=468

For otherfeatures:

update analysis_description set web_data= replace(web_data, 'ensembl', 'rappred') where analysis_id=3;

update analysis_description set web_data= replace(web_data, 'ensembl', 'rapnc') where analysis_id=4;

This is the email to EG
For rice Japonica, the issue was the track name rotates
among the three analysis Caption names. Sometimes it is 'MSU gene',
some times it is 'IRGSP gene prediction', sometimes it is 'IRGSP
nonCoding gene prediction’. See the following screenshot

I examined the three analyses, they all come from otherfeatures
database. And except for MSU gene, the other two are obsolete 2017
gene predictions from RAP-DB. I suggest we drop these two gene sets
for good.

*** oryza_sativa_otherfeatures_61_96_7

  1. msu_gene: 55,799 protein coding genes (keep)
  2. irgspv1.0-20170804-predicted-genes: 8,115 ab-initio predicted
    genes (delete, analysis_id=3)
  3. irgspv1.0-20170804-noncoding-genes: 2,387 predicted noncoding
    genes (delete, analysis_id=4)

In the core database, we also have 4 gene sets.
The RAP-DB genes was just updated in this release from RAP-DB 2018-11
release. The other three are nonCoding genes, except for tRNA gene,
the other two RNA gene sets were also said to be not very reliable,
can be dropped too.

** oryza_sativa_core_61_96_7

  1. gff3_genes: 37,849 protein coding genes from RAP-DB 2018-11
    release. (analysis_id=531)
  2. trnascan_gene: 191 tRNAscan genes (analysis_id=468)
  3. rfam_12.2_gene: 758 Rfam RNA genes (delete, analysis_id=467)
  4. ena_rna: 68 ncRNA gene from ENA (delete, analysis_id=358)

In summary, I am proposing deleting the following gene sets:

oryza_sativa_otherfeatures:

  • analysis_id=3, irgspv1.0-20170804-predicted-genes
  • analysis_id=4, irgspv1.0-20170804-noncoding-genes

oryza_sativa_core:

  • analysis_id=467, rfam_12.2_gene
  • analysis_id=358, ena_rna:

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.