Comments (7)
I think a vdb_date_added
field would definitely be helpful and rather easy to implement. This would help in trying to find viruses that were all uploaded at the same time. I'm not sure about thegenbank_date_added
field, most viruses should have a collection date associated with them which is what we would end up using. Although in the case of KU740184, where collection date is just 2015, could we use genbank_date_added
as an estimate of collection date?
from fauna.
In the case of KU740184, we can see that it was uploaded to Genbank on 22-FEB-2016
. This can be easily retrieved, but I'm not sure if it's useful. The upload date to Genbank is obviously different from the collection date. I think collection date is the primary date field of interested and can stay as date
.
from fauna.
Knowing when things were added to the database and to GenBank are useful for troubleshooting later on. Have you thought about adding sequence versioning too?
from fauna.
I don't know if sequence versioning is a bridge too far or completely appropriate.
from fauna.
You're thinking constant data curation so that the database is up to speed at all times?
from fauna.
Looks like some nice progress in c58c137. Should this issue be closed?
from fauna.
Yes I think so
from fauna.
Related Issues (20)
- Geographic error? HOT 2
- Switch out `xlrd` HOT 1
- fauna downloads fail with Python 3.10
- PhantomJS not found on PATH - installation via npm install HOT 2
- Set `serum_id` to `lot_number` for CDC titer imports HOT 4
- feat: BV-BRC support HOT 1
- serum_passage_category should be set to "egg" instead of "cell" for CDC human pool data like "L21/22 H3-EGG HUMAN POOL" HOT 7
- Assign correct host to titers from non-ferret hosts (e.g., human and mouse)
- Fix dengue author info
- What should the environment variables RETHINK_HOST and RETHINK_AUTH_KEY be set to? HOT 1
- Higher resolution sampling date available for Zika strain HN16 HOT 1
- implement caching for geo lookups HOT 1
- chateau submodule error HOT 2
- Suggest using direct clinical sample sequence for MEX_CIENI551 Zika genome
- Annotate titer TSVs with source and passage
- fauna uploads fail in python 3 unicode error HOT 1
- argument parser in upload.py HOT 3
- Migrate to pandas 0.17 HOT 6
- Fauna installation fails for some users who don't run `npm install` inside of `/chateau` HOT 3
- fauna doesn't work with rethinkdb 2.4 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fauna.