poseidon-framework / poseidon-framework.github.io Goto Github PK
View Code? Open in Web Editor NEWMain landing page for getting information about the Poseidon project
Home Page: https://www.poseidon-adna.org
Main landing page for getting information about the Poseidon project
Home Page: https://www.poseidon-adna.org
Now that we have a complete workflow using trident
, we can aim at having a nice Jupyter/Binder tutorial, complete from downloading Poseidon packages to exploring and analysing them.
For Bash, @nevrome suggested this workflow:
trident list --remote --packages
trident fetch -d . -f "*2020_Immel_Moldova*,*2020_Wang_subSaharanAfrica*"
trident list -d . --packages
trident summarise -d .
trident survey -d .
trident validate -d .
trident forge -d . -f "<HYR002>,<Gordinesti>" -o test -n Testpackage
We could go further than that and even run a quick PCA and plotting in R (using a separate R-backend notebook)
For binder, we'd need to define an environment that includes trident
as well as smartPCA
. The latter is available via a conda package, so that's easy. Trident isn't on conda yet.
We should add a table that shows clearly which versions of the Poseidon schema are compatible with which trident version.
We could add a details page for the Poseidon.yml file, which could also explain the versioning rules.
We could possibly then also link to that page from within the trident docs.
See also poseidon-framework/poseidon-schema#69
The current public repo list is neither especially useful nor pleasant on the eyes. We should redesign that.
We should add a page to document the changes from one Poseidon version to the next and how one could update a package accordingly.
We would like to be able to use per-package URLs to refer to specific package views in the Archive Explorer. Currently, all per-package views are under the same URL https://www.poseidon-adna.org/#/archive_explorer.
We need to statically serve the file terms.rdf.json
(which gets automatically copied here from Poseidon-schema (see Issue #3 in that repo). I think just https://poseidon-framework.github.io/terms.rdf.json
as URL would be good.
We can set up the redirect from w3id.org/poseidon
(yet to be registered)
There will be more files to come. The schema files need to be also copied and statically served, so perhaps we should have a sub-dir called static_schema_files
or something prettier?
Consequence of: poseidon-framework/poseidon-schema#35
snpSet
field.snpSet
will be handled by trident forge
and trident init
.Genetic_Source_Accession_IDs
.Data_Preparation_Pipeline_URL
.Publication_Status
to reflect that it now allows multiple values.We decided today to bundle the existing Getting Started and Tutorials into one place called "Tutorials". Clemens will make a first step, and Stephan can add more sessions (for example the Comp-Book F-Stats session).
Maybe linking to this blogpost with the excellent figure can help to make the dating columns more clear.
Currently, in standard.md email addresses in domains institute.org
, etc are used. There's a couple of special domains, designed explicitly for this purpose. That's not too much of an issue here, but one may still inadvertently expose actual people to getting spammed.
This is only relevant from a developers perspective, but maybe the homepage is still the right place for it.
I imagine a table or a graph that documents which of our software tools potentially have to be updated if certain parts of the Poseidon schema are modified. For example: A change to the .ssf file specification is very important for trident and poseidon-eager, but not for the janno R package. It's not trivial to think this through, but I think it will help us to estimate the cost of certain operations and not forget about critical updates to our infrastructure.
The website should acknowledge the contributions of multiple people:
I think it would be great to have some simple "Getting started" page, which guides you through
I just published the new archive explorer. Here are some ToDos for the future, ideally for @93Boy. Generally each of them deserves an own Pull Request, but maybe the map-related ones can be solved in one go.
Currently possible changes for the map:
Currently possible changes beyond the map:
Changes that will be possible in the future with a more beefed-up version of the Web API:
@stschiff Do you agree with this? Feel free to add or change any of these suggestions.
We decided to provide PDF versions of the documentation for all major software tools, and provide them - including old versions - for download on the website. This would then also make the tabs redundant. Ideally we will have an automatic action that creates new PDFs if documentation gets updated. This concerns existing version-documentations. For new versions, we will still have to make new PDFs and add the old ones to the link list manually.
Hi!
I just used the forge
command as in the documentation with -f *package1*, *package2*
and got this error
option -f: (line 1, column 27):
unexpected end of input
expecting white space, "*" or "<"
So I think the documentation needs to be adjusted to reflect that you cannot have spaces between the -f
inputs. Thanks!
I was recently approached by @DivyaratanPopli on how to prepare a simple Western Eurasian PCA for the projection of ancient samples with the tools and data provided by and for Poseidon. This is a very common application and I suggest we add a tutorial for that on the website.
Maybe we elevate such tutorials, including the Getting started section, to an own top level category of the website.
For linked data, we need to have a persistent URL to host our terms.rdf
and various schema definitions. For this reason I have provisionally assumed in all schema definitions so far that we'll eventually have such a persistent URL at https://w3id.org/poseidon
. This seems to be easy to set up, following the instructions on https://w3id.org
, with a redirect on our GitHub page. Later, if we decide to move away from github, perhaps to an MPI-hosted website, we can just change the redirect with w3id.org and don't have to change any of the linked data RDF terms.
So we need to grab https://w3id.org/poseidon
and set up that redirect.
I made this repo private, because the website was seriously outdated but still the most visible thing that came up for people coming from google. We have to update this package and make it public again.
We could change the URL for our website to
Described here:
https://docs.github.com/en/pages/configuring-a-custom-domain-for-your-github-pages-site
What is unclear to me is whether the old URL will then still work but automatically redirect. That would be good. If it stops working it would be bad.
To be as inclusive as possible I added a solid standard template for a Code of Conduct to the website already a while ago: https://poseidon-framework.github.io/#/conduct
I was now made aware that this includes naming a contact address to report concerns:
Instances of abusive, harassing, or otherwise unacceptable behavior may be reported to
the community leaders responsible for enforcement at [INSERT CONTACT METHOD].
All complaints will be reviewed and investigated promptly and fairly.
What would be the best way to organize this?
We would link the schema in PDF format to the website.
We need a new help page for xerxes
first
On the webpage, the description of the forge-command use is missing an explanation on what the --onlyGeno flag would do.
Concerning https://github.com/poseidon-framework/poseidon2-schema/blob/master/janno_columns.tsv
1. Publication Status
Publication_Status | bibtex key (e.g. "@AuthorJournalYear") or "unpublished" |
---|
It seems that if the Publication status actually starts with an @
, either the bibtex validator doesn't accept the key containing the character, or the poseidon validator prints the following error:
! The .bib file does not contain the literature in the janno file or the bibtex keys are different
! This seems to be a valid package, but some things are fishy.
Removing the @
from the .janno
file fixed the issue, but it seems like an update is needed, either to the validator (strip leading @
) or to the content explanation for the field.
2. mtContamination error
mtContam_stderr | Standard error of ContamMix/Schmutzi estimate |
---|
ContamMix doesn't actually return a stderr, but a 95% confidence interval instead, making the error around the mode asymmetric. In my own package I have reported the largest difference between MAP and the edges of the 95% confidence interval, but that can be somewhat misleading. It would be good to either allow people to specify mtDNA contamination error as an interval or two fields with min and max of the CI (which can be done for both stderr and 95%CI), or give clear instructions on how one should report a 95%CI here.
Someone who knows Jekyll better than myself could structure that page nicer. I would like to have three menu-linked pages leading to the three building blocks (format, tools, repo).
I think embedding the existing READMEs for trident, the standard, and so forth, was very clever, and it certainly has the advantage that we can keep the documentations for trident, standard and PoseidonR in their respective packages. At least for trident and the standard, however, I somehow feel it also hampers flexibility to maintain and develop this webpage further. For example, it could make sense to weave the individual docs more together to create a comprehensive overview or getting started page, or to document the server API together with trident, or such things.
I would suggest to move the individual documentations here directly into this website-repository, and replace the original READMEs with some basic info and then a link to this webpage. This would allow us to freely design this webpage to be maximally user-friendly and engaging, without the danger to compartmentalise the docs for the individual parts.
(Note that copying the READMEs over here with a gh-action won't help, I think, as it still would mean that we are forced to keep the documentations as separate chapters.)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.