Comments (7)
Hm, Genus (Subgenus)
should work according to these tests:
https://github.com/gnames/gnparser/blob/master/testdata/test_data.md#combination-of-two-uninomials
Name: Aaleniella (Danocythere)
Canonical: Aaleniella subgen. Danocythere
Name: Cordia (Adans.) Kuntze sect. Salimori
Canonical: Cordia sect. Salimori
Name: Calathus (Lindrothius) KURNAKOV 1961
Canonical: Calathus subgen. Lindrothius
Can you add examples that show your cases?
from gnparser.
Can you please show examples for worries about "Author in Author, Year"
from gnparser.
Looks like I need to add "dem" as an author word: Von dem Busch
. Ill check if dem
ever happens as a specific epithet.
from gnparser.
@dimus, sorry, I wasn't paying attention to this issue. The "Genus (Subgenus)" and "Author in Author, Year" cases I was thinking of can be found in in https://github.com/gnames/gnames/files/12587991/regex_OK_gnparser_no.txt. Both forms throw up a quality rating of 2.
Please also note that in "Eutrochatella babei (Arango y Molina, 1876)", the "y" is part of the author's surname, so the quality 2 indicator "Spanish 'y' is used instead of '&'" does not apply.
from gnparser.
Thank you @Mesibov for explanation. I do think that y
should decrease the quality, because there are many other languages that people can use for the and
word, and doing so will create a mess. So I decided to limit and
words to and
and &
. I personally would prefer et
though :)
I am not sure what to do if y
is a part of the Author name, I guess I do need to put exceptions and hardcode such authors into gnparser.
Added #251
from gnparser.
In case of Genus (Subgenus)
and Author in Author
the quality is decreased after discussion with Paddy Patterson about these two issues. For botanical names 'Author in Author' is actually valid, so I am on the fence about it. For Genus (Subgenus)
I can double check with ICZN folks.
from gnparser.
I did try to address most of the problems in v1.7.5
from gnparser.
Related Issues (20)
- TSV output should not add commas if they exist in the input.
- Why `’` is converted to `'`? HOT 2
- Parser should stop at 'cf/CF', if they are at the end of the string
- Parser should stop at `ms/MS` if it is at the end of the string
- Documentation in README incorrect HOT 1
- Take in account that some authors with Spanish names have `y` as part of their names. HOT 1
- Normalized output for `B.D’Orbigny` should not add a space after `D'` HOT 2
- `dem` is parsed as specific epithet for names like `Melania testudinaria Von dem Busch, 1842`
- Treat 'fa' as 'forma'
- As a user I want to have an easier matching for species groups. HOT 1
- Second year in name replaces first HOT 1
- `Porphyromonas sp. PAGU1776` should generate cardinality `1`, not `0` HOT 2
- Some last names contain two hyphens
- `Crisia romanica Zágoršek` does not parse correctly HOT 2
- make a field for `Candidatus` names
- As a User I want to be able to get a pool of GNparser instances
- Cardinality for `Anthurium 'Ace of Spades'` cultivar should be 1, not 2 HOT 1
- More names with problematic epithets: complex, spec, do HOT 1
- More names problematic epithets
- Authors are interpreted as subgenera
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gnparser.