Comments (4)
@mbjones regarding the DataONE API documentation detailing the SO:Dataset indexing, I followed the example of current documentation such as https://releases.dataone.org/online/api-documentation-v2.0/design/SearchMetadata_eml.html, which shows XPaths and references to the KNB EML documentation.
- For the 'Path' column, for SO:Dataset, instead of XPaths I put SPARQL queries that were used to extract properties from the JSON-LD documents. While most readers won't require this level of technical detail, it does accurately show how the info is extracted.
- also for the 'Reference' column I referred to the appropriate section in the ESIP SOSO guide, instead of referring to https://schema.org, which doesn't provide context for the fields used, i.e. for Solr
author
, the property extracted ishttps://schema.org/name
, but in the context of SO:Dataset > SO:creator
Is this approach to documenting the SO:Dataset indexing valid, or should a different one be used?
from d1_cn_index_processor.
Sounds like a good approach to me @gothub . Several groups have asked for our crosswalks recently, so that is good to keep documented. I have also pointed them at the config files for the indexer which actually has all of the crosswalks, whereas in sphinx I think we are missing some. But this sounds good.
Where can I see this document? Did you add it to https://github.com/DataONEorg/api-documentation?
from d1_cn_index_processor.
The document will be added to that repo this morning, when I've successfully built the documentation locally. It's almost ready, I just have to finish configuring the python tools required for the doc build.
from d1_cn_index_processor.
This issue has been superseded by DataONEorg/api-documentation#14
from d1_cn_index_processor.
Related Issues (20)
- Verify schema.org indexing compatibility with SOSO v1.2.0 HOT 8
- Handle all valid delimeters for SO:box
- Incorrect geohash calculated for bbox crossing IDL HOT 2
- Support EML references in indexing code
- schema.org indexing appends type to 'abstract' field HOT 1
- schema.org indexing doesn't process creator without context declaration HOT 3
- schema.org indexing recognizes 'https://schema.org' and not 'http://schema.org' HOT 11
- Updates for portal/collection schema v1.1.0
- Indexing fails for objects from member node RW HOT 1
- Support for schema.org/Dataset with multiple `description` entries HOT 1
- Add `schema.org/Dataset` `distribution` info as serviceEndpoint in index HOT 1
- geohashes, text fields not being indexed for SO documents HOT 5
- Update EML Semantic Annotation indexing to include and expand property URIs HOT 1
- Add MOSAIC, ARCRC, SENSO, ADCAD, SALMON ontologies to list of built-in ontologies and reindex content HOT 3
- Json-ld subprocess can't process legitimate schema.org objects HOT 2
- Add formatId for JSON-LD documents HOT 3
- Re-apply previously overwritten changes to XPath for ISOTC211 origin field HOT 7
- Resolve build errors
- For Solr date fields, is just the date sufficient?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from d1_cn_index_processor.