Code Monkey home page Code Monkey logo

wikidata-lexeme-graph-builder's Introduction

Wikidata Lexeme Graph Builder

A simple alternative to the Wikidata Graph Builder that supports lexemes. This tool was originally meant to become obsolete once querying for lexicographical data was available, but reportedly people still find it to be useful these days.

Examples:

License

The content of this repository is released under the AGPL v3 as provided in the LICENSE file that accompanied this code.

By submitting a “pull request” or otherwise contributing to this repository, you agree to license your contribution under the license mentioned above.

wikidata-lexeme-graph-builder's People

Contributors

lucaswerkmeister avatar maxlath avatar waldyrious avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

wikidata-lexeme-graph-builder's Issues

Don’t throw away entity data

Currently, each time you re-submit the form, we completely throw away the entities data. Instead, we should keep it. We need to figure out some other way to find out which entities to process in the next iteration (currently somewhere in getTriples and getNewEntityIds).

Handle somevalue and novalue

Right now the script will probably blow up when encountering somevalue or novalue. I don’t think we need to display those in the graph, but it shouldn’t stop traversal of other statements and entities.

Show lemma as well as sense gloss when using properties that link senses

When using properties that are applied at the lexeme level (e.g. P5191 "derived from"), the label of the graph nodes is the lexeme's lemma; but when a property that connects senses is used instead, what is shown is currently just the sense's gloss. I'd expect something like <lemma>: <gloss> (or even just the lemma) to be a more intuitive label for the graph nodes.

This was originally mentioned in the commit message of 00f3153. Adding an issue for visibility and tracking purposes :)

Limited to 11 links?

When I look at https://lucaswerkmeister.github.io/wikidata-lexeme-graph-builder/?subjects=L8160&predicates=P5191 I've got only 11 links from the L8160 but there is more (18 according to https://www.wikidata.org/w/index.php?title=Special%3AWhatLinksHere&target=Lexeme%3AL8160&namespace=146 )

Same if I change the Qid as entrey, z.B. https://lucaswerkmeister.github.io/wikidata-lexeme-graph-builder/?subjects=L8793&predicates=P5191

I guess it's to avoid too big graph but maybe we could have the limit as an input? (maybe also add the iteration as an input, like on https://angryloki.github.io/wikidata-graph-builder/ ;) )

Handle non-entity ID values

If someone accidentally uses the wrong property (data type quantity, or string, or etc.), the website probably blows up. I suppose it shouldn’t.

Support more than one property ID

Just like we support starting from a set of entities, allow following a set of properties. Shouldn’t be too hard to implement.

Suggested by VIGNERON on-wiki.

Don’t add label/lemma texts multiple times

We add duplicate label/lemma texts on each restart, which makes the texts look ugly in the SVG. Fix that. (Then we can also label entities that we haven’t loaded yet with their entity ID instead of the empty string.)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.