Comments (7)
Hi, it seems that the original file has been removed.
Please write a mail to [email protected], to get a copy.
Regards,
Philipp
from convex.
Hi Philipp,
We have unzipped the Wikidata dump and gotten two files: wikidata2018_09_11.hdt.index.v1-1 and wikidata2018_09_11.hdt.
But we cannot read the content with different text editors. How do you read them? Thank you for your help.
Best regards
Sirui
from convex.
Hi Sirui,
HDT has an efficient representation of the KB, and the stored data is not meant to be directly readable (see https://www.rdfhdt.org).
In case you do not want to mess around with huge knowledge bases, and just want to conveniently access relevant facts for a specific entity or question, I can recommend also taking a look at our latest project CLOCQ which will be published as full paper at WSDM2022, for which we will soon have an open API available: https://clocq.mpi-inf.mpg.de.
If you want to look at the plain Wikidata KB-dump, you can download the latest dump (https://dumps.wikimedia.org/wikidatawiki/entities/). We also have a code-base for extracting a QA-related subset from Wikidata (https://github.com/PhilippChr/wikidata-core-for-QA).
from convex.
Hi Philipp,
Thanks for the reply. As for the Wikidata dump that you sent through email (wikidata2018_09_11.hdt.index.v1-1 and wikidata2018_09_11.hdt), is it already extracted to a QA-related subset?
Best regards
Sirui
from convex.
Hi Sirui,
No, this is the full Wikidata dump from that timestamp.
Regards,
Philipp
from convex.
Hi Philipp,
Thanks a lot. Could you please explain how to construct the context graph for turn 0? For example, given the question "When did The Carpenters sign with A&M Records?", seed entity "The Carpenters" and the answer "1969", I can identify the answer triple is
http://www.wikidata.org/entity/Q223495
http://www.wikidata.org/prop/direct/P571
"1969-01-01T00:00:00Z"^^<http://www.w3.org/2001/XMLSchema#dateTime>
Does the context graph only contain this triple or also contain triples (The Carpenters, ?, A&M Records)? According to your methodology, my understanding is that the context graph contains only one triple; however, according to Figure 1, my understanding is that the context graph also contains triples (The Carpenters, ?, A&M Records).
Sincerely appreciate your help.
Best regards
Sirui
from convex.
Hi Sirui,
The qualifiers are missing in this case.
The fact "Carpenters, record label, A&M records" is there in Wikidata
(https://www.wikidata.org/wiki/Q223495),
but misses the qualifier "point in time, 1969".
In Figure 1, these qualifiers are there.
Regards,
Philipp
from convex.
Related Issues (20)
- A question about ConvQuestion HOT 1
- About settings.json
- Tagme and Entity recognition HOT 1
- Node Embedding HOT 3
- Where to download the dataset HOT 3
- Count Questions HOT 4
- JSON files in data folder HOT 1
- About requirements HOT 8
- KG link does not work
- Predicate index HOT 2
- where is the knowledge graph HOT 1
- requests.get('https://tagme.d4science.org/tagme/tag‘)error HOT 3
- May I know what data is stored in these files? HOT 3
- what is 'qualifiers'? HOT 2
- Results not changing HOT 10
- pip install hdt does not work in python 3.7 macOS HOT 6
- Running problem HOT 3
- Custom knowledge graph HOT 1
- The existentials question HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from convex.