wikidata-triplets.py - 0 entity found #9

bastiennes · 2023-05-13T22:01:17Z

Hello,

First of all, I want to thank you for sharing this script with the community.
I'm trying to regenerate Rebel dataset.
By using python -m wikiextractor.wikiextractor.WikiExtractor data/$1/$1wiki-latest-pages-articles-multistream.xml.bz2 --links --language $1 --output text/$1 --templates data/$1/templates.txt, I get page articles.
The wikidata entities are described by theirs links (--links), but wikidata-triplets.py use the wikidata ID.

How did you turn the links into IDs?

The text was updated successfully, but these errors were encountered:

LittlePea13 · 2023-06-20T15:12:12Z

Hi there, sorry for the late reply, I didn't see the issue until now. See the Readme:

For ./wikiextractor we use a submodule which is a fork of the original wikiextractor that implements wikimapper to extract the Wikidata entities. You can find the fork here, and clone it to the corresponding folder.

LittlePea13 closed this as completed Jun 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wikidata-triplets.py - 0 entity found #9

wikidata-triplets.py - 0 entity found #9

bastiennes commented May 13, 2023 •

edited

LittlePea13 commented Jun 20, 2023

wikidata-triplets.py - 0 entity found #9

wikidata-triplets.py - 0 entity found #9

Comments

bastiennes commented May 13, 2023 • edited

LittlePea13 commented Jun 20, 2023

bastiennes commented May 13, 2023 •

edited