Skip to content

Question: Populating site data #91

Answered by m-i-l
squalx asked this question in Q&A
Jan 24, 2023 · 1 comments · 3 replies
Discussion options

You must be logged in to vote

Great question.

First a quick clarification - the src/indexing/bulkimport/wikipedia/import.sh you ran is for the bulk load of wikipedia. It hasn't been maintained since wikipedia indexing was stopped, and now fails pretty early on. The idea was that src/indexing/bulkimport/ would contain scripts to bulk load content into the search engine directly, while src/db/bulkimport/ would contain scripts to bulk load site details into the database for the indexer to pick up and index as normal. Apologies for the confusion - I've updated the README accordingly. Fortunately the wikipedia import now fails before downloading the whole of wikipedia:-)

Regarding the indexing of test sites on local dev - …

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@squalx
Comment options

@m-i-l
Comment options

@squalx
Comment options

Answer selected by m-i-l
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #90 on January 28, 2023 12:14.