wikipedia-dump

Wikicompiler is a fully extensible python library that compile and evaluate text from Wikipedia dump. You can extract text, do text analysis or even evaluate the AST(Abstract Syntax Tree) yourself

python compiler mediawiki wikipedia wikitext wikipedia-dump wikitext-parser

Updated Apr 20, 2021
Python

SasCezar / WikiBank

Star

WikiBank is a new partially annotated resource for multilingual frame-semantic parsing task.

multilingual python mongodb dataset wikipedia-dump wikidata-dump semantic-role-labeling semantic-role

Updated Dec 2, 2019
Python

rsakib15 / WikiSearch

Star

A search system based on the Wikipedia dump dataset.

python search search-engine reactjs wikipedia fuzzy-search indexing searching-algorithms search-algorithms wikipedia-dump

Updated Jun 20, 2021
Python

quqixun / ReadWiki-ZH

Star

Convert WIKI dumped XML (Chinese) to human readable documents in markdown and txt.

wikipedia wikipedia-dump wikipedia-corpus

Updated Mar 25, 2020
Python

CALIL / citation

Star

Extract citation ISBNs from Wikipedia dump

wikipedia-dump code4lib-jp

Updated Jan 3, 2023
Python

CristianCantoro / wikidump

Star

Framework for the extraction of features from Wikipedia XML dumps.

wikipedia wikipedia-dump wikipedia-data

Updated Jun 14, 2021
Python

deadbits / wikipedia-chat

Star

Chat with local Wikipedia embeddings 📚

wikipedia embeddings openai wikipedia-dump cohere llm chainlit retrieval-augmented-generation

Updated Nov 14, 2023
Python

ESUAdmin / simafive

Star

红岸基金会、恶俗百科（还有一个意义不明的私仇维基）备份

wikipedia-dump esubike zhinared esuwiki

Updated Oct 4, 2020
Python

sayarghoshroy / Acronym-Sense-Disambiguator

Star

Identifies acronyms in a text file and disambiguates possible expansions

python acronym disambiguation nltk text-processing wikipedia-dump sense

Updated Jul 5, 2020
Python

samuelebortolotti / wikidump-lang-breaks-warns

Star

Framework for the extraction of features from Wikipedia XML dumps.

python wikipedia-dump aho-corasick-algorithm gnu-parallel wikipedia-scrapper

Updated Aug 16, 2021
Python

Improve this page

Add a description, image, and links to the wikipedia-dump topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wikipedia-dump topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wikipedia-dump

Here are 35 public repositories matching this topic...

howl-anderson / chinese-wikipedia-corpus-creator

dlenski / wp2git

shyamupa / wikidump_preprocessing

macbre / mediawiki-dump

OlehOnyshchak / pyWikiMM

akb89 / witokit

jon-edward / wiki_dump

qcl / master-research

DhavalTaunk08 / Wiki-Search-Engine

CogComp / wikidump-preprocessing

iwasingh / Wikicompiler

SasCezar / WikiBank

rsakib15 / WikiSearch

quqixun / ReadWiki-ZH

CALIL / citation

CristianCantoro / wikidump

deadbits / wikipedia-chat

ESUAdmin / simafive

sayarghoshroy / Acronym-Sense-Disambiguator

samuelebortolotti / wikidump-lang-breaks-warns

Improve this page

Add this topic to your repo