Skip to content

A Solr demo in which I download the Wikipedia and search inside

Notifications You must be signed in to change notification settings

ntrrgc/localpedia

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Localpedia

I gave a talk (in Spanish) about deploying a search engine built on Solr, an open source search platform based on Lucene.

In it I described how it is set up, how documents can structured for Solr, the mechanisms it provides to convert raw textual information in indexable word stems, how indexes are created, how documents are feeded to the system and how search is performed.

As a demonstration application I create Localpedia: I download a dump of all the abstracts of Spanish wikipedia — they are publicly available on dumps.wikimedia.org, parse them with Python and feed them to Solr. Then I build simple Flask web application that serves as frontend, allowing the user to look for articles.

This repository contains:

  • A document explaining Solr basics.

  • The source of Localpedia.

  • The configuration files used to set up Solr to index Localpedia.

About

A Solr demo in which I download the Wikipedia and search inside

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published