PwaSpellchecker

arquivo edited this page Jul 28, 2015 · 4 revisions

Introduction

Misspellings lead IR systems to provide bad results without users even realizing their mistakes. Users assume that the system lacks quality, which decreases their satisfaction and likelihood of returning to the system. We detected the misspelling problem in the Portuguese Web Archive. Hence, we analyzed existing solutions for spelling suggestion and integrated the solution that provided the best results in our user interface.

The following technical report Query Suggestion for Web Archive Search describes the adopted methodology, the obtained results and the chosen solution.

The datasets (in Portuguese) used can be downloaded from http://www.linguateca.pt/Repositorio/CorrOrtog/.

Compile

Clone pwa-technologies:

  • git clone https://github.com/arquivo/pwa-technologies.git

Install PwaSpellchecker:

  • cd pwa-technologies/PwaSpellchecker
  • mvn install

The WAR file is available in:

  • pwa-technologies/PwaSpellchecker/target/pwaspellchecker-1.0.0.war

Install

Step-by-step:

(where xxx is the query to test)

You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session.
Press h to open a hovercard with more details.