Skip to content

CaioDaoud/Waves

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Waves

#Format of the documents collection..

2121 # 	 palavra1 palavra2 
<docName><space><hashtag><tab><word1><space><word2>...
->See examples in Try/GovTxtTeste/00

#Indexer Build: 1. cd Indexer 2. mkdir build 3. cd build 4. cmake .. 5. make After that you will have the following runnable files: ./geraParcial ./mergeParcial ./geraIndiceFrequencia ./geraNorma ./geraSkipListBlockMaxFix ./GeraDistribuicao (To check the threshold value related to a percentage.) ./SplitIndex (To split the index)

#QueryProcessor Build: 1. cd processador 2. mkdir build 3. cd build 4. cmake .. 5. make After that you will have the following runnable files: ./queryProcessor

Try the Query Processor

  1. Build the Indexer and the QueryProcessor.
  2. Copy all runnable files to ./Try/

[queryProcessor,
geraParcial,
mergeParcial,
geraIndiceFrequencia,
geraNorma,
geraSkipListBlockMaxFix,
GeraDistribuicao,
SplitIndex]
  1. To index the documents and to split the index.

    I. $: cd Try/

    II. Indexing documents in one tier, govInd/govTxt.

     [python GeraIndice.py <pathColecao> <pathBase> <pathIndice> <Nome do Indice>]
     
     $: python GeraIndice.py GovTxtTeste/ GovTxtBase/ GovInd/ GovTxt.
    

    III. Splitting the index in two tiers (govInd/fir_25. govInd/sec_25.) , with 25% in the first tier..

      [python Split.py <pathIndice> <Nome do Indice.> <% (1 - 99)>]
    
      $: python Split.py GovInd/ GovTxt. 25
    
  2. To process queries. (To view the generated ranking open the <file.result> )

     ./QueryProcessor configFile
    

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published