Skip to content

marcosfpr/atri

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation


Logo

atri

An UI Information Retrieval Tool
Explore the docs »

Report Bug · Request Feature

Table of Contents
  1. About The Project
  2. Architecture
  3. Pictures
  4. Roadmap
  5. Contributing
  6. License
  7. Contact

About The Project

In general, ranking algorithms are responsible for deciding, through a query, which documents are relevant or not to it. In this context, in order to guarantee the effectiveness of the returned results, an appropriate modeling of the considered documents and queries is necessary, aiming to adequately produce a ranking function that assigns similarity scores between a query and documents from a collection. To define ranking functions, several Information Retrieval (IR) models were proposed using boolean, vectorial and probabilistic formulations.Each IR model has its own operating assumptions that lead to the rank of documents from a given corpus through desired queries. Thus, this work has, as main objective, the proposal, development, and validation of an experimental IR environment, called ATRI, which includes different IR models to calculate the similarity between queries and documents in a collection through a friendly interface, and could have applications in different scenarios. For this purpose, the following models were considered for similarity calculation: Boolean, Vector Space, Probabilistic, BM25, Belief Network, Extended Boolean, Generalized Vector Space, DFRee, and PL2. In addition, ATRI allows the creation of a benchmarking environment for evaluation of the effectiveness and performance on IR through automatic creation of ensembles, visualization of effectiveness metrics, and support to scientific collections.

Built With

  • Whooshy (My modified fork of Whoosh):
  • Python 3.9
  • FastAPI

Architecture

Currently in development. Bellow, we have the high-level architecture written in PT-BR:

Logo

And the "Motor de Busca" module can be blown up as:

Logo

Pictures

Currently, that is the interface of Atri-UI:

Main page

Logo

Collections Management page

Logo

Search Results page

Logo

Contributing

Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

Contact

Marcos Pontes - mfprezende@gmail.com

Project Link: https://github.com/search-labs/atri

About

scientific search engine

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages