Skip to content

Implement Indexer and Retriever functionalities for given environment clearance data #2

@KaranrajM

Description

@KaranrajM

Description

Strategize a suitable chunking technique to index the given environment clearance data, where each file contains a list of projects and their details. Additionally, implement a retriever that can perform the following actions:

  • Search and look up a project with given location details (latitude, longitude, or State and City)
  • Query for specific details only within the project

Goal

To develop an information retrieval system specific to environment clearance data.

Expected Outcome

  • Efficient indexer and retriever services that enable users to:
    • Lookup and search for a list of projects in a particular location
    • Query specific details within a project

Acceptance Criteria

An information retrieval system specific to environment clearance data with high accuracy.

Implementation Details

  1. Implement a suitable and efficient chunking technique for the given dataset.
  2. Build a retriever that can:
    • Search and look up a project using location details (latitude, longitude, or State and City).
    • Search for specific details only within the project.
  3. Sample parsed and cleaned EC data can be found here. Their respective bare data can be found here.

Mockups/Wireframes

NOT APPLICABLE

Product Name

Jugalbandi

Organisation Name

OpenNyAI

Domain

Legal

Tech Skills Needed

  • Python

Requisites

  • OpenAI API key for Indexing and RAG

Complexity

Medium

Category

Backend

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions