JetBrains Internship Project

Intelligent Chunking Methods for Code Documentation RAG

Results

More detailed results and the implementation approach are described in the file report.pdf. All the computed results from various corpora are located in the results folder.

How to Reproduce

At first, install all required dependencies from the requirements.txt file. The folder data contains all the required corpora, and the corresponding questions are located in the questions_df.csv file. The evaluation can be run using the command:

python3 src/retrieval_evaluation_pipeline.py

Before running, you may want to modify the parameters of the RetrievalEvaluationPipeline in retrieval_evaluation_pipeline.py file, such as the embedding function, corpus, and labels for the questions:

rep = RetrievalEvaluationPipeline(
    embedding_function,
    corpus_file='data/chatlogs.md',
    questions_label='chatlogs'
)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JetBrains Internship Project

Intelligent Chunking Methods for Code Documentation RAG

Results

How to Reproduce

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
results		results
src		src
.gitignore		.gitignore
README.md		README.md
report.pdf		report.pdf
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

JetBrains Internship Project

Intelligent Chunking Methods for Code Documentation RAG

Results

How to Reproduce

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages