Finetuning and evaluating LLMs to extract GHG emissions from PDF reports using RAG and grammar-based decoding.
-
Updated
Mar 22, 2024 - TeX
Finetuning and evaluating LLMs to extract GHG emissions from PDF reports using RAG and grammar-based decoding.
My tex master thesis, touching on: Machine Learning, Natural Language Processing, Information Extraction, Knowledge Graphs.
[Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fine-grained geo-entities, such as streets, stops and routes, as well as standard named entity types (organization, date, number, etc).
A curated list (and summaries) of awesome research publications on topic of data extraction from photos of receipts.
Literature Survey of Information Extraction, especially Relation Extraction, Event Extraction, and Slot Filling.
Add a description, image, and links to the information-extraction topic page so that developers can more easily learn about it.
To associate your repository with the information-extraction topic, visit your repo's landing page and select "manage topics."