Full-Text Level Knowledge Updating System

✍️ Online Demo • 🤗 HF Repo • 📃 Paper • 📎 Presentation • 🗒️ Master's Thesis

Overview

Event Triggered Article Updating System is a long article updating application for knowledge update.

Spearheaded the development of an Event Triggered Article Updating System, a cutting-edge application designed for updating long articles with new knowledge. This project showcases a significant advancement in handling full-text knowledge updating triggered by news events, leveraging the capabilities of large language models (LLMs).

The whole system is trained on NetKu dataset for knowledge updating in full-text triggered by a News Event.

Demo

A live demonstation of the model can be accessed at Live Demo with GPU support, and HF Space with CPU support.

Key Features

Long texts input support: Overcame the limitations of existing LLMs by enabling the system to understand and process long context inputs. Developed a unique approach allowing for unlimited full-article input lengths, with each paragraph handling up to 4,096 tokens.
Instruction-Tuned Models: Implemented multiple baseline models, including those fine-tuned on LLaMA, Alpaca, Vicuna, and GPT-based models, demonstrating versatility and adaptability in model training.
Innovative Model Architecture: Proposed and developed a new Encoder-Decoder based model architecture. Conducted comprehensive evaluations to prove the effectiveness of this novel approach in the context of knowledge updating.

Citations

If you use our code, data, or models in your research, please cite this repository. You can use the following BibTeX entry:

@inproceedings{lee2022multi,
  title={A Multi-grained Dataset for News Event Triggered Knowledge Update},
  author={Lee, Yu-Ting and Tang, Ying-Jhe and Cheng, Yu-Chung and Chen, Pai-Lin and Li, Tsai-Yen and Huang, Hen-Hsen},
  booktitle={Proceedings of the 31st ACM International Conference on Information \& Knowledge Management},
  pages={4158--4162},
  year={2022}
}

License

The code in this project is licensed under the Apache 2.0 License - see the LICENSE file for details.

OpenAI Data Acknowledgment

The text generation included in this project were generated using OpenAI's models and are subject to OpenAI's Terms of Use. Please review OpenAI's Terms of Use for details on usage and limitations.

Acknowledgements

This work is supported by

National Science and Technology Council, Taiwan, under grants 109-2222-E-001-004-MY3 and 109-2628-H-004-001-MY4.
Institute of Information Science, Academia Sinica, Taiwan.
National Chengchi University, Taiwan.
We thank Meta LLaMA team, Vicuna team, Lightning AI and ISI-NLP for their contributions.

Name	Name	Last commit message	Last commit date
Latest commit theQuert Update main model: add api key inputs Feb 1, 2024 5104b3e · Feb 1, 2024 History 373 Commits
dataset	dataset	Add images	Sep 21, 2023
docs	docs	Add presentation	Sep 24, 2023
examples	examples	Update files structure	Aug 16, 2023
generation	generation	Update results from gpt-4	Jul 12, 2023
images	images	Add images	Sep 21, 2023
nb	nb	Update preprocessing scripts	Oct 3, 2023
primer	primer	Add preprocessing actions and replace \n\n with \c\c to meet the embe…	Nov 22, 2022
sample	sample	Add inputs format for finetuning vicuna	May 26, 2023
spark	spark	initial	Sep 28, 2022
util	util	Sync with server	Aug 16, 2023
.gitignore	.gitignore	Add *.json to git-lfs	Jun 8, 2023
LICENSE	LICENSE	Create LICENSE	Aug 18, 2023
README.md	README.md	Update readme	Jan 11, 2024
app.py	app.py	Update main model: add api key inputs	Feb 1, 2024
chatgpt_prompts.txt	chatgpt_prompts.txt	Add article splitting tip and delete useless files	Aug 10, 2023
gcp_config.sh	gcp_config.sh	Update gcp config file > update in KGE repo	Nov 21, 2023
requirements_hf_space.txt	requirements_hf_space.txt	Update requirments for HF Space	Aug 11, 2023
requirements_inference_vicuna.txt	requirements_inference_vicuna.txt	Update requirements for training and inference stage	Jul 26, 2023
requirements_train.txt	requirements_train.txt	Update requirements for training and inference stage	Jul 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Full-Text Level Knowledge Updating System

Overview

Demo

Key Features

Citations

License

OpenAI Data Acknowledgment

Acknowledgements

About

Packages

Languages

License

theQuert/Knowledge-Updating-System

Folders and files

Latest commit

History

Repository files navigation

Full-Text Level Knowledge Updating System

Overview

Demo

Key Features

Citations

License

OpenAI Data Acknowledgment

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Packages 0

Languages

Packages