Based on Fined-tuned Stanford Alpaca , further tune it on Hong Kong 2023 Consumption Voucher Scheme Frequently Asked Questions Dataset

This repo works on the top of https://github.com/tloen/alpaca-lora reproducing the Stanford Alpaca results using low-rank adaptation (LoRA). We provide an Instruct model of similar quality to text-davinci-003 that can run on a Raspberry Pi (for research), and the code can be easily extended to the 13b, 30b, and 65b models.

Here's a colab notebook to see the difference between before and after fine tune on the HK CVS FQA datasets

Setup

pip install -r requirements.txt

Prepare your Dataset (`prepare_your_dataset.ipynb`)

The downstream task is to enable the model to answer related Hong Kong 2023 Consumption Voucher Scheme Frequently Asked Questions. You may change the source of dataset that fit your personal need.

Fine Tuning (`finetune.ipynb`)

We fined tune the model based on the fine-tuned version of Stanford Alpaca and reproduced by this repo: alpaca-lora using Hugging Face's PEFT to fine tune it cheaply and efficiently.

Inference

We push our prompt tuning adaptor into huggingface hub and use it wrap the decapoda-research/llama-7b-hf

Run the code

from transformers import LlamaForCausalLM, LlamaTokenizer,GenerationConfig
from peft import PeftModel


device_map = "auto"

tokenizer = LlamaTokenizer.from_pretrained("decapoda-research/llama-7b-hf")
model = LlamaForCausalLM.from_pretrained(
    "decapoda-research/llama-7b-hf",
    load_in_8bit=True,
    device_map="auto",
)

### Load model after fine tuned on our datasets
model = PeftModel.from_pretrained(model, "Nelsonlin0321/alpaca-lora-7b-tuned-on-hk-cvs-fqa")

tokenizer = LlamaTokenizer.from_pretrained("decapoda-research/llama-7b-hf")
tokenizer.pad_token_id = 0


def generate_prompt_eval(instruction):
    template =  f"""Below is an instruction that describes a task. Write a response that appropriately completes the request.
### Instruction:
{instruction}
### Response:"""
    return template

eval_generation_config = GenerationConfig(
    temperature=0.1,
    top_p=0.75,
    num_beams=4,
)


def generate_answer(instruction):
    prompt = generate_prompt_eval(instruction)
    inputs = tokenizer(prompt, return_tensors="pt")
    input_ids = inputs["input_ids"].cuda()
    generation_output = model.generate(
        input_ids=input_ids,
        generation_config=eval_generation_config,
        return_dict_in_generate=True,
        output_scores=True,
        max_new_tokens=256
    )
    for s in generation_output.sequences:
        output = tokenizer.decode(s)
        print("Response:", output.split("### Response:")[1].strip())


question = "Who are eligible to be disbursed with the first-instalment voucher of $1,500 on 16 April?"

generate_answer(question)
>> Response: All eligible people who have successfully registered under 2022 CVS and met the relevant eligibility criteria will be disbursed with the first-instalment voucher of $1,500 on 16 April.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LICENSE		LICENSE
README.md		README.md
finetune.ipynb		finetune.ipynb
hong_kong_consumption_voucher_scheme_datasets_for_tunning.json		hong_kong_consumption_voucher_scheme_datasets_for_tunning.json
inference.ipynb		inference.ipynb
knowledge_corpus.txt		knowledge_corpus.txt
prepare_your_dataset.ipynb		prepare_your_dataset.ipynb
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Based on Fined-tuned Stanford Alpaca , further tune it on Hong Kong 2023 Consumption Voucher Scheme Frequently Asked Questions Dataset

Here's a colab notebook to see the difference between before and after fine tune on the HK CVS FQA datasets

Setup

Prepare your Dataset (`prepare_your_dataset.ipynb`)

Fine Tuning (`finetune.ipynb`)

Inference

About

Releases

Packages

Languages

License

Nelsonlin0321/nlp-further-tuning-stanford-alpaca

Folders and files

Latest commit

History

Repository files navigation

Based on Fined-tuned Stanford Alpaca , further tune it on Hong Kong 2023 Consumption Voucher Scheme Frequently Asked Questions Dataset

Here's a colab notebook to see the difference between before and after fine tune on the HK CVS FQA datasets

Setup

Prepare your Dataset (prepare_your_dataset.ipynb)

Fine Tuning (finetune.ipynb)

Inference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Prepare your Dataset (`prepare_your_dataset.ipynb`)

Fine Tuning (`finetune.ipynb`)

Packages