Allow Passing in Instantiated Teacher #2170

Satrat · 2024-03-08T19:41:38Z

Asana ticket: https://app.asana.com/0/1206109050183159/1206788571042422/f

We had missed support for passing in an instantiated teacher to the finetuning code, this PR adds in it

Testing

script would fail on main, training starts as intended here:

from sparseml.transformers import compress, SparseAutoModelForCausalLM, SparseAutoTokenizer, load_dataset

model = SparseAutoModelForCausalLM.from_pretrained("zoo:llama2-7b-ultrachat200k_llama2_pretrain-base", device_map="auto")
teacher = SparseAutoModelForCausalLM.from_pretrained("zoo:llama2-7b-open_platypus_orca_llama2_pretrain-base", device_map="auto")
tokenizer = SparseAutoTokenizer.from_pretrained("zoo:llama2-7b-ultrachat200k_llama2_pretrain-base")
recipe = "zoo:llama2-7b-ultrachat200k_llama2_pretrain-pruned40"
dataset = load_dataset("garage-bAInd/Open-Platypus")

def format_data(data):
    data["text"] = data["instruction"] + data["output"]
    return data

dataset = dataset.map(format_data)

compress(
    model=model,
    tokenizer=tokenizer,
    distill_teacher=teacher,
    dataset=dataset,
    recipe=recipe,
    num_train_epochs=1,
    output_dir="./output",
)

allow for teacher to be passed in as instantiated model

541a23a

Satrat requested review from bfineran, dbogunowicz, horheynm and rahul-tuli and removed request for dbogunowicz and horheynm March 8, 2024 19:41

horheynm approved these changes Mar 8, 2024

View reviewed changes

bfineran approved these changes Mar 8, 2024

View reviewed changes

Satrat merged commit 2bd0bd8 into main Mar 8, 2024

Satrat deleted the fix_teacher_py branch March 8, 2024 21:50

Satrat pushed a commit that referenced this pull request Mar 11, 2024

allow for teacher to be passed in as instantiated model (#2170)

f27a966

bfineran pushed a commit that referenced this pull request Mar 11, 2024

allow for teacher to be passed in as instantiated model (#2170) (#2172)

9ae35bd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow Passing in Instantiated Teacher #2170

Allow Passing in Instantiated Teacher #2170

Uh oh!

Satrat commented Mar 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Allow Passing in Instantiated Teacher #2170

Allow Passing in Instantiated Teacher #2170

Uh oh!

Conversation

Satrat commented Mar 8, 2024

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants