-
Notifications
You must be signed in to change notification settings - Fork 17
detail usage #2
Copy link
Copy link
Closed
Description
I was wondering how to do it for one example. I'm guessing you use the get_response code, but in detail
item_new['original_harm_behavior'] = item['original_harm_behavior']
# item_new['rewritten_prompt'] = item['rewritten_prompt']
item_new['nested_prompt'] = item['nested_prompt']
item_new['baseline'] = args.baseline
item_new['test_model'] = args.test_model
item_new['model_output'] = model_output
It would be nice to know what to put in each.
Thanks.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels