main points 15/03 meeting #51

rbroc · 2024-03-15T09:54:06Z

rbroc
Mar 15, 2024
Maintainer

Mina has done some really nice work to finalize data generation! Here is a record of the main points discussed during our 15/03 meeting and next steps

Prompts finalized

Prompts kept simple and general. The final list is here: https://github.com/rbroc/echo/blob/main/src/generate/prompts.py#L52-56
As mentioned in #47, we are running with no system prompt for LLama. For stories, one of the two prompts will be selected (as far as I remember, the first of the two -- but @MinaAlmasi correct me if I am wrong).

Decoding parameters finalized

Mina has generated data with temperature = 1.0, 1.5 and 2.0. A lot of the 2.0 data makes no sense -- hence not planning on using this data for the rest of the project, really. We will work with 1.0 and 1.5, and later decide which dataset we sample from for the human experiment.

Data cleaning

There are a few cases where models don't follow instructions, or start generating nonsensical stuff. They are probably not many, and they occur more with some models (e.g., LLaMa) and some tasks (stories) than others. Filtering this data in a rule-based way seems pretty much impossible. A possible approach to removing this kind of data would be few-shot learning using SetFit on some manual "quality" annotations. On the other hand, these are not that many, and we can also consider these failure scenarios as "signals" that classifiers and humans can use to detect whether text is AI or human generated. Therefore, we lean towards not excluding any data ATM, and possibly reconsidering our decision later on. But maybe we can exclude clear failure cases from the human experiment though.

Note, however, that at some point it could be nice to annotate the data for whether they are "good" completions (grammatical & following instructions) or not anyway. These annotations could actually be relevant for:

a CHR paper looking at how generation parameters differentially influence models' likelihood to "break" across the different tasks -- but let's set this aside for now
the final paper(s) on this project, it would be nice to be able to say something about how many of these cases there are

Next steps

Classify!

Mina will rerun generation for one of the datasets where the max tokens was specified incorrectly, and then we're ready to train a classifier. :)

Raw features vs PCA

For now, we will work with raw TextDescriptives features and probably with XGBoost/RandomForest. However, features are likely to be highly correlated and we may therefore consider PCA-ing our way out of multicollinearity. Multicollinearity won't be a problem for predictive performance, but it might for the interpretation of feature importances. Though perhaps less so if we are conservative in terms of tree complexity (low feature bagging params + low tree depth).

How many models?

I'd probably lean towards training separate classifiers for each task (and temperature), but the same classifier for multiple LLMs.

Open questions

We might at some point reconsider our choice of LLMs (e.g., are LLaMa Chat and Mistral instruct fair to compare), but let's do this at the very end of the project (too many LLMs all the time!) -- I'll open an issue about this
The option of releasing our dataset as a separate paper is still open (could be a LREC submission next year)
The plan for the human experiment is still open, but let's discuss that after having worked on the classifier

@rdkm89 and @MinaAlmasi, feel free to add / edit if needed :)

since we're starting to dive into the really interesting stuff, let's try to have a meeting in the coming weeks with the whole group.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

main points 15/03 meeting #51

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

main points 15/03 meeting #51

rbroc Mar 15, 2024 Maintainer

Prompts finalized

Decoding parameters finalized

Data cleaning

Next steps

Classify!

Raw features vs PCA

How many models?

Open questions

Replies: 0 comments

rbroc
Mar 15, 2024
Maintainer