ShareGPT appending

How is the ShareGPT format handled with this workflow? I'm currently developing a dataset that could be greatly benefited from this technique. However, I hate training on "User" and "Assistant" tokens. It goes against my intentions when working with language models. With Axolotl, there's a way to change the header IDs for sharegpt datasets. I was wondering if there was something similar I could do here, or perhaps I could just do some data processing to change the format...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ShareGPT appending #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

ShareGPT appending #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions