Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Human prompts for dataset generation. #22

Open
winglian opened this issue Jul 5, 2023 · 3 comments
Open

Human prompts for dataset generation. #22

winglian opened this issue Jul 5, 2023 · 3 comments

Comments

@winglian
Copy link

winglian commented Jul 5, 2023

We instruct the user model with carefully designed prompts to mimic human user behavior

@ningding97 could you provide these prompts you used? Thanks!

@ningding97
Copy link
Member

We instruct the user model with carefully designed prompts to mimic human user behavior

@ningding97 could you provide these prompts you used? Thanks!

Hi, thanks for your interest in our work. Below are prompt examples we use in the three sectors respectively to instruct the model to generate a new turn of response as a human user. In our experiences, the key point is to avoid "role exchange", thus the prompt is appended to the dialogue history. We provide three example prompts to generate a new turn of response in three sectors, and we also sample different prompts to mimic different kinds of users (for example, the user can be very professional or casual). Hope it helps!

  • Sector 1
Above is a conversation between a user and an intelligent assistant. Now suppose you are the user, say something to continue the conversation based on given context. Make the response short and the language casual.
  • Sector 2
Above is a conversation between a user and an intelligent assistant. Now suppose you are the user, generate response according to the generated material to continue the conversation. Bear in mind your major request is to ask the assistant to generate some material. So you can ask the assistant either to make it more detailed, add more related information, or any other request to improve the generated material. Be creative and diverse in your request. Make the response short and the language casual.
  • Sector 3
Above is a conversation between a user and an intelligent assistant. Now suppose you are the user, say something to continue the conversation based on given context. Bear in mind your major request is to ask the assistant to do something based on a given text material. So it is better you ask question or give instruction that is accomplishable with only the given text available. Make the response short and the language casual.

To provide more details about the data generation process, we will take Sector 1 as an example and provide all the codes, prompts and meta-information.

@renatz
Copy link

renatz commented Sep 2, 2023

hi, I attempted to reproduce its procedure but the human often does not raise questions but evaluates the assistants' answers.
what the prompt is appended to the dialogue history mean?
do you append the system prompt to every request's content?

@yulinchen99
Copy link
Collaborator

Yes, we put everything under the role "user" for openai chatgpt api. So for example, if you already have the first round of conversation as below

User: user utterance 1
Assistant: assistant response 1

Now you want to generate a new user utterance, then your input message is

{
   "role": "user", 
   "content": """User: user utterance 1
Assistant: assistant response 1
Above is a conversation between a user and an intelligent assistant. Now suppose you are the user, say something to continue the conversation based on given context. Make the response short and the language casual.
"""
}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants