Hello there human π, please help me grade Replier!
Basically, replier is a natural language generation project by myself trying to do psycological counseling with automated transformer sequence-to-sequence training with help from @exr0n and @klintkanopka.
And... Like most seq2seq tasks, human evaluation is need.
So, if you would like to be a human evaulator
- please install python and create a venv
- install everything in
requirements.txt
(pip install -r requirements.txt
) - edit the
REVIEWER
variable atop oftalkback.py
to your name - run
talkback.py
and follow any instructions
Please do as many of these as you like. Once you are done, please send final_result.csv
or whatever you set your RESULT
variable to be to me over Discord or email (hliu@shabang.cf
). Thanks much.
Please note: these dataset are not 50/50 bot/human. One of the question is a traditional Turing test, where you are tasked with judging if the response is bot/human. If one of your response is "human", the other is not necesarily bot.