Spoken language understanding (SLU) task includes intent identification and slot filling. There are datasets for SLU in three types:
- single-turn query
- only tagging, without intent
- multi-turn dialogue
dataset:
Datasets | Domain | Vocab | #Train | #Test | #Slot | #Intent | language | paper | detail | |
single-turn | antis | 1 | 722 | 4478 | 893 | 120 | 21 | English | ||
snips | >1 | 11241 | 13084 | 700 | 72 | 7 | English | |||
only-tagging | MIT rest. | 1 | 4166 | 7660 | 1521 | 17 | - | English | ||
MIT eng. | 1 | 7481 | 9775 | 2443 | 25 | - | English | |||
MIT trivia10k13. | 1 | 12145 | 7816 | 1953 | 25 | - | English | |||
multi-turn dialogue | DSTC 2 | 1 | - | 11677 | 9890 | w/o BIO-format | - | English | ||
incarslu | 1 | 1145 | 10571 | 4882 | 11 (w/o BIO-format) | 17 | English | |||
nlpcc | 3 | 5443 | 4705/21352 | 1177/5350 | 11 | 11 | Chinese |