How to measure the effectiveness of tag "controls" (e.g. sentiment)? #33
Labels
documentation
Improvements or additions to documentation
enhancement
New feature or request
question
Further information is requested
I was thinking that simple methodology would be to generate sequences for sentiment spans and measure accuracy based on some overlap measure, e.g. Jaccard, or rougue.
While imperfect, this would be an initial approach to communicating the effectiveness of utilizing text generation controls, derived from pre-trained supervised models.
Tweets would be a good start.
Potential dataset
https://www.kaggle.com/c/tweet-sentiment-extraction/data
The text was updated successfully, but these errors were encountered: