Multitask Learning for Emotionally Analyzing Sexual Abuse Disclosures

This repository contains code for the paper Multitask Learning for Emotionally Analyzing Sexual Abuse Disclosures accepted at the proceddings of NAACL-HLT 2021 as a long paper. In a first of its kind work we jointly model the tasks related to identifying linguistic behaviours of #MeToo movement along with emotional attributes of the text. In this work we also propose a flexible cross-stiched paramter sharing architecture which takes advantage of both hard parameter sharing and soft paramater sharing for task specific settings. This allows both the models to have their own set of parameters while also encouraging knowledge transfer via the shared encoder weights.

Please follow these links for more information,

Datasets

In our work, we use two datasets which have been mined from the same source -- Twitter:

MeTooMA, consisting of tweets pertaining to the MeToo movement on social media platforms. Thw tweets have been various linguistic behaviours such as (directed-hate, generalized-hate, sarcasm, justification, refuation, support, oppose, relevance). The dataset details, collection methadology, pre-processing information, and instructions to access are present here.
SemEval 2018, consists of tweets representing mental state of the authors in the form of emotions distributed into 11 distinct labels. The details of this dataset are present here.

Text Embeddings

We use BERTweet embeddings in our work and provide the same for both the datasets in the Embeddings folder. The key component in transformer based models is the token level self attention that enables them to generate dynamic contextualized embeddings as opposed to the static embeddings of GloVe. Please follow the paper link for more details.

Experiments, Results and Discussions

The code and their explanation for all the experiments are present in the Jupyter Notebook. We document interesting findings, resuts, discussions and qualitative analysis in the manuscript.

Terms of Use

This work can be used freely for research purposes.
The paper listed below provide details of this work. If you use the expirements, then please cite the paper.
If interested in commercial use of the corpus, send email to midas@iiitd.ac.in.
If you use the corpus in a product or application, then please credit the authors and Multimodal Digital Media Analysis Lab - Indraprastha Institute of Information Technology, New Delhi appropriately. Also, if you send us an email, we will be thrilled to know about how you have used the corpus.
Multimodal Digital Media Analysis Lab - Indraprastha Institute of Information Technology, New Delhi, India disclaims any responsibility for the use of the corpus and does not provide technical support. However, the contact listed above will be happy to respond to queries and clarifications.
Rather than redistributing the corpus, please direct interested parties to this page
To provide feedback or report any issues in the code, please drop an email here

Please feel free to send us an email:

with feedback regarding the corpus.
with information on how you have used the corpus.
if interested in having us analyze your social media data.
if interested in a collaborative research project.

References

Please cite the following papers if the code and/or dataset are useful for your work.

@inproceedings{sawhney-etal-2021-multitask,
    title = "Multitask Learning for Emotionally Analyzing Sexual Abuse Disclosures",
    author = "Sawhney, Ramit  and
      Mathur, Puneet  and
      Jain, Taru  and
      Gautam, Akash Kumar  and
      Shah, Rajiv Ratn",
    booktitle = "Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
    month = jun,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2021.naacl-main.387",
    pages = "4881--4892",
    abstract = "The {\#}MeToo movement on social media platforms initiated discussions over several facets of sexual harassment in our society. Prior work by the NLP community for automated identification of the narratives related to sexual abuse disclosures barely explored this social phenomenon as an independent task. However, emotional attributes associated with textual conversations related to the {\#}MeToo social movement are complexly intertwined with such narratives. We formulate the task of identifying narratives related to the sexual abuse disclosures in online posts as a joint modeling task that leverages their emotional attributes through multitask learning. Our results demonstrate that positive knowledge transfer via context-specific shared representations of a flexible cross-stitched parameter sharing model helps establish the inherent benefit of jointly modeling tasks related to sexual abuse disclosures with emotion classification from the text in homogeneous and heterogeneous settings. We show how for more domain-specific tasks related to sexual abuse disclosures such as sarcasm identification and dialogue act (refutation, justification, allegation) classification, homogeneous multitask learning is helpful, whereas for more general tasks such as stance and hate speech detection, heterogeneous multitask learning with emotion classification works better.",
}

@inproceedings{gautam2020metooma,
  title={\# metooma: Multi-aspect annotations of tweets related to the metoo movement},
  author={Gautam, Akash and Mathur, Puneet and Gosangi, Rakesh and Mahata, Debanjan and Sawhney, Ramit and Shah, Rajiv Ratn},
  booktitle={Proceedings of the International AAAI Conference on Web and Social Media},
  volume={14},
  pages={209--216},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Experiments.ipynb		Experiments.ipynb
README.md		README.md
main_diag.png		main_diag.png
result1.png		result1.png
result2.png		result2.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multitask Learning for Emotionally Analyzing Sexual Abuse Disclosures

Datasets

Text Embeddings

Experiments, Results and Discussions

Terms of Use

References

About

Releases

Packages

Contributors 2

Languages

midas-research/metoo-mtl-naacl

Folders and files

Latest commit

History

Repository files navigation

Multitask Learning for Emotionally Analyzing Sexual Abuse Disclosures

Datasets

Text Embeddings

Experiments, Results and Discussions

Terms of Use

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages