Skip to content

lji0126/Persuasion-Comments-Evaluation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Persuasion-Comments-Evaluation

1.Introduction

    Tensorflow Implementation of "Incorporating Argument-Level Interactions for Persuasion Comments Evaluation 
using Co-attention Model". The dataset used in our paper can be download from the repository. 

2.Data Format

2.1 Dataset for persuasion comments evaluation(CMV)

    The whole dataset consists of 3,456 training instances and 807 testing instances. This dataset contains 
pairs of argumentative threads made in reply to the same original post, one successful and one not. Both files 
have the same format and store data for training and heldout testing respectively. Each line is a json object 
for a pair. A pair has the following fields:
    op_author, op_text, op_name, op_title, positive, negative.
"positive" is a list of replies in a rooted path-unit that won a delta from OP, while "negative" is a matching 
rooted path-unit that did not win a delta. "op_author", "op_text", "op_name" and "op_title" give information 
for the original post.  

2.2 Dataset for persuasion comments evaluation(pre_data)

    This is our preprocessing dataset. In preprocessing, we use NLTK for tokenization and lowercase conversion.
We also filter out stop words and low frequency words. The constructed word vocabulary contains 15,767 distinct words. 

2.3 Annotated dataset for interactive argument pair extraction

    We sample 50 triples in the form of (original post, positive reply, negative reply) from the training set 
and split these into 100 original post-reply pairs in the form of (original post, positive reply) and 
(original post, negative reply). Two annotators are hired to annotate the dataset independently and a third 
annotator is asked to solve the conflict between the two annotators. With the final decision from the third 
annotator, we obtain 365 pairs in total. In detail, 234 interactive argument pairs come from positive replies, 
and the other 131 pairs are generated by negative replies. Here's an example of such a file:

1
========Reply1========

He cited this definition by Merriam-Webster: existing in nature and not made or caused by people: coming from 
nature (URL) as his basis for the distinction for natural vs. unnatural. <<<===>>>Look at the definition you 
provided, if we remove the exclusion of things which humans create: existing in nature and not made or caused 
by people ~so essentially, by this definition, natural things are things that exist, which is frankly rather 
meaningless.

========Reply2========

He cited this definition by Merriam-Webster: existing in nature and not made or caused by people: coming from 
nature (URL) as his basis for the distinction for natural vs. unnatural. <<<===>>> You're using natural to mean 
definition 8 the universe, with all its phenomena. The more common definition is definition 1 the material world, 
especially as surrounding humankind and existing independently of human activities. 

He cited this definition by Merriam-Webster: existing in nature and not made or caused by people: coming from 
nature (URL) as his basis for the distinction for natural vs. unnatural. <<<===>>> So by definition we are not 
part of nature, as nature is more commonly used, and is in this sense used, to refer to things that exist 
independently of human activities. 

Notes:
    (1) Reply 1 and 2 indicate the positive reply and negative reply, respectively. 
    (2) The argument on the left of "<<<===>>>" is from original post and the argument on the right is from reply.
    (3) <<<===>>> indicates the interactive relationship.

3.Citation

If you find the datasets useful, please cite the following paper: 

@article{Ji2018Incorporating, title={Incorporating Argument-Level Interactions for Persuasion Comments Evaluation
using Co-attention Model}, author={Lu Ji, Zhongyu Wei, Xiangkun Hu, Yang Liu, Qi Zhang and Xuanjing Huang}, 
year={2018}, publisher={COLING} }.

The detailed information about the paper will be released later.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages