Skip to content

This is the dataset used for our research project: Trust Evaluation on Twitter. The proposed Co-Rank method and detailed experimental results can be found in our paper: "CoRank: A Coupled Dual Networks Approach to Trust Evaluation on Twitter", which is published on WISE 2018.

TrustEval/Twitter_Tas_dataset

Repository files navigation

CoRank A Coupled Dual Networks Approach to Trust Evaluation on Twitter

Twitter_Tas_dataset

This is the dataset used for our research project: Trust Evaluation on Twitter. Anyone who are working on data minning, social network analysis, or any related topics can download this dataset for research purposes.

The proposed Co-Rank method and detailed experimental results can be found in our paper:

"CoRank A Coupled Dual Networks Approach to Trust Evaluation on Twitter" (WISE 2018).

More detailed work has been published in the journal article:

"Let’s CoRank: trust of users and tweets on social networks" (World Wide Web).

The data set is designed for research purpose only. There are four files storing the Twitter data crawled from Twitter using its official REST API.

'tas_users'

This file stores all the users in the Tas dataset. We have given each user a user index and the UserID is their original user ID collected from Twitter. These users are crawled based on the 'location' attribute in their profile. They are considered all from the Tasmania Twitter Community.

For Example:

User_index UserID
"7232" "1076277643"
"791" "1000773896"

'twitter_tweets_all_user_posted'

This file stores all the tweets posted by the users in the Tas dataset. These tweets are crawled from Twitter platform using RestAPI. Please note that, as restricted by Twitter, only a maximum of 3,200 most recent tweets of each user are allowed to be downloaded.

for example:

TweetID Tweet_index User_index
"100000183710007296" "0" "12423"

Because this file is too big, you can download it from here (you don't need to create a dropbox account to download the file.)

'twitter_follower_friend_index'

This file stores the follow relations between users. Note that, all the numbers are indicating user_index numbers.

for example:

Follower_index Friend_index
"0" "180"

(User_index 0 is following User_index 180)

'twitter_tweets_relation_index'

This file stores the retweet and reply relations between tweets. Note that, all numbers are indicating tweet_index numbers.

first column: original_tweet_index | second column: retweet/reply tweet_index | third column: relation type (1 for reply, 2 for retweet)

for example:

original_tweet_index retweet/reply tweet_index relation type
"35" "40" "2"
"95" "103" "1"

Here in relation type, 1 is for reply, 2 is for retweet. (tweet_index 35 is retweeted by tweet_index 40, and tweet_index 95 is replied by tweet_index 103)

'twitter_user_mention_index'

This file stores the mention relation. User is mentioned by a tweet.

first column: mentioned User_index | second column: tweet_index

for example:

mentioned User_index tweet_index
"3" "1856312"

(user_index 3 is mentioned by tweet_index 1856312)

Note that, due to the request from Twitter, we can not publish the content of Twitter.

References

If you use this data set for research, please cite the following paper(s):

@InProceedings{10.1007/978-3-030-02922-7_10,
author="Li, Peiyao
and Zhao, Weiliang
and Yang, Jian",
editor="Hacid, Hakim
and Cellary, Wojciech
and Wang, Hua
and Paik, Hye-Young
and Zhou, Rui",
title="CoRank: A Coupled Dual Networks Approach to Trust Evaluation on Twitter",
booktitle="Web Information Systems Engineering -- WISE 2018",
year="2018",
publisher="Springer International Publishing",
pages="145--160",
isbn="978-3-030-02922-7"
}
@article{10.1007/s11280-020-00829-4,
author = "Li, Peiyao
and Zhao, Weiliang
and Yang, Jian
and Sheng, Quan Z.
and Wu, Jia",
journal = "World Wide Web",
pages = "1573-1413",
title = "Let’s CoRank: trust of users and tweets on social networks",
year = "2020"
}

In addition, if you are interested in the topic of Social Network trust, you may want to know another work of ours: "CoTrRank: Trust Evaluation of Users and Tweets"

If you have any questions, please feel free to contact us.

About

This is the dataset used for our research project: Trust Evaluation on Twitter. The proposed Co-Rank method and detailed experimental results can be found in our paper: "CoRank: A Coupled Dual Networks Approach to Trust Evaluation on Twitter", which is published on WISE 2018.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages