Skip to content

Gender detection in Twitter@Author Profiling at PAN2018

License

Notifications You must be signed in to change notification settings

pan-webis-de/gopalpatra18

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multimodal Author Profiling @ PAN 2018

Detailed description of this task can be found @PAN 2018. This code only analyzes tweets in English.

Dataset:The dataset used for this experiments can be downloaded from the PAN 2018.

Dependencies:

  1. gensim
  2. sklearn
  3. nltk

Other requirements:

The GloVe models (100d & 200d) are required for word embeddings.

For image captioning, image caption generation using chainer was used. Need to extract image captions before using the above tool and store it in a csv file (format:imageid \t text).

Running the code

python master.py training_input_add test_input_add test_output_add

Output will be a xml file:

Reference

Please cite the following paper if you find this code is useful.

B. G. Patra, G. Das, and D. Das. 2018. Multimodal Author Profiling for Twitter - Notebook for PAN at CLEF 2018. In Proceedings of the PAN 2018 at CLEF-2018, Avignon, France. link

If you have any query please e-mail us. We welcome bug fixes and new features.

About

Gender detection in Twitter@Author Profiling at PAN2018

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%