First download other data, then modify the path ('输出文件目录') in +g.py at line 2156, and finally run +g.py.
- Link: https://pan.baidu.com/s/1rdd0c4Cwku9Y5hTO9P4MLw
- Password: b4j4
- Determining the initial dataset: a+文档集 (manuscripts), a+作者编号_论文内容 (reviewers), a+论文号_作者号广义表.txt、a+作者号_论文号广义表.txt, 文档-标准作者排名.txt (groundtruth.txt in a+文档集)
- Run LDAGibbsSampling-master to calculate "a+作者编号_论文内容", rename the resulting 5 files and put them in the corresponding folders:
- 词号_词次数_词表.txt → 3-词号_词次数_词表.txt
- 文档编号_文档名表.txt → 3-文档编号_文档名表.txt
- phi后缀文件 → 3-主题_词分布.txt
- theta后缀文件 → 3-作者_主题分布.txt
- tassign后缀文件 → 3-tassign文件.txt
- Modify the corresponding parameter path and run: 4_12.py, to get 4 files
- Modify the corresponding parameter path and run: +e.py, to get 2 files
- Documentation required: 论文_方向_年份_作者_引用论文_被引论文广义表 (Refer to the other data)
- Format: {paperID:[{Field:None,..},{Year:None,..},{Author:None,..},{citationPaper:None,..},{citationAuthor:None,..},{citedPaper:None,..},{citedPaper:None,..}]}
- Documentation required: 论文_方向_年份_作者_引用论文_被引论文广义表 (Refer to the other data)
- Modify the corresponding parameter path and run: +f.py, to get 2 files
- Modify the corresponding parameter path and run: +g.py, to get results
- The other data provides all the intermediate files for the data in this paper, so you can perform the last step directly.
- If you need the code in English, you can use tool to translate it.
- Install jdk1.8 and eclipse
- eclipse → File → Open Projects from File System.. → Select the path of LDAGibbsSampling-master to open
- Modify the corresponding path in LDAGibbsSampling-master/src/liuyang/nlp/lda/conf/PathConfig.java, as well as the parameters in LdaParameters.txt
- Run LDAGibbsSampling-master/src/liuyang/nlp/lda/main/LdaGibbsSampling.java
If you find this code useful, please cite the following paper:
@article{tan2021improved,
title = {Improved reviewer assignment based on both word and semantic features},
author = {Tan, Shicheng and Duan, Zhen and Zhao, Shu and Chen, Jie and Zhang, Yanping},
journal = {Information Retrieval Journal},
year = {2021},
type = {Journal Article}
}