Skip to content

play0137/Chinese_ConceptNet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CC BY-SA 4.0

Overview

This dataset is a refined and expanded version of Chinese ConceptNet.
(the original ConceptNet dataset is in commonsense/conceptnet5)

ConceptNet collected commonsense knowledge from voluntary web users all around the world by crowdsourcing.
It contains a variety of domains in real-world and can be applied in different tasks, such as analogy, commonsense reasoning and natural language understanding.

Knowledge acquired by crowds tend to be noisy, redundancy and meaningless especially for unguided projects without supervision and voluntary participants. Therefore, we refined ConceptNet to reduce its error rate.
In the mean time, we increased the quality of ConceptNet. The quality here refers to correctness, coverage and number of concepts.

Please check the reports for more information.

Assertion examples

ID Start End Relation SurfaceText Weight
438595 水銀 有毒 HasProperty [[水銀]] 是 [[有毒]] 的。 1.0
183195 上課 Causes [[上課]] 會令人 [[睏]]。 1.0
241669 游泳 CapableOf [[魚]] 會 [[游泳]]。 1.0
154481 出門 上學 MotivatedByGoal [[出門]] 是為了 [[上學]]。 1.0

References

If you use this version of Chinese ConceptNet in your works, please cite this paper:

Ying-Ren Chen (2021). Generate coherent text using semantic embedding, common sense templates and Monte-Carlo tree search methods (Master's thesis, National Tsing Hua University, Hsinchu, Taiwan).

BibTeX:

@mastersthesis{Chen:2021:generate_coherent_text,
     author = "Ying-Ren Chen,
     title = "Generate coherent text using semantic embedding, common sense templates and Monte-Carlo tree search methods",
     school = "National Tsing Hua University",
     pages = 136,
     year = 2021
}

(the data cleaning part of Chinese ConceptNet is in Chapter 3.1, 4.1 and Appendix A, B)

And cite ConceptNet paper too:

Robyn Speer, Joshua Chin, and Catherine Havasi. 2017. "ConceptNet 5.5: An Open Multilingual Graph of General Knowledge." In proceedings of AAAI 31.

License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
CC BY-SA 4.0

About

Refined and expanded version of Chinese ConceptNet

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published