Skip to content

Latest commit

 

History

History
17 lines (14 loc) · 1.31 KB

README.md

File metadata and controls

17 lines (14 loc) · 1.31 KB

Quantifying-Character-Similarity

arXiv

This repo includes

  • Tutorials: notebook
  • Homoglyphs Dictionaries: Simplified Chinese, Traditional Chinese, Japanese, Korean.
  • CJK Fonts are here.
  • Scripts for japanese supply chain matching
  • Scripts for Synthetic Chinese, Japanese, Korean placenames matching
  • Scripts for ViT training and Inference for homoglyphs dicts: coming soon...
  • Scripts for ViT training and Inference for Ancient Chinese: coming soon...

For the Ancient Chinese Character Dataset: