Skip to content

zweien/yuliao_pyqt

Repository files navigation

yuliao_pyqt

自制语料分析小工具,使用正则表达式提取关键词,PyQt5 制作 UI.

使用方法

  • Requirements

    • Python>=3.6
    • pyqt5>=5.12
    • pandas
  • 将所需语料以 txt 格式存储,编码须为 GB2312。

  • 有两种方式标识对话

    • 在文件开头标识,如
    1-法国
    2-中国
    
    1你好,我叫亚当。
    2你好,我叫张明。
    
    • 在每段对话行首标识,如
    1(法国)你好,我叫亚当。
    
    2(中国)你好,我叫张明。
    
    1很高兴认识你。
    2很高兴认识你。
    
    • 可以在国别后可加上姓名,便于统计,如1-法国-亚当
  • 命令行启动 python main.py

功能介绍

  • 搜索语料库出现词语
  • 包括国家/地区、剔除国家/地区
  • 修改语料库
  • 将搜索结果导出至 excel 文件
  • 按国家统计字数

Authors

yuliao_pyqt's authors are Zweien and Lei.

Licence

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/gpl-3.0.txt.

Change Log

  • 20191207
    • fix bug: 搜索得到结果后,再次搜索为空时,出错
    • add fun: 增加统计功能
    • add fun: plot bar
    • add fun: delete shortcut

About

语料分析小工具

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages