Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support cos similarity match & bug fix #26

Merged
merged 2 commits into from
Dec 8, 2018
Merged

Conversation

MingliangT
Copy link
Contributor

@MingliangT MingliangT commented Dec 8, 2018

  • 支持余弦相似度匹配,test 对比多种 cos 算法,譬如 tf-idf + cos、忽略权重 cos、计数权重 cos、、simhash、minhash 等,对比后效果最好的是 忽略权重 cos,第一个和第二个相似度差距明显。速度上和 chunk 差不多,一会快一点,一会慢一点,批量回放30+条记录 cos 快2-3s。准确性以发单接口为例,101个下流调用,忽略权重 cos 和 chunk 算法的结果一致,其他算法有个位数匹配失败。

  • readable 切分 fix bug

  • outboundBypassPorts 支持多个端口

  • evgarg 统一日志相关初始化到 initLog 函数中

  • 原来 chunk 算法保持不变

@taowen taowen merged commit 64d70c8 into v2pro:master Dec 8, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants