Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于数据标签的问题 #25

Closed
JieJayCao opened this issue May 11, 2022 · 1 comment
Closed

关于数据标签的问题 #25

JieJayCao opened this issue May 11, 2022 · 1 comment

Comments

@JieJayCao
Copy link

您好,非常感谢您复现了deep-packet。
在使用您preprocess.py文件时,我发现您对流量数据的标签按照应用(app)分类时只包括了Non-VPN,即

AIM chat

'aim_chat_3a': 0,
'aim_chat_3b': 0,
'aimchat1': 0,
'aimchat2': 0,

但是ISCXVPN2016原数据集中还包括了例如vpn_aim_chat,这一部分vpn数据是不考虑在app分类中吗?

非常期待您的回答

@munhouiani
Copy link
Owner

Hi,

在處理資料的時候我盡可能處理得跟 Paper 寫的一樣,在 section 4.2.1 Labeling Dataset 中是這麼說的:

For application identification, all pcap files labelled as a particular application which were collected during a nonVPN session, are aggregated into a single file.

因此我並沒有把 vpn 放到 application classification 的資料集中。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants