New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automatically learn the relationship of malware samples at scale using deep learning technique #22

Open
feuerchop opened this Issue Jan 16, 2018 · 5 comments

Comments

Projects
None yet
6 participants
@feuerchop

feuerchop commented Jan 16, 2018

The Holmes Project has recently acquired a large dataset of labeled malware artifacts, which can be used for deep learning based malware relationship mining. This labeled dataset of over 20k samples should be a big help for students attempting to do Malware Relationship Detection. Besides, as a result of the previous GSoC’18, we also have an efficient data model for the malware relationships. New potential GSoC students can immediately start with the machine learning part without concerns for optimal data modeling and distributed storage. As a follow-up project, students are expected to come up with decent learning model to detect malware relationship and create better visualisation frontend.

@pabitralenka

This comment has been minimized.

pabitralenka commented Jan 25, 2018

Hi @feuerchop . I am a senior undergraduate at IIIT Bhubaneswar, India. I am interested to contribute to The Holmes Project. It would be great if you can let me know more about the dataset and a bit of concrete explanation on this issue. This will really help me to get a good start.

Thank you

@ayushb666

This comment has been minimized.

ayushb666 commented Feb 14, 2018

Hi @feuerchop , I am a master student at University of Minnesota, USA. I am interested to contribute to The Holmes project as part of GSOC 2018. Can you please let me know more about the dataset and what things needed to be done.

Thanks

@cli0

This comment has been minimized.

Contributor

cli0 commented Feb 14, 2018

Hey @ayushb666 and @pabitralenka , I would suggest you guys join the Honeynet GSoC Slack chatroom https://gsoc-slack.honeynet.org/ and the #holmesprocessing channel (the project's channel). This way we can answer all your questions somewhere all interested students can easily access. Virtually all students so far have had this same question. Fyi, the mentor is on slack too :)

@ritwikagarwal

This comment has been minimized.

ritwikagarwal commented Mar 26, 2018

I can't see any dataset or any mockup of the data-model provided by the Holmes project.
Anyway I have written my proposal keeping in view the standard format of data used for machine-learning projects and assignments.Can you have a look at it.

@shelldragoon1104

This comment has been minimized.

shelldragoon1104 commented Nov 9, 2018

Hi @feuerchop . I am a second year undergraduate at LNMIIT, India. I am interested to contribute to The Holmes Project as part of GSOC 2019. Can you let me know more about the dataset and a bit of concrete explanation on this issue. This will really help me to get a good start.

Thank you

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment