Skip to content

Data mining and knowledge discovery from social media : implementation data analysis methods on data collection from Twitter

Notifications You must be signed in to change notification settings

itsoum/ML-DsUniPi

Repository files navigation

ML-DsUniPi

Data mining and knowledge discovery from social media:

Implementation data analysis methods on data collection from Twitter

About

In this repository lives the source code of a try

  • grabbing and storage tweets from Twitter's streaming API
  • clustering any corpus of tweets (unsupervised)
  • exporting the fundamental topics of them
  • counting the efficacy of my machine learning efforts.

Also, I developed a method which remove duplicates documents from corpus (retweets problem).

This material constituted the implemented part of my Bachelor Dissertation.

Links

Bachelor Dissertation Page(GR)

Bachelor Dissertation Presentation(GR)

Requirements

SciKit-Learn

Python Twitter Tools (PTT)

About

Data mining and knowledge discovery from social media : implementation data analysis methods on data collection from Twitter

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages