Skip to content

A distributed tweets analysing system on cloud. Fetching tweets from official API, store to CouchDB then show on web after analysing.

Notifications You must be signed in to change notification settings

leo-texuanw/tweets-analyser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

64 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cluster_Assign2

Overview

This is a project for assignment 2 of Cluster and Cloud Computing, semester one, 2018, The University of Melbourne.
Here is a link to a introduction video of this project.

Components

  • app/ - the front-end website to visulize this application.
  • crawler/ - the application to crawl raw data from AURIN API and the Internet (Twitter API).
  • database/ - the application to store data (CouchDB, even Hadoop/Spark can be considered).
  • analyser/ - the application for analysing data.
  • deploy/ - for autimate deployment

Authors

DEADLINE - 10th May 1 pm

  • [* ] Twitter harvesting application, multiple instances are expected (AURIN, AURIN openAPI).
  • [* ] CouchDB Database, by using MapReduce (single node or cluster)
  • [* ] A range of analytic scenarios, must support sentiment analysis
  • [* ] A ReSTful front-end web application for visulising these data sets/scenarios (25% for these above todos)
  • [* ] (10%) Proper handling of the errors and removal of duplicate tweets.
  • [* ] (25%) Dynamic deployment, using Ansible.
  • [* ] (20%) Detailed documentation on the system architecture and design
  • [* ] (20%) Collective Report, including pros and cons of the NeCTAR Research Cloud and suppoting twitter data analytics, more detail in Final packaging and delivery part of assignment pdf. (20-25 pages).
  • [* ] A video of our system to be uploaded to YouTube.

Some notice

  • Details about each part will be updated in the README.md file of each part.
  • We can also use OpenStack, Docker, Hadoop, Spark or any pre-existing software sysmtems. Include sentiment analysis libraries, gender identification libraries, and machine learning systems as well as front-end Javascript libraries and visualisation capabilities, e.g. Googlemaps.
  • To record all the problems and challenges encountered during thi project are prefered, to make it easier for our final report.

About

A distributed tweets analysing system on cloud. Fetching tweets from official API, store to CouchDB then show on web after analysing.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •