Real-time-images-detection-and-hashtags-generation-for-tweets

In this project, we use Pyspark and Pytorch to build a stream processing pipeline to get images from Twitter API and do real time images detection and hashtags generation for tweets.

Platform

We used GCP platform to train all models with NVIDIA Tesla V100.
In order to combine Pyspark and Pytorch, we implmented our system on AWS with platform of Databricks. It provides a enviroment where we can run both Pyspark and Pytorch.

Codes

Model training

collect_images.ipynb Collect images from Twitter API for semi-supervised learning.
train_model.ipynb Train ResNet50

Stream Processing

sender.ipynb Create TCP connection, request images with specific hashtags from Twitter API.
spark_receiver.ipynb Receive DStream from sender and process data to predict the label. At last, it will restore 4 images per window as one npy file for displaying.
display.ipynb Display the result.
You should run sender and spark_receiver first. Once they get images and start processing, you should run display. It can display images by reading npy files in the folder. WARNING: It will delete all npy files to do real time display. So if you wan to store all results, please do not run this code.

Demo

Implement details

Please check all implement details and analysis is our report

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-time-images-detection-and-hashtags-generation-for-tweets

Platform

Codes

Model training

Stream Processing

Demo

Implement details

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
images		images
Project Report.pdf		Project Report.pdf
README.md		README.md
collect_images.ipynb		collect_images.ipynb
display.ipynb		display.ipynb
sender.ipynb		sender.ipynb
spark_receiver.ipynb		spark_receiver.ipynb
train_model.ipynb		train_model.ipynb

cjdsj/Real-time-images-detection-and-hashtags-generation-for-tweets

Folders and files

Latest commit

History

Repository files navigation

Real-time-images-detection-and-hashtags-generation-for-tweets

Platform

Codes

Model training

Stream Processing

Demo

Implement details

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages