GitHub - victorskl/tweet-hotspots: HPC Twitter GeoProcessing

Tweet Hotspots

This application will search a large Geo-Coded Twitter dataset to identify tweet hotspots around Melbourne. The key purpose is to experiment and exercise the parallel programming on HPC environment and GeoProcessing big Twitter data.

It is using Python and mpi4py as a key module.

Running Local

mpiexec -n 8 python app.py

Running on SPARTAN HPC

sbatch job_1n1c.sh
sbatch job_1n8c.sh
sbatch job_2n8c.sh
sbatch job_2n8c-sym.sh

NOTE: the different between job_2n8c.sh and job_2n8c-sym.sh is that, the latter -sym ensure 4 cores per node by using --ntasks-per-node=4, therefore symmetrical.

Utility scripts such as env.sh prepare input data and environmental setup on cluster and clean.sh clear outputs and log files for consecutive runs if desire.

Slurm useful commands

squeue -u [ur_username]
scontrol show jobid -dd [ur_job_id]
scancel [ur_job_id]

This assignment work is done for COMP90024 Cluster and Cloud Computing assignment 1 assessment of 2017 SM1, The University of Melbourne. You can read the report for background context, though it discusses more on the data that I have worked with. You may also want to read the related tutorials mpi4py-tute and mpjexpress-tute. The implementation still has room for improvement. You may wish to cite this work as follow.

LaTeX/BibTeX:

@misc{sanl1,
    author    = {Lin, San Kho},
    title     = {Tweet Hotspots - HPC Twitter GeoProcessing},
    year      = {2017},
    url       = {https://github.com/victorskl/tweet-hotspots},
    urldate   = {yyyy-mm-dd}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.gitignore		.gitignore
SanKhoLin_829463_COMP90024_Project1_Report.pdf		SanKhoLin_829463_COMP90024_Project1_Report.pdf
app.py		app.py
clean.sh		clean.sh
core.py		core.py
env.sh		env.sh
job_1n1c.sh		job_1n1c.sh
job_1n8c.sh		job_1n8c.sh
job_2n8c-sym.sh		job_2n8c-sym.sh
job_2n8c.sh		job_2n8c.sh
model.py		model.py
readme.md		readme.md
sequential.py		sequential.py
test.py		test.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tweet Hotspots

Running Local

Running on SPARTAN HPC

Slurm useful commands

About

Releases

Packages

Languages

victorskl/tweet-hotspots

Folders and files

Latest commit

History

Repository files navigation

Tweet Hotspots

Running Local

Running on SPARTAN HPC

Slurm useful commands

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages