ParSoDA (Parallel Social Data Analytics) is a Java library for social media analytics
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.


ParSoDA (Parallel Social Data Analytics) is a Java programming library for simplifying the development of parallel social media mining application executed on High Performance Computing systems. ParSoDA defines a general framework for a social media analysis application that includes a number of steps (data acquisition, filtering, mapping, partitioning, reduction, analysis, and visualization), and provides a predefined (but extensible) set of functions for each data processing step. Thus, an application developed with ParSoDA is expressed by a concise code that specifies the functions invoked at each step. User applications based on the ParSoDA library can be run on both Apache Hadoop and Spark clusters. The current version of the library (v. 1.3.0 dated October 25, 2018) contains more than forty predefined functions organized in seven packages, corresponding to the seven ParSoDA steps.