Analytics reporting for MediaWiki edit conflicts and the TwoColConflict extension.
reports: SWAP notebooks to explore the data.src: Scala Spark code to refine EventLogging traces into a more usable form.
Build the jar:
sbt package
Process data:
spark2-submit \
--class BuildConflictMetadataApp \
--master yarn \
--executor-memory 8G --executor-cores 4 --driver-memory 16G \
--conf spark.dynamicAllocation.maxExecutors=64 \
./target/scala-2.11/edit-conflict-analytics_2.11-0.2.jar