As remarked in ldbc/ldbc_snb_interactive_v1_impls#173:
Scaling the Interactive workload SF3000 is not trivial: the Hadoop-based Datagen breaks for SF1000+ data sets (with an NPE) and the old parameter generator has scalability issues (it's a single-threaded Python2 script – for SF1000, it already requires about a day to finish).
It would be worth trying to resolve that NPE. This would allow us to generate the data set and parameters for SF3000.