Skip to content

Releases: sola-st/WasmBench

Dataset of WebAssembly Binaries and Metadata

01 Dec 19:22
Compare
Choose a tag to compare

We offer the following two variants of the data set, including the WebAssembly binaries and metadata about them. If you are only interested in metadata (e.g., number of binaries, file sizes, language extensions used etc.) and not the raw binaries, you can get the metadata only from https://github.com/sola-st/WasmBench/tree/main/dataset-metadata.

  • all-binaries-metadata.7z contains the full set of 23,413 WebAssembly binaries before filtering, but after deduplication. The archive contains a directory with all .wasm files, where the filename is the SHA256 hash of the contents. Unpacked size is 6497 MiB.
  • filtered-binaries-metadata.7z contains the filtered set of 8,461 unique WebAssembly binaries. Unpacked size is 5611 MiB.

Both archives are packed with 7-zip and can be uncompressed, e.g., on Linux with 7z x archive.7z. Besides the binaries, they also contain the metadata in the same format as described in https://github.com/sola-st/WasmBench/tree/main/dataset-metadata.