Skip to content

coderrect-inc/sec3-solana-txs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

Header

With financial backing from the Solana Foundation, Sec3 collaborated with SolanaFM to curate this dataset containing parsed transactions from the Solana Mainnet Beta. This dataset offers access to non-voting transactions on the Solana mainnet for September 2023, and deciphers transactions. Transactions related to voting are excluded to conserve storage space.

Update (May 29, 2024)

  • Fixed the issue of incorrect inner instruction order in some transactions.
  • Stored each transaction as a file, facilitating the retrieval of individual transactions.
  • Marked bigint and BigNumber values for easier processing of numerical values.

Data Range

The data contains transactions from slot 214,824,000 (8/31/2023 at 23:55:16 UTC) to 220,802,999 (10/1/2023 at 00:01:42 UTC).

Schema

Each zip file contains the transactions in a group of approximately 1,000 slots. An example of a directory within a zip file is:

214824000
├── 214824000
│   ├── 2GPWc78mVoQTYCSY8Bu8SFLmSNpwJDXrWuxqHtXGBMCMUET47F4tuRLz2LSA6nrU6k5udyRMkBurEEbnShLYieW.json
│   ├── 5csTNgtebRZJAYiP9tp5Tysno6FELEgAxeCidjeWWVxoQq54tjoS1bBA8Uh5QL4vwA9x7CPAGDHUKbx3FLNHwVu7.json
│   └── ...
├── 214824001
│   ├── 54odvSJSeZ32Q7gMGooc9YMGrp47JQ3PCNeznq5oMQ3iusn6LrXcFhPo2Z7g2CNkDati4CDm2xD5YfT2ewSxDG5R.json
│   └── ...
├── 214824002
│   └── ...
└── ...

The structure of each file aligns with the output of Solana's official RPC getTransaction method, with enhancements in instruction data parsing. If instruction data is parseable, it includes a parsed field with subfields:

  • type: Specifies the instruction name.
  • info: Contains the argument of the instruction.

If the instruction data cannot be parsed, the original data field is retained, containing base58 encoded data.

PublicKeys are stored as strings.

BigInt and BigNumber values are stored as {$bn: string}.

The index of the transactions in each slot is denoted as indexInSlot.

Download instructions

Prerequisites

Before you begin the download process, please ensure you meet the following requirements:

  • Storage Requirements

The data is packaged in ZIP files, with each file containing ~1000 slots of transactions and being ~250MB in size. If you intend to download the entire dataset for a 30-day period, the total file size will be approximately 1,704 GB. Please ensure you have sufficient disk space available.

Download

Individual Slots

If you need the transaction data for specific slots, you can download them directly based on their slot numbers.

  • Calculating File Name

Round down the slot number to the nearest thousand to determine the file name. For example, the data for slot number 214825123 would be stored in 214825000.zip.

  • Finding the URL

Once you have identified the file name, locate the corresponding complete URL (in https://) in the provided URL list.

  • Downloading

To download the file, Fetch the URL directly. For example:

DOWNLOAD_URL=<Download URL>
curl --remote-name "${DOWNLOAD_URL}"

Complete Dataset

If you require the full dataset:

  • Save the URL List

First, ensure you have the urls.txt file, which contains all the download URLs, saved to your local machine. This file can be found in the cloned repo or directly downloaded from GitHub.

  • Download the Dataset

Given that the full dataset comprises 5979 ZIP files, we recommend using a download tool or command that supports parallel downloading for efficiency. Below are the example commands for using aria2 and curl as references.

  • Using aria2

    aria2c \
      --input-file=urls.txt \
      --max-concurrent-downloads=5 \
      --continue
  • Using curl + xargs

    cat urls.txt | \
      xargs \
        --max-procs=5 \
        --max-args=1 \
        curl --remote-name

Additional Resources

Contact Us

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published