Draft APIv2 request and responses #1545

kuzdogan · 2024-08-12T10:45:38Z

Currently in the process of drafting the API via Spotlight: https://sourcify.stoplight.io/docs/sourcify-apiv2/branches/main

Open questions:

Should we have separate "single-file" and "multi-file" endpoints? IMO no, there no point in providing these APIs if the server will convert them to JSON anyway. The UI using this API can still provide a converter/wrapper for that functionality.
In the /verify/metadata endpoint, should we have a separate "metadata" field (required) or just put everything in "files" field?
In /verify/etherscan should we make the API key optional or required? Optional meaning by default it uses our own API key. Keep in mind we are easily reaching the rate limits of the API key.
What HTTP code to return at the GET verify/{verificationId} endpoint if the verification failed. Still 200 or 500 etc?
Biggest Q IMO: What fields to provide the user at the GET /contract/{chainId}/{address}` endpoint and in which format?

Action Items:

Kaan to check how many of the Etherscan calls are done via our API vs custom keys.

The text was updated successfully, but these errors were encountered:

manuelwedler · 2024-09-03T12:42:47Z

Some comments on this:

Should we have separate "single-file" and "multi-file" endpoints? IMO no, there no point in providing these APIs if the server will convert them to JSON anyway. The UI using this API can still provide a converter/wrapper for that functionality.

I agree here. The API should not support it, but the UI should have a converter to provide nice UX. If you are programmatically calling our API you are probably also able to create the standard json.

In the /verify/metadata endpoint, should we have a separate "metadata" field (required) or just put everything in "files" field?

In my opinion, to make it unambigous, we should add a "metadata" field. We would avoid errors in case of multiple metadata files and make it explicit that it is a required parameter.

Then, the question would be if we pass the metadata either as pure JSON, or as string encoded (which would be parsed by the server as it is now). I'm leaning more towards the second option as I see this endpoint mostly as a backward compatibility feature.

What HTTP code to return at the GET verify/{verificationId} endpoint if the verification failed. Still 200 or 500 etc?

500 only if the server experiences an error while handling the request. So it should be a 200 if the server is able to query the status of the provided verificationId. I would also not use the 202 here in case it is pending, and rather return 200 for all statuses. The reason is that you query the status of the job, not the contract as a resource.

Also, I would not return null in case the verification failed. Rather I would return "failed". I think we should avoid mixing types in the same field.

What fields to provide the user at the GET /contract/{chainId}/{address}` endpoint and in which format?

We can probably mostly orientate ourselves by the Blockscout API. The biggest question is here if we want to return the source code in the response, which will make the response payload quite big. However, in my opinion, it is easier to have just one endpoint that returns everything for a contract. We can still have a status endpoint that only returns the verification status of an address.

kuzdogan · 2024-09-03T14:15:44Z

Some discussion points during our call:

How do we handle Hardhat output files?
- Let's have a look at the usage metrics how many people use it.
- We should keep the API lean and offload this parsing to the UI or a client.
Should the /metadata endpoint accept an object or a string?
- Object
To add:
- An endpoint to get the list of verified contracts on a chain, similar to /files/contracts/any/{chain}

kuzdogan · 2024-09-04T13:36:27Z

One important question is what to return in the /contract/{chainId}/{address} endpoint. This endpoint will be the main endpoint to check if a contract is verified on Sourcify, and also to retrieve detailed information about the contract.

The response by default will contain the minimal response:

{
  match: "exact_match" | "match" | null,
  creationMatch: "exact_match" | "match" | null,
  runtimeMatch: "exact_match" | "match" | null,
  chainId: "11155111",
  address: "0xfe34567890123456789012345678901234567890",
  verifiedAt: "2024-07-24T12:00:00Z"
}

In addition users can ask for other detailed fields in a query parameter e.g. /contracts/{chainId}/{address}?fields=abi,metadata,name or get everything but omit some: /contracts/{chainId}/{address}?omit=sources

We decided to define each field ourselves and not return a 1-to-1 reflection of the DB schema.

Now the tricky part is how to organize the bytecode information. We have the following data about the contract's bytecodes:

onchain runtime bytecode
onchain creation bytecode
recompiled runtime bytecode
recompiled creation bytecode
transformations
(i.e operations that need to be applied on top of the recompiled bytcodes to reach their onchain correspondent, explained here)
keep in mind the "positions-" and the "values objects" of each of these have different fields in the DB schema
- runtime transformations and their values:
  - cborAuxdata
  - library
  - immutable
  - callProtection
- creation transformations and their values:
  - constructorArguments
  - cborAuxdata
  - library
artifacts for the runtime bytecode
- sourceMap
- cborAuxdata
- linkReferences
- immutableReferences
artifacts for the creation bytecode
- sourceMap
- cborAuxdata
- linkReferences

The question is, how do we organize this data in an understandable way. Considering this available information here's my suggested output structure:

{
  // minimal info, returned always, can't be omitted
  match: "exact_match" | "match" | null,
  creationMatch: "exact_match" | "match" | null,
  runtimeMatch: "exact_match" | "match" | null,
  chainId: "11155111",
  address: "0xfe34567890123456789012345678901234567890",
  verifiedAt: "2024-07-24T12:00:00Z", 
  // bytecode details
  creationBytecode: { // not availible if creationMatch is `null`
    onchainBytecode: "0x608060405234801561001057...610150806100206000396000f3",
    recompiledBytecode: "0x608060405234801561001057...610150806100206000396000f3",
    // artifacts from the compiler output
    sourceMap: "368:7042:14:-:0;;;;;;;;;;;;;;;;;;;",
    linkReferences: {
      "/home/home/dotfiles/poof-core/contracts/MerkleTreeWithHistory.sol": {
        "Hasher": [
          {
            "start": 910,
            "length": 20
          },
        ],
      },
    },
    cborAuxdata: {
      "1": {
        "value": "0xa26469706673582212201e80049ede18eadf4ab7f0dea2c32f2375c33b5aef0b1a16cc5223dbc681559364736f6c63430007060033",
        "offset": 5471
      }
    },
    transformations: [
      {
        "id": "sources/lib/MyLib.sol:MyLib",
        "type": "replace",
        "offset": 582,
        "reason": "library"
      },
      {
        "id": "0",
        "type": "replace",
        "offset": 1269,
        "reason": "cborAuxdata"
      },
      {
        "type": "insert",
        "offset": 1322,
        "reason": "constructorArguments"
      }
    ],
    transformationValues: {
      "library": {
        "sources/lib/MyLib.sol:MyLib": "0x40b70a4904fad0ff86f8c901b231eac759a0ebb0"
      },
      "constructorArguments": "0x00000000000000000000000085fe79b998509b77bf10a8bd4001d58475d29386",
      "cborAuxdata": {
        "0": "0xa26469706673582212201c37bb166aa1bc4777a7471cda1bbba7ef75600cd859180fa30d503673b99f0264736f6c63430008190033"
      }
    }
  }, 
  runtimeBytecode: { // not availible if runtimeMatch is `null`
    onchainBytecode: "0x608060405234801561001057...610150806100206000396000f3",
    recompiledBytecode: "0x608060405234801561001057...610150806100206000396000f3",
    // artifacts from the compiler output
    sourceMap: "340:4705:14:-:0;;;;685:37..........59:1;4355;:5;;;;;;;4217:150;-1:-1:-1;;;4217:150:8:o;3136:155::-;3194:7;3226:1;3221;:6;;3213:49;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;-1:-1:-1;3279:5:8;;;3136:155::o",
    linkReferences: {
      "contracts/AmplificationUtils.sol": {
        "AmplificationUtils": [
          {
            "start": 3078,
            "length": 20
          },
        ],
      },
      "contracts/SwapUtils.sol": {
        "SwapUtils": [
          {
            "start": 2931,
            "length": 20
          },
        ],
      },
    },
    cborAuxdata: {
      "1": {
        "value": "0xa2646970667358221220fc5c3958995f3020259e8da7b7cfc439316c92fecdab62bef66bd9e7b1c0582864736f6c634300060c0033",
        "offset": 2026
      }
    },
    immutableReferences: {
      "1050": [
        {
          "start": 312,
          "length": 32
        },
        {
          "start": 2631,
          "length": 32
        }
      ]
    };
    // transformations applied to the recompiled bytecode to reach the onchain bytecode
    transformations: [
      {
        "id": "20",
        "type": "replace",
        "offset": 137, // in bytes
        "reason": "immutable"
      },
      {
        "id": "0",
        "type": "replace",
        "offset": 1002,
        "reason": "auxdata"
      }
    ];
    transformationValues: {
      "immutables": {
        "20": "0x000000000000000000000000b5d83c2436ad54046d57cd48c00d619d702f3814"
      },
      "cborAuxdata": {
        "0": "0xa26469706673582212205817b060c918294cc8107069c4fd0a74dbfc35b0617214043887fe9d4e17a4a864736f6c634300081a0033"
      }
    }
  },
  deployment: {
    transactionHash: "0xa0483d0b361b247883268883842e66b248952148898849884884884884884884",
    blockNumber: 35453621;
    transactionIndex: 2,
    deployer: "0x1f9840a85d5aF5bf1D1762F925BDADdC4201F984",
  },
  compilation: {
    sources: {
      "contracts/Storage.sol": { 
        id: 0 // The AST identifiers of sources
        content: "// SPDX-License-Identifier: MIT\npragma solidity ^0.8.0;\n\ncontract Storage {\n    uint256 number;\n\n    function setNumber(uint256 newNumber) public {\n        number = newNumber;\n    }\n\n    function getNumber() public view returns (uint256) {\n        return number;\n    }\n}\n",
      },
      "contracts/Owner.sol": {
        id: 1
        content: "// SPDX-License-Identifier: MIT\npragma solidity ^0.8.0;\n\ncontract Owner {\n    address public owner;\n\n    constructor() {\n        owner = msg.sender;\n    }\n}\n"
      }
    },
    language: "Solidity",
    compilerVersion: "v0.8.12+commit.f00d7308",
    compilerSettings: {}
    name: "Storage",
    fullyQualifiedName: "contracts/Storage.sol:Storage",
  }
  abi: [],
  userDoc: {},
  devDoc: {},
  storageLayout: {},
  metadata: {},
}

kuzdogan · 2024-09-11T09:14:33Z

Please do a final short review on the Draft here: https://sourcify.stoplight.io/docs/sourcify-apiv2/branches/main

@marcocastignoli @manuelwedler

marcocastignoli · 2024-09-11T13:29:25Z

The verificationJob object will not last forever, we have to clean the completed ones. Would it make sense to also include the "expiration date" of a verificationJob?
proxy support is missing Handling "Minimal Proxy Contract" pattern #1624, I think we should include it
should we include also sources.hashes and ipfs/swarm links in the /v2/contract/chainId/address response?
minor: I see custom_code is used everywhere with the same example "unsupported_chain", but I think we can create two different custom_code: one for /v2/verify/chainId/address fail statuses and another for /v2/verify/verificationId

marcocastignoli · 2024-09-11T13:54:50Z

Also another big doubt: will we support RepositoryVXStorageService as valid read services with apiV2? If that's the case then many fields of the /v2/contract/chainId/address response will not be available

manuelwedler · 2024-09-11T14:04:28Z

To me, this looks good to get external feedback after solving Marco's questions.

The verificationJob object will not last forever, we have to clean the completed ones. Would it make sense to also include the "expiration date" of a verificationJob?

I would have expected that the verification job is kept forever actually.

should we include also sources.hashes and ipfs/swarm links in the /v2/contract/chainId/address response?

I like that

minor: I see custom_code is used everywhere with the same example "unsupported_chain", but I think we can create two different custom_code: one for /v2/verify/chainId/address fail statuses and another for /v2/verify/verificationId

I agree that we can give more detailed error codes, but this was probably out of scope for this draft.

Also another big doubt: will we support RepositoryVXStorageService as valid read services with apiV2? If that's the case then many fields of the /v2/contract/chainId/address response will not be available

Very good point. I think either we disable them as read services, or things will get more complicated.

Lastly, a comment regarding the 500s returned from the /verify endpoints:

I don't think we should return 500s for compiler errors. compilation should be part of the verification, and therefore it would be shown in the result of the job. I think it is fine to remove the 500s from this specification, as these would be thrown as fallbacks when we encounter bugs on our end. Not sure if we would be able to show custom codes in these cases.

In the end, the verify endpoints should do some basic input validation, and throw 400 errors if this fails. Otherwise they should just start the job and return the job id.

marcocastignoli · 2024-09-11T14:12:51Z

I would have expected that the verification job is kept forever actually.

I thought of the job as something ephemeral because in my mind it was related to an entity in a queue (it gets in the queue, it's elaborated, then deleted). I see your point of keeping it as a database entity, but what are the pros? There is no additional information that we are not already storing.

manuelwedler · 2024-09-11T14:23:46Z

I would have expected that the verification job is kept forever actually.

I thought of the job as something ephemeral because in my mind it was related to an entity in a queue (it gets in the queue, it's elaborated, then deleted). I see your point of keeping it as a database entity, but what are the pros? There is no additional information that we are not already storing.

It's just difficult to find a point in time when the job info is not needed anymore. Let's imagine someone verifies a contract via the Remix plugin, but closes Remix before the verification job status returned successfully. If this person only opens Remix one month later and the job info would be removed by then, the verification would be shown as failed in the plugin.

marcocastignoli · 2024-09-11T14:32:29Z

are Etherscan and Blockscout keeping that information forever?

manuelwedler · 2024-09-11T14:34:36Z

I checked Etherscan for a receipt id from one and a half months ago and it is still returned

marcocastignoli · 2024-09-11T14:40:25Z

Ok, then I just had the wrong idea about that 😆

kuzdogan · 2024-09-13T13:04:35Z

The verificationJob object will not last forever, we have to clean the completed ones. Would it make sense to also include the "expiration date" of a verificationJob?

Let's keep the verificationJobs indefinitely if not forever. The amount of data should be trivial. If it grows big, we can start deleting e.g. older than 1 year old.

proxy support is missing Handling "Minimal Proxy Contract" pattern #1624, I think we should include it

I actually think there should not be the concept of the minimal proxies in the DB. See #1624 and the VerA TG chat.

should we include also sources.hashes and ipfs/swarm links in the /v2/contract/chainId/address response?

I'd say no:

This is already inside the metadata
If someone desperately needs it and don't want to use metadata the hashes are deterministic and they can calculate it.

Mayyybe we can return the keccak after we implement the #1615

minor: I see custom_code is used everywhere with the same example "unsupported_chain", but I think we can create two different custom_code: one for /v2/verify/chainId/address fail statuses and another for /v2/verify/verificationId

For now let's keep it like that but yes the custom_codes should be an enum

Also another big doubt: will we support RepositoryVXStorageService as valid read services with apiV2? If that's the case then many fields of the /v2/contract/chainId/address response will not be available

It'd be really difficult to maintain the APIv2 for a filesystem. One, we have the whole concept of the verificationId and its status so we have to keep it somewhere. Two, all the fields in /v2/contract/{chainId}/{address} is based on the database schema. We don't even have runtimeMatch creationMatch concept directly in the current FS structure.

However the recent use case came up with the VSCode extension with Sourcify might be a good reason to have filesystem support. Still, I think this will overcomplicate things for a small use case and we can start thinking about this maybe after we complete the APIv2 MVP.

RaoulSchaffranek · 2024-09-19T13:41:51Z

Hi,

I’ve reviewed the drafts and appreciate the efforts in the new API design, particularly the shift toward standard JSON. Here’s some feedback from a developer tooling perspective:

It would be fantastic if the GET /contract/{chainId}/{address} endpoint could return data in a standard JSON-compliant format. I understand there is additional data you want to include, so this could be a conservative extension of the standard JSON.

This enhancement would be highly beneficial for developer tools. Many start with local development environments directly consuming output from the Solidity compiler or tools like Foundry/Hardhat. As they expand to support remote environments, they must fetch source code from external providers, which introduces complexity. Developers often face challenges with unavailable data due to unverified contracts, network issues, or missing compilation flags for sourcemaps. Sometimes, we even need to recompile contracts to gather necessary information.

Additionally, dealing with the differences between data from external sources like Sourcify or Blockscout and the standard JSON output from solc requires implementing compatibility layers. This process is resource-intensive due to the variety of APIs and the subtle differences in all their dialects that accumulate over time. It seems to me that standard JSON is really the smallest denominator from which to start.

If Sourcify could provide solc-compliant output directly, it would address these issues by ensuring a consistent format across local and remote environments. This would eliminate the need for extensive compatibility layers, reducing development time and complexity. This would make dev tool integration with Sourcify way simpler.

Just to give you a better feeling of what I mean, here is a small example:

// Response to: GET /contract/{chainId}/{address}
{
  "verifyInput": { 
    // Standard JSON input that was used to verify the contract
    // It's imporant that this input strictly follows the standard JSON schema
    // and can be directly fed back into the compiler
  },
  "input": {
    // Standard JSON input that was used to recompile the contract 
    // The input should contain additional flags for sourcemaps, and function debug information
    // that is commonly needed by developer tools
  },
  "output": {
    // Standard JSON output that was produced during recompilation
    // including the recompiled bytecodes, source maps, and function debug information
  },
  // Everyhting below is additional information that cannot be computed from the input/output
  // Redundant information, such as listing recompiled bytecode should be avoided
  "match": "match",
  "address": "0xDFEBAd708F803af22e81044aD228Ff77C83C935c",
  "chainId": "11155111",
  "deployment": {
    "deployer": "0xDFEBAd708F803af22e81044aD228Ff77C83C935c",
    "blockNumber": 0,
    "transactionHash": "0xb6ee9d528b336942dd70d3b41e2811be10a473776352009fd73f85604f5ed206",
    "transactionIndex": 0
  },
  "verifiedAt": "2024-07-24T12:00:00Z",
  "compilation": {
    // No need to repeat sources, language here. It should be part of the standard json
    "name": "MyContract",
    "compilerVersion": "v0.8.12+commit.f00d7308",
    "fullyQualifiedName": "contracts/MyContract.sol:MyContract"
  },
  "runtimeMatch": "match", // Should this be moved under runtimeBytecode?
  "creationMatch": "match", // Should this be moved under creationBytecode?
  "runtimeBytecode": {
    // No need to repeat source map or recompiled bytecode here. It should be part of the standard json output
    "onchainBytecode": "...",
    "transformations": [ /* ... */],
    "transformationValues": {
  },
  "creationBytecode": {
    // No need to repeat source map or recompiled bytecode here. It should be part of the standard json output
    "onchainBytecode": "...",
    "transformations": [ /* ... */],
    "transformationValues" []
  }
}

kuzdogan · 2024-09-19T15:44:02Z

@RaoulSchaffranek Thanks for the suggestion! I really liked the direction of the comment and agree with it. We also had a difficult time how to organize the fields and we haven't thought about thinking input/output way.

For me it's not totally clear what the differences might be between verifyInput and input. Even though we don't save the full std-json input, we should be able to generate it reliably. You are mentioning JSON input...to verify the contract vs JSON input...to recompile the contract which are essentially the same thing in our pipeline.

Also

The input should contain additional flags for sourcemaps, and function debug information

By additional flags do you mean the outputSelection fields? We actually don't store it inside our DB's compiled_contracts.compilerSettings but it is consistent across all compilations/verifications and it is not a required input field (although if ommited will generate empty output).

RaoulSchaffranek · 2024-09-19T19:01:54Z

For me it's not totally clear what the differences might be between verifyInput and input.

By verifyInput I meant a verbatim copy of the standard JSON input that was originally submitted as a request body to the POST /verify/{chainId}/{address} endpoint. In contrast, by input, I was referring to the already instrumented standard JSON you feed to the solidity compiler for recompilation. I'm not sure if there are more differences besides the outputSelection-section. Giving it another thought, it might be confusing and mostly redundant to include both fields in the output.

Even though we don't save the full std-json input, we should be able to generate it reliably.

That would be great.

By additional flags do you mean the outputSelection fields?

Yes, I meant the outputSelection and all other instrumentation you apply to the std-json for the purpose of recompilation (I'm not sure if there are any more transformations involved). Essentially, it would be super handy if I could just take the response from Sourcify and feed it back into the Solidity compiler locally. Maybe it helps to understand our usecase: For a source-level Solidity debugger it's not sufficient to fetch the source code from Sourcify, we sometimes want to recompile locally with different optimization settings to get more tractable source mapping. Currently, that's a bit of a hassle because we need to manually convert Sourcify's responses to valid std-json first. My hope is to avoid this extra step.

Another argument in favor of the std-json approach is that it ensures long-term alignment with the Solidity compiler output. If the solc team decides to add new fields to the format, it should be relatively straightforward to add them to the Sourcify response. This might become relevant very soon, when the compiler starts outputting the EthDebug annotations.

kuzdogan · 2024-09-20T14:35:00Z

Ok, even though we don't save the

verbatim copy of the standard JSON input that was originally submitted as a request body

we will be extracting the fields from there without touching anything, except the outputSelection which (in theory, lol 😄) should not affect the contents of the output like bytecode. The compiler can generate huge outputs so we won't let the user pass their outputSelections IMO.

But yes point taken, the in our response the input will directly be feeded to the compiler and the output the fields from the compiler as is. Anything else we derive, we'll put in other fields.

Just a small side note for the future us, and if anyone is informed about this, would be the compatibility with the Vyper format. AFAIK the formats are fairly compatible, and the user can expect a different Vyper conforming format if language: Vyper.

kuzdogan · 2024-09-23T14:43:37Z

In accordance with the feedback from @RaoulSchaffranek, the updated response is below.

Some considerations:

Should we have a separate abi field outside the output? I imagine it to be a common enough use case (I just want the abi of 0xab..cd) that deserve its own field.
A downside now is that to access the recompiled bytecodes, or anything inside the output you need to know the "contracts/Storage.sol" and Storage. Technically we will only have a single field here (just the compilation target contract) and we should make this clear to avoid people sending double requests: first to get the fullyQualifiedName and then the output.
Do we include the outputSelection in the settings? I feel like yes since this can affect the output, even though unexpectedly. In that case we need to save the outputSelection in the database in the compiled_contracts.compilerSettings too.
Now the selective querying gets a bit tricky. To get the deployedBytecode for example one might think the need to pass ?fields=output.contracts/Storage.sol.Storage.deployedBytecode whereas IMO it should be just ?fields=output.deployedBytecode But if you selectively request input or output fields, that means the format is not necessarily compatible with the compiler. Should we maybe not allow selecting fields inside input and output and just allow requesting them as a whole?

The modified response format

{
  // minimal info, returned always, can't be omitted
  match: "exact_match" | "match" | null,
  creationMatch: "exact_match" | "match" | null,
  runtimeMatch: "exact_match" | "match" | null,
  chainId: "11155111",
  address: "0xfe34567890123456789012345678901234567890",
  verifiedAt: "2024-07-24T12:00:00Z", 
  // The fields from the compiler's std json output
  input: {
    language: "Solidity",
    sources: {
      "contracts/Storage.sol": {
        "keccak256": "0x1234567890123456789012345678901234567890123456789012345678901234",
        content: "// SPDX-License-Identifier: MIT\npragma solidity ^0.8.0;\n\ncontract Storage {\n    uint256 number;\n\n    function setNumber(uint256 newNumber) public {\n        number = newNumber;\n    }\n\n    function getNumber() public view returns (uint256) {\n        return number;\n    }\n}\n",
      },
      "contracts/Owner.sol": {
        "keccak256": "0x1234567890123456789012345678901234567890123456789012345678901234",
        content: "// SPDX-License-Identifier: MIT\npragma solidity ^0.8.0;\n\ncontract Owner {\n    address public owner;\n\n    constructor() {\n        owner = msg.sender;\n    }\n}\n"
      }
    }
    settings: {...},
  },
  // the fields from the compiler's output json
  output: {
    sources: {
      "contracts/Storage.sol": {
        "id": 0, // The AST identifier of the source
      },
      "contracts/Owner.sol": {
        "id": 1, // The AST identifier of the source
      }
    },
    contracts: {
      // Only the output for the compilation target contract here
      "contracts/Storage.sol": {
        "Storage": {
          "abi": [],
          "userDoc": {},
          "devDoc": {},
          "storageLayout": {},
          "metadata": "{}", // serialised JSON metadata
          "evm": {
            // In Souricfy we refer to this more explicitly as "creation bytecode"
            "bytecode": {
              // The compiler's output does not have the 0x prefix.
              "object": "608060405234801561001057...610150806100206000396000f3",
              "sourceMap": "368:7042:14:-:0;;;;;;;;;;;;;;;;;;;",
              "linkReferences": {
                "/home/home/dotfiles/poof-core/contracts/MerkleTreeWithHistory.sol": {
                  "Hasher": [
                    {
                      "start": 910,
                      "length": 20
                    },
                  ],
                },
              },
            },
            // In Sourcify we refer to this more explicitly as "runtime bytecode"
            "deployedBytecode": {
              "object": "608060405234801561001057...610150806100206000396000f3",
              "sourceMap": "340:4705:14:-:0;;;;685:37..........59:1;4355;:5;;;;;;;4217:150;-1:-1:-1;;;4217:150:8:o;3136:155::-;3194:7;3226:1;3221;:6;;3213:49;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;-1:-1:-1;3279:5:8;;;3136:155::o",
              "immutableReferences": {
                "1050": [
                  {
                    "start": 312,
                    "length": 32
                  },
                  {
                    "start": 2631,
                    "length": 32
                  }
                ]
              };
              "linkReferences": {
                "contracts/AmplificationUtils.sol": {
                  "AmplificationUtils": [
                    {
                      "start": 3078,
                      "length": 20
                    },
                  ],
                },
                "contracts/SwapUtils.sol": {
                  "SwapUtils": [
                    {
                      "start": 2931,
                      "length": 20
                    },
                  ],
                },
              },
            }
        }  
      }
    }
  },
  // bytecode details
  creationBytecode: { // not availible if creationMatch is `null`
    onchainBytecode: "0x608060405234801561001057...610150806100206000396000f3",
    cborAuxdata: {
      "1": {
        "value": "0xa26469706673582212201e80049ede18eadf4ab7f0dea2c32f2375c33b5aef0b1a16cc5223dbc681559364736f6c63430007060033",
        "offset": 5471
      }
    },
    transformations: [
      {
        "id": "sources/lib/MyLib.sol:MyLib",
        "type": "replace",
        "offset": 582,
        "reason": "library"
      },
      {
        "id": "0",
        "type": "replace",
        "offset": 1269,
        "reason": "cborAuxdata"
      },
      {
        "type": "insert",
        "offset": 1322,
        "reason": "constructorArguments"
      }
    ],
    transformationValues: {
      "library": {
        "sources/lib/MyLib.sol:MyLib": "0x40b70a4904fad0ff86f8c901b231eac759a0ebb0"
      },
      "constructorArguments": "0x00000000000000000000000085fe79b998509b77bf10a8bd4001d58475d29386",
      "cborAuxdata": {
        "0": "0xa26469706673582212201c37bb166aa1bc4777a7471cda1bbba7ef75600cd859180fa30d503673b99f0264736f6c63430008190033"
      }
    }
  }, 
  runtimeBytecode: { // not availible if runtimeMatch is `null`
    onchainBytecode: "0x608060405234801561001057...610150806100206000396000f3",
    // artifacts from the compiler output
    cborAuxdata: {
      "1": {
        "value": "0xa2646970667358221220fc5c3958995f3020259e8da7b7cfc439316c92fecdab62bef66bd9e7b1c0582864736f6c634300060c0033",
        "offset": 2026
      }
    }
    // transformations applied to the recompiled bytecode to reach the onchain bytecode
    transformations: [
      {
        "id": "20",
        "type": "replace",
        "offset": 137, // in bytes
        "reason": "immutable"
      },
      {
        "id": "0",
        "type": "replace",
        "offset": 1002,
        "reason": "auxdata"
      }
    ];
    transformationValues: {
      "immutables": {
        "20": "0x000000000000000000000000b5d83c2436ad54046d57cd48c00d619d702f3814"
      },
      "cborAuxdata": {
        "0": "0xa26469706673582212205817b060c918294cc8107069c4fd0a74dbfc35b0617214043887fe9d4e17a4a864736f6c634300081a0033"
      }
    }
  },
  deployment: {
    transactionHash: "0xa0483d0b361b247883268883842e66b248952148898849884884884884884884",
    blockNumber: 35453621;
    transactionIndex: 2,
    deployer: "0x1f9840a85d5aF5bf1D1762F925BDADdC4201F984",
  },
  compilation: {
    compilerVersion: "v0.8.12+commit.f00d7308",
    name: "Storage",
    fullyQualifiedName: "contracts/Storage.sol:Storage",
  },
  // Should we have a separate abi field?
  abi: [],
}

kuzdogan · 2024-09-24T13:07:55Z

Looking at the Vyper output, it seems fully compatible except:

settings
metadata, storageLayout, linkedReferences and immutableReferences (n/a in Vyper)

But these should be fine as these are expected to be in Vyper format if the contract is a Vyper contract.

This is assuming the Vyper docs are up to date with these fields:
https://docs.vyperlang.org/en/v0.4.0/compiling-a-contract.html#compiler-input-and-output-json-description

RaoulSchaffranek · 2024-09-24T19:24:02Z

Should we have a separate abi field outside the output? I imagine it to be a common enough use case (I just want the abi of 0xab..cd) that deserve its own field.

Would there be a difference between the top-level abi and output[source][contract].abi? If not, I'd prefer not to duplicate this information. I agree that it is simpler to look it up from the top level, but I also think it's not too difficult to look it up from the output two levels deeper. My concern is that making an exception for this one field opens the door for more exceptions in the future and that it's slightly confusing to new users to have the same information duplicated across the output. But I don't have hard feelings for either option.

A downside now is that to access the recompiled bytecodes, or anything inside the output you need to know the "contracts/Storage.sol" and Storage. Technically we will only have a single field here (just the compilation target contract) and we should make this clear to avoid people sending double requests: first to get the fullyQualifiedName and then the output.

Yes, that's a bit annoying and should be clearly documented. I think adding a code snippet to the docs here can go a long way. Something along the following lines:

const qualifiedName = response.compilation.fullyQualifiedName
const [source, name] = qualifiedName.split(':')
const abi = response.output.contracts[source][name].abi

Now the selective querying gets a bit tricky. To get the deployedBytecode for example one might think the need to pass ?fields=output.contracts/Storage.sol.Storage.deployedBytecode whereas IMO it should be just ?fields=output.deployedBytecode But if you selectively request input or output fields, that means the format is not necessarily compatible with the compiler. Should we maybe not allow selecting fields inside input and output and just allow requesting them as a whole?

I feel that these granular selection options are rarely used. Take the outputSelection of solc as an example: I think most devs are happy with the default, and if they need anything extra, they just ask the compiler for everything. But I don't have any hard data to support this claim. Can I ask, what's the reason for the response filtering? Is it saving bandwidth and database IO?

kuzdogan · 2024-09-25T15:56:53Z

We also don't have any hard data but we get ask often how to get the contract abi, so I'm assuming this is a use case.

For the response filtering, both bandwidth and I/O. Bandwidthwise particularly the sources can get huge, and the bytecodes to some extent. I/O-wise most of the output field comes from the compiled_contracts table but the codes are in a different table, so it's an extra join. But I guess that's not a ridiculous cost? I'm not that experienced in DBs so I can't really tell much

acuarica · 2024-09-30T17:46:04Z

Hi @kuzdogan, are there any plans to support verification preview #1319 in the new API? We use it in our UI to provide early feedback to the user.

kuzdogan · 2024-10-08T13:16:07Z

@acuarica Thanks for the input. Yes, we can include it as a dryRun field in the request JSON.

kuzdogan · 2024-10-15T13:44:04Z

Finalizing the updates from our call:

Added dryRun field to the verification endpoints.
We included both the "Sourcify-defined" fields as well as "std-json conforming" fields to the Verified Contract response. We aim the Sourcify-defined fields to be used predominantly for common use cases e.g. getting the bytecodes, sources, abi etc. But we simultaneously provide a full std-json conforming input/output for easy integration. These fields are now called stdJsonInput and stdJsonOutput.
To finalize the proxy response Incorporate and finalize the proxy format to the APIv2 response #1701

Please see https://sourcify.stoplight.io/docs/sourcify-apiv2/branches/main/cqx62aqefyrje-get-verified-contract

kuzdogan mentioned this issue Aug 12, 2024

[Milestone] Finish APIv2 design #1470

Closed

kuzdogan added this to the APIv2 Design milestone Aug 12, 2024

kuzdogan added P1 - Very Important labels Aug 22, 2024

kuzdogan self-assigned this Aug 27, 2024

kuzdogan mentioned this issue Sep 10, 2024

Handling "Minimal Proxy Contract" pattern #1624

Open

kuzdogan assigned marcocastignoli and manuelwedler Sep 11, 2024

kuzdogan closed this as completed Sep 16, 2024

SimiHunjan mentioned this issue Sep 19, 2024

Review Sourcify APIv2 proposed design and identify impact hashgraph/hedera-mirror-node-explorer#1368

Open

kuzdogan mentioned this issue Sep 20, 2024

Gather APIv2 Feedback and Iterate #1646

Closed

4 tasks

wmitsuda mentioned this issue Sep 21, 2024

Track, follow and provide feedback on Sourcify API v2 otterscan/otterscan#2499

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft APIv2 request and responses #1545

Draft APIv2 request and responses #1545

kuzdogan commented Aug 12, 2024 •

edited

Loading

manuelwedler commented Sep 3, 2024

kuzdogan commented Sep 3, 2024 •

edited

Loading

kuzdogan commented Sep 4, 2024 •

edited

Loading

kuzdogan commented Sep 11, 2024

marcocastignoli commented Sep 11, 2024 •

edited

Loading

marcocastignoli commented Sep 11, 2024

manuelwedler commented Sep 11, 2024

marcocastignoli commented Sep 11, 2024 •

edited

Loading

manuelwedler commented Sep 11, 2024

marcocastignoli commented Sep 11, 2024

manuelwedler commented Sep 11, 2024

marcocastignoli commented Sep 11, 2024

kuzdogan commented Sep 13, 2024 •

edited

Loading

RaoulSchaffranek commented Sep 19, 2024

kuzdogan commented Sep 19, 2024 •

edited

Loading

RaoulSchaffranek commented Sep 19, 2024

kuzdogan commented Sep 20, 2024

kuzdogan commented Sep 23, 2024 •

edited

Loading

kuzdogan commented Sep 24, 2024

RaoulSchaffranek commented Sep 24, 2024

kuzdogan commented Sep 25, 2024

acuarica commented Sep 30, 2024

kuzdogan commented Oct 8, 2024

kuzdogan commented Oct 15, 2024

Draft APIv2 request and responses #1545

Draft APIv2 request and responses #1545

Comments

kuzdogan commented Aug 12, 2024 • edited Loading

manuelwedler commented Sep 3, 2024

kuzdogan commented Sep 3, 2024 • edited Loading

kuzdogan commented Sep 4, 2024 • edited Loading

kuzdogan commented Sep 11, 2024

marcocastignoli commented Sep 11, 2024 • edited Loading

marcocastignoli commented Sep 11, 2024

manuelwedler commented Sep 11, 2024

marcocastignoli commented Sep 11, 2024 • edited Loading

manuelwedler commented Sep 11, 2024

marcocastignoli commented Sep 11, 2024

manuelwedler commented Sep 11, 2024

marcocastignoli commented Sep 11, 2024

kuzdogan commented Sep 13, 2024 • edited Loading

RaoulSchaffranek commented Sep 19, 2024

kuzdogan commented Sep 19, 2024 • edited Loading

RaoulSchaffranek commented Sep 19, 2024

kuzdogan commented Sep 20, 2024

kuzdogan commented Sep 23, 2024 • edited Loading

kuzdogan commented Sep 24, 2024

RaoulSchaffranek commented Sep 24, 2024

kuzdogan commented Sep 25, 2024

acuarica commented Sep 30, 2024

kuzdogan commented Oct 8, 2024

kuzdogan commented Oct 15, 2024

kuzdogan commented Aug 12, 2024 •

edited

Loading

kuzdogan commented Sep 3, 2024 •

edited

Loading

kuzdogan commented Sep 4, 2024 •

edited

Loading

marcocastignoli commented Sep 11, 2024 •

edited

Loading

marcocastignoli commented Sep 11, 2024 •

edited

Loading

kuzdogan commented Sep 13, 2024 •

edited

Loading

kuzdogan commented Sep 19, 2024 •

edited

Loading

kuzdogan commented Sep 23, 2024 •

edited

Loading