Roadmap update for TUF support #5247

LucidOne · 2019-01-04T15:46:09Z

Is it possible to get an update on the development roadmap about when TUF or other encryption support might be deployed? Thanks!

Also, it appears that tomorrow will be 6 years since January 5, 2013.

nealmcb · 2019-02-19T01:22:44Z

Thanks - good question.
I note that TUF is one of the options noted under Add support for API keys · Issue #994 · pypa/warehouse which appears under the "Cool but not urgent" milestone.
I agree that TUF addresses many of the nicely described PyPI-specific concerns that @dstufft wrote way back on 23 July 2013: Why Package Signing is not the Holy Grail

LucidOne · 2019-02-19T02:43:12Z

I believe that the package signing issues stated can be resolved and in 2019 our internet depends on the security of infrastructure such as PyPI. If there is not a roadmap for TUF support, I'm going to look at solving the problem.

MyNameIsCosmo · 2019-02-19T03:28:57Z

Perhaps we can evaluate integration with third-party services for verification?
Keybase.io helps create trust by building a verified profile attached to a GPG key across multiple services.
GitHub handles gpg signature verification for commits, refs, releases, etc. (although, your GH pgp key is kept in your GH settings... 2fa could help mitigate unauthorized account access)
Falling back, you can have a PGP key on MIT's PGP keybase which forwards a public key to key servers around the world.

Now, this does assume putting trust in third-party services themselves (and the author of the actual PGP key), but the chances are very slim that your PGP key would be replaced on multiple services at once. The biggest issue is an author being unable to secure their private key (no password, weak password, bad private key handling, etc...), or their account becoming compromised (e.g. GitHub).

This doesn't solve the problem on PyPi's side of verifying someone, although PyPi shouldn't have to verify anyone. PyPi should host the package. It should display information about the package being signed (or unsigned), the package's origins and contents, and whoever is downloading the package should check that key against their trusted sources.
In the past, PyPi has hosted malicious code. Of course, this happens. It is expected. Other package managers suffer as well.

I'm sure we all can go on about this.
Sure, PGP signing doesn't guarantee security. It doesn't guarantee verification. It doesn't even guarantee safe, robust, reliable, ... code. PGP is just a part of a solution to of a complex problem.
As signing becomes supported by major services (package maintainers, source code repositories, document services, ...), a trust network (keybase, key servers, ...) can be utilized to help security-minded users track where their packages are coming from.

If a user or a maintainer doesn't care about signing, it doesn't have to be enforced at the beginning, If the feature is there, people will use it.

Edit:
A great talk - How Much Do You Trust That Package?

LucidOne · 2019-02-19T04:39:46Z

I am convinced that this problem is tractable and I think the key here is that Eigen Trust can provide us a mathematical model to get started.

We already have some data in the git commit history. When someone signs a git commit there is an eigentrust metric that can be calculated for all of the previous signed commits. There is reason to believe that causality and techniques like Bayesian inference may also be useful here where multiple commits are signed by a key we do not yet trust, but are temporally followed by a key we do trust. We can also federate our memories of git history and checksums to detect anomalies and attacks on infrastructure.

Sites like github and sr.ht can already provide trust metrics by validating that an email address is connected to a GPG key. We can formalize and automate our existing manual heuristics for validating packages.

Further I think we should start thinking about metadata for developers and organizations that produce software. Organizations can maintain a META-DEV repository that contains the PGP signing keys of active developers, key revocation information, and per repository release manager designation. Developers can also maintain a META-DEV repository that contains yaml (or whatever) metadata linking to a developers twitter / blog / instagram / keybase / identi.ca / matrix / Mastodon / xmpp / whatever where the PGP fingerprint can be posted.

Perhaps we need to start building keyrings in OS native packaging formats (.deb, .rpm, etc) so that trust can at least be established for the most critical python packages.

This is a complex problem but we need to take what small steps forward that we can instead of waiting another year to secure PyPI or even bother to figure out certificate pinning.

brainwane · 2019-03-22T15:49:53Z

Thank you to everyone who's raised this issue and shared their thoughts and useful resources! And sorry for the slow response!

Short answer: we'll be discussing TUF & Warehouse much more in April.

Longer answer:

The folks working on Warehouse have gotten funding to concentrate on improving Warehouse's security, and have kicked off work (funded by the Open Technology Fund) towards multi-factor auth, API keys, and an audit trail. And -- to quote the blog post --

Facebook... has provided the Python Software Foundation with a monetary gift that will be used to fund the development and deployment of enhanced security features to PyPI....

The PSF Packaging Working Group plans to use these funds to implement highly requested security features in PyPI such as cryptographic signing and verification of files uploaded and installed from the index. Additionally, systems for the automated detection of malicious uploads will lower the time to response and improve the resiliency of PyPI against attacks such as "pytosquatting".

This work will be undertaken in the second half of 2019 but planning will begin in the second quarter of the year.

We anticipate that in mid-April (so, basically within about a month) we'll be announcing a formal Request For Information to ask people to tell us about their interest in being contracted to do this work, and that part of that discussion will be further, more detailed conversations about whether TUF is the right tool for this job. So please watch for that, on this issue and on https://discuss.python.org/c/packaging .

(cc @pradyunsg since I think you're interested in this.)

JustinCappos · 2019-03-28T21:06:37Z

From the TUF side, we're very interested in moving this forward. Let us know what we can do to help!

LucidOne · 2019-04-03T04:24:04Z

https://motherboard.vice.com/en_us/article/pan9wn/hackers-hijacked-asus-software-updates-to-install-backdoors-on-thousands-of-computers

trishankatdatadog · 2019-04-03T16:03:48Z

Same, happy to help with this, just let us know how.

westurner · 2019-04-18T19:59:14Z

"PyPI security work: multifactor auth progress & help needed"
https://discuss.python.org/t/pypi-security-work-multifactor-auth-progress-help-needed/1042/10

brainwane · 2019-05-23T02:54:23Z

At PyCon sprints several people spoke about the potential future of TUF in Warehouse and Python packaging, and put notes at https://docs.google.com/document/d/1Wz2-ECkicJgAmQDxMFivWmU2ZunKvPZ2UfQ59zDGj7g/ .

trishankatdatadog · 2019-05-23T13:39:40Z

Thanks, Sumana! I'm happy to help with the design and coding for this project @lukpueh @awwad

ofek · 2019-05-23T13:51:11Z

I'll also devote whatever time is necessary to get this done

lukpueh · 2019-05-24T11:02:57Z

Thanks for putting together our notes, @brainwane! It was a pleasure meeting you guys at PyCon. @ewdurbin, any news on the RFI?

@ofek and @trishankatdatadog, your help will be very much appreciated.

brainwane · 2019-08-28T20:58:30Z

Please check out the newly posted Request for Interest regarding upcoming work implementing cryptographic signing and malware detection on PyPI.

Our current timeline:

Date	Milestone
August 28	Request for Information period opens.
September 18	Request for Information period closes.
September 23	Request for Proposal period opens.
October 16	Request for Proposal period closes.
October 29	Date proposals will have received a decision.
November 30	Contracts for accepted proposals should be finalized.
December 2	Contract work commences.

And then we intend to complete the project over a three to five month period, beginning December 2019.

We're hoping to get participation from potential participants and other experts in the discussion forum, especially about implementation questions, including which of the TUF PEPs (if either) to implement!

brainwane · 2019-09-26T00:35:32Z

See the PSF's new blog post & the open RFP. Later this year, PyPI wants to start:

Implementation of PEP 458 once accepted to add integration of The Update Framework to PyPI

Development of either a stand alone service or code in the Warehouse codebase to create, sign, serve, and handle caching concerns for TUF metadata

Development of necessary code in the Warehouse codebase to integrate TUF metadata and signing

This means we need to move PEP 458 from "Deferred" to "Accepted" status. Per @ewdurbin's guidance, this means we'll need to get PEP 458 revised, as necessary, to pin down specifics, such as key distribution (who, where, how many?) plus any technical choices that TUF leaves up to implementations. To revise PEP 458 and get it accepted, we'll need to collaborate with previous implementers and other experts.

Given the RFP timeline the latest we should get the PEP accepted is 2 December 2019, but I'd much prefer we get it accepted by mid-October.

trishankatdatadog · 2019-09-26T14:18:56Z

Thanks for the update, @brainwane!

@JustinCappos @lukpueh Ok, so we have our work ahead of us. I have work obligations to meet, but can devote whatever time I can for this. Let's plan ASAP.

ofek · 2019-09-26T14:20:23Z

Let me know if you need more assistance, I'd be glad to help!

Facebook Research has now funded implementation of cryptographic signing of packages on PyPI. Per pypi/warehouse#5247 (comment) this means that PEP 458 now moves out of Deferred status and into Draft status. Since the PEP was created, the BDFL-Delegate for PyPI-related PEPs has shifted, and Donald Stufft is now the Delegate.

brainwane · 2019-09-26T19:22:03Z

Now that the PEP is back in Draft status*, I think the next steps are for one or more of the PEP authors to:

refresh the References/Discussion headers on the PEP with relevant discussions from https://discuss.python.org/c/python-software-foundation/pypi-q4-rfi and https://discuss.python.org/t/prerequisites-vetoes-improving-packaging-security/2196 and anywhere else that the PEP was discussed
post to https://discuss.python.org/c/packaging about the PEP (and cross-post to pypa-dev and distutils-sig) to start a new discussion and, eventually, ask for acceptance

@dstufft is now the BDFL-Delegate for this PEP so it'll have to be the other authors (@trishankatdatadog, @vladimir-v-diaz, @JustinCappos) who push this forward. If we want to get any revisions done and get Donald to accept the PEP by mid-October then you should start the steps above in the next couple days, in my opinion.

* the version at python.org needs to be re-generated, but python/peps#1177 was accepted

brainwane · 2019-09-30T18:41:39Z

A few of us had a chat today and are working to update the PEP (python/peps#1178 is part of that), and one or more of the PEP authors will be reaching out to @ewdurbin with a few questions.

pradyunsg · 2019-09-30T19:07:55Z

@brainwane FYI - I'm happy to help with implementing functionality on the client side (i.e. pip) when we get to that point.

I think we'd want to create a tracking issue on pip's issue tracker to have implementation related discussions there, after the PEP is accepted (AFAICT how clients interact with TUF-enabled PyPI is covered by the PEP and would be discussed in the discussions on discuss.python.org).

trishankatdatadog · 2019-10-01T00:50:14Z

Hi @ewdurbin and @dstufft, we have a few questions for which we could use your help:

By "key distribution," do you mean who manages how many keys and where on the PyPI side? Do you also mean how package managers such as pip would determine which keys to trust in the first place?
What is the current deployment process for Warehouse? This will help us determine how to edit the PEP to discuss how to update the TUF-specific code in Warehouse.
What else would you like to see more about, or see changed in the PEP?

Thanks for your time!

trishankatdatadog · 2019-10-01T17:44:10Z

@ewdurbin @dstufft I have also sent you both an email about a conference call this week, if possible. Thanks!

brainwane · 2019-11-05T16:37:55Z

Current status: python/peps#1203 is awaiting review from @dstufft to revise PEP 458. After that, there needs to be a discussion on https://discuss.python.org/c/packaging to get the PEP from "Draft" to "Accepted".

In order to make implementation easier, Dustin wants to work towards implementing #726 (removing Test PyPI from our infrastructure will make key stuff far easier). @di will be speaking more on that in the relevant issue soon.

And, starting in December, @ewdurbin will be managing the contractors who will implement TUF on PyPI. Then the first big key ceremony will be in April at PyCon North America -- if you haven't put PyCon on your calendar yet, you probably should! Conference registration will open later this month.

brainwane · 2020-11-03T14:10:23Z

@jku has some related work people might want to give feedback on, in pip: pypa/pip#8585 and pypa/pip#9041 .

brainwane · 2021-01-28T22:00:05Z

I see #7488 (comment) mentions a few blockers that people are currently working on ("some bugs in the TUF reference implementation, namely missing roledb state when reloading the repository").

theupdateframework/tuf#574

theupdateframework/tuf#1045

theupdateframework/tuf#1048

A few people are working on those, including @sechkova and @lukpueh and @trishankatdatadog. I'm sure they would welcome help.

I believe all of this is still true except that theupdateframework/tuf#1045 is closed.

joshuagl · 2021-01-28T22:15:46Z

support updating individual metadata upon addition of target file theupdateframework/python-tuf#1048

We addressed all of the blockers for Warehouse integration of TUF mentioned above. The remaining, recently filed, issue in TUF is the addition of an abstract signing interface to support the use of signing keys stored in Hashicorp Vault.

That work is being discussed in theupdateframework/python-tuf#1263

brainwane · 2021-02-24T18:24:17Z

I'm having trouble following some of the twists and turns in the linked issues and pull requests, so please forgive my ignorance -- what is left in order to finalize TUF support on PyPI? Just theupdateframework/python-tuf#574 ?

(And then attention ought to move to pypa/pip#8585 to finish up the pip side, I believe.)

woodruffw · 2021-02-24T18:28:55Z

On the TUF side, abstract signer support is still needed. secure-systems-lab/securesystemslib#319 added it to SSLib, but I don't believe that work's been integrated into TUF itself yet. Once it is, I'll be able to continue work on the various Vault interfaces that Warehouse will use to sign metadata.

joshuagl · 2021-02-25T15:47:52Z

On the TUF side, abstract signer support is still needed. secure-systems-lab/securesystemslib#319 added it to SSLib, but I don't believe that work's been integrated into TUF itself yet.

Correct, though there's a PR which I'm planning to review next week theupdateframework/python-tuf#1272

westurner · 2021-03-02T06:33:27Z

It may not be necessary, but is there a milestone or a project board to collect the issues for this epic?

"Package signing & detection/verification" says "78%" complete, but milestones can't include issues from other repos?
https://github.com/pypa/warehouse/milestone/16

Project boards can reference issues from multiple repos.
https://github.com/pypa/warehouse/projects

It's not clear who would create and update a GH project board if even necessary for these issues

woodruffw · 2021-03-02T17:57:06Z

I think a project board would certainly help! I only have triage permissions on this repo, so @brainwane or someone else with more permissions might need to either grant me access or do it.

brainwane · 2021-07-02T18:22:25Z

sorry, I don't have time to look into this - @ewdurbin could you see about giving Will project board permissions for this repo? Thanks.

trishankatdatadog · 2021-07-02T23:36:08Z

Speaking of which, do we have updates about the integration? Have not heard updates in a while...

Cc @joshuagl @mnm678

abitrolly · 2021-08-31T08:31:00Z

Sorry for joining late at the party. I tried to understand how TUF compares to blockchain protection mechanisms against take over and tampering, and to me TUF claims seem misleading.

First TUF main page at https://theupdateframework.io/ claims it provides protection from repo take over.

The Update Framework (TUF) helps developers maintain the security of software update systems, providing protection even against attackers that compromise the repository or signing keys.

And then in https://theupdateframework.io/overview/#how-does-tuf-secure-updates it says this.

TUF identifies the updates, downloads them, and checks them against the metadata that it also downloads from the repository.

In the blockhain world that means that attacker can rehash the content of the repo, and trick clients that the signed content is legit, because clients don't even a copy of Merkle Tree hash to validate the repo at any point in history. How TUF is protects from that? Could someone explain it like I am five?

If the security (by the spec) is provided by offline keys and out-of-band keys distribution, then I don't see how that security can be implemented, or if it worths the complication. For example, https://fwupd.org/ distributes firmware updates for hardware on Linux, is simple and secure without TUF. If analogy with the blockchain is hard, this can be used as an alternative baseline.

JustinCappos · 2021-08-31T13:08:50Z

How this TUF protection is supposed to work? Could someone explain it like I am five?

@marina Moore ***@***.***> has put together a helpful set of blog posts ( https://ssl.engineering.nyu.edu/blog/ ) that use Santa Claus and Calvinball (from Calvin and Hobbes) as examples. Let us know if this helps. :)

abitrolly · 2021-08-31T13:31:47Z

@JustinCappos I am afraid that a distraction for 5 years olds, not an explanation really. :D

mnm678 · 2021-08-31T13:57:57Z

In the blockhain world that means that attacker can rehash the content of the repo, and trick clients that the signed content is legit, because clients don't even a copy of Merkle Tree hash to validate the repo at any point in history. How TUF is protects from that? Could someone explain it like I am five?

TUF and blockchains are based on different threat models. A blockchain uses decentralized nodes so that an attacker would have to compromise a lot of these nodes to gain control. However, it's not always practical to have a network of trusted nodes for software distribution, and it takes a lot of computation to do the proof-of-work necessary to add new items to a blockchain. TUF instead takes the existing package manager approach, and uses offline keys, revocation, and pinned keys to ensure that a compromise of the repository can be detected and recovered from. Using the blockchain analogy, TUF uses pinned root keys instead of a Merkle Tree hash to validate the state of the repository. This has the advantage that the pinned root keys remain valid through updates to the repository.

If the security (by the spec) is provided by offline keys and out-of-band keys distribution, then I don't see how that security can be implemented, or if it worths the complication. For example, https://fwupd.org/ distributes firmware updates for hardware on Linux, is simple and secure without TUF. If analogy with the blockchain is hard, this can be used as an alternative baseline.

This model has already been mostly adopted by PyPI, as well as many others, so it is certainly possible to implement. @trishankatdatadog can provide more insight into deploying TUF in production.

I don't know the details about https://fwupd.org/ specifically, but numerous supply chain security compromises of production systems occur because of a repository or key compromise. TUF mitigates these risks through the use of offline keys, threshold delegations, and namespacing.

JustinCappos · 2021-08-31T14:20:54Z

Also a blockchain doesn't deal with the problems of how do you figure out what to put on there and who can change it if it is wrong / stale / keys are compromised. TUF handles those cases.

…

On Tue, Aug 31, 2021 at 9:58 PM Marina Moore ***@***.***> wrote: In the blockhain world that means that attacker can rehash the content of the repo, and trick clients that the signed content is legit, because clients don't even a copy of Merkle Tree hash to validate the repo at any point in history. How TUF is protects from that? Could someone *explain it like I am five*? TUF and blockchains are based on different threat models. A blockchain uses decentralized nodes so that an attacker would have to compromise a lot of these nodes to gain control. However, it's not always practical to have a network of trusted nodes for software distribution, and it takes a lot of computation to do the proof-of-work necessary to add new items to a blockchain. TUF instead takes the existing package manager approach, and uses offline keys, revocation, and pinned keys to ensure that a compromise of the repository can be detected and recovered from. Using the blockchain analogy, TUF uses pinned root keys instead of a Merkle Tree hash to validate the state of the repository. This has the advantage that the pinned root keys remain valid through updates to the repository. If the security (by the spec) is provided by offline keys and out-of-band keys distribution, then I don't see how that security can be implemented, or if it worths the complication. For example, https://fwupd.org/ distributes firmware updates for hardware on Linux, is simple and secure without TUF. If analogy with the blockchain is hard, this can be used as an alternative baseline. This model has already been mostly adopted by PyPI <https://pyfound.blogspot.com/2020/10/key-generation-and-signing-ceremony-for.html>, as well as many others <https://theupdateframework.io/adoptions/>, so it is certainly possible to implement. @trishankatdatadog <https://github.com/trishankatdatadog> can provide more insight into deploying TUF in production. I don't know the details about https://fwupd.org/ specifically, but numerous supply chain security compromises <https://github.com/cncf/tag-security/tree/main/supply-chain-security/compromises> of production systems occur because of a repository or key compromise. TUF mitigates these risks through the use of offline keys, threshold delegations, and namespacing. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#5247 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAGROD6N5FBEQEUXMELTYMTT7TNXFANCNFSM4GNHO6PQ> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

trishankatdatadog · 2021-08-31T14:23:20Z

Also a blockchain doesn't deal with the problems of how do you figure out what to put on there and who can change it if it is wrong / stale / keys are compromised. TUF handles those cases.

I agree. I have no idea why TUF is being compared to a blockchain without the reader doing their due research.

If you must, see this article we wrote comparing and contrasting a centralized "blockchain" (transparent/tamper-evident logs) to TUF, and why you probably want to use both.

abitrolly · 2021-08-31T15:01:56Z

@mnm678 first, thanks for the explanation. Some nerdy guys like me are completely senseless to people when it comes to "defending the truth". ) I try not to criticize, but when I fail, please forgive me.

However, it's not always practical to have a network of trusted nodes for software distribution,

That's an mistake no.1 (hope you don't mind the terminology, but I don't know another word). Nodes in blockchain are not trusted. They follow the consensus rules. Good nodes do not listen to those who do not follow the consensus. Validating the consensus in that every node does when receiving the block. This way you have near real-time sync of package info and threat detection.

and it takes a lot of computation to do the proof-of-work necessary to add new items to a blockchain.

Mistake no.2 (again I don don't blame anyone - it took me several years to separate blockchain technology from blockchain hype). Proof-of-work is a consensus algorithm for ledgers (accounting books) which designed to solve double spending problem. PyPI is not a ledger, so it is totally irrelevant here. Blockchain is a signed chain of signed blocks. In case of PyPI, one block can be just one package data. The agreement, who can add the blocks is the consensus. "Every user with an account in PyPI can add block" may be a valid rule. "Only blocks that are signed with offline keys" may be a valid rule (although this can be extended to "keys that are signed by offline keys"). "Only users who have the most balance" is not the valid rule for PyPI, but it is another consensus for public ledgers called proof-of-stake.

TUF instead takes the existing package manager approach, and uses offline keys, revocation, and pinned keys to ensure that a compromise of the repository can be detected and recovered from. Using the blockchain analogy, TUF uses pinned root keys instead of a Merkle Tree hash to validate the state of the repository. This has the advantage that the pinned root keys remain valid through updates to the repository.

How packagers sign their packages if TUF private keys are offline?

What are pinned keys? My 5 years old have just read from Wikipedia that public keys pinning for HTTP was considered deprecated, and my search query for pinned keys show a lot of articles that do not recommend this technique.

Unfortunately, without understanding if TUF pinned keys are similar to HTTP pinning keys, I can not comment on if they can really replace Merkle Tree in validating the current state of repository. The state in Merkle Tree not only covers specific package, it coverts the state of all packages at the moment it is generated. If the pinned key signs the state of repository, then it brings another question.

Who owns pinned keys? (package maintainer, PyPI admin, TUF admin)

This model has already been mostly adopted by PyPI, as well as many others, so it is certainly possible to implement.

This doesn't answer the original question - how specifically to sign PyPI packages with offline keys, and which out-of-band channels PyPI users should use for keys distribution. A simple example for a monkey who wants to upload package to PyPI in the most secure manner would do.

I don't know the details about https://fwupd.org/ specifically, but numerous supply chain security compromises of production systems occur because of a repository or key compromise. TUF mitigates these risks through the use of offline keys, threshold delegations, and namespacing.

I haven't found https://fwupd.org/ in the list, so it is hard for to me to accept this argument against it. If TUF security is provided by offline keys, so does the https://fwupd.org/ but without the complications imposed by TUF. https://fwupd.org/ distributes packages signed/encrypted by vendor key (TUF offline keys), and BIOS and other hardware (according to UEFI spec) will not update itself if the signature doesn't match hardcoded public key (TUF out-of-band channel). So the security of TUF and https://fwupd.org/ are equivalent.

JustinCappos · 2021-08-31T15:22:06Z

I haven't found https://fwupd.org/ in the list, so it is hard for to me to accept this argument against it. If TUF security is provided by offline keys, so does the https://fwupd.org/ but without the complications imposed by TUF. https://fwupd.org/ distributes packages signed/encrypted by vendor key (TUF offline keys), and BIOS and other hardware (according to UEFI spec) will not update itself if the signature doesn't match hardcoded public key (TUF out-of-band channel). So the security of TUF and https://fwupd.org/ are equivalent.

From the https://fwupd.org/lvfs/docs/developers site under "Is updating firmware secure?" In both the LVFS and fwupd, GPG crypto is being performed using GnuPG and PKCS#7 crypto is using GnuTLS. The fwupd daemon has no network access and only acts as the mechanism for clients using D-DBus and PolicyKit. Some devices also have additional hardware signature verification schemes implemented by the device manufacturer. The LVFS and fwupd codebases have had several independent security audits. The LVFS has a huge number of tests run for each commit <https://travis-ci.org/hughsie/lvfs-website>, and fwupd has a comprehensive test suite <https://travis-ci.org/hughsie/fwupd>, and is regularly scanned using both clang and Coverity <https://scan.coverity.com/projects/10744>. The threat model implied here is that they sign something and are careful with the key. There is no talk about how they handle key revocation, etc. TUF focuses on dealing with compromises. Not only just keys, but of servers and other parts of the infrastructure. Of course, we've had audits too (which you can find linked on the project site), but the system is designed to resist and securely recover from a compromise of keys, servers, etc. So the threat model and goals are very different. (You can find a lot more about TUF's goals by reading this page, especially the Mitigating Key Risk portion https://theupdateframework.io/security/ ) The website also has a lot of technical papers that describe the security differences in much greater detail over solutions that use a single key for signing, such as the project you mentioned. How packagers sign their packages if TUF private keys are offline? There are different keys in TUF. Some keys (like the root keys and some targets keys) are offline. Others are held by the developers. To try to give the five year old version. If you have a small project on your own, you have your key for your project. If you have a group project, you can choose if one person has the key, if multiple people have to use keys, etc. Just so you're not confused about pinning, this isn't HTTP pinning. The reasons why HTTP pinning is deprecated don't make sense in this context because you don't have hundreds of potentially valid roots of trust (trusted CAs) and have to deal with the problems with having something incorrectly pinned to the incorrect version. PyPI's targets role handles this namespacing unambiguously.

…

On Tue, Aug 31, 2021 at 11:02 PM Anatoli Babenia ***@***.***> wrote: @mnm678 <https://github.com/mnm678> first, thanks for the explanation. Some nerdy guys like me are completely senseless to people when it comes to "defending the truth". ) I try not to criticize, but when I fail, please forgive me. However, it's not always practical to have a network of trusted nodes for software distribution, That's an mistake no.1 (hope you don't mind the terminology, but I don't know another word). Nodes in blockchain are not trusted. They follow the consensus rules. Good nodes do not listen to those who do not follow the consensus. Validating the consensus in that every node does when receiving the block. This way you have near real-time sync of package info and threat detection. and it takes a lot of computation to do the proof-of-work necessary to add new items to a blockchain. Mistake no.2 (again I don don't blame anyone - it took me several years to separate blockchain technology from blockchain hype). Proof-of-work is a consensus algorithm for ledgers (accounting books) which designed to solve double spending problem. PyPI is not a ledger, so it is totally irrelevant here. Blockchain is a signed chain of signed blocks. In case of PyPI, one block can be just one package data. The agreement, who can add the blocks is the consensus. "Every user with an account in PyPI can add block" may be a valid rule. "Only blocks that are signed with offline keys" may be a valid rule (although this can be extended to "keys that are signed by offline keys"). "Only users who have the most balance" is not the valid rule for PyPI, but it is another consensus for public ledgers called proof-of-stake. TUF instead takes the existing package manager approach, and uses offline keys, revocation, and pinned keys to ensure that a compromise of the repository can be detected and recovered from. Using the blockchain analogy, TUF uses pinned root keys instead of a Merkle Tree hash to validate the state of the repository. This has the advantage that the pinned root keys remain valid through updates to the repository. How packagers sign their packages if TUF private keys are offline? What are pinned keys? My 5 years old have just read from Wikipedia <https://en.wikipedia.org/wiki/HTTP_Public_Key_Pinning> that public keys pinning for HTTP was considered deprecated, and my search query for pinned keys <https://www.google.com/search?client=firefox-b-d&q=pinned+keys> show a lot of articles that do not recommend this technique. Unfortunately, without understanding if TUF pinned keys are similar to HTTP pinning keys, I can not comment on if they can really replace Merkle Tree in validating the current state of repository. The state in Merkle Tree not only covers specific package, it coverts the state of all packages at the moment it is generated. If the pinned key signs the state of repository, then it brings another question. Who owns pinned keys? (package maintainer, PyPI admin, TUF admin) This model has already been mostly adopted by PyPI, as well as many others, so it is certainly possible to implement. This doesn't answer the original question - how specifically to sign PyPI packages with offline keys, and which out-of-band channels PyPI users should use for keys distribution. A simple example for a monkey who wants to upload package to PyPI in the most secure manner would do. I don't know the details about https://fwupd.org/ specifically, but numerous supply chain security compromises of production systems occur because of a repository or key compromise. TUF mitigates these risks through the use of offline keys, threshold delegations, and namespacing. I haven't found https://fwupd.org/ in the list, so it is hard for to me to accept this argument against it. If TUF security is provided by offline keys, so does the https://fwupd.org/ but without the complications imposed by TUF. https://fwupd.org/ distributes packages signed/encrypted by vendor key (TUF offline keys), and BIOS and other hardware (according to UEFI spec) will not update itself if the signature doesn't match hardcoded public key (TUF out-of-band channel). So the security of TUF and https://fwupd.org/ are equivalent. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#5247 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAGRODZNNGUXZ645HQTIKALT7TVHBANCNFSM4GNHO6PQ> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

abitrolly · 2021-08-31T15:29:37Z

Also a blockchain doesn't deal with the problems of how do you figure out what to put on there and who can change it if it is wrong / stale / keys are compromised. TUF handles those cases.

@JustinCappos that's a valid point, and a tough problem. if anyone can explain in those childish terms how the handling is done, that would clear up my doubts. Right now I understand this as if offline keys are lost, everything is lost. The same way if private key on the blockchain is lost. On the blockchain the problem is solved with multisignature keys, so that you need 3 out of 5 signatures to make multisig valid. There are also extensions that allow to migrate valid multisig to another multisig with no missing keys.

If you must, see this article we wrote comparing and contrasting a centralized "blockchain" (transparent/tamper-evident logs) to TUF, and why you probably want to use both.

@trishankatdatadog yes, I've read it (after asking), and I like TL very much. I think it is a way to go. So far it seemd that TUF adds a very high level of complexity even if TL + TUF is the most secure. It is hard to explain, unlike blockchain concepts, which many people have already learned. It also hard to me to wrap my head how the TUF absence of immutable history and third-party auditing can provide security in the world, where the state of dependency tree is often more important than the state of packages you own.

While I don't think that TUF is the way to go, I think it may have some good ideas on how to manage signing keys, so instead of following TUF or using combined TF+TUF, there may be a slimmed down version of security framework, that reuses components from both and also leverages some best practices from blockchain technology (like real-time sync, notifications and caching).

ewdurbin · 2021-08-31T15:49:48Z

Hello @abitrolly! I appreciate your engagement and concern with the security of PyPI shown in this discussion.

However, there are multiple points in the recent conversation where you have chosen to belittle or dismiss things as "childish" and comparing efforts of those involved to 5 year olds. This isn't very respectful of the effort and time that people have put into this work.

I'd like to ask you to review the PSF Code of Conduct, which this repository and discussion adheres, before further disrespectful behavior becomes an issue.

trishankatdatadog · 2021-08-31T15:58:38Z

While I don't think that TUF is the way to go, I think it may have some good ideas on how to manage signing keys, so instead of following TUF or using combined TF+TUF, there may be a slimmed down version of security framework, that reuses components from both and also leverages some best practices from blockchain technology (like real-time sync, notifications and caching).

Consider that if something looks complicated, there might be good reasons for it, especially if it was designed with a threat model with nation-state attackers in mind. Feel free to use TLs all you like, but the PyPA consensus has been for TUF, with the community free to record TUF metadata on TLs if they wish, thus getting the best of both worlds.

abitrolly · 2021-09-03T08:41:58Z

@ewdurbin all I wanted is to receive a layperson-friendly explanation as it happens in https://www.reddit.com/r/explainlikeimfive/ which I've subscribed to. I apologize that I haven't referenced it in the first place. Does that clarify that the phrase "explain me like I am five" is not done to belittle or dismiss things as "childish", or offend those who put many efforts in developing and promoting TUF?

I'd like to ask you to review the PSF Code of Conduct, which this repository and discussion adheres, before further disrespectful behavior becomes an issue.

I acknowledge time and effort that people put into developing TUF and trying to include it into Python distribution index. If the critics of TUF framework itself is seen as disrespectful behavior, then it will be better for me to leave the people to their business.

westurner · 2021-09-03T15:35:05Z

Thanks for your feedback.

IMHO, Sigstore should be (1) at least rooted in a trustless blockchain; and (2) using ld-proofs and W3C CCG Cryptographic Signature Suite URIs for future-proofing. That aside, how can Sigstore and TUF work together?

Is there a good ELI5 graphic of the PyPI TUF package build and release workflow, and maybe also a complete sequence diagram?
https://en.wikipedia.org/wiki/Sequence_diagram

https://www.sigstore.dev

JustinCappos · 2021-09-03T15:35:37Z

In a rush, but quickly wanted to point out that Sigstore uses TUF... https://dlorenc.medium.com/using-the-update-framework-in-sigstore-dc393cfe6b52

…

On Fri, Sep 3, 2021 at 11:32 PM Wes Turner ***@***.***> wrote: Thanks for your feedback. IMHO, Sigstore *should* be (1) at least rooted in a *trustless* blockchain; and (2) using ld-proofs and W3C Signature Suite URIs. That aside, how can Sigstore and TUF work together? Is there a good ELI5 graphic of the PyPI TUF package build and release workflow, and maybe also a complete sequence diagram? https://en.wikipedia.org/wiki/Sequence_diagram ![Sigstore architecture summary] (https://www.sigstore.dev/img/system_architecture_summary-01.svg) On Fri, Sep 3, 2021, 04:42 Anatoli Babenia ***@***.***> wrote: > @ewdurbin <https://github.com/ewdurbin> all I wanted is to receive a > layperson-friendly explanation as it happens in > https://www.reddit.com/r/explainlikeimfive/ which I've subscribed to. I > apologize that I haven't referenced it in the first place. Does that > clarify that the phrase "explain me like I am five" is not done to belittle > or dismiss things as "childish", or offend those who put many efforts in > developing and promoting TUF? > > I'd like to ask you to review the PSF Code of Conduct, which this > repository and discussion adheres, before further disrespectful behavior > becomes an issue. > > I acknowledge time and effort that people put into developing TUF and > trying to include it into Python distribution index. If the critics of TUF > framework itself is seen as disrespectful behavior, then it will be better > for me to leave the people to their business. > > — > You are receiving this because you commented. > Reply to this email directly, view it on GitHub > <#5247 (comment)>, > or unsubscribe > < https://github.com/notifications/unsubscribe-auth/AAAMNSZ2CYOUPZMLY3R6GG3UACC6DANCNFSM4GNHO6PQ > > . > Triage notifications on the go with GitHub Mobile for iOS > < https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 > > or Android > < https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub >. > > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#5247 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAGROD3S22WGH66XULSHY7DUADS7JANCNFSM4GNHO6PQ> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

trishankatdatadog · 2021-09-03T15:38:35Z

IMHO, Sigstore should be (1) at least rooted in a trustless blockchain; and (2) using ld-proofs and W3C Signature Suite URIs. That aside, how can Sigstore and TUF work together?

I'm not: (1) sure that there is such a thing as a "trustless" blockchain, and (2) familiar with these technologies. However, Marina and I wrote a blog post about how you can combine TUF and transparent/tamper-evident logs such as sigstore. TLDR: you can publish TUF timestamp metadata on sigstore. My upcoming talk at SupplyChainSecurityCon will discuss how the Datadog Agent integrations is the first transparent, compromise-resilient software publication pipeline in the world.

Is there a good ELI5 graphic of the PyPI TUF package build and release workflow, and maybe also a complete sequence diagram?

I don't think there is one right now, but perhaps you could help us make one using the descriptions from PEPs 458 and 480?

trishankatdatadog · 2021-09-03T15:39:58Z

In a rush, but quickly wanted to point out that Sigstore uses TUF...

Yes. This will be a separate TUF repository for open source projects, distinct from the one for PyPI right now. However, PyPI can publish its own TUF timestamps to sigstore.

di · 2022-02-01T22:36:02Z

I've published a roadmap for the PEP 458 rollout in #10672. Please note that this issue is for PEP 458-related development work only, and is not a place to discuss TUF alternatives, issues with TUF as a framework, or ask for explanations on how TUF works.

brainwane added the needs discussion a product management/policy issue maintainers and users should discuss label Mar 14, 2019

brainwane added this to the Package signing & detection/verification milestone Jun 19, 2019

brainwane mentioned this issue Jun 23, 2019

No Signing of Packages pypa/packaging-problems#15

Open

brainwane mentioned this issue Sep 10, 2019

TUF deployment roadmap for PyPI theupdateframework/python-tuf#816

Closed

brainwane mentioned this issue Sep 26, 2019

PEP 458: Move to Draft status and update Delegate python/peps#1177

Merged

This comment has been minimized.

Sign in to view

pypi locked as off-topic and limited conversation to collaborators Sep 4, 2021

di closed this as completed Feb 1, 2022

Roadmap update for TUF support #5247

Roadmap update for TUF support #5247

Comments

LucidOne commented Jan 4, 2019

nealmcb commented Feb 19, 2019

LucidOne commented Feb 19, 2019

MyNameIsCosmo commented Feb 19, 2019 • edited Loading

LucidOne commented Feb 19, 2019

brainwane commented Mar 22, 2019

JustinCappos commented Mar 28, 2019

LucidOne commented Apr 3, 2019

trishankatdatadog commented Apr 3, 2019

westurner commented Apr 18, 2019

brainwane commented May 23, 2019

trishankatdatadog commented May 23, 2019 via email

ofek commented May 23, 2019

lukpueh commented May 24, 2019

brainwane commented Aug 28, 2019

brainwane commented Sep 26, 2019

trishankatdatadog commented Sep 26, 2019

ofek commented Sep 26, 2019

brainwane commented Sep 26, 2019 • edited Loading

brainwane commented Sep 30, 2019

pradyunsg commented Sep 30, 2019

trishankatdatadog commented Oct 1, 2019

trishankatdatadog commented Oct 1, 2019

brainwane commented Nov 5, 2019

brainwane commented Nov 3, 2020

brainwane commented Jan 28, 2021

joshuagl commented Jan 28, 2021

brainwane commented Feb 24, 2021

woodruffw commented Feb 24, 2021

joshuagl commented Feb 25, 2021

westurner commented Mar 2, 2021

woodruffw commented Mar 2, 2021

brainwane commented Jul 2, 2021

trishankatdatadog commented Jul 2, 2021

abitrolly commented Aug 31, 2021 • edited Loading

JustinCappos commented Aug 31, 2021 via email

abitrolly commented Aug 31, 2021

mnm678 commented Aug 31, 2021

JustinCappos commented Aug 31, 2021 via email

trishankatdatadog commented Aug 31, 2021 • edited Loading

abitrolly commented Aug 31, 2021

JustinCappos commented Aug 31, 2021 via email

abitrolly commented Aug 31, 2021

ewdurbin commented Aug 31, 2021

trishankatdatadog commented Aug 31, 2021

abitrolly commented Sep 3, 2021

westurner commented Sep 3, 2021 • edited Loading

JustinCappos commented Sep 3, 2021 via email

trishankatdatadog commented Sep 3, 2021 • edited Loading

trishankatdatadog commented Sep 3, 2021

This comment has been minimized.

di commented Feb 1, 2022

MyNameIsCosmo commented Feb 19, 2019 •

edited

Loading

brainwane commented Sep 26, 2019 •

edited

Loading

abitrolly commented Aug 31, 2021 •

edited

Loading

trishankatdatadog commented Aug 31, 2021 •

edited

Loading

westurner commented Sep 3, 2021 •

edited

Loading

trishankatdatadog commented Sep 3, 2021 •

edited

Loading