Packet-based enc/dec cipher streams #49896

albertzaharovits · 2019-12-06T00:24:59Z

This adds a new bare snapshot repository project which contains the classes implementing encryption (and decryption) input stream decorators that support mark and reset.

Relates #48221 , #46170

Edit:
Extract from javadocs explaining how encryption works:

An {@code EncryptionPacketsInputStream} wraps another input stream and encrypts its contents.The method of encryption is AES/GCM/NoPadding, which is a type of authenticated encryption. The encryption works packet wise, i.e. the stream is segmented into fixed-size byte packets which are separately encrypted using a unique {@link Cipher}. As an exception, only the last packet will have a different size, possibly zero. Note that the encrypted packets are larger compared to the plaintext packets, because they contain a 16 byte length trailing authentication tag. The resulting encrypted and authenticated packets are assembled back into the resulting stream. The packets are encrypted using the same {@link SecretKey} but using a different initialization vector. The IV is 12 bytes wide and it's comprised of an integer {@code nonce}, the same for every packet in a stream, but which MUST not otherwise be repeated for the same {@code SecretKey} across other streams, and a monotonically increasing long counter. When assembling the resulting stream, the IV is prepended to the corresponding packet's ciphertext.
The packet size is preferably a large multiple of the AES block size (128 bytes), but any positive
integer value smaller than {@link EncryptedRepository#MAX_PACKET_LENGTH_IN_BYTES} is valid.
This input stream supports the {@code mark} and {@code reset} operations, but only if the wrapped stream supports them as well. A {@code mark} call will trigger the memory buffering of the current packet and will also trigger a {@code mark} call on the wrapped input stream on the next packet boundary. Upon a {@code reset} call, the buffered packet will be replayed and new packets will be generated starting from the marked packet boundary on the wrapped stream.
The {@code close} call will close the encryption input stream and any subsequent {@code read},
{@code skip}, {@code available} and {@code reset} calls will throw {@code IOException}s.
This is NOT thread-safe, multiple threads sharing a single instance must synchronize access.

and how decryption works:

A {@code DecryptionPacketsInputStream} wraps an encrypted input stream and decrypts
its contents. This is designed (and tested) to decrypt only the encryption format that
{@link EncryptionPacketsInputStream} generates. No decrypted bytes are returned before
they are authenticated.
The same parameters, namely {@code secretKey}, {@code nonce} and {@code packetLength},
that have been used during encryption must also be used for decryption, otherwise
decryption will fail.
This implementation buffers the encrypted packet in memory. The maximum packet size it can
accommodate is {@link EncryptedRepository#MAX_PACKET_LENGTH_IN_BYTES}.
This implementation does not support {@code mark} and {@code reset}.
The {@code close} call will close the decryption input stream and any subsequent {@code read},
{@code skip}, {@code available} and {@code reset} calls will throw {@code IOException}s.
This is NOT thread-safe, multiple threads sharing a single instance must synchronize access.

…herstream-2

elasticmachine · 2019-12-06T00:25:01Z

Pinging @elastic/es-security (:Security/Security)

…arch/repositories/encrypted/DecryptionPacketsInputStream.java Co-Authored-By: Tim Vernum <tim@adjective.org>

albertzaharovits · 2020-01-01T20:38:39Z

Thank you for another thorough review @tvernum !
I have addressed all the issues you've pointed out. Please take another look.

tvernum

LGTM

albertzaharovits · 2020-01-06T09:24:07Z

@elasticmachine update branch

…herstream-2

This adds a new bare snapshot repository project which contains the classes implementing encryption (and decryption) input stream decorators that support mark and reset. Relates #48221 , #46170

This adds a new bare snapshot repository project which contains the classes implementing encryption (and decryption) input stream decorators that support mark and reset. Relates elastic#48221 , elastic#46170

This builds upon the data encryption streams from #49896 to create an encrypted snapshot repository. The repository encryption works with the following existing repository types: FS, Azure, S3, GCS (possibly works with HDFS and URL, but these are not tested). The encrypted repository is protected by a password stored on every node's keystore. The repository keys (KEK - key encryption key) are generated from the password using the PBKDF2 function, and are used to encrypt (using the AES Wrap algorithm) other symmetric keys (referred to as DEK - data encryption keys) which are themselves used to encrypt the blobs of the regular snapshot. The platinum or enterprise licenses are required to snapshot to the encrypted repository, but no license is required to list or restore already encrypted snapshots.

The client-side encrypted repository is a new type of snapshot repository that internally delegates to the regular variants of snapshot repositories (of types Azure, S3, GCS, FS, and maybe others but not yet tested). After the encrypted repository is set up, it is transparent to the snapshot and restore APIs (i.e. all snapshots stored in the encrypted repository are encrypted, no other parameters required). The encrypted repository is protected by a password stored on every node's keystore (which must be the same across the nodes). The password is used to generate a key encrytion key (KEK), using the PBKDF2 function, which is used to encrypt (using the AES Wrap algorithm) other symmetric keys (referred to as DEK - data encryption keys), which themselves are generated randomly, and which are ultimately used to encrypt the snapshot blobs. For example, here is how to set up an encrypted FS repository: ------ 1) make sure that the cluster runs under at least a "platinum" license (simplest test configuration is to put `xpack.license.self_generated.type: "trial"` in the elasticsearch.yml file) 2) identical to the un-encrypted FS repository, specify the mount point of the shared FS in the elasticsearch.yml conf file (on all the cluster nodes), e.g. `path.repo: ["/tmp/repo"]` 3) store the repository password inside the elasticsearch.keystore, *on every cluster node*. In order to support changing password on existing repository (implemented in a follow-up), the password itself must be names, e.g. for the "test_enc_key" repository password name: `./bin/elasticsearch-keystore add repository.encrypted.test_enc_pass.password` *type in the password* 4) start up the cluster and create the new encrypted FS repository, named "test_enc", by calling: ` curl -X PUT "localhost:9200/_snapshot/test_enc?pretty" -H 'Content-Type: application/json' -d' { "type": "encrypted", "settings": { "location": "/tmp/repo/enc", "delegate_type": "fs", "password_name": "test_enc_pass" } } ' ` 5) the snapshot and restore APIs work unmodified when they refer to this new repository, e.g. ` curl -X PUT "localhost:9200/_snapshot/test_enc/snapshot_1?wait_for_completion=true"` Related: #49896 #41910 #50846 #48221 #65768

The client-side encrypted repository is a new type of snapshot repository that internally delegates to the regular variants of snapshot repositories (of types Azure, S3, GCS, FS, and maybe others but not yet tested). After the encrypted repository is set up, it is transparent to the snapshot and restore APIs (i.e. all snapshots stored in the encrypted repository are encrypted, no other parameters required). The encrypted repository is protected by a password stored on every node's keystore (which must be the same across the nodes). The password is used to generate a key encrytion key (KEK), using the PBKDF2 function, which is used to encrypt (using the AES Wrap algorithm) other symmetric keys (referred to as DEK - data encryption keys), which themselves are generated randomly, and which are ultimately used to encrypt the snapshot blobs. For example, here is how to set up an encrypted FS repository: ------ 1) make sure that the cluster runs under at least a "platinum" license (simplest test configuration is to put `xpack.license.self_generated.type: "trial"` in the elasticsearch.yml file) 2) identical to the un-encrypted FS repository, specify the mount point of the shared FS in the elasticsearch.yml conf file (on all the cluster nodes), e.g. `path.repo: ["/tmp/repo"]` 3) store the repository password inside the elasticsearch.keystore, *on every cluster node*. In order to support changing password on existing repository (implemented in a follow-up), the password itself must be names, e.g. for the "test_enc_key" repository password name: `./bin/elasticsearch-keystore add repository.encrypted.test_enc_pass.password` *type in the password* 4) start up the cluster and create the new encrypted FS repository, named "test_enc", by calling: ` curl -X PUT "localhost:9200/_snapshot/test_enc?pretty" -H 'Content-Type: application/json' -d' { "type": "encrypted", "settings": { "location": "/tmp/repo/enc", "delegate_type": "fs", "password_name": "test_enc_pass" } } ' ` 5) the snapshot and restore APIs work unmodified when they refer to this new repository, e.g. ` curl -X PUT "localhost:9200/_snapshot/test_enc/snapshot_1?wait_for_completion=true"` Related: elastic#49896 elastic#41910 elastic#50846 elastic#48221 elastic#65768

The client-side encrypted repository is a new type of snapshot repository that internally delegates to the regular variants of snapshot repositories (of types Azure, S3, GCS, FS, and maybe others but not yet tested). After the encrypted repository is set up, it is transparent to the snapshot and restore APIs (i.e. all snapshots stored in the encrypted repository are encrypted, no other parameters required). The encrypted repository is protected by a password stored on every node's keystore (which must be the same across the nodes). The password is used to generate a key encrytion key (KEK), using the PBKDF2 function, which is used to encrypt (using the AES Wrap algorithm) other symmetric keys (referred to as DEK - data encryption keys), which themselves are generated randomly, and which are ultimately used to encrypt the snapshot blobs. For example, here is how to set up an encrypted FS repository: ------ 1) make sure that the cluster runs under at least a "platinum" license (simplest test configuration is to put `xpack.license.self_generated.type: "trial"` in the elasticsearch.yml file) 2) identical to the un-encrypted FS repository, specify the mount point of the shared FS in the elasticsearch.yml conf file (on all the cluster nodes), e.g. `path.repo: ["/tmp/repo"]` 3) store the repository password inside the elasticsearch.keystore, *on every cluster node*. In order to support changing password on existing repository (implemented in a follow-up), the password itself must be names, e.g. for the "test_enc_key" repository password name: `./bin/elasticsearch-keystore add repository.encrypted.test_enc_pass.password` *type in the password* 4) start up the cluster and create the new encrypted FS repository, named "test_enc", by calling: ` curl -X PUT "localhost:9200/_snapshot/test_enc?pretty" -H 'Content-Type: application/json' -d' { "type": "encrypted", "settings": { "location": "/tmp/repo/enc", "delegate_type": "fs", "password_name": "test_enc_pass" } } ' ` 5) the snapshot and restore APIs work unmodified when they refer to this new repository, e.g. ` curl -X PUT "localhost:9200/_snapshot/test_enc/snapshot_1?wait_for_completion=true"` Related: #49896 #41910 #50846 #48221 #65768

albertzaharovits added 21 commits November 30, 2019 12:57

Polished main

92a3b34

First successful tests

d07c05f

More tests

de603a7

More tests

7cc62e0

BufferOnMarkInputStreamBug

cbd3c50

A few bugs...

9eb9bcf

BufferOnMark bug

5919a11

More more more bugs!

47f6aea

Mad tests

8263062

Manic testing

cf97ba2

BufferOnMarkInputStreamTests completed

3c82ba9

Checkstyle

76d8271

Merge branch 'repository-encrypted-client-side' into packet-based-cip…

6b30902

…herstream-2

BufferOnMarkInputStream javadocs

4e9778e

merge fallout

24d6d27

PrefixInputStream tests

3cd79bd

WIP

c610fe8

CountingInputStreamTests

e4f8564

Renaming and more javadocs

c816c45

Refactor ChainingInputStream

29c484b

Scarce EncryptionPacketsInputStream javadocs

db5f58e

albertzaharovits added >feature :Security/Security Security issues without another label labels Dec 6, 2019

albertzaharovits requested review from tvernum and jkakavas December 6, 2019 00:24

albertzaharovits self-assigned this Dec 6, 2019

albertzaharovits added 3 commits December 8, 2019 23:12

ChainingInputStream polishing and tests

d6dc875

ChainingInputStreamTests

7cb48f6

ChainingInputStreamTests without mark/reset

83e028b

albertzaharovits and others added 12 commits December 30, 2019 07:27

Update x-pack/plugin/repository-encrypted/src/main/java/org/elasticse…

e85aefe

…arch/repositories/encrypted/DecryptionPacketsInputStream.java Co-Authored-By: Tim Vernum <tim@adjective.org>

no iv instance variable

8a0773a

Nit

07d7ac8

Exception messages

fd10914

Fix tests with exception names

2e41d4f

Test for reader of fewer bytes

97f5917

Adjust counting input stream docs

4fd6dcc

RingBuffer

3d1daf4

WIP

4fcd49d

WIP

9ef136e

More javadoc to the ring buffer inner

cb966b2

Small test polishing

0f9f77c

tvernum self-requested a review January 6, 2020 04:50

tvernum approved these changes Jan 6, 2020

View reviewed changes

Merge branch 'repository-encrypted-client-side' into packet-based-cip…

08fb26c

…herstream-2

albertzaharovits merged commit a863f76 into elastic:repository-encrypted-client-side Jan 6, 2020

albertzaharovits deleted the packet-based-cipherstream-2 branch January 10, 2020 09:51

albertzaharovits mentioned this pull request Jan 10, 2020

Encrypted blob store repository #50846

Closed

albertzaharovits mentioned this pull request Mar 10, 2020

Encrypted blob store reuse DEK #53352

Merged

albertzaharovits mentioned this pull request Dec 23, 2020

Client-side encrypted snapshot repository (feature flag) #66773

Merged

albertzaharovits mentioned this pull request Dec 23, 2020

BACKPORT 7x Client-side encrypted snapshot repository (feature flag) (#66773) #66809

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Packet-based enc/dec cipher streams #49896

Packet-based enc/dec cipher streams #49896

albertzaharovits commented Dec 6, 2019 •

edited

Loading

elasticmachine commented Dec 6, 2019

albertzaharovits commented Jan 1, 2020

tvernum left a comment

albertzaharovits commented Jan 6, 2020

Packet-based enc/dec cipher streams #49896

Packet-based enc/dec cipher streams #49896

Conversation

albertzaharovits commented Dec 6, 2019 • edited Loading

elasticmachine commented Dec 6, 2019

albertzaharovits commented Jan 1, 2020

tvernum left a comment

Choose a reason for hiding this comment

albertzaharovits commented Jan 6, 2020

albertzaharovits commented Dec 6, 2019 •

edited

Loading