Stream Cipher Used to Encrypt Last File Block #9

lipnitsk · 2014-08-26T06:34:25Z

From: https://defuse.ca/audits/encfs.htm

Exploitability: Unknown
Security Impact: High

As reported in [1], EncFS uses a stream cipher mode to encrypt the last file block. The change log says that the ability to add random bytes to a block was added as a workaround for this issue. However, it does not solve the problem, and is not enabled by default.

EncFS needs to use a block mode to encrypt the last block.

EncFS's stream encryption is unorthodox:

1. Run "Shuffle Bytes" on the plaintext.
    N[J+1] = Xor-Sum(i = 0 TO J) { P[i] }
    (N = "shuffled" plaintext value, P = plaintext)
2. Encrypt with (setIVec(IV), key) using CFB mode.
3. Run "Flip Bytes" on the ciphertext.
    This reverses bytes in 64-byte chunks.
4. Run "Shuffle Bytes" on the ciphertext.
5. Encrypt with (setIVec(IV + 1), key) using CFB mode.

Where setIVec(IV) = HMAC(globalIV || (IV), key), and,
    - 'globalIV' is an IV shared across the entire filesystem.
    - 'key' is the encryption key.

This should be removed and replaced with something more standard. As far as I can see, this provides no useful security benefit, however, it is relied upon to prevent the attacks in [1]. This is security by obscurity.

Edit : [1] may be unavailable, so here it is from archives.org :

[Full-disclosure] Multiple Vulnerabilities in EncFS
From: Micha Riser (micha[at]povworld.org)
Date: Thu Aug 26 2010 - 07:05:18 CDT
(...)
3. Last block with single byte is insecure 
------------------------------------------------------- 
The CFB cipher mode is insecure if it is used twice with the same 
initialization vector. In CFB, the ﬁrst block of the plain text is XOR-ed with 
the encrypted IV: 
  C0 = P0 XOR Ek (IV ) 
Therefore, for two cipher blocks C0 and C0' encrypted with the same IV, it 
holds that: 
  C0 XOR C0' = (P0 XOR Ek (IV )) XOR (P0' XOR Ek (IV )) = P0 XOR P0' 
This means that an attacker gets the XOR of the two plain texts. EncFs uses a 
modified version of CFB which additionally shuffles and reverses bytes. It is not 
clear however, if the modiﬁcations generally help against this problem. 

A security problem arises deﬁnitely if the last block contains only a single 
byte and an attacker has two versions of the last block. Operating on a single 
byte, the shuﬄe and reverse operation do nothing. What remains is a double 
encryption with CFB and XOR-ing the two cipher bytes gives the XOR of the 
two plain text bytes due to the reason described above. Encrypting the last 
block with a stream cipher instead of a block cipher saves at most 16 bytes 
(one cipher block). We think it would be better to sacriﬁce these bytes and in 
exchange rely only on a single encryption mode for all blocks which simpliﬁes 
both the crypto analysis and the implementation.

The text was updated successfully, but these errors were encountered:

vgough · 2014-08-29T03:50:43Z

Plan is to eliminate use of stream mode entirely in Encfs 2.x (for new filesystems). No plan for Encfs 1.x

rfjakob · 2014-10-18T09:06:55Z

Do you already have a plan for what mode to use? CBC with ciphertext stealing seems to be a good option.

rfjakob · 2014-10-18T09:35:04Z

The other option would be to go with CTR for the whole file. With CTR, however, an attacker can flip single bits at will, so it would need to go with MAC enabled by default. If ecryptfs has MACs enabled by default (will check) we should probably too, anyway.

rfjakob · 2014-10-19T09:07:29Z

CTR has the additional problem that the XOR of two cipertext files copied at two different times is the XOR of the plaintext. To fix that leak you'd need random per-block IVs.

vgough · 2014-10-23T06:33:32Z

For Encfs2, I'm leaning towards GCM mode (as used in ZFS).

generalmanager · 2015-03-01T15:57:53Z

@vgough Salsa20+Poly1305 would also be a viable (and very fast) alternative, as outlined by Thomas Ptacek in his blog:
http://sockpuppet.org/blog/2014/04/30/you-dont-want-xts/

rfjakob · 2015-03-01T18:48:06Z

Actually, i don't think large changes like that are neccessary. Blockwise cbc works fine for everything but the last 16 bytes (the aes block size).
By padding the plaintext with 16 zero bytes, that problem goes away, at the cost of wasting 16 bytes.
I think this is the way to go.

lachesis · 2015-03-21T04:20:12Z

Please don't invent a padding scheme; just pad with PKCS#7 like everyone else. :)

rfjakob · 2015-03-21T08:53:20Z

Thanks for the pointer! However, pkcs#7 seems to require that you read the last bytes of the ciphertext to geht the plaintext length. This is one additional seek for every stat(), we should really avoid that as it kills rsync performance.

rfjakob · 2015-03-21T09:15:01Z

(It's probably more than one seek, because the filesystem has to parse its internal data structures first to locate the data)
So I think what we need is a "headerless" scheme, where you don't have to read any ciphertext to get the length.
Unconditionally adding 16 zero bytes (or any value) would to that:

pppppppppp 0000000000000000
                    ^---- 16 bytes zero padding
    ^-------------------- 10 bytes plaintext

AES encryption (16 byte blocks) ->

cccccccccccccccc 0000000000
                     ^--- 10 bytes of zeros
     ^------------------- 16 bytes encrypted data

djtm · 2015-05-14T07:58:30Z

Isn't that a security issue if you know that the last bytes will be (padded with) zero bytes? Maybe better random bytes?

rfjakob · 2015-05-14T21:43:27Z

Nope, should be fine. http://en.m.wikipedia.org/wiki/Known-plaintext_attack Modern ciphers such as Advanced Encryption Standard are not currently known to be susceptible to known-plaintext attacks.

RogerThiede · 2015-05-14T22:08:01Z

While the current modes of modern ciphers available to encfs might not currently be susceptible to known-plaintext attacks, these types of attacks are typical for cryptanalysis and so this assumption could change after further years of research. Additionally, encfs offers multiple cipher options. Is this statement true for all ciphers encfs makes available through OpenSSL? If given two choices for this implementation, are there impacts in choosing one over the other? 1. Pad with zeros 2. Pad with any value

akerl · 2015-05-14T22:34:05Z

Trying to predict how to modify ciphers based on what vulnerabilities might be discovered in the future quickly becomes a wild goose chase. I suspect if you submitted a PR that improved the padding without affecting backwards compat, it would fare better.

JanKanis · 2015-07-21T13:07:58Z

A random idea I just thought of:
Encode file length (and other small useful metadata) in the encrypted filename. That would reduce the maximum filename length even more than it is now, so if that maximum is reached, substitute a hash of the filename and add the real file name to the end of the file data. That would encode metadata in the file contents only in the (rare) case where the filename is too long, so it wouldn't hurt rsync et al in the common case. And this would resolve the limited filename length problem as well.

vgough · 2015-07-24T03:57:37Z

In order to make lookups simple, it is preferable that encrypted filenames can be directly computed from plaintext filenames. That way a call to open("foo.txt") doesn't require a directory scan in order to find the encrypted file. Instead, we encrypt "foo.txt" and attempt to open the encrypted name.

Allowing hashed names, to extend allowable file lengths, doesn't hurt too badly since it could still be done without a directory traversal. Encoding metadata into filenames would thwart this, since I'm not aware of any portable way to do a prefix match or otherwise avoid walking the entire directory listing.

JanKanis · 2015-07-24T09:33:52Z

Of course. I should have thought it through a bit longer.

vgough · 2015-07-24T10:04:47Z

No worries, I appreciate the ideas. I've wanted to do the same myself, just didn't figure out a way to make that work.

wasgehetdichdasan · 2015-08-02T17:44:57Z

Is there a chance that there - maybe ;-) - will be a solution for the actual version in next time?

wasgehetdichdasan · 2015-08-13T21:15:32Z

no one who thinks that he can make a fast fix?

rfjakob · 2015-08-14T12:14:28Z

Well, this is an incompatible format change, there is no fast fix i'm afraid

wasgehetdichdasan · 2015-08-18T14:40:57Z

uhh. And what's with an not backwards compatible version which is not 2.0?

benrubson · 2018-05-05T15:10:19Z

Thx for your feedbacks @rfjakob 👍

I agree if the cipher file is fully available locally.
You may be in a situation where the cipher file would not be locally available, so you would not be able to stat() it (so you would not be able to know if the block you have been asked to write is the last one of the file).
Think about for example downloading (or syncing, whatever the method used) some remote cipher files directly into a reverse-mounted EncFS.

rfjakob · 2018-05-05T15:11:05Z

Forward mode would not work either in this case, right?

benrubson · 2018-05-05T15:15:55Z

It would, because then here you encode data, so you don't expect it to be a multiple of cipherBlockSize. If the block you are writing is at the end of the local (cipher) file, you assume this is the last block and compute a cipherBlockSize - 1 bytes padding.

rfjakob · 2018-05-05T15:25:33Z

I agree if the cipher file is fully available locally.

Can't we stat() the plaintext file instead?

benrubson · 2018-05-05T15:31:09Z

Unfortunately this would not help.
Let's assume we receive a 4KB (blockSize) cipher block.
According to the write call received, we have to write it as the end of the plaintext file. Perfect.
It could then be the last block of the plain file. But how to be sure ?
How can we then remove the last padding bytes that may exist ?
Without padding every block as proposed above, I don't see :|

rfjakob · 2018-05-05T15:32:33Z

If the write expanded the file, if must be the last block, and it must have padding

rfjakob · 2018-05-05T15:33:27Z

(otherwise forward mode is buggy)

benrubson · 2018-05-05T15:36:13Z

Not necessarily. Think about a cipher file being dowloaded directly into a reverse-write EncFS (so that it is written decrypted directly to the local disk).
Every block received and written will expand the plain file. But only the last one received (and written) will be the real last block of the plain file.

rfjakob · 2018-05-05T15:37:12Z

The every block must have padding.

benrubson · 2018-05-05T15:40:42Z

A 15 bytes padding ?
Or a OneAndZeroes padding of each block, with a cipherBlockSize - 1 bytes padding for the last block ?

rfjakob · 2018-05-05T15:41:52Z

Yes, 15 bytes.

rfjakob · 2018-05-05T15:42:38Z

At that moment, it's the last block, right?

benrubson · 2018-05-05T15:44:32Z

Look at these use cases :

Backup :
plain local -> EncFS reverse -> rsync to remote location

Restore :
rsync from remote location -> EncFS reverse -> plain local

I'm not sure backup will need to insert a 15 bytes padding after every block.

rfjakob · 2018-05-05T16:03:10Z

Interesting use case, but there are other problems:

plain local -> EncFS reverse -> rsync to remote location -> ciphertext

Now, let's assume the ciphertext contains 1000. And rsync happens to write() a chunk of data that ends with 1000. What does

EncFS reverse -> plain local

do?

benrubson · 2018-05-05T16:13:27Z

// strange duplicate part of your message above deleted

Yes, I think this is the last tricky case.
I already thought about this, and I think we need an additional internal buffer.

Let's take your example.

1000%16 = 8
We crop last last 8 bytes.
We decode.
We remove padding bytes if it looks like we can.
We write plain data at the end of the plain file.
We return that we wrote 1000 bytes.
As 1000 < 4096, we keep the 1000 bytes into an internal buffer, as we may receive the next bytes of the block.

If we receive a write request with the next 1000 bytes, we will not read the 1000 previous bytes of the block from the plain file, as we have cropped some bytes, but will take them from our internal buffer.

rfjakob · 2018-05-05T16:27:34Z

I was curious if that use case really works, so I did:

a/zero -> reverse -> b/eNZPWSyw0rxU7T37UwNN3,n9  ----> cp
d/zero -> reverse -> c/eNZPWSyw0rxU7T37UwNN3,n9  <---/

And it seems wo work at first glance:

$ md5sum a/zero d/zero 
2d56b031dc8683c233c016429084f870  a/zero
2d56b031dc8683c233c016429084f870  d/zero

So that was easy, lets overwrite the middle of the file with itself:

dd if=b/eNZPWSyw0rxU7T37UwNN3,n9 of=c/eNZPWSyw0rxU7T37UwNN3,n9 bs=123 seek=43 skip=43 count=1

Random garbage:

$ md5sum a/zero d/zero 
2d56b031dc8683c233c016429084f870  a/zero
a22fc0525129c3eb2fe1af2e4bc9fd5d  d/zero

rfjakob · 2018-05-05T16:30:13Z

However, this (note the odd block size):

dd if=b/eNZPWSyw0rxU7T37UwNN3,n9 of=c/eNZPWSyw0rxU7T37UwNN3,n9 bs=123

works, and I'm not sure why.

$ md5sum a/zero d/zero 
2d56b031dc8683c233c016429084f870  a/zero
2d56b031dc8683c233c016429084f870  d/zero

On decryption, we have to know if it is the last block, because the last block is handled differently. Where do we have this information from?

benrubson · 2018-05-05T16:37:36Z

I think every 123 bytes block is written using stream cipher (so this creates garbage), until you are ready to write enough bytes (up to blokSize) to read (stream-encode) them again and re-decode the whole block correctly using CBC.

Confirmed (here blockSize is 1024) :

VERBOSE FileNode::write offset 984, data size 123 [FileNode.cpp:247]
VERBOSE streamRead(data, 984, IV) [CipherFileIO.cpp:350]
VERBOSE Called blockWrite [CipherFileIO.cpp:420]
VERBOSE Called streamWrite [CipherFileIO.cpp:429]

benrubson · 2018-05-05T17:10:40Z

Strangely, in your failing example above, file get truncated by dd at the end of the 123 bytes written block (I reproduced it).
There is a bug somewhere :)

rfjakob · 2018-05-05T17:18:44Z

Oh, my bad! You are right, the truncation is what causes the garbage:

dd if=b/eNZPWSyw0rxU7T37UwNN3,n9 of=c/eNZPWSyw0rxU7T37UwNN3,n9 \
 bs=123 seek=43 skip=43 count=1 conv=notrunc

$ md5sum a/zero d/zero 
2d56b031dc8683c233c016429084f870  a/zero
2d56b031dc8683c233c016429084f870  d/zero

From the latest audit (https://defuse.ca/audits/encfs.htm): > EncFS is probably safe as long as the adversary only gets one copy of > the ciphertext and nothing more. EncFS is not safe if the adversary has > the opportunity to see two or more snapshots of the ciphertext at > different times. EncFS attempts to protect files from malicious > modification, but there are serious problems with this feature. vgough/encfs#8 vgough/encfs#9 (critical) vgough/encfs#10 vgough/encfs#11 vgough/encfs#13 vgough/encfs#16 vgough/encfs#17

CryFS (https://www.cryfs.org/) is a software that can be run by non-root users that have access to /dev/fuse. Its command is directly used to mount a directory ("/usr/bin/cryfs basedir mountpoint"), like command "mount". Unmounting a mountpoint is done with "fusermount -u mountpoint", /usr/bin/fusermount being a setuid-root program labeled mount_exec_t. EncFS (https://www.arg0.net/encfs) is a similar software that has been considered insecure since a security audit in 2014 found vulnerabilities that are not yet fixed (like vgough/encfs#9). gocryptfs (https://nuetzlich.net/gocryptfs/) is a similare software that has been inspired by EncFS. Allow users with role sysadm to use all these projects. Signed-off-by: Nicolas Iooss <nicolas.iooss@m4x.org>

CryFS (https://www.cryfs.org/) is a software that can be run by non-root users that have access to /dev/fuse. Its command is directly used to mount a directory ("/usr/bin/cryfs basedir mountpoint"), like command "mount". Unmounting a mountpoint is done with "fusermount -u mountpoint", /usr/bin/fusermount being a setuid-root program labeled mount_exec_t. EncFS (https://www.arg0.net/encfs) is a similar software that has been considered insecure since a security audit in 2014 found vulnerabilities that are not yet fixed (like vgough/encfs#9). gocryptfs (https://nuetzlich.net/gocryptfs/) is a similar software that has been inspired by EncFS. Allow users with role sysadm to use all these projects. Signed-off-by: Nicolas Iooss <nicolas.iooss@m4x.org>

CryFS (https://www.cryfs.org/) is a software that can be run by non-root users that have access to /dev/fuse. Its command is directly used to mount a directory ("/usr/bin/cryfs basedir mountpoint"), like command "mount". Unmounting a mountpoint is done with "fusermount -u mountpoint", /usr/bin/fusermount being a setuid-root program labeled mount_exec_t. EncFS (https://www.arg0.net/encfs) is a similar software that has been considered insecure since a security audit in 2014 found vulnerabilities that are not yet fixed (like vgough/encfs#9). gocryptfs (https://nuetzlich.net/gocryptfs/) is a similar software that has been inspired by EncFS. Allow users with role sysadm to use all these projects. Signed-off-by: Nicolas Iooss <nicolas.iooss@m4x.org> Signed-off-by: Jason Zaman <perfinion@gentoo.org>

jcguu95 · 2021-05-19T13:09:43Z

What's the current state of this issue?

lipnitsk mentioned this issue Aug 26, 2014

Generating Block IV by XORing Block Number #10

Open

vgough added this to the 2.0 milestone Aug 29, 2014

vgough added backward incompatible and removed backward incompatible labels Aug 29, 2014

rfjakob mentioned this issue Nov 10, 2014

Editing Configuration File Disables MACs #14

Closed

This was referenced Nov 1, 2015

How Secure am i if keyfile is stored outside of cloud?? #122

Closed

small last block insecure #103

Closed

Repository owner deleted a comment from rfjakob May 5, 2018

benrubson mentioned this issue May 12, 2018

Handle CBC padding #521

Closed

taz-007 mentioned this issue May 21, 2018

Error after upgrading archlinux32 #524

Closed

fishilico mentioned this issue Dec 22, 2019

Add policy for CryFS, encfs and gocryptfs SELinuxProject/refpolicy#145

Merged

buhtz mentioned this issue Mar 29, 2024

Mark EncFS as deprecated feature and inform the users about it bit-team/backintime#1549

Open

Stream Cipher Used to Encrypt Last File Block #9

Stream Cipher Used to Encrypt Last File Block #9

Comments

lipnitsk commented Aug 26, 2014 • edited by benrubson

vgough commented Aug 29, 2014

rfjakob commented Oct 18, 2014

rfjakob commented Oct 18, 2014

rfjakob commented Oct 19, 2014

vgough commented Oct 23, 2014

generalmanager commented Mar 1, 2015

rfjakob commented Mar 1, 2015

lachesis commented Mar 21, 2015

rfjakob commented Mar 21, 2015

rfjakob commented Mar 21, 2015

djtm commented May 14, 2015

rfjakob commented May 14, 2015 via email

RogerThiede commented May 14, 2015 via email

akerl commented May 14, 2015

JanKanis commented Jul 21, 2015

vgough commented Jul 24, 2015

JanKanis commented Jul 24, 2015

vgough commented Jul 24, 2015

wasgehetdichdasan commented Aug 2, 2015

wasgehetdichdasan commented Aug 13, 2015

rfjakob commented Aug 14, 2015 via email

wasgehetdichdasan commented Aug 18, 2015

benrubson commented May 5, 2018 • edited

rfjakob commented May 5, 2018

benrubson commented May 5, 2018 • edited

rfjakob commented May 5, 2018

benrubson commented May 5, 2018 • edited

rfjakob commented May 5, 2018

rfjakob commented May 5, 2018

benrubson commented May 5, 2018

rfjakob commented May 5, 2018

benrubson commented May 5, 2018

rfjakob commented May 5, 2018

rfjakob commented May 5, 2018

benrubson commented May 5, 2018

rfjakob commented May 5, 2018 • edited

benrubson commented May 5, 2018 • edited

rfjakob commented May 5, 2018

rfjakob commented May 5, 2018

benrubson commented May 5, 2018 • edited

benrubson commented May 5, 2018 • edited

rfjakob commented May 5, 2018

jcguu95 commented May 19, 2021

lipnitsk commented Aug 26, 2014 •

edited by benrubson

benrubson commented May 5, 2018 •

edited

benrubson commented May 5, 2018 •

edited

benrubson commented May 5, 2018 •

edited

rfjakob commented May 5, 2018 •

edited

benrubson commented May 5, 2018 •

edited

benrubson commented May 5, 2018 •

edited

benrubson commented May 5, 2018 •

edited