Issue #6114 - hot deploy of symlink results in symlink name #6160

joakime · 2021-04-12T20:38:44Z

Using getAbsolutePath not canonical (too many corner cases with it to be reliable)

Signed-off-by: Joakim Erdfelt joakim.erdfelt@gmail.com

+ Using getAbsolutePath not canonical (too many corner cases with it to be reliable) Signed-off-by: Joakim Erdfelt <joakim.erdfelt@gmail.com>

joakime · 2021-04-12T20:39:44Z

This PR also addresses a disabled testcase in Issue #5684

jetty-util/src/main/java/org/eclipse/jetty/util/Scanner.java

Signed-off-by: Joakim Erdfelt <joakim.erdfelt@gmail.com>

janbartel

I'm questioning the need for this change.

The javadoc for File.getCanonicalPath says:

A canonical pathname is both absolute and unique. The precise definition of canonical form is system-dependent. This method first converts this pathname to absolute form if necessary, as if by invoking the getAbsolutePath() method, and then maps it to its unique form in a system-dependent way. This typically involves removing redundant names such as "." and ".." from the pathname, resolving symbolic links (on UNIX platforms), and converting drive letters to a standard case (on Microsoft Windows platforms).

Therefore a canonical path is an absolute path.

joakime · 2021-04-27T14:04:31Z

There are 2 major reasons.

Canonical path requires filesystem permissions for all paths necessary to create the canonical path and it follows symlinks.

Example:

C:\sites\customer1\webapp\

Where C:\sites is setup to be no permissions to the running server user.
But the C:\sites\customer1 has permissions for the running server user.
If you attempt to new File("C:\sites\customer1\webapp\").getCanonicalPath() it will fail.
If you use new File("C:\sites\customer1\webapp\").getAbsolutePath() it will work.

The use of getCanonicalPath above will result in either an IOException or a more specific FileSystemException.
With getAbsolutePath it will just work.

Absolute path does not require extra filesystem permissions, and does not follow links, it merely resolves them (as a directory or a file in place).

This gets even more complicated on filesystems that are different from the default, such as network shares, virtual filesystems (git/gitlab/github/perforce/docker to name a few), and even alternate filesystem implementations (seen in increasing frequency in data centers for example)

An alternative would be Path.toRealPath(LinkOption...) as that would allow for better control over symlink behavior (follow vs resolve) and even detects filesystem loops (FileSystemLoopException) with symlinks, and even invalid symlinks (NotLinkException).

The Scanner operating on getCanonicalPath breaks the ability to have symlinks in /webapps/ directory and deploy as the name on the symlink, it forces the name to be the canonical name (what we don't want).

In short, getCanonicalPath is a bad API for ..

users that care about filesystem permissions on servers
users that would like to use symlink deployment as the symlink name (be it a xml deployable, a war, or a directory)
users that have filesystems that are not default for the OS and/or local

janbartel · 2021-04-28T05:21:10Z

I'm cautious about making this change at all. It seems more secure to be both absolute and canonical for the majority of operating systems. If some operating systems have different behaviours with canonicalization what is the risk? That a webapp won't be deployed? That a webapp will be redeployed multiple times for every scan?

BTW the Scanner has always worked on canonical paths. That being the case, how was it possible for the usecase with multiple symlinks in the /webapps directory to have produced multiple deployments, or is this a new usecase?

joakime · 2021-04-28T09:04:44Z

The testcase for this PR isn't new, it was created back when we underwent the Scanner > PathWatcher > Scanner changes a few years ago.
It was disabled when the Scanner was rewritten during the change back to Scanner.
We used to support proper symlink behavior, the getCanonicalPath change broke it.
The scanInfoMap in the Scanner was added around this time to detect filesystem loops and "have we seen this" behaviors, something that the PathWatcher (and the nio.Files/nio.Paths) already support built-in.

getCanonicalPath (and getCanonicalFile) has a history with this project were we remove it when it becomes a problem (either reported, or discovered), it's been done in the WebAppClassloader already, it's been done in the Resource layer already, it's been done in jetty-start already.

Do we have usages of getCanonicalPath still around? Yes, but the vast majority is our own test cases (where we don't have to worry about the variety of environments it runs on).

The knowledge of the canonical path in the Scanner has no value, so why break legit usage by insisting on it.
Absolute is sufficient, and causes less problems.

gregw · 2021-04-28T10:51:30Z

I'm also cautious of this change. The removal of getCanonicalPath has already caused one significant CVE.
If we do remove it, I think using toRealPath would be better as it allows control over symlink handling.

Most importantly, the handling must be the same in the Scanner as it is in WebAppProvider, but the Scanner is used elsewhere, so changing it may break other things.

Thus I think we need to do is create an interface like

    public interface Normalizer
    {
        String normalize(File f) throws IOException;
    }

Which can optionally be passed into the constructor of Scanner. If not passed, then it is initialised to File::getCanonicalPath).
This keeps the Scanner exactly the same for existing uses. Then the WebAppProvider can have some configuration to say if it uses canonical or not, and/or if it follows symlinks. This config can be used the select a Normalizer, which will then be used both by the Scanner and the WebAppProvider.

gregw

see previous comment

joakime · 2021-05-11T11:28:27Z

The Normalizer would then need to be OS and FileSystem specific.
There would be no good "default" that could be documented.

There would just be one with getCanonicalPath for users on Linux with ext3 or ext4.
And getAbsolutePath for everyone else (Windows, OSX, docker, zfs, etc)

gregw · 2021-05-11T12:03:15Z

@joakime, it may not even be that. It might be a normalizer with getCanonicalPath for any other direct usages of the Scanner and a normalizer with getAbsolutePath for using the scanner in the DeployerProvider.

It's not about different OS's. It's about not changing the Scanner just so it syncs with the Deployer. By having it pluggable the deployer can ensure that the same impl is used and not be vulnerable if the Scanner is subsequently changed. It is about non fragile code.

janbartel · 2021-05-24T00:55:03Z

@joakime when we resolve this issue, can you also take a look at 2 @Disabled tests that touch on this very subject at https://github.com/eclipse/jetty.project/blob/jetty-9.4.x/jetty-deploy/src/test/java/org/eclipse/jetty/deploy/providers/WebAppProviderTest.java#L104 and https://github.com/eclipse/jetty.project/blob/jetty-9.4.x/jetty-deploy/src/test/java/org/eclipse/jetty/deploy/providers/WebAppProviderTest.java#L121 and then tick them off the list at #5684

joakime · 2021-06-04T15:08:42Z

This will need to be reworked, in light of the changes in #6317

Issue #6114 - hot deploy of symlink results in symlink name

e2d5e23

+ Using getAbsolutePath not canonical (too many corner cases with it to be reliable) Signed-off-by: Joakim Erdfelt <joakim.erdfelt@gmail.com>

joakime requested a review from gregw April 12, 2021 20:38

joakime self-assigned this Apr 12, 2021

joakime added the Enhancement label Apr 12, 2021

joakime marked this pull request as draft April 12, 2021 20:40

gregw requested a review from janbartel April 12, 2021 22:46

janbartel requested changes Apr 13, 2021

View reviewed changes

jetty-util/src/main/java/org/eclipse/jetty/util/Scanner.java Show resolved Hide resolved

Issue #6114 - more canonical to absolute changes in Scanner

3ee21e1

Signed-off-by: Joakim Erdfelt <joakim.erdfelt@gmail.com>

janbartel requested changes Apr 27, 2021

View reviewed changes

gregw requested changes May 7, 2021

View reviewed changes

gregw mentioned this pull request May 24, 2021

Fix #6114 Deploy symlink webapps #6317

Merged

joakime closed this Jun 4, 2021

joakime deleted the jetty-10.0.x-5684-cleanup-webappprovidertest branch June 17, 2021 12:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue #6114 - hot deploy of symlink results in symlink name #6160

Issue #6114 - hot deploy of symlink results in symlink name #6160

joakime commented Apr 12, 2021

joakime commented Apr 12, 2021

janbartel left a comment

joakime commented Apr 27, 2021

janbartel commented Apr 28, 2021

joakime commented Apr 28, 2021

gregw commented Apr 28, 2021

gregw left a comment

joakime commented May 11, 2021

gregw commented May 11, 2021

janbartel commented May 24, 2021

joakime commented Jun 4, 2021

Issue #6114 - hot deploy of symlink results in symlink name #6160

Issue #6114 - hot deploy of symlink results in symlink name #6160

Conversation

joakime commented Apr 12, 2021

joakime commented Apr 12, 2021

janbartel left a comment

Choose a reason for hiding this comment

joakime commented Apr 27, 2021

janbartel commented Apr 28, 2021

joakime commented Apr 28, 2021

gregw commented Apr 28, 2021

gregw left a comment

Choose a reason for hiding this comment

joakime commented May 11, 2021

gregw commented May 11, 2021

janbartel commented May 24, 2021

joakime commented Jun 4, 2021