[Apache] Traverse the include path to the root where relevant #7642

joohoi · 2019-12-13T14:29:45Z

This PR allows Certbot to utilize full include tree for searches that need to know about the ancestors. Search for mod_macro block is included in this PR. The functionality also takes the possibility of multiple includes into account.

Because of the limitations of the current parsing engine, we have to search includes from the root up. This also introduces quite big performance penalty, especially if done dynamically. This is why a dictionary of Include, IncludeOptions and their values are saved to the parser object, and refreshed when new includes are added.

ohemorange · 2020-01-03T00:05:35Z

Before getting into the code, I'd love some high-level clarifications about this PR, probably mostly since it's been a while since before the holidays.

Basically, I'm wondering how much this is worth it. As I understand it, this code is a) pretty specific to the augeas parser implementation, which makes it relatively temporary and b) it involves caching results, which is a prime breeding ground for bugs. This combination might be worth it, but it seems like we're doing this to get a more correct implementation than we previously had (searching down include paths, because the augeas path won't have the full logical path in it). And on top of that, are there searches other than the macro search that's changed in this PR?

Were there other factors I'm missing here? Because based on just these, I'm surprised to see this PR.

joohoi · 2020-01-06T16:35:05Z

The main need for this PR is to allow runtime assertions to be utilized without the need of re-implementing a minor bug to the apacheparser implementation that has existed in the code for a long time.

The bug exists because we're simply reading the Augeas path of VirtualHost block to determine if it's wrapped in a <Macro> block. While this catches the majority of use cases, there may be configurations that it's not able to parse correctly because of how Augeas paths work.

This would not trigger the bug:

<Macro VHost $domain>
  <VirtualHost *:80>
    ServerName $domain
    ...
  </VirtualHost>
</Macro>
Use VHost example.com
Use VHost example.org

But this would:

# vhost.conf
<VirtualHost *:80>
  ServerName $domain
  ...
</VirtualHost>
# end of vhost.conf

# main.conf
<Macro VHost $domain>
  IncludeOptional vhost.conf
</Macro>
Use VHost example.com
Use VHost example.org
# end of main.conf

In the bugged example, we would be checking the /path/to/vhost.conf/VirtualHost to determine if it's in a mod_macro block.

ohemorange

While the overall logic seems to be implemented correctly, where possible we should attempt to simplify this, because as it is now it's a bit tricky to follow, which makes it more error-prone than I'm comfortable with.

ohemorange · 2020-01-16T01:59:27Z

certbot-apache/certbot_apache/_internal/apache_util.py

+    # Removes ../ etc.
+    path = os.path.normpath(path)
+    if not os.path.isabs(path):
+        path = os.path.join(root, path)


it's probably not a problem, but I'd feel more comfortable if the normpath call came after the join, just in case the root isn't already norm.

ohemorange · 2020-01-17T00:56:54Z