Improve consistency between Git and API formula handling #12936

Bo98 · 2022-02-28T14:36:56Z

I intended to open a PR for this a while back but never did.

It's WIP and I'll hopefully revisit it within the next month. There's been minimal changes since I last presented this (main change is keg_only_reason being now supported).

BrewTestBot · 2022-02-28T14:37:11Z

Review period will end on 2022-03-01 at 14:36:56 UTC.

MikeMcQuaid

This is amazing. A bunch of comments but would really like to land this sooner rather than later.

MikeMcQuaid · 2022-02-28T15:18:10Z

Library/Homebrew/brew.sh

@@ -758,7 +758,7 @@ then
  export HOMEBREW_DEVELOPER_MODE="1"
 fi

-if [[ -n "${HOMEBREW_INSTALL_FROM_API}" && -n "${HOMEBREW_DEVELOPER_COMMAND}" ]]
+if [[ -n "${HOMEBREW_INSTALL_FROM_API}" && -n "${HOMEBREW_DEVELOPER_COMMAND}" && "${HOMEBREW_COMMAND}" != "irb" ]]


Wondering if we should turn this into a more explicit denylist or just remove this entirely?

Could probably remove this, since I want to allow brew ruby too. @Rylan12 will have a better idea about this.

Yeh, that makes sense to me. bump, command, dispatch-build-bottle, generate-man-completions, install-bundler-gems, irb, linkage, pr-publish, prof, release, rubocop, ruby, sh, sponsors, style, tap-new, tests, typecheck, unpack, update-license-data, update-maintainers, update-test, vendor-gems all look like they should work without a tapped homebrew/core.

Yeah, there's no reason to exclude HOMEBREW_INSTALL_FROM_API from devs for those commands (except maybe brew style formula and unpack), it's just easier to maintain that developers shouldn't use it. Otherwise, we need to maintain this list to make sure that if commands get an online component they are removed from the list (and vice versa).

Another, better option, is probably just to complain on a per-command basis. We can remove the restriction on developers and just fail immediately in brew bump-formula-pr and friends if the user has HOMEBREW_INSTALL_FROM_API.

brew style formula could still work for non-core formulae and brew unpack could still be useful to unpack even without a local formula IMO.

We can remove the restriction on developers and just fail immediately in brew bump-formula-pr and friends if the user has HOMEBREW_INSTALL_FROM_API.

This would work for me 👍🏻

Library/Homebrew/cmd/update.sh

Library/Homebrew/formulary.rb

MikeMcQuaid · 2022-02-28T15:25:58Z

Library/Homebrew/formulary.rb

+    if !CoreTap.instance.installed? &&
+       Homebrew::EnvConfig.install_from_api? &&


I wonder whether we want a testing mode where you can always install from the API, even if the CoreTap is installed?

This makes sense, though I'm not sure if it should be a part of this PR or not.

Yeh, can punt on that being part of this PR if necessary. Just think it'd be a nice decoupling at some point.

Library/Homebrew/api/formula.rb

BrewTestBot · 2022-03-01T15:08:08Z

Review period ended.

Rylan12

This looks really good to me!

Have you tested this with an install yet? I feel like something is missing still. Before, the bottles were downloaded using Homebrew::API::Bottle.fetch_bottles. This added some caching thing to Formulary so that BottleLoader would recognize the ref (I'm pretty sure) and load there. The API load happens last in Formulary::laoder_for, so I worry that there will be some unexpected consequences of that.

Also, we can probably remove Formulary.map_formula_name_to_local_bottle_path and the methods in Homebrew::API::Bottle (and maybe the entire API), right?

Rylan12

I've spent today taking a much more thorough look at this PR for the API project. Here are a few questions.

One thing that I'm still looking into is caching in Formulary. Currently, when you load a formula from a path, the formula class is cached in Formulary#cache under the path name. However, when loading from the API (as is currently set up), the class is still cached but in a different way. Instead of caching in Formulary#cache, we simply check the Formulary::FormulaNamespaceAPI module to see if the class is already defined and, if so, return it (although only after re-updating the build flags). I think this has the same effect, but I wonder if we should be consistent and cache in Formulary#cache like we do "normally."

Library/Homebrew/formulary.rb

Library/Homebrew/api/formula.rb

Bo98 · 2022-06-13T20:55:46Z

Currently, when you load a formula from a path, the formula class is cached in Formulary#cache under the path name. However, when loading from the API (as is currently set up), the class is still cached but in a different way. Instead of caching in Formulary#cache, we simply check the Formulary::FormulaNamespaceAPI module to see if the class is already defined and, if so, return it (although only after re-updating the build flags). I think this has the same effect, but I wonder if we should be consistent and cache in Formulary#cache like we do "normally."

Caching is something I've largely ignored. I feel like we should probably investigate what we have as we currently have three caching mechanisms: factory caching, Formulary#cache and the namespace management. The latter is needed for marhsalling reasons etc and wasn't introduced for caching (but is able to work as one), so it's worth investigating whether the former two is actually adding much on top of that.

One thing to remember: we should make sure a formula loaded from file and a formula created via API is not seen as the same cache entry. The scenario where this might happen is a build-from-source flow (theoretically, as this doesn't actually exist yet).

MikeMcQuaid · 2022-06-14T07:54:09Z

Caching is something I've largely ignored. I feel like we should probably investigate what we have as we currently have three caching mechanisms: factory caching, Formulary#cache and the namespace management. The latter is needed for marhsalling reasons etc and wasn't introduced for caching (but is able to work as one), so it's worth investigating whether the former two is actually adding much on top of that.

Agreed that this is worth investigating 👍🏻

Rylan12 · 2022-06-14T17:58:04Z

I've looked a bit more into the caching and it looks like we do two things:

We always cache when loading a formula from a file path. When loading a formula from a file, we first check to see if the path is in the cache and if so simply return the class that is cached. Here, the formula class (e.g. Formulary::FormulaNamespaceb684604c8244a5905bc797f4e22cc31f::Wget) is cached
We sometimes cache all formulae, regardless of how they're loaded. This is the "factory cache" and needs to be explicitly enabled (which is only done in uses, deps, and unbottled at the moment). When enabled, we create a cache key from the parameters passed to ::factory and compare that with the factory cache. If there's a match, we return that formula and skip the rest of the loading process. Here, the formula instance is cached
If we load from the API or a formula's contents (i.e. from a bottle), we don't have any caching

I'd suggest that we scope the cache to be type-dependent. Meaning, having a separate cache for loading from path and from the API. That way, there's no risk of accidental overlap if we somehow try to load the same formula from the API and a file.

We also could add caching when loading from a bottle, potentially using the bottle path as a cache key. We could also use e.g. a hash of the contents as a cache key. This might help speed up loading from a bottle since we won't need to read the file contents each time, but is also probably outside the scope of this PR.

Rylan12 · 2022-06-14T22:36:13Z

Okay, I think I'm done with this for today. At the moment, loading from the API does work. I was successfully able to uninstall and reinstall formulae.

For consistency, one important thing to note is that loading formulae and casks from the API will take precedence over loading from an installed keg/cask. This is intentional since it mimics the way things work without the API: the most recent version is loaded, even if an installed version is older. Doing this will allow lots of if Homebrew::EnvConfig.install_from_api? calls to be removed since the formula that's loaded will always be assumed to be the latest version.

I'm still working through all of those changes, and I'll mark this PR as "ready" once I've made those and have done more testing. But, I'll gladly accept feedback on what's been done so far since I've made some more substantial changes to the original commits.

With these changes, I've also been able to remove the Homebrew::API::Bottle code since we don't really need the bottle API anymore. Eventually, I think it may make sense to remove that API altogether since it has several flaws (e.g. it doesn't include build/test dependencies, doesn't know that different OSes can have different information, etc.) and I don't think it really provides any information that can't be found using our other APIs. That can be a conversation for the future, though.

Overall, I'm very pleased with how this approach is looking since I've been able to remove a ton of those conditionals. It makes everything feel much more integrated and less like an add-on.

Bo98 · 2022-06-15T00:27:16Z

For consistency, one important thing to note is that loading formulae and casks from the API will take precedence over loading from an installed keg/cask. This is intentional since it mimics the way things work without the API: the most recent version is loaded, even if an installed version is older.

Yeah I agree. Formula files in kegs can get stale. Casks use their equivalent a lot more than the formula side and there have been countless bugs caused by that due to our deprecation turnover. One of the goals here was to avoid needing to use them.

+ we need the latest information anyway for brew outdated etc

Overall, I'm very pleased with how this approach is looking since I've been able to remove a ton of those conditionals. It makes everything feel much more integrated and less like an add-on.

Excellent. That's exactly what I wanted to see the API code become. It's a lot easier to maintain not having to have two different code paths everywhere. The idea was to fix brew info etc without actually touching cmd/info.rb.

Rylan12

The PR is now ready to move out of the draft stage. I've done some testing locally, and haven't encountered any issues. It feels super seamless and much more stable. Plus, a ton of code was able to be removed so that the only places where we need to check whether HOMEBREW_INSTALL_FROM_API is set are in Formulary, Cask::CaskLoader, Caskroom, and a few places only to make sure that having homebrew/core and homebrew/cask untapped isn't an issue.

There are still a few things that I want to work on that should happen in separate PRs:

Removing the restrictions on HOMEBREW_INSTALL_FROM_API for developers (except for certain commands that need full clones)
The new brew update process will need to be looked at to make sure that things like tap/formula migrations still are noticed, and potentially failing brew update if the cached formula.json file can't be downloaded
There are certain commands (e.g. brew update) that feel like they run slower now since they need to parse the huge formula.json file. I'm not sure yet what the best solution is, but I wonder if we can further improve performance for some of these commands.
Adding a way to test without needing to move homebrew/cask and homebrew/cask so that they aren't installed

Bo98 · 2022-06-15T23:15:51Z

The new brew update process will need to be looked at to make sure that things like tap/formula migrations still are noticed

Is there even an API endpoint for that yet?

There are certain commands (e.g. brew update) that feel like they run slower now since they need to parse the huge formula.json file.

How long does parsing the file take?

One thing to address at some point (not now) is download integrity.

This could be using standards like JWS, or potentially something more custom if we really want to shoehorn it into existing endpoints.

Rylan12 · 2022-06-15T23:40:27Z

Is there even an API endpoint for that yet?

Nope

How long does parsing the file take?

Here are the results of some tests I just ran using hyperfine:

Command	Average Time Without API	Average Time With API
`brew outdated`	702.9 ms	5.076 s
`brew ruby -e 'Formulary.factory("abcde")'`	935.7 ms	1.416 s

I wonder if there's something else going on in brew update that explains why it takes 70 times longer. More testing can definitely be done.

One thing to address at some point (not now) is download integrity.

This could be using standards like JWS, or potentially something more custom if we really want to shoehorn it into existing endpoints.

Good point, thanks for bringing it up. I don't really know anything about this kind of thing so I'll have to look into it more in the future

Bo98 · 2022-06-16T00:01:41Z

Good point, thanks for bringing it up. I don't really know anything about this kind of thing so I'll have to look into it more in the future

I know about JWS at least so if we go that route feel free to ask me about it at the time.

MikeMcQuaid · 2022-06-16T09:29:41Z

For consistency, one important thing to note is that loading formulae and casks from the API will take precedence over loading from an installed keg/cask. This is intentional since it mimics the way things work without the API: the most recent version is loaded, even if an installed version is older.

Yeah I agree. Formula files in kegs can get stale. Casks use their equivalent a lot more than the formula side and there have been countless bugs caused by that due to our deprecation turnover. One of the goals here was to avoid needing to use them.

we need the latest information anyway for brew outdated etc

Also agreed 👍🏻

I wonder if there's something else going on in brew update that explains why it takes 70 times longer. More testing can definitely be done.

Tried playing with brew prof here? If the answer is "I'm not sure how to do that": shout and I'll give you a hand.

MikeMcQuaid · 2022-06-16T09:34:59Z

Library/Homebrew/brew.sh

@@ -764,7 +764,7 @@ then
  export HOMEBREW_DEVELOPER_MODE="1"
 fi

-if [[ -n "${HOMEBREW_INSTALL_FROM_API}" && -n "${HOMEBREW_DEVELOPER_COMMAND}" ]]
+if [[ -n "${HOMEBREW_INSTALL_FROM_API}" && -n "${HOMEBREW_DEVELOPER_COMMAND}" && "${HOMEBREW_COMMAND}" != "irb" ]]


👍🏻 for now. I'm thinking we may want to have a longer list of commands we allow here.

I already have a PR in the works for this

Library/Homebrew/cmd/update.sh

Library/Homebrew/formulary.rb

MikeMcQuaid · 2022-06-16T09:39:39Z

Great work @Rylan12 and @Bo98. Happy to see this merged as-is and we can iterate further!

Co-authored-by: Mike McQuaid <mike@mikemcquaid.com>

Rylan12 · 2022-06-16T20:12:01Z

Great! Thanks for getting this started and helping out, @Bo98!

Bo98 added the in progress Maintainers are working on this label Feb 28, 2022

BrewTestBot added the waiting for feedback Merging is blocked until sufficient time has passed for review label Feb 28, 2022

MikeMcQuaid approved these changes Feb 28, 2022

View reviewed changes

BrewTestBot removed the waiting for feedback Merging is blocked until sufficient time has passed for review label Mar 1, 2022

BrewTestBot approved these changes Mar 1, 2022

View reviewed changes

Rylan12 reviewed Mar 1, 2022

View reviewed changes

Bo98 mentioned this pull request Mar 30, 2022

Skip build deps to avoid downloading bottles #13065

Merged

7 tasks

Bo98 mentioned this pull request Apr 6, 2022

set prefer_loading_from_api: true for brew fetch #13089

Merged

7 tasks

Bo98 changed the title ~~Support offline usage under HOMEBREW_INSTALL_FROM_API~~ Improve consistency between Git and API formula handling Jun 13, 2022

Rylan12 reviewed Jun 13, 2022

View reviewed changes

Library/Homebrew/formulary.rb Outdated Show resolved Hide resolved

Library/Homebrew/formulary.rb Show resolved Hide resolved

Library/Homebrew/formulary.rb Outdated Show resolved Hide resolved

Library/Homebrew/api/formula.rb Outdated Show resolved Hide resolved

Bo98 and others added 7 commits June 14, 2022 16:06

Support offline usage under HOMEBREW_INSTALL_FROM_API

1d36c42

Add bottle rebuild when loading from API

944d7ee

Align API loading with other formula loading

827acd3

Remove unnecessary code

e53ccbc

Remove Bottle API

89483ab

Fix style

90c6aef

Update cached formula json file when needed

43f7fa4

Rylan12 force-pushed the api-offline branch from 0cc825d to 43f7fa4 Compare June 14, 2022 20:12

Cleanup

ccd46af

Rylan12 added 3 commits June 15, 2022 16:35

Streamline loading casks from API

1e53621

Remove unnecessary HOMEBREW_INSTALL_FROM_API checks

cff0122

Don't ignore errors when loading from the API

996ca83

Rylan12 marked this pull request as ready for review June 15, 2022 21:12

Rylan12 reviewed Jun 15, 2022

View reviewed changes

Add test

0113774

MikeMcQuaid approved these changes Jun 16, 2022

View reviewed changes

Rylan12 and others added 7 commits June 16, 2022 13:26

Cleanup

dd81ca5

Expand Formulary test coverage

dd516e4

Add more API test coverage

8c8c696

Fix dependency check test

2adfdae

Fix dependency check test again

78aa927

Clarify TODO in brew update

98f8a86

Co-authored-by: Mike McQuaid <mike@mikemcquaid.com>

Fix style

f724dde

Rylan12 merged commit d23dba6 into Homebrew:master Jun 16, 2022

Bo98 deleted the api-offline branch June 16, 2022 20:12

This was referenced Jun 16, 2022

Formulary Improvements with HOMEBREW_INSTALL_FROM_API #13437

Merged

Allow more developer commands with HOMEBREW_INSTALL_FROM_API #13439

Merged

github-actions bot added the outdated PR was locked due to age label Jul 17, 2022

github-actions bot locked as resolved and limited conversation to collaborators Jul 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve consistency between Git and API formula handling #12936

Improve consistency between Git and API formula handling #12936

Bo98 commented Feb 28, 2022 •

edited

BrewTestBot commented Feb 28, 2022

MikeMcQuaid left a comment

MikeMcQuaid Feb 28, 2022

Bo98 Feb 28, 2022

MikeMcQuaid Feb 28, 2022

Rylan12 Mar 1, 2022

MikeMcQuaid Mar 1, 2022

MikeMcQuaid Feb 28, 2022

Bo98 Mar 31, 2022

MikeMcQuaid Mar 31, 2022

BrewTestBot commented Mar 1, 2022

Rylan12 left a comment

Rylan12 left a comment

Bo98 commented Jun 13, 2022 •

edited

MikeMcQuaid commented Jun 14, 2022

Rylan12 commented Jun 14, 2022

Rylan12 commented Jun 14, 2022

Bo98 commented Jun 15, 2022 •

edited

Rylan12 left a comment •

edited

Bo98 commented Jun 15, 2022 •

edited

Rylan12 commented Jun 15, 2022

Bo98 commented Jun 16, 2022

MikeMcQuaid commented Jun 16, 2022

MikeMcQuaid Jun 16, 2022

Rylan12 Jun 16, 2022

MikeMcQuaid commented Jun 16, 2022

Rylan12 commented Jun 16, 2022

		if !CoreTap.instance.installed? &&
		Homebrew::EnvConfig.install_from_api? &&

Improve consistency between Git and API formula handling #12936

Improve consistency between Git and API formula handling #12936

Conversation

Bo98 commented Feb 28, 2022 • edited

BrewTestBot commented Feb 28, 2022

MikeMcQuaid left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BrewTestBot commented Mar 1, 2022

Rylan12 left a comment

Choose a reason for hiding this comment

Rylan12 left a comment

Choose a reason for hiding this comment

Bo98 commented Jun 13, 2022 • edited

MikeMcQuaid commented Jun 14, 2022

Rylan12 commented Jun 14, 2022

Rylan12 commented Jun 14, 2022

Bo98 commented Jun 15, 2022 • edited

Rylan12 left a comment • edited

Choose a reason for hiding this comment

Bo98 commented Jun 15, 2022 • edited

Rylan12 commented Jun 15, 2022

Bo98 commented Jun 16, 2022

MikeMcQuaid commented Jun 16, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MikeMcQuaid commented Jun 16, 2022

Rylan12 commented Jun 16, 2022

Bo98 commented Feb 28, 2022 •

edited

Bo98 commented Jun 13, 2022 •

edited

Bo98 commented Jun 15, 2022 •

edited

Rylan12 left a comment •

edited

Bo98 commented Jun 15, 2022 •

edited