cmd/go: 'go test' fails to cache test result due to modified `.git` directory #52573

matthewmueller · 2022-04-26T16:39:02Z

What version of Go are you using (`go version`)?

$ go version
go version go1.18 darwin/amd64

Does this issue reproduce with the latest release?

Yes

What operating system and processor architecture are you using (`go env`)?

go env Output

$ go env
GO111MODULE="on"
GOARCH="amd64"
GOBIN=""
GOCACHE="/Users/m/Library/Caches/go-build"
GOENV="/Users/m/Library/Application Support/go/env"
GOEXE=""
GOEXPERIMENT=""
GOFLAGS=""
GOHOSTARCH="amd64"
GOHOSTOS="darwin"
GOINSECURE=""
GOMODCACHE="/Users/m/dev/pkg/mod"
GONOPROXY=""
GONOSUMDB=""
GOOS="darwin"
GOPATH="/Users/m/dev"
GOPRIVATE=""
GOPROXY="https://proxy.golang.org,direct"
GOROOT="/usr/local/go"
GOSUMDB="sum.golang.org"
GOTMPDIR=""
GOTOOLDIR="/usr/local/go/pkg/tool/darwin_amd64"
GOVCS=""
GOVERSION="go1.18"
GCCGO="gccgo"
GOAMD64="v1"
AR="ar"
CC="clang"
CXX="clang++"
CGO_ENABLED="1"
GOMOD="/Users/m/dev/src/github.com/matthewmueller/hack/test-cache/go.mod"
GOWORK=""
CGO_CFLAGS="-g -O2"
CGO_CPPFLAGS=""
CGO_CXXFLAGS="-g -O2"
CGO_FFLAGS="-g -O2"
CGO_LDFLAGS="-g -O2"
PKG_CONFIG="pkg-config"
GOGCCFLAGS="-fPIC -arch x86_64 -m64 -pthread -fno-caret-diagnostics -Qunused-arguments -fmessage-length=0 -fdebug-prefix-map=/var/folders/4f/tcxcr6_55v9bp38d8g4hjlf80000gn/T/go-build1711552986=/tmp/go-build -gno-record-gcc-switches -fno-common"

What did you do?

This took me awhile to whittle down. My tests were running unbearably slow and never seeming to cache. I'm still investigating the slowness (this issue describes the symptoms), but I'm able to reproduce the caching issue:

1. Create the following directory tree:

.
└── one_test.go

With the following one_test.go:

package test

import (
	"os"
	"testing"
)

func TestUncacheable(t *testing.T) {
	_, err := os.ReadDir(".")
	if err != nil {
		t.Fatal(err)
	}
}

Run go mod init
Run git init
Run go test ./one_test.go multiple times

$ go test ./one_test.go 
ok      command-line-arguments  0.094s
$ go test ./one_test.go 
ok      command-line-arguments  0.093s

What did you expect to see?

I expected to see it cache after the second run:

$ go test ./one_test.go 
ok      command-line-arguments  (cached)

What did you see instead?

It never caches. If you remove .git it will start caching again. Oddly adding other types of directories like a/b or .a/b still cache. It seems there's something special with .git.

Why do you need this?

My test suite needs to lookup the go.mod of the current project directory. With this logic in place, my tests never cache.

The text was updated successfully, but these errors were encountered:

matthewmueller · 2022-04-26T17:23:07Z

Narrowing it down a bit further, I ran GODEBUG=gocachetest=1 go test ./one_test.go -run "TestUncacheable", then removed the irrelevant logs.

Here's a diff between runs. The modtime changes everytime.

  HASH[open]: "go1.18"
  HASH[open]: "stat 160 800001ed 2022-04-26 12:07:04.587272446 -0500 CDT true\n"
  HASH[open]: "file .git "
- HASH[open]: "stat 288 800001ed 2022-04-26 12:07:22.744082806 -0500 CDT true\n"
+ HASH[open]: "stat 288 800001ed 2022-04-26 12:07:42.335762299 -0500 CDT true\n"
  HASH[open]: "file go.mod "
  HASH[open]: "stat 58 1a4 2022-04-26 11:34:12.067721337 -0500 CDT false\n"
  HASH[open]: "file one_test.go "
  HASH[open]: "stat 144 1a4 2022-04-26 11:53:56.630154997 -0500 CDT false\n"
- HASH[open]: 8e895090d73fda1474da8cd9819d3196677d2ffd2cc3e6f7a401b2c6cc09abf4
- HASH[testInputs]: "open /Users/m/dev/src/github.com/matthewmueller/hack/test-cache 8e895090d73fda1474da8cd9819d3196677d2ffd2cc3e6f7a401b2c6cc09abf4\n"
- HASH[testInputs]: 397c0fa5c27c402f4b5be6f2ce32780bbb718ae956556f493dc111fb38089581
+ HASH[open]: 53356a05f8ddd1917cffb837b75f5de58859b18675069e571929e60b2ff7aa5e
+ HASH[testInputs]: "open /Users/m/dev/src/github.com/matthewmueller/hack/test-cache 53356a05f8ddd1917cffb837b75f5de58859b18675069e571929e60b2ff7aa5e\n"
+ HASH[testInputs]: c43d67ffb3fff2baa47ae3a2caef13367214dcf1f3217124453bc035daa4db87

It looks like the logic is in here:

go/src/cmd/go/internal/test/test.go

Line 1742 in 1158520

fmt.Fprintf(h, "file %s ", f.Name())

seankhliao · 2022-04-26T17:35:52Z

can you try -buildvcs=false and/or 1.18.1?

matthewmueller · 2022-04-26T17:46:51Z

Update: could be related to my ~/.zshrc config since I'm not able to reproduce this in /bin/sh or in docker

Edit: thanks for the message @seankhliao, I'll try that next if my current lead doesn't pan out.

matthewmueller · 2022-04-26T17:54:33Z

Welp, that was a tough one, it was related to my git prompt.

# Add git to the terminal prompt
git_prompt() {
  # Don't go any further if we're not in a git repo
#  git rev-parse --is-inside-work-tree &> /dev/null || return

  # Stylize
  echo -n "$color_red"
#  echo -n "$(git_branch)"
  echo -n "$color_reset"
  echo -n "$color_purple"
#  echo -n "$(git_progress)"
  echo -n "$color_reset"
  echo -n "$color_yellow"
  echo -n "$(git_dirty)"
  echo -n "$color_reset"
}


# fastest possible way to check if repo is dirty
git_dirty() {
  # If the git status has *any* changes (e.g. dirty), echo our character
  if [[ -n "$(git status --porcelain 2> /dev/null)" ]]; then
    echo " !"
  fi
}

It appears that git status --porcelain changes the modtime on the .git directory.

Thanks for suggestion @seankhliao!

matthewmueller · 2022-04-27T02:39:29Z

I'd actually like to re-open this one, but perhaps more like a feature request. Why is go test looking at .git in the first place?

I might be missing something but I'd expect go test to cache to work something like this:

"Walk the Go files starting from the test, then its imports, imports of imports, hashing all the files it encounters. Now that we have embedded files, we can discover those as well".

A similar solution is how go mod tidy expects to find dependencies that it can reach. If you've got tools, you need to manually add link them into the dependency tree.

I can share more code but something here's the recursive function to hopefully show what I mean:

func findDeps(fset *fileSet, module *gomod.Module, dir string) (err error) {
	imported, err := build.Default.Import(".", dir, build.ImportMode(0))
	if err != nil {
		return err
	}
	// Add all the Go files
	for _, path := range imported.GoFiles {
		fset.Add(filepath.Join(dir, path))
	}
	// Add all the embeds
        // TODO: resolve patterns
	for _, path := range imported.EmbedPatterns {
		fset.Add(filepath.Join(dir, path))
	}
	// Traverse imports and compute a hash
	eg := new(errgroup.Group)
	for _, importPath := range imported.Imports {
		importPath := importPath
		eg.Go(func() error {
			if !shouldWalk(module, importPath) {
				return nil
			}
			dir, err := module.ResolveDirectory(importPath)
			if err != nil {
				return err
			}
			relPath, err := filepath.Rel(module.Directory(), dir)
			if err != nil {
				return err
			}
			if err := findDeps(fset, module, relPath); err != nil {
				return err
			}
			return nil
		})
	}
	return eg.Wait()
}

Feel free to close again or I can open a different issue. Just after spending some time debugging the test cache, it feels like it could be a lot faster.

bcmills · 2022-04-28T20:35:14Z

The stat of .git is AFAICT a direct consequence of the test calling os.ReadDir:
https://cs.opensource.google/go/go/+/master:src/os/dir_unix.go;l=128;drc=master

The git prompt may be #51767, which should be fixed in Go 1.18.1.

@matthewmueller, does that explain the behavior you were seeing? Is there anything left to be done here?

matthewmueller · 2022-04-29T00:34:44Z

Thanks for the response @bcmills!

Yep, I eventually learned that os.ReadDir(".") is stating .git. I also learned that .git gets modified every time you run git status.

The reason I re-opened this is because I still wonder:

Why does go test care about the modtime of .git (or any non-dependency) in its caching decision? I feel like the Go team took a very pragmatic approach with go mod tidy, where it discovers all dependencies by traversing imports. Any unknown tool needs to be explicitly depended upon in a tools.go. Would it make sense to do the same with go test? I understand that go test caching came before go modules, but is it worth proposing this as a future improvement?

While the above seems wasteful, it isn't my top-level problem. My top-level problem is that by considering all these files in the caching decision, the setup and tear down of these tests take longer than the test itself. I commented on that in this issue.

seankhliao · 2022-06-05T17:13:34Z

Your test results may depend on the result of os.ReadDir("."). It is a dependency, and go can't tell if it actually mattered, so the caches are (conservatively) invalidated.

matthewmueller closed this as completed Apr 26, 2022

matthewmueller reopened this Apr 27, 2022

bcmills added the WaitingForInfo Issue is not actionable because of missing required information, which needs to be provided. label Apr 28, 2022

bcmills changed the title ~~go/test: Uncacheable Test~~ cmd/go: 'go test' fails to cache test result due to modified .git directory Apr 28, 2022

seankhliao closed this as completed Jun 5, 2022

golang locked and limited conversation to collaborators Jun 5, 2023

gopherbot added the FrozenDueToAge label Jun 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/go: 'go test' fails to cache test result due to modified `.git` directory #52573

cmd/go: 'go test' fails to cache test result due to modified `.git` directory #52573

matthewmueller commented Apr 26, 2022 •

edited

Loading

matthewmueller commented Apr 26, 2022 •

edited

Loading

seankhliao commented Apr 26, 2022

matthewmueller commented Apr 26, 2022 •

edited

Loading

matthewmueller commented Apr 26, 2022 •

edited

Loading

matthewmueller commented Apr 27, 2022 •

edited

Loading

bcmills commented Apr 28, 2022

matthewmueller commented Apr 29, 2022 •

edited

Loading

seankhliao commented Jun 5, 2022

cmd/go: 'go test' fails to cache test result due to modified .git directory #52573

cmd/go: 'go test' fails to cache test result due to modified .git directory #52573

Comments

matthewmueller commented Apr 26, 2022 • edited Loading

What version of Go are you using (go version)?

Does this issue reproduce with the latest release?

What operating system and processor architecture are you using (go env)?

What did you do?

What did you expect to see?

What did you see instead?

Why do you need this?

matthewmueller commented Apr 26, 2022 • edited Loading

seankhliao commented Apr 26, 2022

matthewmueller commented Apr 26, 2022 • edited Loading

matthewmueller commented Apr 26, 2022 • edited Loading

matthewmueller commented Apr 27, 2022 • edited Loading

bcmills commented Apr 28, 2022

matthewmueller commented Apr 29, 2022 • edited Loading

seankhliao commented Jun 5, 2022

cmd/go: 'go test' fails to cache test result due to modified `.git` directory #52573

cmd/go: 'go test' fails to cache test result due to modified `.git` directory #52573

matthewmueller commented Apr 26, 2022 •

edited

Loading

What version of Go are you using (`go version`)?

What operating system and processor architecture are you using (`go env`)?

matthewmueller commented Apr 26, 2022 •

edited

Loading

matthewmueller commented Apr 26, 2022 •

edited

Loading

matthewmueller commented Apr 26, 2022 •

edited

Loading

matthewmueller commented Apr 27, 2022 •

edited

Loading

matthewmueller commented Apr 29, 2022 •

edited

Loading