x/tools/gopls: inconsistent performance on hashicorp/terraform-provider-aws #60621

findleyr · 2023-06-06T02:42:06Z

Discovered by way of a user survey, there is very inconsistent performance of the gopls analysis driver in v0.12.0.

Repro:

clone https://github.com/hashicorp/terraform-provider-aws
open a small package, e.g. ./internal/types. Everything is great, and gopls uses much less memory than v0.11.0, as expected
open ./internal/provider, everything goes boom. analysis uses ~50GB (and counting..?)

Given that gopls can type-check the repository expediently, this seems like a likely bug in the new analysis driver.

CC @adonovan

The text was updated successfully, but these errors were encountered:

gopherbot · 2023-06-07T03:16:41Z

Change https://go.dev/cl/501207 mentions this issue: gopls/internal/lsp/cache: use forEachPackage for analysis

adonovan · 2023-06-07T14:41:23Z

This is a really fascinating issue that has been distracting us from conference talk prep! Long story short, the analysis driver has a pathological memory allocation in some larger workspaces due to its simple one-pass top-down recursion causing repeated decoding of the same import/fact data over and over again. The solution is something conceptually equivalent to the "batching" done by the main type-checking loop, which uses a two pass (bottom up) approach. The two-pass approach allows a "batch" of type-checking operations to share the same graph of symbols, rather than each unit being a singleton batch, allowing re-use of already-decode type export data. (For analysis, this would apply to decoded facts too.)

One way to implement this would be to implement batching in the analysis driver. Another would be to use the main type-checking loop (forEachPackage) directly, though at the cost of the pruning based on source+export+facts that the analysis driver already does. (To be clear, that's a second order benefit compared to the cost of not batching.) We quickly sketched this in the attached CL and found that it greatly improves analysis warm-up time. However, in our experimental haste, we deleted the optimization that applies only a subset of fact-using analyzers to dependencies, and it turns out this is surprisingly important.

There is clearly more work to be done here to achieve the performance goals we wanted for 0.12, but so far, other than this opt-out survey, we don't have any direct communication from users or issues files to suggest that there's a wider problem. (It's not clear why the problem manifests so clearly in this hashicorp repo but not in k8s, which has very similar graph metrics: nodes, edges, median and p95 arity, etc. Perhaps there are some unusually large types.Packages in this project.)

findleyr · 2023-06-10T02:12:14Z

Another instance of this in #60711.

gopherbot · 2023-06-16T21:58:03Z

Change https://go.dev/cl/503195 mentions this issue: gopls/internal/lsp/cache: reduce importing in analysis

2uasimojo · 2023-06-22T17:51:21Z

Can confirm fix via gopls 0.12.3 for previously-problematic scenarios with https://github.com/openshift/hive/

Thanks!

This CL is a substantial reorganization of the analysis driver to ensure that export data is imported at most once per batch of packages that are analyzed, instead of once per import edge. This greatly reduces the amount of allocation and computation done during analysis. In cache/analysis.go, Snapshot.Analyze (which now takes a set of PackageIDs, instead of being called singly in a loop) constructs an ephemeral DAG that mirrors the package graph, and then works in parallel postorder over this graph doing analysis. It uses a single FileSet for the whole batch of packages it creates. The subgraph rooted at each node is effectively a types.Importer for that node, as it represents the mapping from PackagePath to *types.Package. We no longer bother with promises or invalidation. We rely on the fact that the graph is relatively cheap to construct, cache hits are cheap to process, and the whole process only occurs after an idle delay of about a second. Also: - In internal/facts, optimize the fact decoder by using a callback. Previously, it was spending a lot of time traversing the API of all imports of a package to build a PackagePath-to-types.Package mapping. For many packages in terraform-provider-aws this visits over 1M objects (!!). But of course this is trivially computed from the new representation. - In internal/gcimporter, IImportShallow now uses a single callback to get all the types.Package symbols from the client, potentially in parallel (and that's what gopls does). The previous separation of "create" and "populate" has gone away. The analysis driver additionally exploits the getPackages callback to efficiently read the package manifest of an export data file, then abort with an error before proceeding to actually decode the rest of the file. With this change, we can process the internal/provider package of the terraform-provider-aws repo in 20s cold, 4s hot. (Before, it would run out of memory.) $ go test -bench=InitialWorkspaceLoad/hashiform ./gopls/internal/regtest/bench BenchmarkInitialWorkspaceLoad/hashiform-8 1 4014521793 ns/op 349570384 alloc_bytes 439230464 in_use_bytes 668992216 total_alloc_bytes PASS Fixes golang/go#60621 Change-Id: Iadeb02f57eb19dcccb639857053b897a60e0a90e Reviewed-on: https://go-review.googlesource.com/c/tools/+/503195 Reviewed-by: Robert Findley <rfindley@google.com> TryBot-Result: Gopher Robot <gobot@golang.org> Run-TryBot: Alan Donovan <adonovan@google.com> Reviewed-by: Alan Donovan <adonovan@google.com>

findleyr added the NeedsInvestigation label Jun 6, 2023

findleyr added this to the gopls/v0.12.3 milestone Jun 6, 2023

gopherbot added Tools gopls labels Jun 6, 2023

adonovan self-assigned this Jun 6, 2023

findleyr mentioned this issue Jun 10, 2023

x/tools/gopls: runaway memory via VSCode #60711

Closed

findleyr added NeedsFix Soon and removed NeedsInvestigation labels Jun 10, 2023

gopherbot closed this as completed in golang/tools@b71392a Jun 20, 2023

golang locked and limited conversation to collaborators Jun 21, 2024

gopherbot added the FrozenDueToAge label Jun 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

x/tools/gopls: inconsistent performance on hashicorp/terraform-provider-aws #60621

x/tools/gopls: inconsistent performance on hashicorp/terraform-provider-aws #60621

findleyr commented Jun 6, 2023

gopherbot commented Jun 7, 2023

adonovan commented Jun 7, 2023

findleyr commented Jun 10, 2023

gopherbot commented Jun 16, 2023

2uasimojo commented Jun 22, 2023

x/tools/gopls: inconsistent performance on hashicorp/terraform-provider-aws #60621

x/tools/gopls: inconsistent performance on hashicorp/terraform-provider-aws #60621

Comments

findleyr commented Jun 6, 2023

gopherbot commented Jun 7, 2023

adonovan commented Jun 7, 2023

findleyr commented Jun 10, 2023

gopherbot commented Jun 16, 2023

2uasimojo commented Jun 22, 2023