Added non-conflicting hash for install files by MarconZet · Pull Request #1454 · bazel-contrib/rules_jvm_external

MarconZet · 2025-09-29T16:01:28Z

Sumary

This commit introduces lock file version 3 with per-artifact hashing instead of a single global hash.

This per-artifact hashing approach can reduce the amount of merge conflicts when multiple people update canonical version in large monorepo.

The code still supports reading v2 lock files - it checks for v3 first, then falls back to v2, then v1. Users with older lock files will see a message to repin.

Key Changes

Lock File Format Change (v2 → v3)

Before (v2): __INPUT_ARTIFACTS_HASH and __RESOLVED_ARTIFACTS_HASH were single integer values
After (v3): Both are now dictionaries mapping each artifact coordinate to its individual hash

Example in maven_install.json:

// Old format 
"__INPUT_ARTIFACTS_HASH": 1994476565, 
"__RESOLVED_ARTIFACTS_HASH": -274973469,
// New format
"__INPUT_ARTIFACTS_HASH": { "com.google.guava:guava": 733518530, "junit:junit": -652553691, "..." }, 
"__RESOLVED_ARTIFACTS_HASH": { "com.google.guava:guava": -1587873388, "..." }

Hash Computation Changes (private/rules/v3_lock_file.bzl:53-108)

The new _compute_lock_file_hash_v3 function computes individual hashes per artifact that include:

The artifact's own info (coordinates, SHA sums)
The repository it came from
Hashes of all transitive dependencies (dependency-aware hashing)

Input Hash Changes (private/rules/coursier.bzl:334-386)

compute_dependency_inputs_signature now returns a dictionary of per-artifact hashes plus backward-compatible v1/v2 signatures.

shs96c · 2025-10-07T09:56:31Z

This is looking really good. I like the idea of only having conflicts if the transitive deps have changed.

vinnybod · 2025-10-09T16:21:32Z

-    for (String key : keys) {
-      toHash.put(key, rendered.get(key));
+  @SuppressWarnings("unchecked")
+  private static Map<String, Integer> calculateArtifactHash(Map<String, Object> rendered) {


@shs96c question for a potential breaking change in the next major update.

It seems like this code and the code in v3_lock_file.bzl are similar. IIRC, the reason the starlark implementation exists is if the user doesn't have a lockfile.
If that is the case, is there a possibility to consolidate around the java code (which is easiest to test tbh) by forcing lockfile usage?

We'd likely want to consolidate on the starlark version of the code, since that's the one that's used by people when they verify the signatures.

I agree that it would be the best solution, but it's not simple to implement.

Starlark code runs in analysis phase, this java code runs in execution phase, so it's impossible to do without a minor rewrite of the flow.

shs96c · 2025-11-04T12:18:32Z

@MarconZet, I'm waiting until you move this out of draft before reviewing. Please LMK when you're ready!

MarconZet · 2025-12-08T11:41:50Z

@shs96c any progress on the review?

thomasbao12 · 2025-12-08T16:57:28Z

Could we add a description to the PR like:

Summary

This commit introduces lock file version 3 with per-artifact hashing instead of a single global hash. The main purpose is to create "non-conflicting" hashes that allow more granular change
detection in the maven dependency lock files.

Key Changes

Lock File Format Change (v2 → v3)

Before (v2): __INPUT_ARTIFACTS_HASH and __RESOLVED_ARTIFACTS_HASH were single integer values
After (v3): Both are now dictionaries mapping each artifact coordinate to its individual hash

Example in maven_install.json:
// Old format
"__INPUT_ARTIFACTS_HASH": 1994476565,
"__RESOLVED_ARTIFACTS_HASH": -274973469,

// New format
"__INPUT_ARTIFACTS_HASH": {
"com.google.guava:guava": 733518530,
"junit:junit": -652553691,
...
},
"__RESOLVED_ARTIFACTS_HASH": {
"com.google.guava:guava": -1587873388,
...
}

File Renames

v2_lock_file.bzl → v3_lock_file.bzl
V2LockFile.java → V3LockFile.java
V2LockFileTest.java → V3LockFileTest.java

Hash Computation Changes (private/rules/v3_lock_file.bzl:53-108)

The new _compute_lock_file_hash_v3 function computes individual hashes per artifact that include:

The artifact's own info (coordinates, SHA sums)
The repository it came from
Hashes of all transitive dependencies (dependency-aware hashing)

Input Hash Changes (private/rules/coursier.bzl:334-386)

compute_dependency_inputs_signature now returns a dictionary of per-artifact hashes plus backward-compatible v1/v2 signatures.

Command-line Interface Change (pin_dependencies.bzl)

Changed from --input_hash (single value) to --input-hash-path (path to JSON file containing the hash dictionary).

Backward Compatibility

The code still supports reading v2 lock files - it checks for v3 first, then falls back to v2, then v1. Users with older lock files will see a message to repin.

Purpose

This per-artifact hashing approach allows the system to detect exactly which artifacts changed, rather than just knowing "something changed." This is useful for incremental updates and
more precise cache invalidation.

honnix · 2025-12-18T06:21:57Z

We tried this patch and so far it has been working well. There is one thing though. In case of mismatched signature,

rules_jvm_external/private/rules/coursier.bzl

Line 568 in e95b9d7

    
           "%s_install.json contains an invalid signature (expected %s and got %s) and may be corrupted. " % (

prints out a huge single line of artifact shas, for each every artifact. In our case it causes ~2GB of logs.

MarconZet · 2026-01-07T15:48:42Z

@honnix I changed the code, It should print errors better now

honnix · 2026-01-08T09:18:47Z

@honnix I changed the code, It should print errors better now

Nice! Thank you. We will take the new patch and try it out.

vinnybod · 2026-01-12T22:50:38Z

FWIW, We've been using this at Confluent for a month now and it has been working well.

shs96c · 2025-10-21T07:24:47Z

-    for (String key : keys) {
-      toHash.put(key, rendered.get(key));
+  @SuppressWarnings("unchecked")
+  private static Map<String, Integer> calculateArtifactHash(Map<String, Object> rendered) {


We'd likely want to consolidate on the starlark version of the code, since that's the one that's used by people when they verify the signatures.

shs96c · 2025-12-05T10:48:50Z


+def _add_to_hash_dictionary(dictionary, artifact, salt):
+    artifact_dict = json.decode(artifact)
+    key = artifact_dict["group"] + ":" + artifact_dict["artifact"]


You should use the same logic that's in Coordinates.asKey() to get a stable key that includes things like the classifier. That's already in coordinates.bzl as to_key

hmm, this key has a direct mapping to what appears in the lock file

My idea was, that I don't want things like classifier and packaging, my reason being:

It looked ugly in the lock file – when I tried it in my repo, the __INPUT_ARTIFACTS_HASH size duplicated with :sources. It did not provide any information and was just noise.

I think that at this level, we don't want to allow a non-conflicting merge.

shs96c · 2025-12-05T10:49:40Z

    if boms and len(boms):
        for bom in sorted(boms):
            artifact_inputs.append(_stable_artifact(bom))
+            _add_to_hash_dictionary(all_hashes, bom, "bom")


Why is the salt needed for artifacts and boms? They should have unique coordinates no matter what.

I did this thinking about artifact and excluded_artifact.

In v2 hash, excluding an artifact would change the hash because the has order changed. In group:artifact: HASH notation, excluding an artifact would not change the hash, so we need salt.

The rest is about code design principles, adding salt everywhere is easier then adding salt only to excluded_artifact.

shs96c · 2026-01-19T17:44:15Z

Let me handle the rebase, and I'll merge this when I've done so.

shs96c · 2026-01-20T16:10:30Z

Ah! I can't do the rebase. Could you please handle that?

shs96c · 2026-01-21T16:04:24Z

The test failures look related to this change. The V3LockFileTest is failing.

MarconZet · 2026-01-21T20:46:04Z

@shs96c I forgot to add some files, It should be ok now

shs96c

Let's go! LGTM

* master: (25 commits) fix: use forward slash separator in Maven purl format (bazel-contrib#1530) Load rules from specific bzl files and add sh_test imports (bazel-contrib#1529) Added non-conflicting hash for install files (bazel-contrib#1454) Update the maven and coursier resolver tests to create a class index file. (bazel-contrib#1519) [ci] Drop Bazel 6 and ensure we run on Bazel 7 and 8 (bazel-contrib#1525) Only allow modules specified in known_contributing_modules to contribute artifacts or boms to the root module (bazel-contrib#1523) [gradle] Fix false resolution failures when BOM upgrades dependency version (bazel-contrib#1520) [gradle] Fix Gradle resolver to respect force_version and include runtime dependencies (bazel-contrib#1516) Correctly merge BOMs from non-root modules (bazel-contrib#1518) Update more lock files Filter test_only artifacts out of artifacts merged into root repos and print a warning when a root artifact version is overridden by a non_root bazel_dep (bazel-contrib#1511) Fix SHA mismatch for conflicting dependency versions (bazel-contrib#1513) [gradle] Plumb through the force_version attribute (bazel-contrib#1515) [gradle] Add dep exclusions to only that dep (bazel-contrib#1514) [gradle] Handle aggregating dependencies and relocation version conflicts (bazel-contrib#1512) BOM Fixes (bazel-contrib#1506) Allow an optional index of dep -> class to be created (bazel-contrib#1492) Put files in `ResolutionResult` (bazel-contrib#1484) Optimize dependency graph building with O(1) lookups (bazel-contrib#1483) Provide a mechanism to list all resolved direct deps for a workspace (bazel-contrib#1510) ...

* master: Add presubmit check for prebuilt jars (bazel-contrib#1486) Upload artifacts in parallel (address artifactorys "Maven Snapshot Version Behaviour") (bazel-contrib#1524) feat: Support COURSIER_SHA256 environment variable (bazel-contrib#1527) fix: Do not add coursier opts when run other tools (bazel-contrib#1531) fix: add string attributes to `amend_artifact` for explicit unset state (bazel-contrib#1499) fix: use forward slash separator in Maven purl format (bazel-contrib#1530) Load rules from specific bzl files and add sh_test imports (bazel-contrib#1529) Added non-conflicting hash for install files (bazel-contrib#1454) Update the maven and coursier resolver tests to create a class index file. (bazel-contrib#1519) [ci] Drop Bazel 6 and ensure we run on Bazel 7 and 8 (bazel-contrib#1525) Only allow modules specified in known_contributing_modules to contribute artifacts or boms to the root module (bazel-contrib#1523) [gradle] Fix false resolution failures when BOM upgrades dependency version (bazel-contrib#1520)

MarconZet commented Oct 2, 2025

View reviewed changes

Comment thread tests/custom_maven_install/regression_testing_gradle_install.json

vinnybod reviewed Oct 9, 2025

View reviewed changes

Comment thread private/rules/v3_lock_file.bzl Outdated

vinnybod reviewed Oct 9, 2025

View reviewed changes

MarconZet force-pushed the master branch from 6cd5c90 to 490fcea Compare October 22, 2025 13:03

MarconZet closed this Oct 22, 2025

MarconZet force-pushed the master branch from 490fcea to 61cd272 Compare October 22, 2025 13:05

MarconZet reopened this Oct 22, 2025

MarconZet closed this Nov 19, 2025

MarconZet force-pushed the master branch from 1d1bf99 to 61cd272 Compare November 19, 2025 11:57

MarconZet reopened this Nov 19, 2025

MarconZet marked this pull request as ready for review November 19, 2025 13:17

MarconZet requested review from cheister, jin and shs96c as code owners November 19, 2025 13:17

Added non-conflicting hash for install files

c275cf3

MarconZet force-pushed the master branch from 3e6dc14 to c275cf3 Compare November 21, 2025 15:21

vinnybod mentioned this pull request Dec 10, 2025

V3 lockfiles confluentinc/rules_jvm_external#69

Merged

MarconZet mentioned this pull request Dec 11, 2025

maven_install.json should be mergeable by Git when two non-overlapping dependency subgraphs are updated #758

Closed

fixed error message lenght

4511397

shs96c reviewed Jan 14, 2026

View reviewed changes

MarconZet added 2 commits January 14, 2026 19:46

fixed typo

6be2e2e

Merge remote-tracking branch 'main/master' into mc

4f02a3e

fixed test^

8d6eb56

shs96c mentioned this pull request Jan 16, 2026

Prepare for 6.10 release #1521

Merged

4 tasks

Merge remote-tracking branch 'main/master'

f691948

forgot^

1ad1395

shs96c enabled auto-merge (squash) January 21, 2026 21:04

shs96c approved these changes Jan 21, 2026

View reviewed changes

shs96c merged commit 50b2011 into bazel-contrib:master Jan 21, 2026
6 checks passed

Uh oh!

Conversation

MarconZet commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Sumary

Key Changes

Uh oh!

Uh oh!

shs96c commented Oct 7, 2025

Uh oh!

Uh oh!

vinnybod Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

shs96c Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

MarconZet Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shs96c commented Nov 4, 2025

Uh oh!

MarconZet commented Dec 8, 2025

Uh oh!

thomasbao12 commented Dec 8, 2025

Uh oh!

honnix commented Dec 18, 2025

Uh oh!

MarconZet commented Jan 7, 2026

Uh oh!

honnix commented Jan 8, 2026

Uh oh!

vinnybod commented Jan 12, 2026

Uh oh!

Uh oh!

Uh oh!

shs96c Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

shs96c Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

MarconZet Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shs96c Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

MarconZet Jan 14, 2026

Choose a reason for hiding this comment

Uh oh!

shs96c commented Jan 19, 2026

Uh oh!

shs96c commented Jan 20, 2026

Uh oh!

shs96c commented Jan 21, 2026

Uh oh!

MarconZet commented Jan 21, 2026

Uh oh!

shs96c left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

MarconZet commented Sep 29, 2025 •

edited

Loading

MarconZet Jan 14, 2026 •

edited

Loading

MarconZet Jan 14, 2026 •

edited

Loading