[processor/resourcedetection] Add support for dynamic refresh resource attributes #42697

paulojmdias · 2025-09-15T22:53:07Z

Description

This PR adds support for a configurable refresh_interval to the resourcedetectionprocessor.
When set, resource detection will re-run periodically in the background, allowing resources (e.g., cloud metadata, container tags) to be updated without restarting the collector.

Since refresh is disabled by default, the current behavior is maintained.

Link to tracking issue

Fixes #42663

Testing

I tested in AWS, updating tags while the processor is running.

Also added tests to support this feature.

Documentation

Updated README.md with this parameter.

…e attributes Signed-off-by: Paulo Dias <[email protected]>

Signed-off-by: Paulo Dias <[email protected]>

atoulme · 2025-09-19T14:57:34Z

This seems like a neat enhancement.

github-actions · 2025-10-10T05:21:50Z

This PR was marked stale due to lack of activity. It will be closed in 14 days.

paulojmdias · 2025-10-10T06:57:58Z

/label -stale

mx-psi · 2025-10-10T07:39:51Z

@dashpole @Aneurysm9 this is waiting for your review

…-contrib into feat/42663 Signed-off-by: Paulo Dias <[email protected]>

Signed-off-by: Paulo Dias <[email protected]>

…lector-contrib into feat/42663 Signed-off-by: Paulo Dias <[email protected]>

Signed-off-by: Paulo Dias <[email protected]>

paulojmdias · 2025-10-10T21:46:18Z

/label -stale

paulojmdias · 2025-10-19T21:18:13Z

Tests are failing due to #43625

processor/resourcedetectionprocessor/internal/resourcedetection.go

processor/resourcedetectionprocessor/resourcedetection_processor.go

Aneurysm9 · 2025-10-28T19:35:22Z

processor/resourcedetectionprocessor/README.md

+- It may take some time for newly detected resource attributes to be applied.
+- Changes to resource attributes can result in the creation of new metric time series.
+- Frequent refreshes can increase resource usage and should be configured carefully.


This documentation could use a clarification pass. What is "some time"? How frequent is too frequent to refresh detected resource attributes? Should there be a lower bound on the refresh_interval?

Good points! I've clarified the documentation with:

Specific timing: "up to refresh_interval duration"

Performance guidance: Values below 5 minutes can increase resource usage; below 1 minute is strongly discouraged

No enforced minimum, but clear recommendations provided

Added context about metric cardinality impact

processor/resourcedetectionprocessor/internal/resourcedetection.go

processor/resourcedetectionprocessor/resourcedetection_processor.go

Signed-off-by: Paulo Dias <[email protected]>

…-contrib into feat/42663 Signed-off-by: Paulo Dias <[email protected]>

Signed-off-by: Paulo Dias <[email protected]>

dashpole · 2025-11-06T20:45:37Z

processor/resourcedetectionprocessor/internal/resourcedetection.go

+	defer p.mu.RUnlock()
+	res = p.detectedResource
+	if res == nil {
+		return pcommon.NewResource(), "", nil


should we initialize res as pcommon.NewResource() so we don't need this check?

I've refactored this to initialize the return values upfront with their defaults, so we don't need the nil check anymore.

dashpole · 2025-11-06T20:53:00Z

processor/resourcedetectionprocessor/internal/resourcedetection.go


-	p.detectedResource.resource = res
-	p.detectedResource.schemaURL = mergedSchemaURL
+	// Only fail if ALL detectors failed.


Is this the current behavior? It seems like we should fail if any of the detectors failed...

I've rephrased the comment to be clearer about the partial success behavior.

dashpole · 2025-11-06T20:56:59Z

processor/resourcedetectionprocessor/internal/resourcedetection.go

+	// Keep the last good snapshot if the refresh errored OR returned an empty resource.
+	if hadPrevSuccess && (err != nil || IsEmptyResource(res)) {
+		p.logger.Warn("resource refresh yielded empty or error; keeping previous snapshot", zap.Error(err))
+		return prev.resource, prev.schemaURL, prev.err


If we return the previous error, won't that cause refreshLoop to log the previous error? That seems very confusing if the error differs from our current error.

Yeah, you're right. Thanks for pointing me out. Since prev.err is guaranteed to be nil here (from the hadPrevSuccess check), I've changed it to explicitly return nil to make the intent clear and avoid any confusion in the logs.

dashpole · 2025-11-06T21:05:21Z

processor/resourcedetectionprocessor/resourcedetection_processor.go

+		rdp.wg.Add(1)
+		go rdp.refreshLoop(client)
+	}
+	return nil


Should we move the initial attempt at detection to Start so that we don't need the initOnce logic in the resource provider?

I've moved the initial detection to Start(), which now calls Refresh() directly, removed the initOnce field entirely, and made Get() truly read-only with no initialization logic. Tests have been updated accordingly to call Refresh() first for initialization. Please let me know what you think about this change 🙏

dashpole · 2025-11-06T21:07:26Z

processor/resourcedetectionprocessor/resourcedetection_processor.go

+
+	stopCh  chan struct{}
+	wg      sync.WaitGroup
+	current atomic.Pointer[resourceSnapshot]


I think a big part of why the code is so confusing is that you are caching the latest resource here and in internal/resourcedetection.go. Please only store the current value in one place, and concentrate the refresh goroutine logic and handling of errors there.

I've refactored the code to store the resource in a single location (the provider's detectedResource). I removed the current atomic.Pointer from the processor and moved all refresh logic into the provider with StartRefreshing() and StopRefreshing() methods. Now the processor just calls provider.Get() to retrieve the cached resource, and the provider handles all the refresh goroutine logic and error handling internally. This eliminates the duplicate state and makes the architecture much cleaner. Let me know if it makes sense to you 🙏

Signed-off-by: Paulo Dias <[email protected]>

…lector-contrib into feat/42663 Signed-off-by: Paulo Dias <[email protected]>

paulojmdias and others added 5 commits September 15, 2025 23:52

[processor/resourcedetection] Add support for dynamic refresh resourc…

485e18e

…e attributes Signed-off-by: Paulo Dias <[email protected]>

Merge branch 'main' into feat/42663

692f5bb

Merge branch 'main' into feat/42663

b7a3eb9

Merge branch 'main' into feat/42663

c72541b

feat: improve and update README.md

1e1b611

Signed-off-by: Paulo Dias <[email protected]>

paulojmdias marked this pull request as ready for review September 18, 2025 22:07

paulojmdias requested review from a team and dashpole as code owners September 18, 2025 22:07

github-actions bot assigned mx-psi Sep 18, 2025

github-actions bot added the processor/resourcedetection Resource detection processor label Sep 18, 2025

github-actions bot requested a review from Aneurysm9 September 18, 2025 22:07

Merge branch 'main' into feat/42663

7749eec

atoulme added the waiting-for-code-owners label Sep 19, 2025

paulojmdias added 3 commits September 23, 2025 11:35

Merge branch 'main' into feat/42663

e726e61

Merge branch 'main' into feat/42663

cb5ec38

Merge branch 'main' into feat/42663

0eee20d

github-actions bot added the Stale label Oct 10, 2025

Merge branch 'main' into feat/42663

97bff0b

paulojmdias added 4 commits October 10, 2025 09:57

Merge branch 'main' of github.com:paulojmdias/opentelemetry-collector…

f9bf944

…-contrib into feat/42663 Signed-off-by: Paulo Dias <[email protected]>

chore: update changelog

000f750

Signed-off-by: Paulo Dias <[email protected]>

Merge branch 'feat/42663' of github.com:paulojmdias/opentelemetry-col…

ad53dff

…lector-contrib into feat/42663 Signed-off-by: Paulo Dias <[email protected]>

chore: update README.md

4944302

Signed-off-by: Paulo Dias <[email protected]>

Merge branch 'main' into feat/42663

90b025f

github-actions bot removed the Stale label Oct 11, 2025

Merge branch 'main' into feat/42663

c71a546

paulojmdias added 2 commits October 17, 2025 22:16

Merge branch 'main' into feat/42663

a9cf1db

Merge branch 'main' into feat/42663

857f654

paulojmdias added 3 commits October 20, 2025 19:10

Merge branch 'main' into feat/42663

2227df6

Merge branch 'main' into feat/42663

ec6a47c

Merge branch 'main' into feat/42663

452ff69

dashpole reviewed Oct 28, 2025

View reviewed changes

processor/resourcedetectionprocessor/internal/resourcedetection.go Show resolved Hide resolved

dashpole reviewed Oct 28, 2025

View reviewed changes

processor/resourcedetectionprocessor/resourcedetection_processor.go Outdated Show resolved Hide resolved

processor/resourcedetectionprocessor/resourcedetection_processor.go Outdated Show resolved Hide resolved

Aneurysm9 reviewed Oct 28, 2025

View reviewed changes

paulojmdias and others added 8 commits October 31, 2025 16:14

Merge branch 'main' into feat/42663

ce8e56d

feat: improve refresh using atomic.Pointer

da4017d

Signed-off-by: Paulo Dias <[email protected]>

Merge branch 'main' into feat/42663

4341628

feat: improve keep of last successfully snapshot

ff03fc0

Signed-off-by: Paulo Dias <[email protected]>

Merge branch 'main' of github.com:paulojmdias/opentelemetry-collector…

f4ea20c

…-contrib into feat/42663 Signed-off-by: Paulo Dias <[email protected]>

feat: resourceResult improvements

78b7de4

Signed-off-by: Paulo Dias <[email protected]>

Merge branch 'main' into feat/42663

d6ab45d

Merge branch 'main' into feat/42663

a4704ea

dashpole reviewed Nov 6, 2025

View reviewed changes

paulojmdias and others added 3 commits November 6, 2025 23:20

feat: improvements

b2321fd

Signed-off-by: Paulo Dias <[email protected]>

Merge branch 'feat/42663' of github.com:paulojmdias/opentelemetry-col…

b7c7d90

…lector-contrib into feat/42663 Signed-off-by: Paulo Dias <[email protected]>

Merge branch 'main' into feat/42663

3fd23b3

[processor/resourcedetection] Add support for dynamic refresh resource attributes #42697

Are you sure you want to change the base?

[processor/resourcedetection] Add support for dynamic refresh resource attributes #42697

Uh oh!

Conversation

paulojmdias commented Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Link to tracking issue

Testing

Documentation

Uh oh!

atoulme commented Sep 19, 2025

Uh oh!

github-actions bot commented Oct 10, 2025

Uh oh!

paulojmdias commented Oct 10, 2025

Uh oh!

mx-psi commented Oct 10, 2025

Uh oh!

paulojmdias commented Oct 10, 2025

Uh oh!

paulojmdias commented Oct 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

paulojmdias commented Sep 15, 2025 •

edited

Loading