Skip to content

[mmu probing] pr01.docs: Add MMU threshold probing framework design#22539

Merged
StormLiangMS merged 1 commit intosonic-net:masterfrom
XuChen-MSFT:xuchen3/mmu_probe/pr01-documentation
Mar 25, 2026
Merged

[mmu probing] pr01.docs: Add MMU threshold probing framework design#22539
StormLiangMS merged 1 commit intosonic-net:masterfrom
XuChen-MSFT:xuchen3/mmu_probe/pr01-documentation

Conversation

@XuChen-MSFT
Copy link
Contributor

@XuChen-MSFT XuChen-MSFT commented Feb 23, 2026

Description of PR

Summary:

Add comprehensive design document for MMU threshold probing framework,
including:

  • Three-phase probing algorithms (upper/lower bound + threshold detection)
  • Executor protocol for platform abstraction
  • Observer pattern for metrics tracking
  • Template method pattern for test case framework
  • Integration with SONiC QoS test infrastructure

Fixes # (issue)

Type of change

  • Bug fix
  • Testbed and Framework(new/improvement)
  • New Test case
    • Skipped for non-supported platforms
  • Test case improvement

Back port request

  • 202205
  • 202305
  • 202311
  • 202405
  • 202411
  • 202505
  • 202511

Approach

What is the motivation for this PR?

qos refactoring

How did you do it?

How did you verify/test it?

Any platform specific information?

Supported testbed topology if it's a new test case?

Documentation

relevant PRs:
[mmu probing] pr01.docs: Add MMU threshold probing framework design
[mmu probing] pr02.probe: Add core probing algorithms with essential data structures
[mmu probing] pr03.probe: Add probing executors and executor registry
[mmu probing] pr04.probe: Add observer pattern for metrics tracking
[mmu probing] pr05.probe: Add stream manager and buffer occupancy controller
[mmu probing] pr06.probe: Add base framework and all probing implementations
[mmu probing] pr07.test: Add comprehensive unit tests for probe framework
[mmu probing] pr08.test: Add integration tests for end-to-end probing workflows
[mmu probing] pr09.test: Add production probe test and infrastructure updates

@mssonicbld
Copy link
Collaborator

/azp run

@azure-pipelines
Copy link

Azure Pipelines could not run because the pipeline triggers exclude this branch/path.

Add comprehensive design document for MMU threshold probing framework,
including:
- Three-phase probing algorithms (upper/lower bound + threshold detection)
- Executor protocol for platform abstraction
- Observer pattern for metrics tracking
- Template method pattern for test case framework
- Integration with SONiC QoS test infrastructure

Signed-off-by: Xu Chen <[email protected]>
@XuChen-MSFT XuChen-MSFT force-pushed the xuchen3/mmu_probe/pr01-documentation branch from 320c509 to 2ed9f72 Compare March 24, 2026 04:59
@mssonicbld
Copy link
Collaborator

/azp run

@azure-pipelines
Copy link

Azure Pipelines could not run because the pipeline triggers exclude this branch/path.

Copy link
Collaborator

@StormLiangMS StormLiangMS left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ LGTM — Excellent design documentation

Comprehensive 1,700-line design doc that thoroughly covers the probing methodology. Highlights:

  • Quantitative evidence: Real hardware data from Mellanox SN4600C and Arista 7260CX3 backs every design decision
  • Error accumulation analysis (§3.4): Clear demonstration of why Range-Based composite probing fails (up to 528% error on TD3) and why Step-by-Step Point Probing is necessary
  • Step-size optimization (§3.4.8): Empirical comparison on physical hardware — step_size=2 as optimal balance (0.39% error, 61.8 min) is well-justified
  • Architecture diagrams: Four-role separation (ProbingBase/Algorithm/Executor/Observer) is clearly explained with UML and sequence diagrams

Minor: Some UTF-8 box-drawing characters may render with artifacts depending on the viewer, but the content is clear regardless.

Copy link
Collaborator

@StormLiangMS StormLiangMS left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@StormLiangMS StormLiangMS merged commit 1dfc9a1 into sonic-net:master Mar 25, 2026
7 checks passed
ravaliyel pushed a commit to ravaliyel/sonic-mgmt that referenced this pull request Mar 27, 2026
Add comprehensive design document for MMU threshold probing framework,
including:
- Three-phase probing algorithms (upper/lower bound + threshold detection)
- Executor protocol for platform abstraction
- Observer pattern for metrics tracking
- Template method pattern for test case framework
- Integration with SONiC QoS test infrastructure

Signed-off-by: Xu Chen <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants