Skip to content

Investegate incident in NHN 25.02.2025 #37

@havardelnan

Description

@havardelnan

On the 25.02 a rollout of the agent v2 resulted in a outage in the ror-api.

Root cause:
mongodb stoped replying to queries in a timely manner due to missing indexes on the resourcev2 collection.

Other issues observed:
RabbitMQ had a que that had no consumers making it fill the memory allocated.
The api starts slowly due to ldap servers queried from dns not responding.

Remediation:

Sub-issues

Metadata

Metadata

Assignees

Labels

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions