Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Failed-Request-Alert-Tuning] Investigate and Mitigate Excessive failed-request-too-high Alerts in the Alerting System #3344

Closed
5 tasks done
quiet-node opened this issue Dec 20, 2024 · 0 comments
Assignees
Labels
bug Something isn't working Epic
Milestone

Comments

@quiet-node
Copy link
Member

quiet-node commented Dec 20, 2024

Problem

The current Alerting System on Grafana is generating an excessive number of failed-request-too-high alerts. To ensure the system remains reliable and actionable, it is crucial to investigate the root cause of the elevated failure rates and assess whether the alerting thresholds or mechanisms need refinement.

Upon analysis, the eth_getBlockByHash and eth_getBlockByNumber endpoints have been identified as the primary drivers of the issue, contributing significantly to the recurring errors observed in the system.

Solution

The proposed solution involves analyzing the logs to determine the root cause of the failed requests and identifying the specific response codes being returned. Based on these findings, address the underlying issues by locating and resolving the bug causing the failures. This approach will help mitigate the problem and effectively reduce the volume of white-noise alerts, ensuring the alerting system remains focused on critical issues.

Alternatives

No response

Tasks

Preview Give feedback
  1. bug
    quiet-node
  2. bug
    quiet-node
  3. enhancement
    quiet-node
  4. internal
    quiet-node
@quiet-node quiet-node added bug Something isn't working Epic labels Dec 20, 2024
@quiet-node quiet-node added this to the 0.63.0 milestone Dec 20, 2024
@quiet-node quiet-node self-assigned this Dec 20, 2024
@quiet-node quiet-node moved this from Backlog to Epics In Progress in Smart Contract Sprint Board Dec 20, 2024
@quiet-node quiet-node changed the title Investigate and Mitigate Excessive failed-request-too-high Alerts in the Alerting System [Failed-Request-Alert-Tuning] Investigate and Mitigate Excessive failed-request-too-high Alerts in the Alerting System Dec 20, 2024
@quiet-node quiet-node modified the milestones: 0.63.0, 0.63.1 Dec 26, 2024
@quiet-node quiet-node moved this from Epics In Progress to In Review in Smart Contract Sprint Board Jan 6, 2025
@quiet-node quiet-node moved this from In Review to Epics In Progress in Smart Contract Sprint Board Jan 6, 2025
@github-project-automation github-project-automation bot moved this from Epics In Progress to Done in Smart Contract Sprint Board Jan 9, 2025
@quiet-node quiet-node reopened this Jan 9, 2025
@github-project-automation github-project-automation bot moved this from Done to Sprint Backlog in Smart Contract Sprint Board Jan 9, 2025
@quiet-node quiet-node moved this from Sprint Backlog to Epics In Progress in Smart Contract Sprint Board Jan 10, 2025
@github-project-automation github-project-automation bot moved this from Epics In Progress to Done in Smart Contract Sprint Board Jan 10, 2025
@quiet-node quiet-node reopened this Jan 15, 2025
@github-project-automation github-project-automation bot moved this from Done to Sprint Backlog in Smart Contract Sprint Board Jan 15, 2025
@quiet-node quiet-node moved this from Sprint Backlog to Epics In Progress in Smart Contract Sprint Board Jan 15, 2025
@github-project-automation github-project-automation bot moved this from Epics In Progress to Done in Smart Contract Sprint Board Jan 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Epic
Projects
Status: Done
Development

No branches or pull requests

1 participant