Skip to content
This repository has been archived by the owner on Nov 12, 2024. It is now read-only.

INCIDENT 029 | Heartbeat publication not completed by COOP #32

Open
Christian-MK opened this issue Apr 16, 2024 · 0 comments
Open

INCIDENT 029 | Heartbeat publication not completed by COOP #32

Christian-MK opened this issue Apr 16, 2024 · 0 comments
Labels
ada-usd Incident affected the ADA-USD feed under-review Incident is under-review since being logged and continue to be monitored

Comments

@Christian-MK
Copy link
Contributor

Trigger

  • ⬛ suspected malware infections
  • ⬛ access violations
  • ✔️ anomalous system behaviors
  • ⬛ human errors
  • ⬛ unauthorized access attempts

Date

2024-04-14

Summary

COOP didn't complete publication of the 23:00 (UTC) heartbeat on 14 April. The issue self-corrected at the next interval, i.e. 00:00 (UCT) on 15 April.

Status

Under Review

Assessment

It is still unclear as to why the heartbeat was missed. The Orcfax team continues to work towards converting the COOP component coop-sock to systemd. Until this conversion is complete, logging remains incomplete.

Additional Notes

Most recently, similar failures for COOP to complete publishing on-chain have been caused by the sync state of the Plutus Chain Index which is to be replaced with important reliability changes in COOP v2.

Persistent logging of COOP issues will be added with the completed COOP work as the project seeks to address a number of concerns in a holistic upgrade of the Orcfax network.

Technical improvements

We are investigating:

  1. Completing the transition from coop-sock to systemd (currently active in preprod).
  2. Implementing improved logging to better understand these issues.
  3. Coverage for colleagues monitoring the network during weekend periods so that datum can be published manually once the issue arises.

Documentation improvements

  1. The issue will be added to devops documentation to assist future team members with triaging like incidents.
@Christian-MK Christian-MK changed the title INCIDENT 029 | Heartbeat publication not completed by COOP resulting in heartbeat datum not arriving on-chain INCIDENT 029 | Heartbeat publication not completed by COOP Apr 16, 2024
@Christian-MK Christian-MK added under-review Incident is under-review since being logged and continue to be monitored ada-usd Incident affected the ADA-USD feed labels May 2, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
ada-usd Incident affected the ADA-USD feed under-review Incident is under-review since being logged and continue to be monitored
Projects
None yet
Development

No branches or pull requests

1 participant