-
Notifications
You must be signed in to change notification settings - Fork 154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[pkg/component] - A component's health should also depend on health of units #5386
Comments
Pinging @elastic/elastic-agent (Team:Elastic-Agent) |
Please share your thoughts! |
I think the original idea on the split health status of an overall component versus a unit was that we can see the component itself is healthy but the unit that is running is not. I think in practice most users just review the component health and don't look at individual units. I think taking an aggregated approach of make the unit status reflect the component status does make sense. I am +1 for this type of change, if done correctly. I don't want to lose the context of the component health, so maybe we need to have two status levels for a component. The overall health of the component (including the aggregation of the units) and then a single health state for the components communication with the Elastic Agent. |
From a user perspective I agree. The only case that worries me is the upgrade watcher: elastic-agent/internal/pkg/agent/application/upgrade/watcher.go Lines 222 to 228 in d8bdd71
If we made this change today, and had a single failed unit set the component state to failed, agent would begin rolling back upgrades because of unit level errors. We need to decide if this is behavior we want. My preference is to leave this unchanged so we ignore unit errors when deciding to roll back. |
Describe the enhancement:
elastic-agent/pkg/component/runtime/command.go
Lines 185 to 192 in fd477ec
Describe a specific use case for the enhancement or feature:
beats-monitoring
component is Healthy (state 2), but the monitoring unit is actually in a degraded state.What is the definition of done?
The text was updated successfully, but these errors were encountered: