Requesting a new metric that includes detailed stack-level failure data to enable targeted alerts, ownership routing, and integration with their monitoring tools.
Current workaround:
Using a notification policy to capture stack failure events.
Also using spacelift_current_stacks_count_by_state in Prometheus to display counts in New Relic dashboards.
Workaround is too generic — only shows counts, no per-stack metadata.
Limitations:
No "resolved" signal to indicate when a failure is fixed.
No per-stack detail (ID, name, owner, commit info).
No ownership assignment for routing alerts (e.g., to team Slack/email).
No observability integration beyond counts — can’t drive SLOs or MTTR tracking.
Can’t easily track historical trends or failure rates.
Requested metric should include:
stack_id
stack_name
commit_id
commit_author
labels (e.g., team) for routing alerts to correct owners
Stack state (to enable “failure” and “resolved” signals)
Please authenticate to join the conversation.
🔭 Discovery
💡 Feature Requests
Stacks
6 months ago
Get notified by email when there are changes.
🔭 Discovery
💡 Feature Requests
Stacks
6 months ago
Get notified by email when there are changes.