Alerting and configurable options for Watchdog

Currently, any watchdog job created in the UI runs every hour to check for errors. For example, I use dataflow errors to monitor the execution of critical flows. These flows may run weekly, daily or hourly.
When a daily job fails, the watchdog will send the same error message every hour, resulting in up to 24 messages (if integrated with a Slack webhook) about the failed dataflow. While this might be appropriate for hourly jobs, I expect to receive only one error message per failure for daily jobs.
The main issue with the alerting system is its frequency—receiving too many alerts can lead to them being ignored. Ideally, the following configurable parameters should be added:
- alert me once job is failing/ alert me every time watchdog runs
- frequency of watchdog executions should be configurable (e.g., every day, every 5 minutes, every hour, etc.).
With these two parameters, we could better fine-tune the alerting system.
Categories
- All Categories
- 2K Product Ideas
- 2K Ideas Exchange
- 1.6K Connect
- 1.3K Connectors
- 311 Workbench
- 7 Cloud Amplifier
- 9 Federated
- 3K Transform
- 114 SQL DataFlows
- 655 Datasets
- 2.2K Magic ETL
- 4.1K Visualize
- 2.5K Charting
- 809 Beast Mode
- 80 App Studio
- 45 Variables
- 763 Automate
- 189 Apps
- 480 APIs & Domo Developer
- 76 Workflows
- 18 DomoAI
- 40 Predict
- 17 Jupyter Workspaces
- 23 R & Python Tiles
- 408 Distribute
- 119 Domo Everywhere
- 279 Scheduled Reports
- 10 Software Integrations
- 142 Manage
- 138 Governance & Security
- 8 Domo Community Gallery
- 48 Product Releases
- 12 Domo University
- 5.4K Community Forums
- 41 Getting Started
- 31 Community Member Introductions
- 114 Community Announcements
- 4.8K Archive