Tools such as SignalFX and NewRelic are great for observing metrics data in real time. Cultivate a “measure and monitor everything” mindset with automation determining the response to detected alertsĪpplication performance monitoring is essential to ensure that the application-specific performance indicators such as time to load a page, latencies of downstream services, or transitions are monitored in addition to basis system metrics such as CPU and memory utilization. Build detectors for security (upgrades/patches/rolling credentials).During sprints, plan to close actions from previous incident reviews, especially actions related to building missing monitors and automation.Plan “war games” for the service to ensure monitors operate as expected and catch missing monitors.Allocate time to build required dashboards and train team members to use them.Build monitors to ensure dependent services operate as expected.During development, build appropriate, high-quality alerts in the code that minimize mean time to detect (MTTD) and mean time to isolate (MTTI).
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |