Enhancing Reliability in Cloud Attendance Management with Datadog Implementation
OM Network Co., Ltd., based in Niigata, Japan, has made significant strides in enhancing the reliability of its cloud-based attendance management service, R-Kintai, by transitioning its monitoring infrastructure from Zabbix to Datadog. This pivotal change integrates infrastructure monitoring and Application Performance Monitoring (APM) into a unified surveillance system designed not only for efficiency but also for improved service stability.
Background and Challenges
R-Kintai operates alongside R-Shift, a shift management solution, serving a number of retail enterprises. As the number of companies utilizing R-Kintai grows, the demand for stable service operations has intensified. However, the previous monitoring framework encountered several challenges:
1.
Disjointed Monitoring of Infrastructure and Applications: Previously, server resource monitoring (CPU, memory, disk usage, etc.) and application performance monitoring were managed by separate tools, making it time-consuming to isolate issues when response times deteriorated.
2.
Insufficient Anomaly Detection Mechanisms: The prior operational model relied heavily on reactive measures, responding to performance drops only after they occurred, rather than proactively identifying early warning signs of issues.
3.
Subjective Performance Improvement Justifications: The assessment of performance enhancements relied on individual experiences and intuitions, presenting challenges in addressing intermittent latency issues without quantitative data.
Overview of Datadog Implementation
The recent implementation leverages Datadog’s advanced infrastructure monitoring and APM. For the infrastructure aspect, Datadog effectively gathers and visualizes real-time data on server CPU, memory, and disk usage. It includes alerts for thresholds that exceed predefined limits, along with an anomaly detection feature that identifies unusual patterns automatically.
On the application front, key performance metrics such as screen loading times and data processing periods are rendered visible, allowing for swift identification of any sluggish performance issues. Notably, crucial alerts are integrated with LINE WORKS, enabling engineers to respond promptly to any situations that arise.
Reasons for Choosing Datadog
Several monitoring tools were critically assessed before Datadog was selected, with the following reasons being significant:
- - Unified Management Platform: The ability to manage both infrastructure monitoring and APM through a single platform was a major advantage, eliminating the need to switch between different tools and enhancing efficiency in pinpointing issues during incidents.
- - Scalability and Future-readiness: The platform's capacity to expand to additional monitoring areas, such as log management and Real User Monitoring, is ideal for long-term operational strategies.
Changes Post-Implementation
With Datadog in place, the capacity for cross-sectional monitoring of infrastructure and application statuses has dramatically improved. The time required to pinpoint performance issues has been significantly shortened. Tasks that previously demanded the cross-referencing of various tools can now be completed using Datadog’s unified dashboard, facilitating both speed and accuracy in monitoring efforts.
Engineer’s Comments
One of the engineers commented, "The visualization of previously hidden issues has substantially expedited the identification of their causes. The most remarkable benefit is the shift in team mentality. Seeing the performance of the code I’ve written displayed in real-time graphs has heightened our commitment to quality. We continue to focus on enhancing customer assurance through our improvements."
Future Directions
This implementation of Datadog within R-Kintai is just the first step. The operational knowledge gained will be applied to other in-house products, such as the R-Shift. There are plans to gradually expand Datadog's functions, incorporating Real User Monitoring for actual user experience metrics and advanced database monitoring capabilities, all aimed at continuously enhancing service quality.
Company Overview
- - Company Name: OM Network Co., Ltd.
- - Location: Niigata, Niigata, Japan
- - CEO: Shinya Yamagishi
- - Business Overview: Development of business systems and the R-Shift shift management system.
- - Website: www.omnetwork.co.jp
Recommended Articles
- - "Why OM Network Transitioned from ChatGPT to Gemini: A Hands-on Report"
- - "AI Deciphers Complicated WAF Logs, Reducing Response Time by Over 90%"
- - "Developing New Products with Unfamiliar Tech Stacks: OM Network's Full Utilization of AI 'Claude Code'"