nightingale v6.5 releases: enterprise-level cloud-native monitoring system

Nightingale

Nightingale is an enterprise-level cloud-native monitoring system, which can be used as a drop-in replacement for Prometheus for alerting and management.

Nightingale is a cloud-native monitoring system by All-In-On design, that supports enterprise-class functional features with an out-of-the-box experience. We recommend upgrading yourPrometheus + AlertManager + Grafana combo solution to Nightingale.

  • Multiple prometheus data sources management: manage all alerts and dashboards in one centralized visually view;
  • Out-of-the-box alert rule: built-in multiple alert rules, reuse alert rules template by one-click import with a detailed explanation of metrics;
  • Multiple modes for visualizing data: out-of-the-box dashboards, instance customize views, expression browser, and Grafana integration;
  • Multiple collection clients: support using Prometheus Exporter、Telegraf、Datadog Agent to collect metrics;
  • Integration of multiple storage: support Prometheus, M3DB, VictoriaMetrics, Influxdb, TDEngine as storage solutions, and original support for PromQL;
  • Fault self-healing: support the ability to self-heal from failures by configuring webhook;

If you are using Prometheus and have one or more of the following requirement scenarios, it is recommended that you upgrade to Nightingale:

  • Multiple systems such as Prometheus, Alertmanager, Grafana, etc. are fragmented and lack a unified view, and cannot be used out of the box;
  • The way to manage Prometheus and Alertmanager by modifying configuration files has a big learning curve and is difficult to collaborate;
  • Too much data to scale-up your Prometheus cluster;
  • Multiple Prometheus clusters running in production environments, which faced high management and usage costs;

If you are using Zabbix and have the following scenarios, it is recommended that you upgrade to Nightingale:

  • Monitoring too much data and wanting a better scalable solution;
  • A high learning curve and a desire for better efficiency of collaborative use in a multi-person, multi-team model;
  • Microservice and cloud-native architectures with variable monitoring data lifecycles and high monitoring data dimension bases, which are not easily adaptable to the Zabbix data model;

If you are using open-falcon, we recommend you to upgrade to Nightingale:

A typical Nightingale deployment architecture

Changelog v6.5

  • feat: New version menu, with theme color settings support for the menu
  • feat: Dashboard now features color block charts
  • feat: Dashboard adds units packets/sec and dBm
  • feat: Business group tree structure level is no longer limited, team list can also be rendered as a tree structure
  • feat: Real-time query in log analysis now supports multiple tab queries
  • feat: Alarm rule Host type machine identifier filtering adds =~ and !~ operators
  • refactor: Dashboard non-fullscreen mode now allows theme mode switching
  • refactor: Optimized the data source style of the event list in the expanded active alarm card, solving potential occlusion issues
  • refactor: Real-time queries in Prometheus’ Graph mode default to SI format
  • refactor: Prometheus and Elasticsearch Proxy interfaces no longer intercept 401 and 403 status codes
  • refactor: Alarm subscription rules table adds rule remarks and enabled column
  • refactor: Renamed “hide” column to “enable” in notification settings table
  • fix: Fixed an issue where switching display modes in the dashboard table chart could cause rendering crashes
  • fix: Fixed overflow issue in the legend table content of Prometheus real-time query Graph mode
  • fix: Import Grafana dashboard
    • Adapt variable hidden configuration
    • Adapt global variables
    • Default value of time series graph curve transparency changed from 0.5 to 0

Install & Use

Copyright (C) 2017 Beijing Didi Infinity Technology and Development Co., Ltd. All rights reserved.