monitoring cgroups

You can monitor containers and virtual machines using cgroups.

cgroups (or control groups), are a Linux kernel feature that provides accounting and resource usage limiting for processes. When cgroups are bundled with namespaces (i.e. isolation), they form what we usually call containers.

cgroups are hierarchical, meaning that cgroups can contain child cgroups, which can contain more cgroups, etc. All accounting is reported (and resource usage limits are applied) also in a hierarchical way.

To visualize cgroup metrics netdata provides configuration for cherry picking the cgroups of interest. By default (without any configuration) netdata should pick systemd services, all kinds of containers (lxc, docker, etc) and virtual machines spawn by managers that register them with cgroups (qemu, libvirt, etc).

configuring netdata for cgroups

For each cgroup available in the system, netdata provides this configuration:

[plugin:cgroups]
    enable cgroup XXX = yes | no

But it also provides a few patterns to provide a sane default (yes or no).

Below we see, how this works.

how netdata finds the available cgroups

Linux exposes resource usage reporting and provides dynamic configuration for cgroups, using virtual files (usually) under /sys/fs/cgroup. netdata reads /proc/self/mountinfo to detect the exact mount point of cgroups. netdata also allows manual configuration of this mount point, using these settings:

[plugin:cgroups]
	check for new cgroups every = 10
	path to /sys/fs/cgroup/cpuacct = /sys/fs/cgroup/cpuacct
	path to /sys/fs/cgroup/blkio = /sys/fs/cgroup/blkio
	path to /sys/fs/cgroup/memory = /sys/fs/cgroup/memory
	path to /sys/fs/cgroup/devices = /sys/fs/cgroup/devices

netdata rescans these directories for added or removed cgroups every check for new cgroups every seconds.

hierarchical search for cgroups

Since cgroups are hierarchical, for each of the directories shown above, netdata walks through the subdirectories recursively searching for cgroups (each subdirectory is another cgroup).

For each of the directories found, netdata provides a configuration variable:

[plugin:cgroups]
	search for cgroups under PATH = yes | no

To provide a sane default for this setting, netdata uses the following pattern list (patterns starting with ! give a negative match and their order is important: the first matching a path will be used):

[plugin:cgroups]
	search for cgroups in subpaths matching =  !*/init.scope  !*-qemu  !/init.scope  !/system  !/systemd  !/user  !/user.slice  *

So, we disable checking for child cgroups in systemd internal cgroups (systemd services are monitored by netdata), user cgroups (normally used for desktop and remote user sessions), qemu virtual machines (child cgroups of virtual machines) and init.scope. All others are enabled.

enabled cgroups

To check if the cgroup is enabled, netdata uses this setting:

[plugin:cgroups]
	enable cgroup NAME = yes | no

To provide a sane default, netdata uses the following pattern list (it checks the pattern against the path of the cgroup):

[plugin:cgroups]
	enable by default cgroups matching =  !*/init.scope  *.scope  !*/vcpu*  !*/emulator  !*.mount  !*.partition  !*.service  !*.slice  !*.swap  !*.user  !/  !/docker  !/libvirt  !/lxc  !/lxc/*/ns  !/lxc/*/ns/*  !/machine  !/qemu  !/system  !/systemd  !/user  *

The above provides the default yes or no setting for the cgroup. However, there is an additional step. In many cases the cgroups found in the /sys/fs/cgroup hierarchy are just random numbers and in many cases these numbers are ephemeral: they change across reboots or sessions.

So, we need to somehow map the paths of the cgroups to names, to provide consistent netdata configuration (i.e. there is no point to say enable cgroup 1234 = yes | no, if 1234 is a random number that changes over time - we need a name for the cgroup first, so that enable cgroup NAME = yes | no will be consistent).

For this mapping netdata provides 2 configuration options:

[plugin:cgroups]
	run script to rename cgroups matching =  *.scope  *docker*  *lxc*  *qemu*  !/  !*.mount  !*.partition  !*.service  !*.slice  !*.swap  !*.user  *
	script to get cgroup names = /usr/libexec/netdata/plugins.d/cgroup-name.sh

The whole point for the additional pattern list, is to limit the number of times the script will be called. Without this pattern list, the script might be called thousands of times, depending on the number of cgroups available in the system.

The above pattern list is matched against the path of the cgroup. For matched cgroups, netdata calls the script cgroup-name.sh to get its name. This script queries docker, or applies heuristics to find give a name for the cgroup.

General

Running Netdata

Special Uses

netdata for IoT
lower netdata resource utilization
high performance netdata
netdata public on the internet

Notes on memory management

Memory deduplication
half netdata memory requirements
netdata virtual memory size

Database Replication and Mirroring

Replication Overview
monitoring ephemeral nodes
Use netdata to monitor auto-scaled cloud servers.
netdata proxies
Streaming netdata metrics between netdata servers.

Backends
archiving netdata collected metrics to a time-series database

netdata-backends
graphite, opentsdb, kairosdb, influxdb, elasticsearch, blueflood
netdata with prometheus
Walk Through: netdata with prometheus and grafana

Health monitoring - Alarms
alarms and alarm notifications in netdata

Overview
Reference
reference for writing alarms
Examples
simple how-to for writing alarms
Notifications Configuration
health API calls
troubleshooting alarms

Netdata Registry

mynetdata Menu Item

Monitoring Info

Monitoring web servers
The spectacles of a web server log file
monitoring ephemeral containers
Use netdata to monitor auto-scaled containers.
monitoring systemd services
monitoring cgroups
Use netdata to monitor containers and virtual machines.
monitoring IPMI
Use netdata to monitor enterprise server hardware
Monitoring disks
Monitoring Go Applications

Netdata Badges

Generating Badges

Data Collection

Add more charts to netdata
Internal Plugins
External Plugins
statsd
netdata is a fully featured statsd server
Third Party Plugins
netdata plugins distributed by third parties

Binary Modules

Python Modules

Node.js Modules

BASH Modules

General Info - charts.d

Active BASH Modules

Obsolete BASH Modules

API Documentation

Web Dashboards

Overview
Custom Dashboards
Custom Dashboard with Confluence
Chart Libraries
Learn how to create dashboards with charts from one or more netdata servers!

Running behind another web server

Package Maintainers

Netdata Package Maintainers

Donations

Donations netdata has received

Blog

December, 2016

Linux console tools, fail to report per process CPU usage properly
April, 2016

netdata v1.1.0 released

You should install QoS on all your servers (Linux QoS for humans)

Monitor application bandwidth with Linux QoS (Good to do it, anyway)

Monitoring SYNPROXY (Linux TCP Anti-DDoS)
March, 2016

Article: Introducing netdata (the design principles of netdata)

Other monitoring tools

monitoring cgroups

configuring netdata for cgroups

how netdata finds the available cgroups

hierarchical search for cgroups

enabled cgroups

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!