influxdb/monitor
Philip O'Toole 07bcd523f3 Tag Go memstats during writes to _internal 2015-09-18 14:16:39 -07:00
..
README.md Fix typo in monitor README 2015-09-14 12:39:38 -07:00
build_info.go Add build info to diagnostics 2015-09-09 12:43:51 -07:00
config.go By default write stats to default retention policy 2015-09-10 15:08:09 -07:00
config_test.go Fix compile error in monitor unit tests 2015-09-10 17:28:44 -07:00
go_runtime.go Add Go memstats to statistics 2015-09-04 13:20:58 -07:00
network.go Add Go memstats to statistics 2015-09-04 13:20:58 -07:00
service.go Tag Go memstats during writes to _internal 2015-09-18 14:16:39 -07:00
service_test.go Create and configure _internal retention policy 2015-09-15 14:05:00 -07:00
statement_executor.go actually move influxql.Row* -> models.Row* 2015-09-16 16:32:50 -05:00
system.go Add Go memstats to statistics 2015-09-04 13:20:58 -07:00

README.md

System Monitoring

This functionality should be considered experimental and is subject to change.

System Monitoring means all statistical and diagnostic information made availabe to the user of InfluxDB system, about the system itself. Its purpose is to assist with troubleshooting and performance analysis of the database itself.

Statistics vs. Diagnostics

A distinction is made between statistics and diagnostics for the purposes of monitoring. Generally a statistical quality is something that is being counted, and for which it makes sense to store persistently for historical analysis. Diagnostic information is not necessarily numerical, and may not make sense to store.

An example of statistical information would be the number of points received over UDP, or the number of queries executed. Examples of diagnostic information would be a list of current Graphite TCP connections, the version of InfluxDB, or the uptime of the process.

System Statistics

SHOW STATS displays statisics about subsystems within the running influxd process. Statistics include points received, points indexed, bytes written to disk, TCP connections handled etc. These statistics are all zero when the InfluxDB process starts.

All statistics are written, by default, by each node to a "monitor" database within the InfluxDB system, allowing analysis of aggregated statistical data using the standard InfluxQL language. This allows users to track the performance of their system. Importantly, this allows cluster-level statistics to be viewed, since by querying the monitor database, statistics from all nodes may be queried. This can be a very powerful approach for troubleshooting your InfluxDB system and understanding its behaviour.

System Diagnostics

SHOW DIAGNOSTICS displays various diagnostic information about the influxd process. This information is not stored persistently within the InfluxDB system.

Standard expvar support

All statistical information is available at HTTP API endpoint /debug/vars, in expvar format, allowing external systems to monitor an InfluxDB node. By default, the full path to this endpoint is http://localhost:8086/debug/vars.

Configuration

The monitor module allows the following configuration:

  • Whether to write statistical and diagnostic information to an InfluxDB system. This is enabled by default.
  • The name of the database to where this information should be written. Defaults to _internal. The information is written to the default retention policy for the given database.
  • The name of the retention policy, along with full configuration control of the retention policy, if the default retention policy is not suitable.
  • The rate at which this information should be written. The default rate is once every 10 seconds.

Design and Implementation

A new module named monitor supports all basic statistics and diagnostic functionality. This includes:

  • Allowing other modules to register statistics and diagnostics information, allowing it to be accessed on demand by the monitor module.
  • Serving the statistics and diagnostic information to the user, in response to commands such as SHOW DIAGNOSTICS.
  • Expose standard Go runtime information such as garbage collection statistics.
  • Make all collected expvar data via HTTP, for collection by 3rd-party tools.
  • Writing the statistical information to the "monitor" database, for query purposes.

Registering statistics and diagnostics

To export statistical information with the monitor system, code simply calls influxdb.NewStatistics() and receives an expvar.Map instance in response. This object can then be used to store statistics. To register diagnostic information, monitor.RegisterDiagnosticsClient is called, passing a influxdb.monitor.DiagsClient object to monitor.

expvar

Statistical information is gathered by each package using expvar. Each package registers a map using its package name.

Due to the nature of expvar, statistical information is reset to its initial state when a server is restarted.