338 lines
20 KiB
Markdown
338 lines
20 KiB
Markdown
---
|
|
title: InfluxDB glossary of terms
|
|
menu:
|
|
influxdb_1_5:
|
|
name: Glossary of terms
|
|
weight: 20
|
|
parent: Concepts
|
|
---
|
|
|
|
## aggregation
|
|
An InfluxQL function that returns an aggregated value across a set of points.
|
|
See [InfluxQL Functions](/influxdb/v1.5/query_language/functions/#aggregations) for a complete list of the available and upcoming aggregations.
|
|
|
|
Related entries: [function](/influxdb/v1.5/concepts/glossary/#function), [selector](/influxdb/v1.5/concepts/glossary/#selector), [transformation](/influxdb/v1.5/concepts/glossary/#transformation)
|
|
|
|
## batch
|
|
A collection of points in line protocol format, separated by newlines (`0x0A`).
|
|
A batch of points may be submitted to the database using a single HTTP request to the write endpoint.
|
|
This makes writes via the HTTP API much more performant by drastically reducing the HTTP overhead.
|
|
InfluxData recommends batch sizes of 5,000-10,000 points, although different use cases may be better served by significantly smaller or larger batches.
|
|
|
|
Related entries: [line protocol](/influxdb/v1.5/concepts/glossary/#line-protocol), [point](/influxdb/v1.5/concepts/glossary/#point)
|
|
|
|
## continuous query (CQ)
|
|
An InfluxQL query that runs automatically and periodically within a database.
|
|
Continuous queries require a function in the `SELECT` clause and must include a `GROUP BY time()` clause.
|
|
See [Continuous Queries](/influxdb/v1.5/query_language/continuous_queries/).
|
|
|
|
|
|
Related entries: [function](/influxdb/v1.5/concepts/glossary/#function)
|
|
|
|
## database
|
|
A logical container for users, retention policies, continuous queries, and time series data.
|
|
|
|
Related entries: [continuous query](/influxdb/v1.5/concepts/glossary/#continuous-query-cq), [retention policy](/influxdb/v1.5/concepts/glossary/#retention-policy-rp), [user](/influxdb/v1.5/concepts/glossary/#user)
|
|
|
|
## duration
|
|
The attribute of the retention policy that determines how long InfluxDB stores data.
|
|
Data older than the duration are automatically dropped from the database.
|
|
See [Database Management](/influxdb/v1.5/query_language/database_management/#create-retention-policies-with-create-retention-policy) for how to set duration.
|
|
|
|
Related entries: [retention policy](/influxdb/v1.5/concepts/glossary/#retention-policy-rp)
|
|
|
|
## field
|
|
The key-value pair in InfluxDB's data structure that records metadata and the actual data value.
|
|
Fields are required in InfluxDB's data structure and they are not indexed - queries on field values scan all points that match the specified time range and, as a result, are not performant relative to tags.
|
|
|
|
*Query tip:* Compare fields to tags; tags are indexed.
|
|
|
|
Related entries: [field key](/influxdb/v1.5/concepts/glossary/#field-key), [field set](/influxdb/v1.5/concepts/glossary/#field-set), [field value](/influxdb/v1.5/concepts/glossary/#field-value), [tag](/influxdb/v1.5/concepts/glossary/#tag)
|
|
|
|
## field key
|
|
The key part of the key-value pair that makes up a field.
|
|
Field keys are strings and they store metadata.
|
|
|
|
Related entries: [field](/influxdb/v1.5/concepts/glossary/#field), [field set](/influxdb/v1.5/concepts/glossary/#field-set), [field value](/influxdb/v1.5/concepts/glossary/#field-value), [tag key](/influxdb/v1.5/concepts/glossary/#tag-key)
|
|
|
|
## field set
|
|
The collection of field keys and field values on a point.
|
|
|
|
Related entries: [field](/influxdb/v1.5/concepts/glossary/#field), [field key](/influxdb/v1.5/concepts/glossary/#field-key), [field value](/influxdb/v1.5/concepts/glossary/#field-value), [point](/influxdb/v1.5/concepts/glossary/#point)
|
|
|
|
## field value
|
|
The value part of the key-value pair that makes up a field.
|
|
Field values are the actual data; they can be strings, floats, integers, or booleans.
|
|
A field value is always associated with a timestamp.
|
|
|
|
Field values are not indexed - queries on field values scan all points that match the specified time range and, as a result, are not performant.
|
|
|
|
*Query tip:* Compare field values to tag values; tag values are indexed.
|
|
|
|
Related entries: [field](/influxdb/v1.5/concepts/glossary/#field), [field key](/influxdb/v1.5/concepts/glossary/#field-key), [field set](/influxdb/v1.5/concepts/glossary/#field-set), [tag value](/influxdb/v1.5/concepts/glossary/#tag-value), [timestamp](/influxdb/v1.5/concepts/glossary/#timestamp)
|
|
|
|
## function
|
|
InfluxQL aggregations, selectors, and transformations.
|
|
See [InfluxQL Functions](/influxdb/v1.5/query_language/functions/) for a complete list of InfluxQL functions.
|
|
|
|
Related entries: [aggregation](/influxdb/v1.5/concepts/glossary/#aggregation), [selector](/influxdb/v1.5/concepts/glossary/#selector), [transformation](/influxdb/v1.5/concepts/glossary/#transformation)
|
|
|
|
## identifier
|
|
Tokens that refer to continuous query names, database names, field keys,
|
|
measurement names, retention policy names, subscription names, tag keys, and
|
|
user names.
|
|
See [Query Language Specification](/influxdb/v1.5/query_language/spec/#identifiers).
|
|
|
|
Related entries:
|
|
[database](/influxdb/v1.5/concepts/glossary/#database),
|
|
[field key](/influxdb/v1.5/concepts/glossary/#field-key),
|
|
[measurement](/influxdb/v1.5/concepts/glossary/#measurement),
|
|
[retention policy](/influxdb/v1.5/concepts/glossary/#retention-policy-rp),
|
|
[tag key](/influxdb/v1.5/concepts/glossary/#tag-key),
|
|
[user](/influxdb/v1.5/concepts/glossary/#user)
|
|
|
|
## line protocol
|
|
The text based format for writing points to InfluxDB. See [Line Protocol](/influxdb/v1.5/write_protocols/).
|
|
|
|
## measurement
|
|
The part of InfluxDB's structure that describes the data stored in the associated fields.
|
|
Measurements are strings.
|
|
|
|
Related entries: [field](/influxdb/v1.5/concepts/glossary/#field), [series](/influxdb/v1.5/concepts/glossary/#series)
|
|
|
|
## metastore
|
|
Contains internal information about the status of the system.
|
|
The metastore contains the user information, databases, retention policies, shard metadata, continuous queries, and subscriptions.
|
|
|
|
Related entries: [database](/influxdb/v1.5/concepts/glossary/#database), [retention policy](/influxdb/v1.5/concepts/glossary/#retention-policy-rp), [user](/influxdb/v1.5/concepts/glossary/#user)
|
|
|
|
## node
|
|
An independent `influxd` process.
|
|
|
|
Related entries: [server](/influxdb/v1.5/concepts/glossary/#server)
|
|
|
|
## now()
|
|
The local server's nanosecond timestamp.
|
|
|
|
## point
|
|
The part of InfluxDB's data structure that consists of a single collection of fields in a series.
|
|
Each point is uniquely identified by its series and timestamp.
|
|
|
|
You cannot store more than one point with the same timestamp in the same series.
|
|
Instead, when you write a new point to the same series with the same timestamp as an existing point in that series, the field set becomes the union of the old field set and the new field set, where any ties go to the new field set.
|
|
For an example, see [Frequently Asked Questions](/influxdb/v1.5/troubleshooting/frequently-asked-questions/#how-does-influxdb-handle-duplicate-points).
|
|
|
|
Related entries: [field set](/influxdb/v1.5/concepts/glossary/#field-set), [series](/influxdb/v1.5/concepts/glossary/#series), [timestamp](/influxdb/v1.5/concepts/glossary/#timestamp)
|
|
|
|
## points per second
|
|
A deprecated measurement of the rate at which data are persisted to InfluxDB.
|
|
The schema allows and even encourages the recording of multiple metric values per point, rendering points per second ambiguous.
|
|
|
|
Write speeds are generally quoted in values per second, a more precise metric.
|
|
|
|
Related entries: [point](/influxdb/v1.5/concepts/glossary/#point), [schema](/influxdb/v1.5/concepts/glossary/#schema), [values per second](/influxdb/v1.5/concepts/glossary/#values-per-second)
|
|
|
|
## query
|
|
An operation that retrieves data from InfluxDB.
|
|
See [Data Exploration](/influxdb/v1.5/query_language/data_exploration/), [Schema Exploration](/influxdb/v1.5/query_language/schema_exploration/), [Database Management](/influxdb/v1.5/query_language/database_management/).
|
|
|
|
## replication factor
|
|
The attribute of the retention policy that determines how many copies of the data are stored in the cluster.
|
|
InfluxDB replicates data across `N` data nodes, where `N` is the replication factor.
|
|
|
|
{{% warn %}} Replication factors do not serve a purpose with single node instances.
|
|
{{% /warn %}}
|
|
|
|
Related entries: [duration](/influxdb/v1.5/concepts/glossary/#duration), [node](/influxdb/v1.5/concepts/glossary/#node), [retention policy](/influxdb/v1.5/concepts/glossary/#retention-policy-rp)
|
|
|
|
## retention policy (RP)
|
|
The part of InfluxDB's data structure that describes for how long InfluxDB keeps data (duration), how many copies of this data is stored in the cluster (replication factor), and the time range covered by shard groups (shard group duration).
|
|
RPs are unique per database and along with the measurement and tag set define a series.
|
|
|
|
When you create a database, InfluxDB automatically creates a retention policy called `autogen` with an infinite duration, a replication factor set to one, and a shard group duration set to seven days.
|
|
See [Database Management](/influxdb/v1.5/query_language/database_management/#retention-policy-management) for retention policy management.
|
|
|
|
{{% warn %}} Replication factors do not serve a purpose with single node instances.
|
|
{{% /warn %}}
|
|
|
|
Related entries: [duration](/influxdb/v1.5/concepts/glossary/#duration), [measurement](/influxdb/v1.5/concepts/glossary/#measurement), [replication factor](/influxdb/v1.5/concepts/glossary/#replication-factor), [series](/influxdb/v1.5/concepts/glossary/#series), [shard duration](/influxdb/v1.5/concepts/glossary/#shard-duration), [tag set](/influxdb/v1.5/concepts/glossary/#tag-set)
|
|
|
|
## schema
|
|
How the data are organized in InfluxDB.
|
|
The fundamentals of the InfluxDB schema are databases, retention policies, series, measurements, tag keys, tag values, and field keys.
|
|
See [Schema Design](/influxdb/v1.5/concepts/schema_and_data_layout/) for more information.
|
|
|
|
Related entries: [database](/influxdb/v1.5/concepts/glossary/#database), [field key](/influxdb/v1.5/concepts/glossary/#field-key), [measurement](/influxdb/v1.5/concepts/glossary/#measurement), [retention policy](/influxdb/v1.5/concepts/glossary/#retention-policy-rp), [series](/influxdb/v1.5/concepts/glossary/#series), [tag key](/influxdb/v1.5/concepts/glossary/#tag-key), [tag value](/influxdb/v1.5/concepts/glossary/#tag-value)
|
|
|
|
## selector
|
|
An InfluxQL function that returns a single point from the range of specified points.
|
|
See [InfluxQL Functions](/influxdb/v1.5/query_language/functions/#selectors) for a complete list of the available and upcoming selectors.
|
|
|
|
Related entries: [aggregation](/influxdb/v1.5/concepts/glossary/#aggregation), [function](/influxdb/v1.5/concepts/glossary/#function), [transformation](/influxdb/v1.5/concepts/glossary/#transformation)
|
|
|
|
## series
|
|
The collection of data in InfluxDB's data structure that share a measurement, tag set, and retention policy.
|
|
|
|
|
|
> **Note:** The field set is not part of the series identification!
|
|
|
|
Related entries: [field set](/influxdb/v1.5/concepts/glossary/#field-set), [measurement](/influxdb/v1.5/concepts/glossary/#measurement), [retention policy](/influxdb/v1.5/concepts/glossary/#retention-policy-rp), [tag set](/influxdb/v1.5/concepts/glossary/#tag-set)
|
|
|
|
## series cardinality
|
|
The number of unique database, measurement, tag set, and field key combinations in an InfluxDB instance.
|
|
|
|
For example, assume that an InfluxDB instance has a single database and one measurement.
|
|
The single measurement has two tag keys: `email` and `status`.
|
|
If there are three different `email`s, and each email address is associated with two
|
|
different `status`es then the series cardinality for the measurement is 6
|
|
(3 * 2 = 6):
|
|
|
|
| email | status |
|
|
| :-------------------- | :----- |
|
|
| lorr@influxdata.com | start |
|
|
| lorr@influxdata.com | finish |
|
|
| marv@influxdata.com | start |
|
|
| marv@influxdata.com | finish |
|
|
| cliff@influxdata.com | start |
|
|
| cliff@influxdata.com | finish |
|
|
|
|
Note that, in some cases, simply performing that multiplication may overestimate series cardinality because of the presence of dependent tags.
|
|
Dependent tags are tags that are scoped by another tag and do not increase series
|
|
cardinality.
|
|
If we add the tag `firstname` to the example above, the series cardinality
|
|
would not be 18 (3 * 2 * 3 = 18).
|
|
It would remain unchanged at 6, as `firstname` is already scoped by the `email` tag:
|
|
|
|
| email | status | firstname |
|
|
| :-------------------- | :----- | :-------- |
|
|
| lorr@influxdata.com | start | lorraine |
|
|
| lorr@influxdata.com | finish | lorraine |
|
|
| marv@influxdata.com | start | marvin |
|
|
| marv@influxdata.com | finish | marvin |
|
|
| cliff@influxdata.com | start | clifford |
|
|
| cliff@influxdata.com | finish | clifford |
|
|
|
|
See [SHOW CARDINALITY](/influxdb/v1.5/query_language/spec/#show-cardinality) to learn about the InfluxQL commands for series cardinality.
|
|
|
|
Related entries: [field key](#field-key),[measurement](#measurement), [tag key](#tag-key), [tag set](#tag-set)
|
|
|
|
## server
|
|
A machine, virtual or physical, that is running InfluxDB.
|
|
There should only be one InfluxDB process per server.
|
|
|
|
Related entries: [node](/influxdb/v1.5/concepts/glossary/#node)
|
|
|
|
## shard
|
|
|
|
A shard contains the actual encoded and compressed data, and is represented by a TSM file on disk.
|
|
Every shard belongs to one and only one shard group.
|
|
Multiple shards may exist in a single shard group.
|
|
Each shard contains a specific set of series.
|
|
All points falling on a given series in a given shard group will be stored in the same shard (TSM file) on disk.
|
|
|
|
Related entries: [series](/influxdb/v1.5/concepts/glossary/#series), [shard duration](/influxdb/v1.5/concepts/glossary/#shard-duration), [shard group](/influxdb/v1.5/concepts/glossary/#shard-group), [tsm](/influxdb/v1.5/concepts/glossary/#tsm-time-structured-merge-tree)
|
|
|
|
## shard duration
|
|
|
|
The shard duration determines how much time each shard group spans.
|
|
The specific interval is determined by the `SHARD DURATION` of the retention policy.
|
|
See [Retention Policy management](/influxdb/v1.5/query_language/database_management/#retention-policy-management) for more information.
|
|
|
|
For example, given a retention policy with `SHARD DURATION` set to `1w`, each shard group will span a single week and contain all points with timestamps in that week.
|
|
|
|
Related entries: [database](/influxdb/v1.5/concepts/glossary/#database), [retention policy](/influxdb/v1.5/concepts/glossary/#retention-policy-rp), [series](/influxdb/v1.5/concepts/glossary/#series), [shard](/influxdb/v1.5/concepts/glossary/#shard), [shard group](/influxdb/v1.5/concepts/glossary/#shard-group)
|
|
|
|
## shard group
|
|
|
|
Shard groups are logical containers for shards.
|
|
Shard groups are organized by time and retention policy.
|
|
Every retention policy that contains data has at least one associated shard group.
|
|
A given shard group contains all shards with data for the interval covered by the shard group.
|
|
The interval spanned by each shard group is the shard duration.
|
|
|
|
Related entries: [database](/influxdb/v1.5/concepts/glossary/#database), [retention policy](/influxdb/v1.5/concepts/glossary/#retention-policy-rp), [series](/influxdb/v1.5/concepts/glossary/#series), [shard](/influxdb/v1.5/concepts/glossary/#shard), [shard duration](/influxdb/v1.5/concepts/glossary/#shard-duration)
|
|
|
|
## subscription
|
|
Subscriptions allow [Kapacitor](/{{< latest "kapacitor" >}}/) to receive data from InfluxDB in a push model rather than the pull model based on querying data.
|
|
When Kapacitor is configured to work with InfluxDB, the subscription will automatically push every write for the subscribed database from InfluxDB to Kapacitor.
|
|
Subscriptions can use TCP or UDP for transmitting the writes.
|
|
|
|
## tag
|
|
The key-value pair in InfluxDB's data structure that records metadata.
|
|
Tags are an optional part of InfluxDB's data structure but they are useful for storing commonly queried metadata; tags are indexed so queries on tags are performant.
|
|
*Query tip:* Compare tags to fields; fields are not indexed.
|
|
|
|
Related entries: [field](/influxdb/v1.5/concepts/glossary/#field), [tag key](/influxdb/v1.5/concepts/glossary/#tag-key), [tag set](/influxdb/v1.5/concepts/glossary/#tag-set), [tag value](/influxdb/v1.5/concepts/glossary/#tag-value)
|
|
|
|
## tag key
|
|
The key part of the key-value pair that makes up a tag.
|
|
Tag keys are strings and they store metadata.
|
|
Tag keys are indexed so queries on tag keys are performant.
|
|
|
|
*Query tip:* Compare tag keys to field keys; field keys are not indexed.
|
|
|
|
Related entries: [field key](/influxdb/v1.5/concepts/glossary/#field-key), [tag](/influxdb/v1.5/concepts/glossary/#tag), [tag set](/influxdb/v1.5/concepts/glossary/#tag-set), [tag value](/influxdb/v1.5/concepts/glossary/#tag-value)
|
|
|
|
## tag set
|
|
The collection of tag keys and tag values on a point.
|
|
|
|
Related entries: [point](/influxdb/v1.5/concepts/glossary/#point), [series](/influxdb/v1.5/concepts/glossary/#series), [tag](/influxdb/v1.5/concepts/glossary/#tag), [tag key](/influxdb/v1.5/concepts/glossary/#tag-key), [tag value](/influxdb/v1.5/concepts/glossary/#tag-value)
|
|
|
|
## tag value
|
|
The value part of the key-value pair that makes up a tag.
|
|
Tag values are strings and they store metadata.
|
|
Tag values are indexed so queries on tag values are performant.
|
|
|
|
|
|
Related entries: [tag](/influxdb/v1.5/concepts/glossary/#tag), [tag key](/influxdb/v1.5/concepts/glossary/#tag-key), [tag set](/influxdb/v1.5/concepts/glossary/#tag-set)
|
|
|
|
## timestamp
|
|
The date and time associated with a point.
|
|
All time in InfluxDB is UTC.
|
|
|
|
For how to specify time when writing data, see [Write Syntax](/influxdb/v1.5/write_protocols/line_protocol_reference/).
|
|
For how to specify time when querying data, see [Data Exploration](/influxdb/v1.5/query_language/data_exploration/#time-syntax).
|
|
|
|
Related entries: [point](/influxdb/v1.5/concepts/glossary/#point)
|
|
|
|
## transformation
|
|
An InfluxQL function that returns a value or a set of values calculated from specified points, but does not return an aggregated value across those points.
|
|
See [InfluxQL Functions](/influxdb/v1.5/query_language/functions/#transformations) for a complete list of the available and upcoming aggregations.
|
|
|
|
Related entries: [aggregation](/influxdb/v1.5/concepts/glossary/#aggregation), [function](/influxdb/v1.5/concepts/glossary/#function), [selector](/influxdb/v1.5/concepts/glossary/#selector)
|
|
|
|
## tsm (Time Structured Merge tree)
|
|
The purpose-built data storage format for InfluxDB. TSM allows for greater compaction and higher write and read throughput than existing B+ or LSM tree implementations. See [Storage Engine](/influxdb/v1.5/concepts/storage_engine/) for more.
|
|
|
|
## user
|
|
There are two kinds of users in InfluxDB:
|
|
|
|
* *Admin users* have `READ` and `WRITE` access to all databases and full access to administrative queries and user management commands.
|
|
* *Non-admin users* have `READ`, `WRITE`, or `ALL` (both `READ` and `WRITE`) access per database.
|
|
|
|
When authentication is enabled, InfluxDB only executes HTTP requests that are sent with a valid username and password.
|
|
See [Authentication and Authorization](/influxdb/v1.5/query_language/authentication_and_authorization/).
|
|
|
|
## values per second
|
|
The preferred measurement of the rate at which data are persisted to InfluxDB. Write speeds are generally quoted in values per second.
|
|
|
|
To calculate the values per second rate, multiply the number of points written per second by the number of values stored per point. For example, if the points have four fields each, and a batch of 5000 points is written 10 times per second, then the values per second rate is `4 field values per point * 5000 points per batch * 10 batches per second = 200,000 values per second`.
|
|
|
|
Related entries: [batch](/influxdb/v1.5/concepts/glossary/#batch), [field](/influxdb/v1.5/concepts/glossary/#field), [point](/influxdb/v1.5/concepts/glossary/#point), [points per second](/influxdb/v1.5/concepts/glossary/#points-per-second)
|
|
|
|
## wal (Write Ahead Log)
|
|
The temporary cache for recently written points. To reduce the frequency with which the permanent storage files are accessed, InfluxDB caches new points in the WAL until their total size or age triggers a flush to more permanent storage. This allows for efficient batching of the writes into the TSM.
|
|
|
|
Points in the WAL can be queried, and they persist through a system reboot. On process start, all points in the WAL must be flushed before the system accepts new writes.
|
|
|
|
Related entries: [tsm](/influxdb/v1.5/concepts/glossary/#tsm-time-structured-merge-tree)
|
|
|
|
<!--
|
|
|
|
|
|
|
|
## shard
|
|
|
|
## shard group
|
|
-->
|