All Products
Search
Document Center

Terms

Last Updated: Aug 07, 2020

An Influx Query Language (InfluxQL) function that returns an aggregated value across a set of points. For more information about a complete list of the available and upcoming aggregate functions, see InfluxQL functions.

Relevant terms: function, selector, and transformation

Batch

A collection of points that are separated by line feeds (0x0A) and conform to the line protocol format. You can use a single HTTP request to write a batch of points to a database. This greatly reduces the HTTP overhead and improves the performance of data writing through the HTTP API. For Time Series Database (TSDB) for InfluxDB® instances, we recommend batch sizes of 5,000 to 10,000 points. You can adjust the batch size to achieve better performance in different scenarios.

Relevant terms: line protocol and point

Continuous query (CQ)

An InfluxQL query that automatically and periodically runs in a database. Continuous queries require a function in the SELECT clause and must include a GROUP BY time() clause.

Relevant terms: function

Database

A logical container for users, retention policies (RPs), continuous queries, and time series data.

Relevant terms: continuous query, retention policy, and user

Duration

An attribute of a retention policy that determines how long TSDB for InfluxDB® stores data. Data generated before the duration is automatically deleted from the database.

Relevant terms: retention policy

Field

A key-value pair that records the metadata and actual data in the data structure of TSDB for InfluxDB®. Fields are required in the data structure of TSDB for InfluxDB® and are not indexed. If you use a field value as a query condition, all points in the selected period must be scanned. This is not as efficient as a tag-based query.

Tip: Tags are indexed in the database. Therefore, tag-based queries are more efficient.

Relevant terms: field key, field set, field value, and tag

Field key

The key in the key-value pair of a field. A field key is a string and stores metadata.

Relevant terms: field, field set, field value, and tag key

Field set

The collection of field keys and field values on a point.

Relevant terms: field, field key, field value, and point

Field value

The value in the key-value pair of a field. A field value is the actual data that can be a string, a floating-point number, an integer, or a Boolean value. A field value is always associated with a timestamp.

A field value is not indexed in a database. If the field value is used as a query condition, all points in the selected period must be scanned. This makes the query inefficient.

Tip: Tag values are indexed in the database. Therefore, tag value-based queries are more efficient.

Relevant terms: field, field key, field set, tag value, and timestamp

Function

InfluxQL aggregations, selectors, and transformations. For more information about the complete list of InfluxQL functions, see InfluxQL functions.

Relevant terms: aggregation, selector, and transformation

Identifier

Tokens that refer to continuous query names, database names, field keys, measurement names, retention policy names, subscription names, tag keys, and user names. For more information, see InfluxQL reference.

Relevant terms: database, field key, measurement, retention policy, tag key, and user

Line protocol

The text format for writing points to TSDB for InfluxDB®. For more information, see Line protocol.

Measurement

A part of the data structure of TSDB for InfluxDB® that describes the data stored in related fields. The value of a measurement is a string.

Relevant terms: field and series

Metastore

The internal information that contains the system status. The metastore contains the user information, databases, retention policies, shard metadata, and continuous queries.

Relevant terms: database, retention policy, and user

Node

An independent TSDB for InfluxDB® instance.

Relevant terms: server

now()

The current nanosecond-level timestamp of the local server.

Point

A part in the data structure of TSDB for InfluxDB®, which consists of fields contained in a series. Each point is uniquely identified by its series and timestamp.

You cannot store multiple points that have the same timestamp in a series. If you write a point whose timestamp is the same as that of an existing point in the series, the field set of the timestamp changes to the union of the old and new field sets, and the union is returned when this point is queried.

Relevant terms: field set, series, and timestamp

Points per second

This term has been deprecated. It was used to measure the rate at which data is written to TSDB for InfluxDB®. The schema of TSDB for InfluxDB® allows or even encourages the recording of multiple metric values per point. Therefore, this term is ambiguous.

Values per second is a more precise metric that is now used to represent the data writing rate.

Relevant terms: point, schema, and values per second

Query

The operation that retrieves data from TSDB for InfluxDB®. For more information, see Data exploration, Schema exploration, and Database management.

Replication factor

An attribute of a retention policy that determines the number of data replicas stored in a cluster. If TSDB for InfluxDB® replicates data on N nodes, N is the replication factor.

Note: The replication factor does not apply to single-node instances.

Relevant terms: duration, node, and retention policy

Retention policy (RP)

A part in the data structure of TSDB for InfluxDB®, which describes how long the database keeps data (duration), how many data replicas to store in the cluster (replication factor), and the time range covered by shard groups (shard duration). A retention policy is unique in each database. A retention policy, a measurement, and a tag set define a series.

When you create a database, TSDB For InfluxDB® creates a retention policy named autogen.

Note: The replication factor does not apply to single-node instances.

Relevant terms: duration, measurement, replication factor, series, shard duration, and tag set

Schema

A schema describes how data is organized in TSDB for InfluxDB®. The schema of TSDB for InfluxDB® is based on the databases, retention policies, series, measurements, tag keys, tag values, and field keys.

Relevant terms: database, field key, measurement, retention policy, series, tag key, and tag value

Selector

An InfluxQL function that returns a point from certain points. For more information about the complete list of the available and upcoming selectors, see InfluxQL functions.

Relevant terms: aggregation, function, and transformation

Series

A set of points that contain the same measurements, tag sets, and retention policies in the data structure of TSDB for InfluxDB®.

Note: A field set does not identify any part of a series.

Relevant terms: field set, measurement, retention policy, and tag set

Series cardinality

The number of combinations of different databases, measurements, tag sets, and field keys in a TSDB for InfluxDB® instance.

Assume that a TSDB for InfluxDB® instance has a database and a measurement, and the measurement has two tag keys: email and status. If the email tag has three different values, and each email address is associated with two different status tag values, the series cardinality of this measurement is 6 (3 x 2 = 6).

email status
amy@alibaba-inc.com start
amy@alibaba-inc.com finish
jeff@alibaba-inc.com start
jeff@alibaba-inc.com finish
john@alibaba-inc.com start
john@alibaba-inc.com finish

In some cases, if you simply perform the multiplication, the series cardinality may be overestimated due to dependent tags. A dependent tag is a tag whose range is restricted by another tag. It does not increase the series cardinality. If we add the firstname tag in the preceding example, the series cardinality does not become 18 (3 x 2 x 3 = 18). Instead, it remains 6 because firstname has been overwritten by email.

email status firstname
amy@alibaba-inc.com start amy
amy@alibaba-inc.com finish amy
jeff@alibaba-inc.com start jeff
jeff@alibaba-inc.com finish jeff
john@alibaba-inc.com start john
john@alibaba-inc.com finish john

For more information about how to run InfluxQL statements to query the series cardinality, see InfluxQL reference.

Relevant terms: field key, measurement, tag key, and tag set

Server

A virtual machine or physical machine that runs TSDB for InfluxDB®.

Relevant terms: node

Shard

A shard contains the actual encoded data and compressed data and is represented by a Time-Structured Merge Tree (TSM) file on a disk. Each shard belongs to only one shard group. Multiple shards may exist in a single shard group. Each shard contains a specific set of series. All points in a series in a given shard group are stored in the same shard (TSM file) on a disk.

Relevant terms: series, shard duration, shard group, and TSM

Shard duration

A shard duration determines how long each shard group spans. The specific duration is determined by SHARD DURATION in the retention policy.

For example, if SHARD DURATION in the retention policy is set to 1w, each shard group spans for a week and contains all points of the week.

Relevant terms: database, retention policy, series, shard, and shard group

Shard group

Shard groups are logical containers for shards and are organized by time and retention policy. Each retention policy that contains data must contain at least one associated shard group. A shard group contains all shards that have data that is generated within the duration spanned by the shard group. This duration refers to the shard duration.

Relevant terms: database, retention policy, series, shard, and shard duration

Tag

The key-value pair that records metadata in the data structure of TSDB for InfluxDB®. Tags are optional in the data structure of TSDB for InfluxDB®. Tags are indexed in a database. Therefore, you can improve the query performance by using tags to store frequently queried metadata.

Tip: Fields are not indexed in a database.

Relevant terms: field, tag key, tag set, and tag value

Tag key

The key in the key-value pair of a tag. A tag key is a string and stores metadata. Tag keys are indexed in a database. Therefore, queries based on tag keys are efficient.

Tip: Field keys are not indexed in a database.

Relevant terms: field key, tag, tag set, and tag value

Tag set

The collection of tag keys and tag values on a point.

Relevant terms: point, series, tag, tag key, and tag value

Tag value

The value in the key-value pair of a tag. A tag value is a string and stores metadata. Tag values are indexed in a database. Therefore, queries based on tag values are efficient.

Relevant terms: tag, tag key, and tag set

Timestamp

The date and time associated with a point. TSDB for InfluxDB® uses the UTC+0 time zone.

For more information about how to specify the data writing time, see the write protocol. For more information about how to specify the data query time, see Data exploration.

Relevant terms: point

Transformation

An InfluxQL function that returns a value or a value set computed at specific points, but not an aggregated value of these points. For more information about a complete list of the available and upcoming aggregate functions, see InfluxQL functions.

Relevant terms: aggregation, function, and selector

Time Structured Merge Tree (TSM)

The dedicated data storage format of TSDB for InfluxDB®. Compared with the implementation of the B+ tree or log structured merge tree (LSM tree), TSM provides better compression and higher write and read throughput.

User

TSDB for InfluxDB® has two types of users:

  • Admin users have read and write permissions on all databases, and all permissions to manage queries and users.
  • Non-admin users have read-only, write-only, or read and write permissions on databases.

Values per second

The preferred measurement of the rate at which data is written to TSDB For InfluxDB®. The write rate is usually quoted in values per second.

To calculate the values per second rate, multiply the number of points written per second by the number of values stored for each point. For example, if a batch containing 5,000 points is written 10 times per second and each point contains four fields, the write rate in values per second is calculated as follows: 4 x 5000 x 10 = 200000.

Relevant terms: batch, field, point, and points per second

Write Ahead Log (WAL)

The temporary cache of recently written points. To reduce the frequency of accessing permanent storage files, TSDB for InfluxDB® does not stop caching the recently written points in WAL until the total data volume reaches the threshold or the data writing time exceeds a certain period. In this case, TSDB for InfluxDB® flushes the data in WAL to a storage space that can store data for a longer period. The written data can be effectively written into the TSM file in batches by using WAL.

Points in WAL can be queried and will not be lost after the system restarts. When TSDB for InfluxDB® starts, it must flush all points in WAL to the storage space before the system accepts new write requests.

Relevant terms: TSM


InfluxDB® is a trademark registered by InfluxData, which is not affiliated with, and does not endorse, TSDB for InfluxDB®.