23 KiB
| title | seotitle | list_title | description | menu | weight | metadata | related | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Get started querying data | Query data | Get started with InfluxDB Cloud Dedicated | Query data | Get started querying data in InfluxDB Cloud Dedicated by learning about SQL and InfluxQL, and using tools like InfluxDB client libraries. |
|
102 |
|
|
{{% cloud-name %}} supports multiple query languages:
- SQL: Traditional SQL powered by the Apache Arrow DataFusion query engine. The supported SQL syntax is similar to PostgreSQL.
- InfluxQL: An SQL-like query language designed to query time series data stored in InfluxDB.
This tutorial walks you through the fundamentals of querying data in InfluxDB and focuses on using SQL to query your time series data. The InfluxDB SQL implementation is built using Arrow Flight SQL, a protocol for interacting with SQL databases using the Arrow in-memory format and the Flight RPC framework. It leverages the performance of Apache Arrow with the simplicity of SQL.
{{% note %}} The examples in this section of the tutorial query the get-started database for data written in the Get started writing data section. {{% /note %}}
Tools to execute queries
{{% cloud-name %}} supports many different tools for querying data, including:
{{< req type="key" text="Covered in this tutorial" color="magenta" >}}
influx3data CLI{{< req "* " >}}- InfluxDB v3 client libraries
- Flight SQL clients{{< req "* " >}}
- Superset
- Grafana
- InfluxQL with InfluxDB v1 HTTP API
- [Chronograf](/{{< latest "Chronograf" >}}/)
SQL query basics
The {{% cloud-name %}} SQL implementation is powered by the Apache Arrow DataFusion query engine which provides an SQL syntax similar to PostgreSQL.
{{% note %}} This is a brief introduction to writing SQL queries for InfluxDB. For more in-depth details, see Query data with SQL. {{% /note %}}
InfluxDB SQL queries most commonly include the following clauses:
{{< req type="key" >}}
- {{< req "*">}}
SELECT: Identify specific fields and tags to query from a measurement or use the wildcard alias (*) to select all fields and tags from a measurement. - {{< req "*">}}
FROM: Identify the measurement to query. If coming from an SQL background, an InfluxDB measurement is the equivalent of a relational table. WHERE: Only return data that meets defined conditions such as falling within a time range, containing specific tag values, etc.GROUP BY: Group data into SQL partitions and apply an aggregate or selector function to each group.
{{% influxdb/custom-timestamps %}}
-- Return the average temperature and humidity within time bounds from each room
SELECT
avg(temp),
avg(hum),
room
FROM
home
WHERE
time >= '2022-01-01T08:00:00Z'
AND time <= '2022-01-01T20:00:00Z'
GROUP BY
room
{{% /influxdb/custom-timestamps %}}
Example SQL queries
Select all data in a measurement
SELECT * FROM home
Select all data in a measurement within time bounds
SELECT
*
FROM
home
WHERE
time >= '2022-01-01T08:00:00Z'
AND time <= '2022-01-01T20:00:00Z'
Select a specific field within relative time bounds
SELECT temp FROM home WHERE time >= now() - INTERVAL '1 day'
Select specific fields and tags from a measurement
SELECT temp, room FROM home
Select data based on tag value
SELECT * FROM home WHERE room = 'Kitchen'
Select data based on tag value within time bounds
SELECT
*
FROM
home
WHERE
time >= '2022-01-01T08:00:00Z'
AND time <= '2022-01-01T20:00:00Z'
AND room = 'Living Room'
Downsample data by applying interval-based aggregates
SELECT
DATE_BIN(INTERVAL '1 hour', time, '2022-01-01T00:00:00Z'::TIMESTAMP) as _time,
room
selector_max(temp, time)['value'] AS 'max temp',
FROM
home
GROUP BY
_time,
'max temp',
room
ORDER BY room, _time
Execute an SQL query
Get started with one of the following tools for querying data stored in an {{% cloud-name %}} database:
- InfluxDB v3 client libraries: Use language-specific (Python, Go, etc.) clients to execute queries in your terminal or custom code.
- influx3 CLI: Send queries from your terminal command-line.
- Grafana: Query InfluxDB v3 with the FlightSQL Data Source plugin and connect and visualize data.
For this example, use the following query to select all the data written to the get-started database between {{% influxdb/custom-timestamps-span %}} 2022-01-01T08:00:00Z and 2022-01-01T20:00:00Z. {{% /influxdb/custom-timestamps-span %}}
{{% influxdb/custom-timestamps %}}
SELECT
*
FROM
home
WHERE
time >= '2022-01-01T08:00:00Z'
AND time <= '2022-01-01T20:00:00Z'
{{% /influxdb/custom-timestamps %}}
{{< tabs-wrapper >}} {{% tabs %}} influx3 CLI Python Go {{% /tabs %}} {{% tab-content %}}
{{% influxdb/custom-timestamps %}}
Query InfluxDB v3 using SQL and the influx3 CLI.
The following steps include setting up a Python virtual environment already covered in Get started writing data. If your project's virtual environment is already running, skip to step 3.
-
Setup your Python virtual environment. Inside of your project directory:
python -m venv envs/virtual-env -
Activate the virtual environment.
source ./envs/virtual-env/bin/activate -
Install the CLI package (already installed in the Write data section).
pip install influxdb3-python-cliInstalling
influxdb3-python-clialso installs thepyarrowlibrary for working with Arrow data returned from queries. -
Create the
config.jsonconfiguration.influx3 config \ --name="config-dedicated" \ --database="get-started" \ --host="cluster-id.influxdb.io" \ --token="DATABASE_TOKEN" \ --org="ORG_ID"Replace the following:
DATABASE_TOKEN: a database token with read access to the get-started databaseORG_ID: any non-empty string (InfluxDB ignores this parameter, but the client requires it)
-
Enter the
influx3 sqlcommand and your SQL query statement.
influx3 sql "SELECT *
FROM home
WHERE time >= '2022-01-01T08:00:00Z'
AND time <= '2022-01-01T20:00:00Z'"
influx3 displays query results in your terminal.
{{% /influxdb/custom-timestamps %}}
{{% /tab-content %}} {{% tab-content %}}
{{% influxdb/custom-timestamps %}}
Use the influxdb_client_3 module to integrate {{< cloud-name >}} with your Python code.
This module supports writing data to InfluxDB and querying data using SQL or InfluxQL.
The following steps include setting up a Python virtual environment already covered in Get started writing data. If your project's virtual environment is already running, skip to step 3.
-
Open a terminal in the
influxdb_py_clientmodule directory you created in the Write data section:-
To create your Python virtual environment, enter the following command in your terminal:
python -m venv envs/virtual-env -
Activate the virtual environment.
source ./envs/virtual-env/bin/activate -
Install the following dependencies:
{{< req type="key" text="Already installed in the Write data section" color="magenta" >}}
influxdb3-python{{< req text="* " color="magenta" >}}: Provides the InfluxDBinfluxdb_client_3Python client library module and also installs thepyarrowpackage for working with Arrow data returned from queries.pandas: Providespandasfunctions, modules, and data structures for analyzing and manipulating data.tabulate: Provides thetabulatefunction for formatting tabular data.
In your terminal, enter the following command:
pip install influxdb3-python pandas tabulate -
In your terminal or editor, create a new file for your code--for example:
query.py.
-
-
In
query.py, enter the following sample code:from influxdb_client_3 import InfluxDBClient3 import os # INFLUX_TOKEN is an environment variable you assigned to your database READ token string TOKEN = os.getenv('INFLUX_TOKEN') client = InfluxDBClient3( host="cluster-id.influxdb.io", token=TOKEN, database="get-started", ) sql = ''' SELECT * FROM home WHERE time >= '2022-01-01T08:00:00Z' AND time <= '2022-01-01T20:00:00Z' ''' table = client.query(query=sql) print(reader.to_pandas().to_markdown())The sample code does the following:
-
Imports the
InfluxDBClient3constructor from theinfluxdb_client_3module. -
Calls the
InfluxDBClient3()constructor method with credentials to instantiate an InfluxDBclientwith the following credentials:- host: {{% cloud-name %}} cluster URL (without
https://protocol or trailing slash) - token: a database token with read access to the get-started database. For security reasons, we recommend setting this as an environment variable rather than including the raw token string.
- database: the name of the {{% cloud-name %}} database to query
- host: {{% cloud-name %}} cluster URL (without
-
Defines the SQL query to execute and assigns it to a
queryvariable. -
Calls the
client.query()method with the SQL query.query()sends a Flight request to InfluxDB, queries the database, retrieves result data from the endpoint, and then returns a pyarrow.Table assigned to thetablevariable. -
Calls the
to_pandas()method to convert the Arrow table to a pandas DataFrame. -
Calls the
pandas.DataFrame.to_markdown()method to convert the DataFrame to a markdown table. -
Calls the
print()method to print the markdown table to stdout.
-
-
In your terminal, enter the following command to run the program and query your {{% cloud-name %}} cluster:
python query.py
{{< expand-wrapper >}} {{% expand "View returned markdown table" %}}
| co | hum | room | temp | time | |
|---|---|---|---|---|---|
| 0 | 0 | 35.9 | Kitchen | 21 | 2022-01-01 08:00:00 |
| 1 | 0 | 36.2 | Kitchen | 23 | 2022-01-01 09:00:00 |
| 2 | 0 | 36.1 | Kitchen | 22.7 | 2022-01-01 10:00:00 |
| 3 | 0 | 36 | Kitchen | 22.4 | 2022-01-01 11:00:00 |
| 4 | 0 | 36 | Kitchen | 22.5 | 2022-01-01 12:00:00 |
| 5 | 1 | 36.5 | Kitchen | 22.8 | 2022-01-01 13:00:00 |
| 6 | 1 | 36.3 | Kitchen | 22.8 | 2022-01-01 14:00:00 |
| 7 | 3 | 36.2 | Kitchen | 22.7 | 2022-01-01 15:00:00 |
| 8 | 7 | 36 | Kitchen | 22.4 | 2022-01-01 16:00:00 |
| 9 | 9 | 36 | Kitchen | 22.7 | 2022-01-01 17:00:00 |
| 10 | 18 | 36.9 | Kitchen | 23.3 | 2022-01-01 18:00:00 |
| 11 | 22 | 36.6 | Kitchen | 23.1 | 2022-01-01 19:00:00 |
| 12 | 26 | 36.5 | Kitchen | 22.7 | 2022-01-01 20:00:00 |
| 13 | 0 | 35.9 | Living Room | 21.1 | 2022-01-01 08:00:00 |
| 14 | 0 | 35.9 | Living Room | 21.4 | 2022-01-01 09:00:00 |
| 15 | 0 | 36 | Living Room | 21.8 | 2022-01-01 10:00:00 |
| 16 | 0 | 36 | Living Room | 22.2 | 2022-01-01 11:00:00 |
| 17 | 0 | 35.9 | Living Room | 22.2 | 2022-01-01 12:00:00 |
| 18 | 0 | 36 | Living Room | 22.4 | 2022-01-01 13:00:00 |
| 19 | 0 | 36.1 | Living Room | 22.3 | 2022-01-01 14:00:00 |
| 20 | 1 | 36.1 | Living Room | 22.3 | 2022-01-01 15:00:00 |
| 21 | 4 | 36 | Living Room | 22.4 | 2022-01-01 16:00:00 |
| 22 | 5 | 35.9 | Living Room | 22.6 | 2022-01-01 17:00:00 |
| 23 | 9 | 36.2 | Living Room | 22.8 | 2022-01-01 18:00:00 |
| 24 | 14 | 36.3 | Living Room | 22.5 | 2022-01-01 19:00:00 |
| 25 | 17 | 36.4 | Living Room | 22.2 | 2022-01-01 20:00:00 |
{{% /expand %}} {{< /expand-wrapper >}} {{% /influxdb/custom-timestamps %}}
{{% /tab-content %}} {{% tab-content %}}
{{% influxdb/custom-timestamps %}}
-
In the
influxdb_go_clientdirectory you created in the Write data section, create a new file namedquery.go. -
In
query.go, enter the following sample code:package main import ( "context" "crypto/x509" "encoding/json" "fmt" "os" "github.com/apache/arrow/go/v12/arrow/flight/flightsql" "google.golang.org/grpc" "google.golang.org/grpc/credentials" "google.golang.org/grpc/metadata" ) func dbQuery(ctx context.Context) error { url := "cluster-id.influxdb.io:443" // INFLUX_TOKEN is an environment variable you created for your database READ token token := os.Getenv("INFLUX_TOKEN") database := "get-started" // Create a gRPC transport pool, err := x509.SystemCertPool() if err != nil { return fmt.Errorf("x509: %s", err) } transport := grpc.WithTransportCredentials(credentials.NewClientTLSFromCert(pool, "")) opts := []grpc.DialOption{ transport, } // Create query client client, err := flightsql.NewClient(url, nil, nil, opts...) if err != nil { return fmt.Errorf("flightsql: %s", err) } ctx = metadata.AppendToOutgoingContext(ctx, "authorization", "Bearer "+token) ctx = metadata.AppendToOutgoingContext(ctx, "database", database) // Execute query query := `SELECT * FROM home WHERE time >= '2022-01-01T08:00:00Z' AND time <= '2022-01-01T20:00:00Z'` info, err := client.Execute(ctx, query) if err != nil { return fmt.Errorf("flightsql flight info: %s", err) } reader, err := client.DoGet(ctx, info.Endpoint[0].Ticket) if err != nil { return fmt.Errorf("flightsql do get: %s", err) } // Print results as JSON for reader.Next() { record := reader.Record() b, err := json.MarshalIndent(record, "", " ") if err != nil { return err } fmt.Println("RECORD BATCH") fmt.Println(string(b)) if err := reader.Err(); err != nil { return fmt.Errorf("flightsql reader: %s", err) } } return nil } func main() { if err := dbQuery(context.Background()); err != nil { fmt.Fprintf(os.Stderr, "error: %v\n", err) os.Exit(1) } }The sample code does the following:
-
Imports the following packages:
contextcrypto/x509encoding/jsonfmtosgithub.com/apache/arrow/go/v12/arrow/flight/flightsqlgoogle.golang.org/grpcgoogle.golang.org/grpc/credentialsgoogle.golang.org/grpc/metadata
-
Creates a
dbQueryfunction that does the following:-
Defines variables for InfluxDB credentials.
- url: {{% cloud-name %}} region hostname and port (
:443) (no protocol) - token: a database token with read access to the specified database. For security reasons, we recommend setting this as an environment variable rather than including the raw token string.
- database: the name of the {{% cloud-name %}} database to query
- url: {{% cloud-name %}} region hostname and port (
-
Defines an
optsoptions list that includes a gRPC transport for communicating with {{% cloud-name %}} over the gRPC+TLS protocol. -
Calls the
flightsql.NewClient()method withurlandoptsto create a new Flight SQL client. -
Appends the following InfluxDB credentials as key-value pairs to the outgoing context:
- authorization: Bearer <INFLUX_TOKEN>
- database-name: Database name
-
Define the SQL query to execute.
-
Calls the
client.execute()method to send the query request. -
Calls the
client.doGet()method with the ticket from the query response to retrieve result data from the endpoint. -
Creates a reader to read the Arrow table returned by the endpoint and print the results as JSON.
-
-
Creates a
mainmodule function that executes thedbQueryfunction.
-
-
In your terminal, enter the following command to install all the necessary packages and run the program to query {{% cloud-name %}}.
go get ./...
go run ./query.go
{{< expand-wrapper >}} {{% expand "View program output" %}}
RECORD BATCH
[
{
"co": 0,
"hum": 35.9,
"room": "Kitchen",
"temp": 21,
"time": "2022-01-01 08:00:00"
},
{
"co": 0,
"hum": 36.2,
"room": "Kitchen",
"temp": 23,
"time": "2022-01-01 09:00:00"
},
{
"co": 0,
"hum": 36.1,
"room": "Kitchen",
"temp": 22.7,
"time": "2022-01-01 10:00:00"
},
{
"co": 0,
"hum": 36,
"room": "Kitchen",
"temp": 22.4,
"time": "2022-01-01 11:00:00"
},
{
"co": 0,
"hum": 36,
"room": "Kitchen",
"temp": 22.5,
"time": "2022-01-01 12:00:00"
},
{
"co": 1,
"hum": 36.5,
"room": "Kitchen",
"temp": 22.8,
"time": "2022-01-01 13:00:00"
},
{
"co": 1,
"hum": 36.3,
"room": "Kitchen",
"temp": 22.8,
"time": "2022-01-01 14:00:00"
},
{
"co": 3,
"hum": 36.2,
"room": "Kitchen",
"temp": 22.7,
"time": "2022-01-01 15:00:00"
},
{
"co": 7,
"hum": 36,
"room": "Kitchen",
"temp": 22.4,
"time": "2022-01-01 16:00:00"
},
{
"co": 9,
"hum": 36,
"room": "Kitchen",
"temp": 22.7,
"time": "2022-01-01 17:00:00"
},
{
"co": 18,
"hum": 36.9,
"room": "Kitchen",
"temp": 23.3,
"time": "2022-01-01 18:00:00"
},
{
"co": 22,
"hum": 36.6,
"room": "Kitchen",
"temp": 23.1,
"time": "2022-01-01 19:00:00"
},
{
"co": 26,
"hum": 36.5,
"room": "Kitchen",
"temp": 22.7,
"time": "2022-01-01 20:00:00"
},
{
"co": 0,
"hum": 35.9,
"room": "Living Room",
"temp": 21.1,
"time": "2022-01-01 08:00:00"
},
{
"co": 0,
"hum": 35.9,
"room": "Living Room",
"temp": 21.4,
"time": "2022-01-01 09:00:00"
},
{
"co": 0,
"hum": 36,
"room": "Living Room",
"temp": 21.8,
"time": "2022-01-01 10:00:00"
},
{
"co": 0,
"hum": 36,
"room": "Living Room",
"temp": 22.2,
"time": "2022-01-01 11:00:00"
},
{
"co": 0,
"hum": 35.9,
"room": "Living Room",
"temp": 22.2,
"time": "2022-01-01 12:00:00"
},
{
"co": 0,
"hum": 36,
"room": "Living Room",
"temp": 22.4,
"time": "2022-01-01 13:00:00"
},
{
"co": 0,
"hum": 36.1,
"room": "Living Room",
"temp": 22.3,
"time": "2022-01-01 14:00:00"
},
{
"co": 1,
"hum": 36.1,
"room": "Living Room",
"temp": 22.3,
"time": "2022-01-01 15:00:00"
},
{
"co": 4,
"hum": 36,
"room": "Living Room",
"temp": 22.4,
"time": "2022-01-01 16:00:00"
},
{
"co": 5,
"hum": 35.9,
"room": "Living Room",
"temp": 22.6,
"time": "2022-01-01 17:00:00"
},
{
"co": 9,
"hum": 36.2,
"room": "Living Room",
"temp": 22.8,
"time": "2022-01-01 18:00:00"
},
{
"co": 14,
"hum": 36.3,
"room": "Living Room",
"temp": 22.5,
"time": "2022-01-01 19:00:00"
},
{
"co": 17,
"hum": 36.4,
"room": "Living Room",
"temp": 22.2,
"time": "2022-01-01 20:00:00"
}
]
{{% /expand %}} {{< /expand-wrapper >}} {{% /influxdb/custom-timestamps %}}
{{% /tab-content %}} {{< /tabs-wrapper >}}
Congratulations! You've learned the basics of querying data in InfluxDB with SQL. For a deep dive into all the ways you can query {{% cloud-name %}}, see the Query data in InfluxDB section of documentation.
{{< page-nav prev="/influxdb/cloud-dedicated/get-started/write/" keepTab=true >}}