5143-Add Optimize Queries page with query analysis help (#5165)

* chore(test): Use my python client fork (pending approval) to allow custom headers. * feature(query): Add Optimize Queries page with query analysis help - Closes Client library query traces: Python #5143 - Dedicated and Clustered examples for enabling query tracing and extracting headers - System.queries table - Explain and Analyze - For now, skip tests for sample Flight responses until we add in code samples. * Update content/influxdb/clustered/query-data/execute-queries/optimize-queries.md Co-authored-by: Scott Anderson <sanderson@users.noreply.github.com> * Update content/influxdb/cloud-dedicated/query-data/execute-queries/optimize-queries.md Co-authored-by: Scott Anderson <sanderson@users.noreply.github.com> * Update content/influxdb/cloud-dedicated/query-data/execute-queries/optimize-queries.md Co-authored-by: Scott Anderson <sanderson@users.noreply.github.com> * Update content/influxdb/cloud-dedicated/query-data/execute-queries/optimize-queries.md Co-authored-by: Scott Anderson <sanderson@users.noreply.github.com> * Update content/influxdb/clustered/query-data/execute-queries/optimize-queries.md Co-authored-by: Scott Anderson <sanderson@users.noreply.github.com> * Update content/influxdb/clustered/query-data/execute-queries/optimize-queries.md Co-authored-by: Scott Anderson <sanderson@users.noreply.github.com> * Update content/influxdb/clustered/query-data/execute-queries/optimize-queries.md Co-authored-by: Scott Anderson <sanderson@users.noreply.github.com> * Update content/influxdb/clustered/query-data/execute-queries/optimize-queries.md Co-authored-by: Scott Anderson <sanderson@users.noreply.github.com> * Update content/influxdb/cloud-dedicated/query-data/execute-queries/optimize-queries.md Co-authored-by: Scott Anderson <sanderson@users.noreply.github.com> * Update content/influxdb/cloud-dedicated/query-data/execute-queries/optimize-queries.md Co-authored-by: Scott Anderson <sanderson@users.noreply.github.com> * feat(v3): influx-trace-id for dedicated, tracing not ready for clustered (Client library query traces: Python #5143) --------- Co-authored-by: Scott Anderson <sanderson@users.noreply.github.com>
2023-10-16 15:08:40 -05:00 · 2023-10-16 15:08:40 -05:00 · 5ad8e80361
parent 6be4bbd3bc
commit 5ad8e80361
10 changed files with 740 additions and 21 deletions
--- a/content/influxdb/cloud-dedicated/query-data/execute-queries/client-libraries/python.md
+++ b/content/influxdb/cloud-dedicated/query-data/execute-queries/client-libraries/python.md
@ -25,6 +25,7 @@ related:
    - /influxdb/cloud-dedicated/query-data/sql/
    - /influxdb/cloud-dedicated/reference/influxql/
    - /influxdb/cloud-dedicated/reference/sql/
+    - /influxdb/cloud-dedicated/query-data/execute-queries/troubleshoot/

 list_code_example: |
    ```py
@ -305,7 +306,7 @@ and specify the following arguments:

 #### Example {#execute-query-example}

-The following examples shows how to use SQL or InfluxQL to select all fields in a measurement, and then output the results formatted as a Markdown table.
+The following example shows how to use SQL or InfluxQL to select all fields in a measurement, and then use PyArrow functions to extract metadata and aggregate data.

 {{% code-tabs-wrapper %}}
 {{% code-tabs %}}
--- a/content/influxdb/cloud-dedicated/query-data/execute-queries/optimize-queries.md
+++ b/content/influxdb/cloud-dedicated/query-data/execute-queries/optimize-queries.md
@ -0,0 +1,442 @@
+---
+title: Optimize queries
+description: >
+  Optimize your SQL and InfluxQL queries to improve performance and reduce their memory and compute (CPU) requirements.
+weight: 401
+menu:
+  influxdb_cloud_dedicated:
+    name: Optimize queries
+    parent: Execute queries
+influxdb/cloud-dedicated/tags: [query, sql, influxql]
+related:
+  - /influxdb/cloud-dedicated/query-data/sql/
+  - /influxdb/cloud-dedicated/query-data/influxql/
+  - /influxdb/cloud-dedicated/query-data/execute-queries/troubleshoot/
+  - /influxdb/cloud-dedicated/reference/client-libraries/v3/
+---
+
+Use the following tools to help you identify performance bottlenecks and troubleshoot problems in queries:
+
+<!-- TOC -->
+
+- [EXPLAIN and ANALYZE](#explain-and-analyze)
+- [Enable trace logging](#enable-trace-logging)
+    - [Avoid unnecessary tracing](#avoid-unnecessary-tracing)
+  - [Syntax](#syntax)
+  - [Example](#example)
+  - [Tracing response header](#tracing-response-header)
+    - [Trace response header syntax](#trace-response-header-syntax)
+  - [Inspect Flight response headers](#inspect-flight-response-headers)
+- [Retrieve query information](#retrieve-query-information)
+
+<!-- /TOC -->
+
+## EXPLAIN and ANALYZE
+
+To view the query engine's execution plan and metrics for an SQL or InfluxQL query, prepend [`EXPLAIN`](/influxdb/cloud-dedicated/reference/sql/explain/) or [`EXPLAIN ANALYZE`](/influxdb/cloud-dedicated/reference/sql/explain/#explain-analyze) to the query.
+The report can reveal query bottlenecks such as a large number of table scans or parquet files, and can help triage the question, "Is the query slow due to the amount of work required or due to a problem with the schema, compactor, etc.?"
+
+The following example shows how to use the InfluxDB v3 Python client library and pandas to view `EXPLAIN` and `EXPLAIN ANALYZE` results for a query:
+
+<!-- Import for tests and hide from users.
+```python
+import os
+```
+-->
+
+{{% code-placeholders "DATABASE_(NAME|TOKEN)" %}}
+<!--pytest-codeblocks:cont-->
+
+```python
+from influxdb_client_3 import InfluxDBClient3
+import pandas as pd
+import tabulate # Required for pandas.to_markdown()
+
+# Instantiate an InfluxDB client.
+client = InfluxDBClient3(token = f"DATABASE_TOKEN",
+                        host = f"{{< influxdb/host >}}",
+                        database = f"DATABASE_NAME")
+
+sql_explain = '''EXPLAIN
+              SELECT temp
+              FROM home
+              WHERE time >= now() - INTERVAL '90 days'
+              AND room = 'Kitchen'
+              ORDER BY time'''
+
+table = client.query(sql_explain)
+df = table.to_pandas()
+print(df.to_markdown(index=False))
+
+assert df.shape == (2, 2), f'Expect {df.shape} to have 2 columns, 2 rows'
+assert 'physical_plan' in df.plan_type.values, "Expect physical_plan"
+assert 'logical_plan' in df.plan_type.values, "Expect logical_plan"
+```
+
+{{< expand-wrapper >}}
+{{% expand "View EXPLAIN example results" %}}
+| plan_type     | plan                                                                                                                                                                           |
+|:--------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| logical_plan  | Projection: home.temp                                                                                                                                                           |
+|               |   Sort: home.time ASC NULLS LAST                                                                                                                                                |
+|               |     Projection: home.temp, home.time                                                                                                                                            |
+|               |       TableScan: home projection=[room, temp, time], full_filters=[home.time >= TimestampNanosecond(1688676582918581320, None), home.room = Dictionary(Int32, Utf8("Kitchen"))] |
+| physical_plan | ProjectionExec: expr=[temp@0 as temp]                                                                                                                                           |
+|               |   SortExec: expr=[time@1 ASC NULLS LAST]                                                                                                                                        |
+|               |     EmptyExec: produce_one_row=false                                                                                                                                            |
+{{% /expand %}}
+{{< /expand-wrapper >}}
+
+<!--pytest-codeblocks:cont-->
+
+```python
+sql_explain_analyze = '''EXPLAIN ANALYZE
+                      SELECT *
+                      FROM home
+                      WHERE time >= now() - INTERVAL '90 days'
+                      ORDER BY time'''
+
+table = client.query(sql_explain_analyze)
+df = table.to_pandas()
+print(df.to_markdown(index=False))
+
+assert df.shape == (1,2)
+assert 'Plan with Metrics' in df.plan_type.values, "Expect plan metrics"
+
+client.close()
+```
+{{% /code-placeholders %}}
+
+Replace the following:
+
+- {{% code-placeholder-key %}}`DATABASE_NAME`{{% /code-placeholder-key %}}: your {{% product-name %}} database
+- {{% code-placeholder-key %}}`DATABASE_TOKEN`{{% /code-placeholder-key %}}: a [database token](/influxdb/cloud-dedicated/admin/tokens/) with sufficient permissions to the specified database
+
+{{< expand-wrapper >}}
+{{% expand "View EXPLAIN ANALYZE example results" %}}
+| plan_type         | plan                                                                                                                  |
+|:------------------|:-----------------------------------------------------------------------------------------------------------------------|
+| Plan with Metrics | ProjectionExec: expr=[temp@0 as temp], metrics=[output_rows=0, elapsed_compute=1ns]                                    |
+|                   |   SortExec: expr=[time@1 ASC NULLS LAST], metrics=[output_rows=0, elapsed_compute=1ns, spill_count=0, spilled_bytes=0] |
+|                   |     EmptyExec: produce_one_row=false, metrics=[]
+{{% /expand %}}
+{{< /expand-wrapper >}}
+
+## Enable trace logging
+
+When you enable trace logging for a query, InfluxDB propagates your _trace ID_ through system processes and collects additional log information.
+
+InfluxDB Support can then use the trace ID that you provide to filter, collate, and analyze log information for the query run.
+The tracing system follows the [OpenTelemetry traces](https://opentelemetry.io/docs/concepts/signals/traces/) model for providing observability into a request.
+
+{{% warn %}}
+#### Avoid unnecessary tracing
+
+Only enable tracing for a query when you need to request troubleshooting help from InfluxDB Support.
+To manage resources, InfluxDB has an upper limit for the number of trace requests.
+Too many traces can cause InfluxDB to evict log information.
+{{% /warn %}}
+
+To enable tracing for a query, include the `influx-trace-id` header in your query request.
+
+### Syntax
+
+Use the following syntax for the `influx-trace-id` header:
+
+```http
+influx-trace-id: TRACE_ID:1112223334445:0:1
+```
+
+In the header value, replace the following:
+
+- `TRACE_ID`: a unique string, 8-16 bytes long, encoded as hexadecimal (32 maximum hex characters).
+  The trace ID should uniquely identify the query run.
+- `:1112223334445:0:1`: InfluxDB constant values (required, but ignored)
+
+### Example
+
+The following examples show how to create and pass a trace ID to enable query tracing in InfluxDB:
+
+{{< tabs-wrapper >}}
+{{% tabs %}}
+[Python with FlightCallOptions](#)
+[Python with FlightClientMiddleware](#python-with-flightclientmiddleware)
+{{% /tabs %}}
+{{% tab-content %}}
+<!---- BEGIN PYTHON WITH FLIGHTCALLOPTIONS ---->
+Use the `InfluxDBClient3` InfluxDB Python client and pass the `headers` argument in the
+`query()` method.
+
+<!-- Import for tests and hide from users.
+```python
+import os
+```
+-->
+
+{{% code-placeholders "DATABASE_(NAME|TOKEN)|APP_REQUEST_ID" %}}
+
+<!--pytest-codeblocks:cont-->
+
+```python
+from influxdb_client_3 import InfluxDBClient3
+import secrets
+
+def use_flightcalloptions_trace_header():
+  print('# Use FlightCallOptions to enable tracing.')
+  client = InfluxDBClient3(token=f"DATABASE_TOKEN",
+                          host=f"{{< influxdb/host >}}",
+                          database=f"DATABASE_NAME")
+
+  # Generate a trace ID for the query:
+  # 1.  Generate a random 8-byte value as bytes.
+  # 2.  Encode the value as hexadecimal.
+  random_bytes = secrets.token_bytes(8)
+  trace_id = random_bytes.hex()
+
+  # Append required constants to the trace ID.
+  trace_value = f"{trace_id}:1112223334445:0:1"
+
+  # Encode the header key and value as bytes.
+  # Create a list of header tuples.
+  headers = [((b"influx-trace-id", trace_value.encode('utf-8')))]
+
+  sql = "SELECT * FROM home WHERE time >= now() - INTERVAL '30 days'"
+  influxql = "SELECT * FROM home WHERE time >= -90d"
+
+  # Use the query() headers argument to pass the list as FlightCallOptions.
+  client.query(sql, headers=headers)
+
+  client.close()
+
+use_flightcalloptions_trace_header()
+```
+
+{{% /code-placeholders %}}
+<!---- END PYTHON WITH FLIGHTCALLOPTIONS ---->
+{{% /tab-content %}}
+{{% tab-content %}}
+<!---- BEGIN PYTHON WITH MIDDLEWARE ---->
+Use the `InfluxDBClient3` InfluxDB Python client and `flight.ClientMiddleware` to pass and inspect headers.
+
+### Tracing response header
+
+With tracing enabled and a valid trace ID in the request, InfluxDB's `DoGet` action response contains a header with the trace ID that you sent.
+
+#### Trace response header syntax
+
+```http
+trace-id: TRACE_ID
+```
+
+### Inspect Flight response headers
+
+To inspect Flight response headers when using a client library, pass a `FlightClientMiddleware` instance.
+that defines a middleware callback function for the `onHeadersReceived` event (the particular function name you use depends on the client library language).
+
+The following example uses Python client middleware that adds request headers and extracts the trace ID from the `DoGet` response headers:
+
+<!-- Import for tests and hide from users.
+```python
+import os
+```
+-->
+
+{{% code-placeholders "DATABASE_(NAME|TOKEN)|APP_REQUEST_ID" %}}
+
+<!--pytest-codeblocks:cont-->
+
+```python
+import pyarrow.flight as flight
+
+class TracingClientMiddleWareFactory(flight.ClientMiddleware):
+  # Defines a custom middleware factory that returns a middleware instance.
+    def __init__(self):
+        self.request_headers = []
+        self.response_headers = []
+        self.traces  = []
+
+    def addRequestHeader(self, header):
+        self.request_headers.append(header)
+
+    def addResponseHeader(self, header):
+        self.response_headers.append(header)
+
+    def addTrace(self, traceid):
+        self.traces.append(traceid)
+
+    def createTrace(self, traceid):
+      # Append InfluxDB constants to the trace ID.
+      trace = f"{traceid}:1112223334445:0:1"
+
+      # To the list of request headers,
+      # add a tuple with the header key and value as bytes.
+      self.addRequestHeader((b"influx-trace-id", trace.encode('utf-8')))
+
+    def start_call(self, info):
+        return TracingClientMiddleware(info.method, self)
+
+class TracingClientMiddleware(flight.ClientMiddleware):
+  # Defines middleware with client event callback methods.
+    def __init__(self, method, callback_obj):
+        self._method = method
+        self.callback = callback_obj
+
+    def call_completed(self, exception):
+      print('callback: call_completed')
+      if(exception):
+        print(f"  ...with exception: {exception}")
+
+    def sending_headers(self):
+      print('callback: sending_headers: ', self.callback.request_headers)
+      if len(self.callback.request_headers) > 0:
+        return dict(self.callback.request_headers)
+
+    def received_headers(self, headers):
+      self.callback.addResponseHeader(headers)
+      # For the DO_GET action, extract the trace ID from the response headers.
+      if str(self._method) == "FlightMethod.DO_GET" and "trace-id" in headers:
+          trace_id = headers["trace-id"][0]
+          self.callback.addTrace(trace_id)
+
+from influxdb_client_3 import InfluxDBClient3
+import secrets
+
+def use_middleware_trace_header():
+  print('# Use Flight client middleware to enable tracing.')
+
+  # Instantiate the middleware.
+  res = TracingClientMiddleWareFactory()
+
+  # Instantiate the client, passing in the middleware instance that provides
+  # event callbacks for the request.
+  client = InfluxDBClient3(token=f"DATABASE_TOKEN",
+                          host=f"{{< influxdb/host >}}",
+                          database=f"DATABASE_NAME",
+                          flight_client_options={"middleware": (res,)})
+
+  # Generate a trace ID for the query:
+  # 1.  Generate a random 8-byte value as bytes.
+  # 2.  Encode the value as hexadecimal.
+  random_bytes = secrets.token_bytes(8)
+  trace_id = random_bytes.hex()
+
+  res.createTrace(trace_id)
+
+  sql = "SELECT * FROM home WHERE time >= now() - INTERVAL '30 days'"
+
+  client.query(sql)
+  client.close()
+  assert trace_id in res.traces[0], "Expect trace ID in DoGet response."
+
+use_middleware_trace_header()
+```
+{{% /code-placeholders %}}
+<!---- END PYTHON WITH  MIDDLEWARE ---->
+{{% /tab-content %}}
+{{< /tabs-wrapper >}}
+
+Replace the following:
+
+- {{% code-placeholder-key %}}`DATABASE_NAME`{{% /code-placeholder-key %}}: your {{% product-name %}} database
+- {{% code-placeholder-key %}}`DATABASE_TOKEN`{{% /code-placeholder-key %}}: a [database token](/influxdb/cloud-dedicated/admin/tokens/) with sufficient permissions to the specified database
+
+{{% note %}}
+Store or log your query trace ID to ensure you can provide it to InfluxDB Support for troubleshooting.
+{{% /note %}}
+
+After you run your query with tracing enabled, do the following:
+
+- Remove the tracing header from subsequent runs of the query (to [avoid unnecessary tracing](#avoid-unnecessary-tracing)).
+- Provide the trace ID in a request to InfluxDB Support.
+
+## Retrieve query information
+
+In addition to the SQL standard `information_schema`, {{% product-name %}} contains _system_ tables that provide access to
+InfluxDB-specific information.
+The information in each system table is scoped to the namespace you're querying;
+you can only retrieve system information for that particular instance.
+
+To get information about queries you've run on the current instance, use SQL to query the [`system.queries` table](/influxdb/cloud-dedicated/reference/internals/system-tables/#systemqueries-measurement), which contains information from the querier instance currently handling queries.
+If you [enabled trace logging for the query](#enable-trace-logging-for-a-query), the `trace-id` appears in the `system.queries.trace_id` column for the query.
+
+The `system.queries` table is an InfluxDB v3 **debug feature**.
+To enable the feature and query `system.queries`, include an `"iox-debug"` header set to `"true"` and use SQL to query the table.
+
+The following sample code shows how to use the Python client library to do the following:
+
+1.  Enable tracing for a query.
+2.  Retrieve the trace ID record from `system.queries`.
+
+<!-- Import for tests and hide from users.
+```python
+import os
+```
+-->
+
+{{% code-placeholders "DATABASE_(NAME|TOKEN)|APP_REQUEST_ID" %}}
+
+<!--pytest-codeblocks:cont-->
+
+```python
+from influxdb_client_3 import InfluxDBClient3
+import secrets
+import pandas
+
+def get_query_information():
+  print('# Get query information')
+
+  client = InfluxDBClient3(token = f"DATABASE_TOKEN",
+                    host = f"{{< influxdb/host >}}",
+                    database = f"DATABASE_NAME")
+
+  random_bytes = secrets.token_bytes(16)
+  trace_id = random_bytes.hex()
+  trace_value = (f"{trace_id}:1112223334445:0:1").encode('utf-8')
+  sql = "SELECT * FROM home WHERE time >= now() - INTERVAL '30 days'"
+
+  try:
+    client.query(sql, headers=[(b'influx-trace-id', trace_value)])
+    client.close()
+  except Exception as e:
+    print("Query error: ", e)
+
+  client = InfluxDBClient3(token = f"DATABASE_TOKEN",
+                    host = f"{{< influxdb/host >}}",
+                    database = f"DATABASE_NAME")
+
+  import time
+  df = pandas.DataFrame()
+
+  for i in range(0, 5):
+    time.sleep(1)
+    # Use SQL
+    # To query the system.queries table for your trace ID, pass the following:
+    #   - the iox-debug: true request header
+    #   - an SQL query for the trace_id column
+    reader = client.query(f'''SELECT compute_duration, query_type, query_text,
+                          success, trace_id
+                          FROM system.queries
+                          WHERE issue_time >= now() - INTERVAL '1 day'
+                            AND trace_id = '{trace_id}'
+                          ORDER BY issue_time DESC
+                        ''',
+                        headers=[(b"iox-debug", b"true")],
+                        mode="reader")
+
+    df = reader.read_all().to_pandas()
+    if df.shape[0]:
+      break
+
+  assert df.shape == (1, 5), f"Expect a row for the query trace ID."
+  print(df)
+
+get_query_information()
+```
+{{% /code-placeholders %}}
+
+The output is similar to the following:
+
+```text
+compute_duration query_type                        query_text  success  trace_id
+          0 days        sql  SELECT compute_duration, quer...     True  67338...
+```
--- a/content/influxdb/cloud-dedicated/query-data/execute-queries/troubleshoot.md
+++ b/content/influxdb/cloud-dedicated/query-data/execute-queries/troubleshoot.md
@ -24,6 +24,7 @@ Learn how to handle responses and troubleshoot errors encountered when querying
    - [Internal Error: Received RST_STREAM](#internal-error-received-rst_stream)
    - [Internal Error: stream terminated by RST_STREAM with NO_ERROR](#internal-error-stream-terminated-by-rst_stream-with-no_error)
    - [Invalid Argument: Invalid ticket](#invalid-argument-invalid-ticket)
+    - [Timeout: Deadline exceeded](#timeout-deadline-exceeded)
    - [Unauthenticated: Unauthenticated](#unauthenticated-unauthenticated)
    - [Unauthorized: Permission denied](#unauthorized-permission-denied)
    - [FlightUnavailableError: Could not get default pem root certs](#flightunavailableerror-could-not-get-default-pem-root-certs)
@ -80,7 +81,8 @@ SELECT co, delete, hum, room, temp, time

 The Python client library outputs the following schema representation:

-```py
+<!--pytest.mark.skip-->
+```python
 Schema:
  co: int64
    -- field metadata --
@ -175,7 +177,7 @@ _For a list of gRPC codes that servers and clients may return, see [Status codes

 **Example**:

-```sh
+```structuredtext
 Flight returned internal error, with message: Received RST_STREAM with error code 2. gRPC client debug context: UNKNOWN:Error received from peer ipv4:34.196.233.7:443 {grpc_message:"Received RST_STREAM with error code 2"}
 ```

@ -192,11 +194,12 @@ Flight returned internal error, with message: Received RST_STREAM with error cod

 **Example**:

+<!--pytest.mark.skip-->
 ```sh
 pyarrow._flight.FlightInternalError: Flight returned internal error, with message: stream terminated by RST_STREAM with error code: NO_ERROR. gRPC client debug context: UNKNOWN:Error received from peer ipv4:3.123.149.45:443 {created_time:"2023-07-26T14:12:44.992317+02:00", grpc_status:13, grpc_message:"stream terminated by RST_STREAM with error code: NO_ERROR"}. Client context: OK
 ```

-**Potential Reasons**:
+**Potential reasons**:

 - The server terminated the stream, but there wasn't any specific error associated with it.
 - Possible network disruption, even if it's temporary.
@ -208,21 +211,35 @@ pyarrow._flight.FlightInternalError: Flight returned internal error, with messag

 **Example**:

+<!--pytest.mark.skip-->
 ```sh
 pyarrow.lib.ArrowInvalid: Flight returned invalid argument error, with message: Invalid ticket. Error: Invalid ticket. gRPC client debug context: UNKNOWN:Error received from peer ipv4:54.158.68.83:443 {created_time:"2023-08-31T17:56:42.909129-05:00", grpc_status:3, grpc_message:"Invalid ticket. Error: Invalid ticket"}. Client context: IOError: Server never sent a data message. Detail: Internal
 ```

-**Potential Reasons**:
+**Potential reasons**:

 - The request is missing the database name or some other required metadata value.
 - The request contains bad query syntax.

 <!-- END -->

+#### Timeout: Deadline exceeded
+
+<!--pytest.mark.skip-->
+```sh
+pyarrow._flight.FlightTimedOutError: Flight returned timeout error, with message: Deadline Exceeded. gRPC client debug context: UNKNOWN:Deadline Exceeded {grpc_status:4, created_time:"2023-09-27T15:30:58.540385-05:00"}. Client context: IOError: Server never sent a data message. Detail: Internal
+```
+
+**Potential reasons**:
+
+- The server's response time exceeded the number of seconds allowed by the client.
+  See how to specify `timeout` in [FlightCallOptions](https://arrow.apache.org/docs/python/generated/pyarrow.flight.FlightCallOptions.html#pyarrow.flight.FlightCallOptions).
+
 #### Unauthenticated: Unauthenticated

 **Example**:

+<!--pytest.mark.skip-->
 ```sh
 Flight returned unauthenticated error, with message: unauthenticated. gRPC client debug context: UNKNOWN:Error received from peer ipv4:34.196.233.7:443 {grpc_message:"unauthenticated", grpc_status:16, created_time:"2023-08-28T15:38:33.380633-05:00"}. Client context: IOError: Server never sent a data message. Detail: Internal
 ```
@ -238,6 +255,7 @@ Flight returned unauthenticated error, with message: unauthenticated. gRPC clien

 **Example**:

+<!--pytest.mark.skip-->
 ```sh
 pyarrow._flight.FlightUnauthorizedError: Flight returned unauthorized error, with message: Permission denied. gRPC client debug context: UNKNOWN:Error received from peer ipv4:54.158.68.83:443 {grpc_message:"Permission denied", grpc_status:7, created_time:"2023-08-31T17:51:08.271009-05:00"}. Client context: IOError: Server never sent a data message. Detail: Internal
 ```
@ -254,6 +272,7 @@ pyarrow._flight.FlightUnauthorizedError: Flight returned unauthorized error, wit

 If unable to locate a root certificate for _gRPC+TLS_, the Flight client returns errors similar to the following:

+<!--pytest.mark.skip-->
 ```sh
 UNKNOWN:Failed to load file... filename:"/usr/share/grpc/roots.pem",
  children:[UNKNOWN:No such file or directory
--- a/content/influxdb/cloud-dedicated/reference/internals/system-tables.md
+++ b/content/influxdb/cloud-dedicated/reference/internals/system-tables.md
@ -16,34 +16,68 @@ related:
 InfluxDB system measurements contain time series data used by and generated from the
 InfluxDB internal monitoring system.

-Each InfluxDB Cloud Dedicated namespace includes the following system measurements:
+Each {{% product-name %}} namespace includes the following system measurements:

- [queries](#_queries-system-measurement)
+<!-- TOC -->

-## queries system measurement
+- [system.queries measurement](#systemqueries-measurement)
+  - [system.queries schema](#systemqueries-schema)
+
+## system.queries measurement

 The `system.queries` measurement stores log entries for queries executed for the provided namespace (database) on the node that is currently handling queries.

-The following example shows how to list queries recorded in the `system.queries` measurement:
+```python
+from influxdb_client_3 import InfluxDBClient3
+client = InfluxDBClient3(token = DATABASE_TOKEN,
+                          host = HOSTNAME,
+                          org = '',
+                          database=DATABASE_NAME)
+client.query('select * from home')
+reader = client.query('''
+                      SELECT *
+                      FROM system.queries
+                      WHERE issue_time >= now() - INTERVAL '1 day'
+                      AND query_text LIKE '%select * from home%'
+                      ''',
+                    language='sql',
+                    headers=[(b"iox-debug", b"true")],
+                    mode="reader")
+print("# system.queries schema\n")
+print(reader.schema)
+```

-```sql
-SELECT issue_time, query_type, query_text, success FROM system.queries;
+<!--pytest-codeblocks:expected-output-->
+
+`system.queries` has the following schema:
+
+```python
+# system.queries schema
+
+issue_time: timestamp[ns] not null
+query_type: string not null
+query_text: string not null
+completed_duration: duration[ns]
+success: bool not null
+trace_id: string
 ```

 _When listing measurements (tables) available within a namespace, some clients and query tools may include the `queries` table in the list of namespace tables._

 `system.queries` reflects a process-local, in-memory, namespace-scoped query log.
+The query log isn't shared across instances within the same deployment.
 While this table may be useful for debugging and monitoring queries, keep the following in mind:

 - Records stored in `system.queries` are volatile.
  - Records are lost on pod restarts.
  - Queries for one namespace can evict records from another namespace.
- Data reflects the state of a specific pod answering queries for the namespace.
+- Data reflects the state of a specific pod answering queries for the namespace----the log view is scoped to the requesting namespace and queries aren't leaked across namespaces.
  - A query for records in `system.queries` can return different results depending on the pod the request was routed to.

 **Data retention:** System data can be transient and is deleted on pod restarts.
+The log size per instance is limited and the log view is scoped to the requesting namespace.

-### queries measurement schema
+### system.queries schema

 - **system.queries** _(measurement)_
  - **fields**:
--- a/content/influxdb/cloud-serverless/query-data/execute-queries/client-libraries/python.md
+++ b/content/influxdb/cloud-serverless/query-data/execute-queries/client-libraries/python.md
@ -26,6 +26,7 @@ related:
    - /influxdb/cloud-serverless/query-data/sql/
    - /influxdb/cloud-serverless/reference/influxql/
    - /influxdb/cloud-serverless/reference/sql/
+    - /influxdb/cloud-serverless/query-data/execute-queries/troubleshoot/

 list_code_example: |
    ```py
@ -33,7 +34,7 @@ list_code_example: |

    # Instantiate an InfluxDB client
    client = InfluxDBClient3(
-        host='cloud2.influxdata.com',
+        host='{{< influxdb/host >}}',
        token='DATABASE_TOKEN',
        database='DATABASE_NAME'
    )
@ -306,7 +307,7 @@ and specify the following arguments:

 #### Example {#execute-query-example}

-The following examples show how to use SQL or InfluxQL to select all fields in a measurement, and then output the results formatted as a Markdown table.
+The following example shows how to use SQL or InfluxQL to select all fields in a measurement, and then use PyArrow functions to extract metadata and aggregate data.

 {{% code-tabs-wrapper %}}
 {{% code-tabs %}}
--- a/content/influxdb/cloud-serverless/query-data/execute-queries/optimize-queries.md
+++ b/content/influxdb/cloud-serverless/query-data/execute-queries/optimize-queries.md
@ -0,0 +1,106 @@
+---
+title: Optimize queries
+description: >
+  Optimize your SQL and InfluxQL queries to improve performance and reduce their memory and compute (CPU) requirements.
+weight: 401
+menu:
+  influxdb_cloud_serverless:
+    name: Optimize queries
+    parent: Execute queries
+influxdb/cloud-serverless/tags: [query, sql, influxql]
+related:
+  - /influxdb/cloud-serverless/query-data/sql/
+  - /influxdb/cloud-serverless/query-data/influxql/
+  - /influxdb/cloud-serverless/query-data/execute-queries/troubleshoot/
+  - /influxdb/cloud-serverless/reference/client-libraries/v3/
+---
+
+## Troubleshoot query performance
+
+Use the following tools to help you identify performance bottlenecks and troubleshoot problems in queries:
+
+<!-- TOC -->
+
+- [Troubleshoot query performance](#troubleshoot-query-performance)
+  - [EXPLAIN and ANALYZE](#explain-and-analyze)
+  - [Enable trace logging](#enable-trace-logging)
+
+<!-- /TOC -->
+
+### EXPLAIN and ANALYZE
+
+To view the query engine's execution plan and metrics for an SQL query, prepend [`EXPLAIN`](/influxdb/cloud-serverless/reference/sql/explain/) or [`EXPLAIN ANALYZE`](/influxdb/cloud-serverless/reference/sql/explain/#explain-analyze) to the query.
+The report can reveal query bottlenecks such as a large number of table scans or parquet files, and can help triage the question, "Is the query slow due to the amount of work required or due to a problem with the schema, compactor, etc.?"
+
+The following example shows how to use the InfluxDB v3 Python client library and pandas to view `EXPLAIN` and `EXPLAIN ANALYZE` results for a query:
+
+<!-- Import for tests and hide from users.
+```python
+import os
+```
+-->
+<!--pytest-codeblocks:cont-->
+{{% code-placeholders "BUCKET_NAME|API_TOKEN|APP_REQUEST_ID" %}}
+```python
+from influxdb_client_3 import InfluxDBClient3
+import pandas as pd
+import tabulate # Required for pandas.to_markdown()
+
+def explain_and_analyze():
+  print('Use SQL EXPLAIN and ANALYZE to view query plan information.')
+
+  # Instantiate an InfluxDB client.
+  client = InfluxDBClient3(token = f"API_TOKEN",
+                          host = f"{{< influxdb/host >}}",
+                          database = f"BUCKET_NAME")
+
+  sql_explain = '''EXPLAIN SELECT *
+        FROM home
+        WHERE time >= now() - INTERVAL '90 days'
+        ORDER BY time'''
+
+  table = client.query(sql_explain)
+  df = table.to_pandas()
+
+  sql_explain_analyze = '''EXPLAIN ANALYZE SELECT *
+      FROM home
+      WHERE time >= now() - INTERVAL '90 days'
+      ORDER BY time'''
+
+  table = client.query(sql_explain_analyze)
+
+  # Combine the Dataframes and output the plan information.
+  df = pd.concat([df, table.to_pandas()])
+
+  assert df.shape == (3, 2) and df.columns.to_list() == ['plan_type', 'plan']
+  print(df[['plan_type', 'plan']].to_markdown(index=False))
+
+  client.close()
+
+explain_and_analyze()
+```
+{{% /code-placeholders %}}
+
+Replace the following:
+
+- {{% code-placeholder-key %}}`BUCKET_NAME`{{% /code-placeholder-key %}}: your {{% product-name %}} database
+- {{% code-placeholder-key %}}`API_TOKEN`{{% /code-placeholder-key %}}: a [database token](/influxdb/cloud-serverless/admin/tokens/) with sufficient permissions to the specified database
+
+The output is similar to the following:
+
+```markdown
+| plan_type         | plan                                                                                                                                         |
+|:------------------|:---------------------------------------------------------------------------------------------------------------------------------------------|
+| logical_plan      | Sort: home.time ASC NULLS LAST                                                                                                               |
+|                   |   TableScan: home projection=[co, hum, room, sensor, temp, time], full_filters=[home.time >= TimestampNanosecond(1688491380936276013, None)] |
+| physical_plan     | SortExec: expr=[time@5 ASC NULLS LAST]                                                                                                       |
+|                   |   EmptyExec: produce_one_row=false                                                                                                           |
+| Plan with Metrics | SortExec: expr=[time@5 ASC NULLS LAST], metrics=[output_rows=0, elapsed_compute=1ns, spill_count=0, spilled_bytes=0]                         |
+|                   |   EmptyExec: produce_one_row=false, metrics=[]
+```
+
+### Enable trace logging
+
+Customers with an {{% product-name %}} [annual or support contract](https://www.influxdata.com/influxdb-cloud-pricing/) can [contact InfluxData Support](https://support.influxdata.com/) to enable tracing and request help troubleshooting your query.
+With tracing enabled, InfluxDB Support can trace system processes and analyze log information for a query instance.
+The tracing system follows the [OpenTelemetry traces](https://opentelemetry.io/docs/concepts/signals/traces/) model for providing observability into a request.
--- a/content/influxdb/cloud-serverless/query-data/execute-queries/troubleshoot.md
+++ b/content/influxdb/cloud-serverless/query-data/execute-queries/troubleshoot.md
@ -197,7 +197,7 @@ Flight returned internal error, with message: Received RST_STREAM with error cod
 pyarrow._flight.FlightInternalError: Flight returned internal error, with message: stream terminated by RST_STREAM with error code: NO_ERROR. gRPC client debug context: UNKNOWN:Error received from peer ipv4:3.123.149.45:443 {created_time:"2023-07-26T14:12:44.992317+02:00", grpc_status:13, grpc_message:"stream terminated by RST_STREAM with error code: NO_ERROR"}. Client context: OK
 ```

-**Potential Reasons**:
+**Potential reasons**:

 - The server terminated the stream, but there wasn't any specific error associated with it.
 - Possible network disruption, even if it's temporary.
@ -213,7 +213,7 @@ pyarrow._flight.FlightInternalError: Flight returned internal error, with messag
 ArrowInvalid: Flight returned invalid argument error, with message: bucket "otel5" not found. gRPC client debug context: UNKNOWN:Error received from peer ipv4:3.123.149.45:443 {grpc_message:"bucket \"otel5\" not found", grpc_status:3, created_time:"2023-08-09T16:37:30.093946+01:00"}. Client context: IOError: Server never sent a data message. Detail: Internal
 ```

-**Potential Reasons**:
+**Potential reasons**:

 - The specified bucket doesn't exist.

@ -227,7 +227,7 @@ ArrowInvalid: Flight returned invalid argument error, with message: bucket "otel
 pyarrow.lib.ArrowInvalid: Flight returned invalid argument error, with message: Invalid ticket. Error: Invalid ticket. gRPC client debug context: UNKNOWN:Error received from peer ipv4:54.158.68.83:443 {created_time:"2023-08-31T17:56:42.909129-05:00", grpc_status:3, grpc_message:"Invalid ticket. Error: Invalid ticket"}. Client context: IOError: Server never sent a data message. Detail: Internal
 ```

-**Potential Reasons**:
+**Potential reasons**:

 - The request is missing the bucket name or some other required metadata value.
 - The request contains bad query syntax.
--- a/content/influxdb/clustered/query-data/execute-queries/optimize-queries.md
+++ b/content/influxdb/clustered/query-data/execute-queries/optimize-queries.md
@ -0,0 +1,115 @@
+---
+title: Optimize queries
+description: >
+  Optimize your SQL and InfluxQL queries to improve performance and reduce their memory and compute (CPU) requirements.
+weight: 401
+menu:
+  influxdb_clustered:
+    name: Optimize queries
+    parent: Execute queries
+influxdb/clustered/tags: [query, sql, influxql]
+related:
+  - /influxdb/clustered/query-data/sql/
+  - /influxdb/clustered/query-data/influxql/
+  - /influxdb/clustered/query-data/execute-queries/troubleshoot/
+  - /influxdb/clustered/reference/client-libraries/v3/
+---
+
+Use the following tools to help you identify performance bottlenecks and troubleshoot problems in queries:
+
+<!-- TOC -->
+
+- [EXPLAIN and ANALYZE](#explain-and-analyze)
+
+<!-- /TOC -->
+
+### EXPLAIN and ANALYZE
+
+To view the query engine's execution plan and metrics for an SQL query, prepend [`EXPLAIN`](/influxdb/clustered/reference/sql/explain/) or [`EXPLAIN ANALYZE`](/influxdb/clustered/reference/sql/explain/#explain-analyze) to the query.
+The report can reveal query bottlenecks such as a large number of table scans or parquet files, and can help triage the question, "Is the query slow due to the amount of work required or due to a problem with the schema, compactor, etc.?"
+
+The following example shows how to use the InfluxDB v3 Python client library and pandas to view `EXPLAIN` and `EXPLAIN ANALYZE` results for a query:
+
+<!-- Import for tests and hide from users.
+```python
+import os
+```
+-->
+
+{{% code-placeholders "DATABASE_(NAME|TOKEN)" %}}
+<!--pytest-codeblocks:cont-->
+
+```python
+from influxdb_client_3 import InfluxDBClient3
+import pandas as pd
+import tabulate # Required for pandas.to_markdown()
+
+# Instantiate an InfluxDB client.
+client = InfluxDBClient3(token = f"DATABASE_TOKEN",
+                        host = f"{{< influxdb/host >}}",
+                        database = f"DATABASE_NAME")
+
+sql_explain = '''EXPLAIN
+              SELECT temp
+              FROM home
+              WHERE time >= now() - INTERVAL '90 days'
+              AND room = 'Kitchen'
+              ORDER BY time'''
+
+table = client.query(sql_explain)
+df = table.to_pandas()
+print(df.to_markdown(index=False))
+
+assert df.shape == (2, 2), f'Expect {df.shape} to have 2 columns, 2 rows'
+assert 'physical_plan' in df.plan_type.values, "Expect physical_plan"
+assert 'logical_plan' in df.plan_type.values, "Expect logical_plan"
+```
+
+{{< expand-wrapper >}}
+{{% expand "View EXPLAIN example results" %}}
+| plan_type     | plan                                                                                                                                                                           |
+|:--------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| logical_plan  | Projection: home.temp                                                                                                                                                           |
+|               |   Sort: home.time ASC NULLS LAST                                                                                                                                                |
+|               |     Projection: home.temp, home.time                                                                                                                                            |
+|               |       TableScan: home projection=[room, temp, time], full_filters=[home.time >= TimestampNanosecond(1688676582918581320, None), home.room = Dictionary(Int32, Utf8("Kitchen"))] |
+| physical_plan | ProjectionExec: expr=[temp@0 as temp]                                                                                                                                           |
+|               |   SortExec: expr=[time@1 ASC NULLS LAST]                                                                                                                                        |
+|               |     EmptyExec: produce_one_row=false                                                                                                                                            |
+{{% /expand %}}
+{{< /expand-wrapper >}}
+
+<!--pytest-codeblocks:cont-->
+
+```python
+sql_explain_analyze = '''EXPLAIN ANALYZE
+                      SELECT *
+                      FROM home
+                      WHERE time >= now() - INTERVAL '90 days'
+                      ORDER BY time'''
+
+table = client.query(sql_explain_analyze)
+df = table.to_pandas()
+print(df.to_markdown(index=False))
+
+assert df.shape == (1,2)
+assert 'Plan with Metrics' in df.plan_type.values, "Expect plan metrics"
+
+client.close()
+```
+{{% /code-placeholders %}}
+
+Replace the following:
+
+- {{% code-placeholder-key %}}`DATABASE_NAME`{{% /code-placeholder-key %}}: your {{% product-name %}} database
+- {{% code-placeholder-key %}}`DATABASE_TOKEN`{{% /code-placeholder-key %}}: a [database token](/influxdb/cloud-dedicated/admin/tokens/) with sufficient permissions to the specified database
+
+{{< expand-wrapper >}}
+{{% expand "View EXPLAIN ANALYZE example results" %}}
+| plan_type         | plan                                                                                                                  |
+|:------------------|:-----------------------------------------------------------------------------------------------------------------------|
+| Plan with Metrics | ProjectionExec: expr=[temp@0 as temp], metrics=[output_rows=0, elapsed_compute=1ns]                                    |
+|                   |   SortExec: expr=[time@1 ASC NULLS LAST], metrics=[output_rows=0, elapsed_compute=1ns, spill_count=0, spilled_bytes=0] |
+|                   |     EmptyExec: produce_one_row=false, metrics=[]
+{{% /expand %}}
+{{< /expand-wrapper >}}
--- a/content/influxdb/clustered/query-data/execute-queries/troubleshoot.md
+++ b/content/influxdb/clustered/query-data/execute-queries/troubleshoot.md
@ -197,7 +197,7 @@ Flight returned internal error, with message: Received RST_STREAM with error cod
 pyarrow._flight.FlightInternalError: Flight returned internal error, with message: stream terminated by RST_STREAM with error code: NO_ERROR. gRPC client debug context: UNKNOWN:Error received from peer ipv4:3.123.149.45:443 {created_time:"2023-07-26T14:12:44.992317+02:00", grpc_status:13, grpc_message:"stream terminated by RST_STREAM with error code: NO_ERROR"}. Client context: OK
 ```

-**Potential Reasons**:
+**Potential reasons**:

 - The server terminated the stream, but there wasn't any specific error associated with it.
 - Possible network disruption, even if it's temporary.
@ -213,7 +213,7 @@ pyarrow._flight.FlightInternalError: Flight returned internal error, with messag
 pyarrow.lib.ArrowInvalid: Flight returned invalid argument error, with message: Invalid ticket. Error: Invalid ticket. gRPC client debug context: UNKNOWN:Error received from peer ipv4:54.158.68.83:443 {created_time:"2023-08-31T17:56:42.909129-05:00", grpc_status:3, grpc_message:"Invalid ticket. Error: Invalid ticket"}. Client context: IOError: Server never sent a data message. Detail: Internal
 ```

-**Potential Reasons**:
+**Potential reasons**:

 - The request is missing the database name or some other required metadata value.
 - The request contains bad query syntax.
--- a/test/requirements.txt
+++ b/test/requirements.txt
@ -1,5 +1,6 @@
 ## Code sample dependencies
-influxdb3-python
+# Temporary fork for passing headers in query options.
+influxdb3-python @ git+https://github.com/jstirnaman/influxdb3-python@4abd41c710e79f85333ba81258b10daff54d05b0
 pandas
 ## Tabulate for printing pandas DataFrames.
 tabulate