added details to scalar values guide, updated glossary, resolves #512

2019-11-07 16:24:25 -07:00 · 2019-11-07 16:24:25 -07:00 · 34deb8a392
parent 84d21aee34
commit 34deb8a392
2 changed files with 223 additions and 22 deletions
--- a/content/v2.0/query-data/guides/scalar-values.md
+++ b/content/v2.0/query-data/guides/scalar-values.md
@ -10,9 +10,167 @@ weight: 210
 v2.0/tags: [scalar]
 ---

-placeholder
+Use Flux [stream and table functions](/v2.0/reference/flux/stdlib/built-in/transformations/stream-table/)
+to extract scalar values from Flux query output for things like dynamically setting
+variables using query results.
+
+**To extract scalar values from output:**
+
+1. [Extract a table](#extract-a-table)
+2. Use the extracted table and [extract an array of column values](#extract-a-column-from-the-table)
+   or [extract a row as an object](#extract-a-row-from-the-table).
+
+_The samples on this page use the [sample data provided below](#sample-data)._
+
+{{% warn %}}
+#### Current limitations
+- The InfluxDB user interface (UI) does not support raw scalar output, but you can
+  use [`map()`](/v2.0/reference/flux/stdlib/built-in/transformations/map/) to add
+  scalar values to data.
+- The [Flux REPL ](/v2.0/reference/cli/influx/repl) does not currently support
+  Flux stream and table functions (also known as "dynamic queries").
+  See [#15321](https://github.com/influxdata/influxdb/issues/15231).
+
+{{% /warn %}}
+
+## Extract a table
+Flux formats query results as a stream of tables.
+To extract a scalar value, first first use
+[`tableFind()`](/v2.0/reference/flux/stdlib/built-in/transformations/stream-table/tablefind/)
+to extract a single table from the stream of tables.
+
+{{% note %}}
+If query results include only one table, it is still formatted as a stream of tables.
+{{% /note %}}
+
+The `tableFind()` extracts the **first** table in a stream of tables whose
+[group key](/v2.0/reference/glossary/#group-key) values match the
+[predicate function](/v2.0/reference/glossary/#predicate-expression) defined by
+the `fn` parameter.
+The predicate function expects a `key` object, which represents the group key of
+each table.

 ```js
+sampleData
+  |> tableFind(fn: (key) =>
+      key._field == "temp" and
+      key.location == "sfo"
+  )
+```
+
+This returns a single table:
+
+| _time                | location | _field | _value |
+|:-----                |:--------:|:------:| ------:|
+| 2019-11-01T12:00:00Z | sfo      | temp   | 65.1   |
+| 2019-11-01T13:00:00Z | sfo      | temp   | 66.2   |
+| 2019-11-01T14:00:00Z | sfo      | temp   | 66.3   |
+| 2019-11-01T15:00:00Z | sfo      | temp   | 66.8   |
+
+{{% note %}}
+#### Minimize the number of input tables
+Flux functions do not guarantee the order in which tables are returned.
+`tableFind()` simply extracts the first table that matches the `fn` predicate.
+To ensure you retrieve the table that includes the data you actually want, filter
+and transform your data to minimize the number of tables piped-forward into `tableFind()`.
+{{% /note %}}
+
+## Extract a column from the table
+Use the [`getColumn()` function](/v2.0/reference/flux/stdlib/built-in/transformations/stream-table/getcolumn/)
+to output an array of values from a specific column in the extracted table.
+
+
+```js
+sampleData
+  |> tableFind(fn: (key) =>
+      key._field == "temp"  and
+      key.location == "sfo"
+  )
+  |> getColumn(column: "_value")
+
+// Returns [65.1, 66.2, 66.3, 66.8]
+```
+
+### Use extracted column values
+Use a variable to store the array of values.
+Reference a specific index (integer starting from `0`) in the array to return the
+value at that index.
+
+```js
+SFOTemps = sampleData
+  |> tableFind(fn: (key) =>
+      key._field == "temp" and
+      key.location == "sfo"
+  )
+  |> getColumn(column: "_value")
+
+SFOTemps
+// Returns [65.1, 66.2, 66.3, 66.8]
+
+SFOTemps[0]
+// Returns 65.1
+
+SFOTemps[2]
+// Returns 66.3
+```
+
+## Extract a row from the table
+Use the [`getRecord()` function](/v2.0/reference/flux/stdlib/built-in/transformations/stream-table/getrecord/)
+to output data from a single row in the extracted table.
+Specify the index of row to output using the `idx` parameter.
+The function outputs an object with key-value pairs for each column.
+
+```js
+sampleData
+  |> tableFind(fn: (key) =>
+      key._field == "temp" and
+      key.location == "sfo"
+  )
+  |> getRecord(idx: 0)
+
+// Returns {
+//   _time:2019-11-11T12:00:00Z,
+//   _field:"temp",
+//   location:"sfo",
+//   _value: 65.1
+// }
+```
+
+### Use an extracted row object
+Use a variable to store the extracted row object.
+Use [dot notation](/v2.0/query-data/get-started/syntax-basics/#objects) to reference
+keys in the object.
+
+```js
+tempInfo = sampleData
+  |> tableFind(fn: (key) =>
+      key._field == "temp" and
+      key.location == "sfo"
+  )
+  |> getRecord(idx: 0)
+
+tempInfo
+// Returns {
+//   _time:2019-11-11T12:00:00Z,
+//   _field:"temp",
+//   location:"sfo",
+//   _value: 65.1
+// }
+
+tempInfo._time
+// Returns 2019-11-11T12:00:00Z
+
+tempInfo.location
+// Returns sfo
+```
+
+## Example helper functions
+Use the process of extracting scalar values from query output to create custom
+helper functions that do it for you.
+
+##### Extract a scalar field value
+```js
+// Define a helper function to extract field values
 getFieldValue = (tables=<-, field) => {
  extract = tables
    |> tableFind(fn: (key) => key._field == field)
@ -20,31 +178,69 @@ getFieldValue = (tables=<-, field) => {
  return extract[0]
 }

-getRowObject = (tables=<-, field) => {
+lastJFKTemp = sampleData
+  |> filter(fn: (r) => r.location == "kjfk")
+  |> last()
+  |> getFieldValue(field: "temp")
+
+lastJFKTemp
+// Returns 71.2
+```
+
+##### Extract scalar row data
+```js
+// Define a helper function to extract a row as an object
+getRow = (tables=<-, field, idx=0) => {
  extract = tables
-    |> tableFind(fn: (key) => key._field == field)
-    |> getRecord(idx: 0)
+    |> tableFind(fn: (key) => true)
+    |> getRecord(idx: idx)
  return extract
 }

-n_cpus = from(bucket: "default")
-  |> range(start: -1h)
-  |> filter(fn: (r) => r._measurement == "system")
-  |> getFieldValue(field: "n_cpus")
+lastReported = sampleData
+  |> last()
+  |> getRow(idx: 0)

-row = from(bucket: "default")
-  |> range(start: -1h)
-  |> filter(fn: (r) => r._measurement == "system")
-  |> getRowObject(field: "n_cpus")
+"The last location to report was ${lastReported.location}.
+The temperature was ${string(v: lastReported._value)}."

-from(bucket: "default")
-  |> range(start: -1h)
-  |> filter(fn: (r) => r._measurement == "mem" and r._field == "used_percent")
-  |> keep(columns: ["_time", "_field", "_value"])
-  |> map(fn: (r) => ({ r with
-    n_cpus: n_cpus,
-    rowTime: row._time,
-    rowCPUs: row._value,
-    rowField: row._field
-  }))
+// Returns:
+// The last location to report was kord.
+// The temperature was 38.9.
 ```
+
+---
+
+## Sample data
+
+The following sample data set represents fictional temperature metrics collected
+from three locations.
+It's formatted in [annotated CSV](/v2.0/reference/syntax/annotated-csv/) and imported
+into the Flux query using the [`csv.from()` function](/v2.0/reference/flux/stdlib/csv/from/).
+
+Place the following at the beginning of your query to use the sample data:
+
+{{% truncate %}}
+```js
+import "csv"
+
+sampleData = csv.from(csv: "
+#datatype,string,long,dateTime:RFC3339,string,string,double
+#group,false,true,false,true,true,false
+#default,,,,,,
+,result,table,_time,location,_field,_value
+,,0,2019-11-01T12:00:00Z,sfo,temp,65.1
+,,0,2019-11-01T13:00:00Z,sfo,temp,66.2
+,,0,2019-11-01T14:00:00Z,sfo,temp,66.3
+,,0,2019-11-01T15:00:00Z,sfo,temp,66.8
+,,1,2019-11-01T12:00:00Z,kjfk,temp,69.4
+,,1,2019-11-01T13:00:00Z,kjfk,temp,69.9
+,,1,2019-11-01T14:00:00Z,kjfk,temp,71.0
+,,1,2019-11-01T15:00:00Z,kjfk,temp,71.2
+,,2,2019-11-01T12:00:00Z,kord,temp,46.4
+,,2,2019-11-01T13:00:00Z,kord,temp,46.3
+,,2,2019-11-01T14:00:00Z,kord,temp,42.7
+,,2,2019-11-01T15:00:00Z,kord,temp,38.9
+")
+```
+{{% /truncate %}}
--- a/content/v2.0/reference/glossary.md
+++ b/content/v2.0/reference/glossary.md
@ -383,6 +383,11 @@ Elements include a range, major and minor tick marks (within the range), and a p

 A diagram that visually depicts the relation between variable quantities measured along specified axes.

+### group key
+Group keys determine the schema and contents of tables in Flux output.
+A group key is a list of columns for which every row in the table has the same value.
+Columns with unique values in each row are not part of the group key.
+
 ### gzip

 gzip is a type of data compression that compress chunks of data, which is restored by unzipping compressed gzip files.