added details to scalar values guide, updated glossary, resolves #512

pull/592/head
Scott Anderson 2019-11-07 16:24:25 -07:00
parent 84d21aee34
commit 34deb8a392
2 changed files with 223 additions and 22 deletions

View File

@ -10,9 +10,167 @@ weight: 210
v2.0/tags: [scalar]
---
placeholder
Use Flux [stream and table functions](/v2.0/reference/flux/stdlib/built-in/transformations/stream-table/)
to extract scalar values from Flux query output for things like dynamically setting
variables using query results.
**To extract scalar values from output:**
1. [Extract a table](#extract-a-table)
2. Use the extracted table and [extract an array of column values](#extract-a-column-from-the-table)
or [extract a row as an object](#extract-a-row-from-the-table).
_The samples on this page use the [sample data provided below](#sample-data)._
{{% warn %}}
#### Current limitations
- The InfluxDB user interface (UI) does not support raw scalar output, but you can
use [`map()`](/v2.0/reference/flux/stdlib/built-in/transformations/map/) to add
scalar values to data.
- The [Flux REPL ](/v2.0/reference/cli/influx/repl) does not currently support
Flux stream and table functions (also known as "dynamic queries").
See [#15321](https://github.com/influxdata/influxdb/issues/15231).
{{% /warn %}}
## Extract a table
Flux formats query results as a stream of tables.
To extract a scalar value, first first use
[`tableFind()`](/v2.0/reference/flux/stdlib/built-in/transformations/stream-table/tablefind/)
to extract a single table from the stream of tables.
{{% note %}}
If query results include only one table, it is still formatted as a stream of tables.
{{% /note %}}
The `tableFind()` extracts the **first** table in a stream of tables whose
[group key](/v2.0/reference/glossary/#group-key) values match the
[predicate function](/v2.0/reference/glossary/#predicate-expression) defined by
the `fn` parameter.
The predicate function expects a `key` object, which represents the group key of
each table.
```js
sampleData
|> tableFind(fn: (key) =>
key._field == "temp" and
key.location == "sfo"
)
```
This returns a single table:
| _time | location | _field | _value |
|:----- |:--------:|:------:| ------:|
| 2019-11-01T12:00:00Z | sfo | temp | 65.1 |
| 2019-11-01T13:00:00Z | sfo | temp | 66.2 |
| 2019-11-01T14:00:00Z | sfo | temp | 66.3 |
| 2019-11-01T15:00:00Z | sfo | temp | 66.8 |
{{% note %}}
#### Minimize the number of input tables
Flux functions do not guarantee the order in which tables are returned.
`tableFind()` simply extracts the first table that matches the `fn` predicate.
To ensure you retrieve the table that includes the data you actually want, filter
and transform your data to minimize the number of tables piped-forward into `tableFind()`.
{{% /note %}}
## Extract a column from the table
Use the [`getColumn()` function](/v2.0/reference/flux/stdlib/built-in/transformations/stream-table/getcolumn/)
to output an array of values from a specific column in the extracted table.
```js
sampleData
|> tableFind(fn: (key) =>
key._field == "temp" and
key.location == "sfo"
)
|> getColumn(column: "_value")
// Returns [65.1, 66.2, 66.3, 66.8]
```
### Use extracted column values
Use a variable to store the array of values.
Reference a specific index (integer starting from `0`) in the array to return the
value at that index.
```js
SFOTemps = sampleData
|> tableFind(fn: (key) =>
key._field == "temp" and
key.location == "sfo"
)
|> getColumn(column: "_value")
SFOTemps
// Returns [65.1, 66.2, 66.3, 66.8]
SFOTemps[0]
// Returns 65.1
SFOTemps[2]
// Returns 66.3
```
## Extract a row from the table
Use the [`getRecord()` function](/v2.0/reference/flux/stdlib/built-in/transformations/stream-table/getrecord/)
to output data from a single row in the extracted table.
Specify the index of row to output using the `idx` parameter.
The function outputs an object with key-value pairs for each column.
```js
sampleData
|> tableFind(fn: (key) =>
key._field == "temp" and
key.location == "sfo"
)
|> getRecord(idx: 0)
// Returns {
// _time:2019-11-11T12:00:00Z,
// _field:"temp",
// location:"sfo",
// _value: 65.1
// }
```
### Use an extracted row object
Use a variable to store the extracted row object.
Use [dot notation](/v2.0/query-data/get-started/syntax-basics/#objects) to reference
keys in the object.
```js
tempInfo = sampleData
|> tableFind(fn: (key) =>
key._field == "temp" and
key.location == "sfo"
)
|> getRecord(idx: 0)
tempInfo
// Returns {
// _time:2019-11-11T12:00:00Z,
// _field:"temp",
// location:"sfo",
// _value: 65.1
// }
tempInfo._time
// Returns 2019-11-11T12:00:00Z
tempInfo.location
// Returns sfo
```
## Example helper functions
Use the process of extracting scalar values from query output to create custom
helper functions that do it for you.
##### Extract a scalar field value
```js
// Define a helper function to extract field values
getFieldValue = (tables=<-, field) => {
extract = tables
|> tableFind(fn: (key) => key._field == field)
@ -20,31 +178,69 @@ getFieldValue = (tables=<-, field) => {
return extract[0]
}
getRowObject = (tables=<-, field) => {
lastJFKTemp = sampleData
|> filter(fn: (r) => r.location == "kjfk")
|> last()
|> getFieldValue(field: "temp")
lastJFKTemp
// Returns 71.2
```
##### Extract scalar row data
```js
// Define a helper function to extract a row as an object
getRow = (tables=<-, field, idx=0) => {
extract = tables
|> tableFind(fn: (key) => key._field == field)
|> getRecord(idx: 0)
|> tableFind(fn: (key) => true)
|> getRecord(idx: idx)
return extract
}
n_cpus = from(bucket: "default")
|> range(start: -1h)
|> filter(fn: (r) => r._measurement == "system")
|> getFieldValue(field: "n_cpus")
lastReported = sampleData
|> last()
|> getRow(idx: 0)
row = from(bucket: "default")
|> range(start: -1h)
|> filter(fn: (r) => r._measurement == "system")
|> getRowObject(field: "n_cpus")
"The last location to report was ${lastReported.location}.
The temperature was ${string(v: lastReported._value)}."
from(bucket: "default")
|> range(start: -1h)
|> filter(fn: (r) => r._measurement == "mem" and r._field == "used_percent")
|> keep(columns: ["_time", "_field", "_value"])
|> map(fn: (r) => ({ r with
n_cpus: n_cpus,
rowTime: row._time,
rowCPUs: row._value,
rowField: row._field
}))
// Returns:
// The last location to report was kord.
// The temperature was 38.9.
```
---
## Sample data
The following sample data set represents fictional temperature metrics collected
from three locations.
It's formatted in [annotated CSV](/v2.0/reference/syntax/annotated-csv/) and imported
into the Flux query using the [`csv.from()` function](/v2.0/reference/flux/stdlib/csv/from/).
Place the following at the beginning of your query to use the sample data:
{{% truncate %}}
```js
import "csv"
sampleData = csv.from(csv: "
#datatype,string,long,dateTime:RFC3339,string,string,double
#group,false,true,false,true,true,false
#default,,,,,,
,result,table,_time,location,_field,_value
,,0,2019-11-01T12:00:00Z,sfo,temp,65.1
,,0,2019-11-01T13:00:00Z,sfo,temp,66.2
,,0,2019-11-01T14:00:00Z,sfo,temp,66.3
,,0,2019-11-01T15:00:00Z,sfo,temp,66.8
,,1,2019-11-01T12:00:00Z,kjfk,temp,69.4
,,1,2019-11-01T13:00:00Z,kjfk,temp,69.9
,,1,2019-11-01T14:00:00Z,kjfk,temp,71.0
,,1,2019-11-01T15:00:00Z,kjfk,temp,71.2
,,2,2019-11-01T12:00:00Z,kord,temp,46.4
,,2,2019-11-01T13:00:00Z,kord,temp,46.3
,,2,2019-11-01T14:00:00Z,kord,temp,42.7
,,2,2019-11-01T15:00:00Z,kord,temp,38.9
")
```
{{% /truncate %}}

View File

@ -383,6 +383,11 @@ Elements include a range, major and minor tick marks (within the range), and a p
A diagram that visually depicts the relation between variable quantities measured along specified axes.
### group key
Group keys determine the schema and contents of tables in Flux output.
A group key is a list of columns for which every row in the table has the same value.
Columns with unique values in each row are not part of the group key.
### gzip
gzip is a type of data compression that compress chunks of data, which is restored by unzipping compressed gzip files.