influxdb/docs/testing.md

# Testing

This document covers details that are only relevant if you are developing IOx and running the tests.


## Running the IOx server from source

### Starting the server
You can run IOx locally with a command like this (replacing `--data-dir` with your preferred location)

```shell
cargo run -- run -v --object-store=file --data-dir=$HOME/.influxdb_iox --server-id=42
```

### Loading data
In another terminal window, try loading some data. These commands will create a database called `parquet_db` and load the contents of `tests/fixtures/lineproto/metrics.lp` into it

```shell
cd influxdb_iox
./target/debug/influxdb_iox database create parquet_db
./target/debug/influxdb_iox database write parquet_db tests/fixtures/lineproto/metrics.lp
```

### Editing configuration
You can interactively edit the configuration of the IOx instance with a command like this:

```shell
./scripts/edit_db_rules  localhost:8082 parquet_db
```

Which will bring up your editor with a file that looks like this. Any changes you make to the file will be sent to IOx as its new config.

In this case, these settings will cause data to be persisted to parquet almost immediately

```json
{
  "rules": {
    "name": "parquet_db",
    "partitionTemplate": {
      "parts": [
        {
          "time": "%Y-%m-%d %H:00:00"
        }
      ]
    },
    "lifecycleRules": {
      "bufferSizeSoft": "52428800",
      "bufferSizeHard": "104857600",
      "dropNonPersisted": true,
      "immutable": false,
      "persist": true,
      "workerBackoffMillis": "1000",
      "catalogTransactionsUntilCheckpoint": "100",
      "lateArriveWindowSeconds": 1,
      "persistRowThreshold": "1",
      "persistAgeThresholdSeconds": 1,
      "mubRowThreshold": "1",
      "parquetCacheLimit": "0",
      "maxActiveCompactionsCpuFraction": 1
    },
    "workerCleanupAvgSleep": "500s"
  }
}
```

### Examining Parquet Files
You can use tools such as `parquet-tools` to examine the parquet files created by IOx. For example, the following command will show the contents of the `disk` table when persisted as parquet (note the actual filename will be different):

```shell
parquet-tools meta /Users/alamb/.influxdb_iox/42/parquet_db/data/disk/2020-06-11\ 16\:00\:00/1.4b1a7805-d6de-495e-844b-32fa452147c7.parquet
```


## Object storage

### To run the tests or not run the tests

If you are testing integration with some or all of the object storage options, you'll have more
setup to do.

By default, `cargo test -p object_store` does not run any tests that actually contact
any cloud services: tests that do contact the services will silently pass.

To run integration tests, use `TEST_INTEGRATION=1 cargo test -p object_store`, which will run the
tests that contact the cloud services and fail them if the required environment variables aren't
set.

### Configuration differences when running the tests

When running `influxdb_iox run database`, you can pick one object store to use. When running the tests,
you can run them against all the possible object stores. There's still only one
`INFLUXDB_IOX_BUCKET` variable, though, so that will set the bucket name for all configured object
stores. Use the same bucket name when setting up the different services.

Other than possibly configuring multiple object stores, configuring the tests to use the object
store services is the same as configuring the server to use an object store service. See the output
of `influxdb_iox run database --help` for instructions.

## InfluxDB 2 Client

The `influxdb2_client` crate may be used by people using InfluxDB 2.0 OSS, and should be compatible
with both that and IOx. If you want to run the integration tests for the client against InfluxDB
2.0 OSS, you will need to set `TEST_INTEGRATION=1`.

If you have `docker` in your path, the integration tests for the `influxdb2_client` crate will run
integration tests against `influxd` running in a Docker container.

If you do not want to use Docker locally, but you do have `influxd` for InfluxDB
2.0 locally, you can use that instead by running the tests with the environment variable
`INFLUXDB_IOX_INTEGRATION_LOCAL=1`.

## Kafka Write Buffer

By default, the integration tests for the Kafka-based write buffer are not run. 

In order to run them you must set two environment variables:

* `TEST_INTEGRATION=1`
* `KAFKA_CONNECT` to a host and port where the tests can connect to a running Kafka broker

### Running Kafka Locally

[Redpanda](https://vectorized.io/redpanda/) is a Kafka-compatible broker that can be used to run the tests, and is used
by the CI to test IOx.

Either follow the instructions on the website to install redpanda directly onto your system, or alternatively 
it can be run in a docker container with:

```
docker run -d --pull=always --name=redpanda-1 --rm \
    -p 9092:9092 \
    -p 9644:9644 \
    docker.vectorized.io/vectorized/redpanda:latest \
    redpanda start \
    --overprovisioned \
    --smp 1  \
    --memory 1G \
    --reserve-memory 0M \
    --node-id 0 \
    --check=false
```

It is then just a case of setting the environment variables and running the tests as normal

```
TEST_INTEGRATION=1 KAFKA_CONNECT=localhost:9093 cargo test
```

Or to just run the Kafka tests

```
TEST_INTEGRATION=1 KAFKA_CONNECT=localhost:9093 cargo test -p write_buffer kafka --nocapture
```
feat: Consolidate all bucket config into one option/env var Fixes #869. 2021-02-24 21:52:18 +00:00			`# Testing`

			`This document covers details that are only relevant if you are developing IOx and running the tests.`

docs: add writeup on how to use IOx server from source (#2220) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com> 2021-08-06 21:04:44 +00:00
			`## Running the IOx server from source`

			`### Starting the server`
			You can run IOx locally with a command like this (replacing `--data-dir` with your preferred location)

			```shell
			`cargo run -- run -v --object-store=file --data-dir=$HOME/.influxdb_iox --server-id=42`
			```

			`### Loading data`
			In another terminal window, try loading some data. These commands will create a database called `parquet_db` and load the contents of `tests/fixtures/lineproto/metrics.lp` into it

			```shell
			`cd influxdb_iox`
			`./target/debug/influxdb_iox database create parquet_db`
			`./target/debug/influxdb_iox database write parquet_db tests/fixtures/lineproto/metrics.lp`
			```

			`### Editing configuration`
			`You can interactively edit the configuration of the IOx instance with a command like this:`

			```shell
			`./scripts/edit_db_rules localhost:8082 parquet_db`
			```

			`Which will bring up your editor with a file that looks like this. Any changes you make to the file will be sent to IOx as its new config.`

			`In this case, these settings will cause data to be persisted to parquet almost immediately`

			```json
			`{`
			`"rules": {`
			`"name": "parquet_db",`
			`"partitionTemplate": {`
			`"parts": [`
			`{`
			`"time": "%Y-%m-%d %H:00:00"`
			`}`
			`]`
			`},`
			`"lifecycleRules": {`
			`"bufferSizeSoft": "52428800",`
			`"bufferSizeHard": "104857600",`
			`"dropNonPersisted": true,`
			`"immutable": false,`
			`"persist": true,`
			`"workerBackoffMillis": "1000",`
			`"catalogTransactionsUntilCheckpoint": "100",`
			`"lateArriveWindowSeconds": 1,`
			`"persistRowThreshold": "1",`
			`"persistAgeThresholdSeconds": 1,`
			`"mubRowThreshold": "1",`
			`"parquetCacheLimit": "0",`
			`"maxActiveCompactionsCpuFraction": 1`
			`},`
			`"workerCleanupAvgSleep": "500s"`
			`}`
			`}`
			```

			`### Examining Parquet Files`
			You can use tools such as `parquet-tools` to examine the parquet files created by IOx. For example, the following command will show the contents of the `disk` table when persisted as parquet (note the actual filename will be different):

			```shell
			`parquet-tools meta /Users/alamb/.influxdb_iox/42/parquet_db/data/disk/2020-06-11\ 16\:00\:00/1.4b1a7805-d6de-495e-844b-32fa452147c7.parquet`
			```


feat: Consolidate all bucket config into one option/env var Fixes #869. 2021-02-24 21:52:18 +00:00			`## Object storage`

			`### To run the tests or not run the tests`

			`If you are testing integration with some or all of the object storage options, you'll have more`
			`setup to do.`

docs: Clarify more about TEST_INTEGRATION in regards to both crates that use it 2021-02-25 20:52:52 +00:00			By default, `cargo test -p object_store` does not run any tests that actually contact
			`any cloud services: tests that do contact the services will silently pass.`
feat: Consolidate all bucket config into one option/env var Fixes #869. 2021-02-24 21:52:18 +00:00
test: Don't run any integration tests unless TEST_INTEGRATION is set 2021-04-22 18:20:27 +00:00			To run integration tests, use `TEST_INTEGRATION=1 cargo test -p object_store`, which will run the
			`tests that contact the cloud services and fail them if the required environment variables aren't`
			`set.`
feat: Consolidate all bucket config into one option/env var Fixes #869. 2021-02-24 21:52:18 +00:00
			`### Configuration differences when running the tests`

refactor: rename `query` mode to `database` 2021-10-27 07:31:41 +00:00			When running `influxdb_iox run database`, you can pick one object store to use. When running the tests,
feat: Consolidate all bucket config into one option/env var Fixes #869. 2021-02-24 21:52:18 +00:00			`you can run them against all the possible object stores. There's still only one`
			`INFLUXDB_IOX_BUCKET` variable, though, so that will set the bucket name for all configured object
			`stores. Use the same bucket name when setting up the different services.`
docs: Clarify more about TEST_INTEGRATION in regards to both crates that use it 2021-02-25 20:52:52 +00:00
			`Other than possibly configuring multiple object stores, configuring the tests to use the object`
			`store services is the same as configuring the server to use an object store service. See the output`
refactor: rename `query` mode to `database` 2021-10-27 07:31:41 +00:00			of `influxdb_iox run database --help` for instructions.
test: start of integration tests of influxdb2 client against influxdb 2.0 OSS 2021-03-25 19:22:51 +00:00
			`## InfluxDB 2 Client`

			The `influxdb2_client` crate may be used by people using InfluxDB 2.0 OSS, and should be compatible
test: Don't run any integration tests unless TEST_INTEGRATION is set 2021-04-22 18:20:27 +00:00			`with both that and IOx. If you want to run the integration tests for the client against InfluxDB`
			2.0 OSS, you will need to set `TEST_INTEGRATION=1`.

			If you have `docker` in your path, the integration tests for the `influxdb2_client` crate will run
			integration tests against `influxd` running in a Docker container.
test: Run influxd in docker by default for integration tests 2021-04-15 20:58:27 +00:00
docs: Make a sentence clearer 2021-04-19 19:54:37 +00:00			If you do not want to use Docker locally, but you do have `influxd` for InfluxDB
fix: Rename LOCAL env var to INFLUXDB_IOX_INTEGRATION_LOCAL 2021-04-21 14:32:07 +00:00			`2.0 locally, you can use that instead by running the tests with the environment variable`
			`INFLUXDB_IOX_INTEGRATION_LOCAL=1`.
test: Start of Kafka Write Buffer integration tests 2021-06-09 17:35:59 +00:00
			`## Kafka Write Buffer`

refactor: merge CI integration tests (#3049) * refactor: merge CI integration tests * chore: update docs 2021-11-10 16:48:16 +00:00			`By default, the integration tests for the Kafka-based write buffer are not run.`
test: Start of Kafka Write Buffer integration tests 2021-06-09 17:35:59 +00:00
refactor: merge CI integration tests (#3049) * refactor: merge CI integration tests * chore: update docs 2021-11-10 16:48:16 +00:00			`In order to run them you must set two environment variables:`
test: Start of Kafka Write Buffer integration tests 2021-06-09 17:35:59 +00:00
refactor: merge CI integration tests (#3049) * refactor: merge CI integration tests * chore: update docs 2021-11-10 16:48:16 +00:00			* `TEST_INTEGRATION=1`
			* `KAFKA_CONNECT` to a host and port where the tests can connect to a running Kafka broker
test: Start of Kafka Write Buffer integration tests 2021-06-09 17:35:59 +00:00
refactor: merge CI integration tests (#3049) * refactor: merge CI integration tests * chore: update docs 2021-11-10 16:48:16 +00:00			`### Running Kafka Locally`
docs: Clarify the differences between the two ways to run the Kafka integration tests 2021-06-16 14:52:21 +00:00
refactor: merge CI integration tests (#3049) * refactor: merge CI integration tests * chore: update docs 2021-11-10 16:48:16 +00:00			`[Redpanda](https://vectorized.io/redpanda/) is a Kafka-compatible broker that can be used to run the tests, and is used`
			`by the CI to test IOx.`
docs: Clarify the differences between the two ways to run the Kafka integration tests 2021-06-16 14:52:21 +00:00
refactor: merge CI integration tests (#3049) * refactor: merge CI integration tests * chore: update docs 2021-11-10 16:48:16 +00:00			`Either follow the instructions on the website to install redpanda directly onto your system, or alternatively`
			`it can be run in a docker container with:`
test: Start of Kafka Write Buffer integration tests 2021-06-09 17:35:59 +00:00
			```
refactor: merge CI integration tests (#3049) * refactor: merge CI integration tests * chore: update docs 2021-11-10 16:48:16 +00:00			`docker run -d --pull=always --name=redpanda-1 --rm \`
			`-p 9092:9092 \`
			`-p 9644:9644 \`
			`docker.vectorized.io/vectorized/redpanda:latest \`
			`redpanda start \`
			`--overprovisioned \`
			`--smp 1 \`
			`--memory 1G \`
			`--reserve-memory 0M \`
			`--node-id 0 \`
			`--check=false`
test: Start of Kafka Write Buffer integration tests 2021-06-09 17:35:59 +00:00			```

refactor: merge CI integration tests (#3049) * refactor: merge CI integration tests * chore: update docs 2021-11-10 16:48:16 +00:00			`It is then just a case of setting the environment variables and running the tests as normal`
test: Start of Kafka Write Buffer integration tests 2021-06-09 17:35:59 +00:00
			```
refactor: merge CI integration tests (#3049) * refactor: merge CI integration tests * chore: update docs 2021-11-10 16:48:16 +00:00			`TEST_INTEGRATION=1 KAFKA_CONNECT=localhost:9093 cargo test`
test: Start of Kafka Write Buffer integration tests 2021-06-09 17:35:59 +00:00			```

refactor: merge CI integration tests (#3049) * refactor: merge CI integration tests * chore: update docs 2021-11-10 16:48:16 +00:00			`Or to just run the Kafka tests`
docs: Recommend pulling latest docker image first 2021-06-16 15:12:26 +00:00
			```
refactor: merge CI integration tests (#3049) * refactor: merge CI integration tests * chore: update docs 2021-11-10 16:48:16 +00:00			`TEST_INTEGRATION=1 KAFKA_CONNECT=localhost:9093 cargo test -p write_buffer kafka --nocapture`
			```