---
title: Scraping and discovery
aliases:
  - /kapacitor/v1/pull_metrics/scraping-and-discovery/
  - /kapacitor/v1/pull_metrics/
menu:
  kapacitor_v1:
    name: Dynamic Data Scraping
    weight: 6
    parent: work-w-kapacitor
---

Data can be pulled from a dynamic list of remote targets with the discovery and scraping features of Kapacitor.
Use those features with [TICKscripts](/kapacitor/v1/reference/tick/) to monitor targets, process the data, and write data to [InfluxDB](/influxdb/v1/).
Currently, Kapacitor supports only Prometheus style targets.

{{% note %}}
**Note**: Scraping and discovery is currently under technical preview.
There may be changes to the configuration and behavior in subsequent releases.
{{% /note %}}

### Content

* [Overview](#overview)
* [Configuring Scrapers and Discoverers](#configuring-scrapers-and-discoverers)

## Overview

The diagram below outlines the infrastructure for discovering and scraping data with Kapacitor.

**Image 1 &ndash; Scrapping and Discovery work flow**

<img src="/img/kapacitor/1-4-pull-metrics.png" alt="configuration-open" style="max-width:100%;" />

1. First, Kapacitor implements the discovery process to identify the available targets in your infrastructure.
It requests that information at regular intervals and receives that information from an [authority](#available-discoverers).
In the diagram, the authority informs Kapacitor of three targets: `A`, `B`, and `C`.
1. Next, Kapacitor implements the scraping process to pull metrics data from the existing targets.
It runs the scraping process at regular intervals.
Here, Kapacitor requests metrics from targets `A`, `B`, and `C`.
The application running on `A`, `B`, and `C` exposes a `/metrics` endpoint on its HTTP API which returns application-specific statistics.
1. Finally, Kapacitor processes the data according to configured [TICKscripts](/kapacitor/v1/reference/tick/).
Use TICKscripts to filter, transform, and perform other tasks on the metrics data.
In addition, if the data should be stored, configure a TICKscript to send it to [InfluxDB](/influxdb/v1/).

### Pushing vs. Pulling Metrics

By combining discovery with scraping, Kapacitor enables a metrics gathering infrastructure to pull metrics off of targets instead of requiring them to push metrics out to InfluxDB.
Pulling metrics has several advantages in dynamic environments where a target may have a short lifecycle.

## Configuring Scrapers and Discoverers

A single scraper scrapes the targets from a single discoverer.
Configuring both scrapers and discoverers comes down to configuring each individually and then informing the scraper about the discoverer.

Below are all the configuration options for a scraper.

**Example 1 &ndash; Scrapper Configuration**

```
[[scraper]]
  enabled = false
  name = "myscraper"
  # ID of the discoverer to use
  discoverer-id = ""
  # The kind of discoverer to use
  discoverer-service = ""
  db = "mydb"
  rp = "myrp"
  type = "prometheus"
  scheme = "http"
  metrics-path = "/metrics"
  scrape-interval = "1m0s"
  scrape-timeout = "10s"
  username = ""
  password = ""
  bearer-token = ""
  ssl-ca = ""
  ssl-cert = ""
  ssl-key = ""
  ssl-server-name = ""
  insecure-skip-verify = false
```


### Available Discoverers

Kapacitor supports the following services for discovery:

| Name             | Description                                                                                                               |
| ----             | -----------                                                                                                               |
| azure            | Discover targets hosted in [Azure](https://azure.microsoft.com/).                                                         |
| consul           | Discover targets using [Consul](https://www.consul.io/) service discovery.                                                |
| dns              | Discover targets via DNS queries.                                                                                         |
| ec2              | Discover targets hosted in [AWS EC2](https://aws.amazon.com/ec2/).                                                        |
| file-discovery   | Discover targets listed in files.                                                                                         |
| gce              | Discover targets hosted in [GCE](https://cloud.google.com/compute/).                                                      |
| kubernetes       | Discover targets hosted in [Kubernetes](https://kubernetes.io/).                                                          |
| marathon         | Discover targets using [Marathon](https://mesosphere.github.io/marathon/) service discovery.                              |
| nerve            | Discover targets using [Nerve](https://github.com/airbnb/nerve) service discovery.                                        |
| serverset        | Discover targets using [Serversets](https://github.com/twitter/finagle/tree/master/finagle-serversets) service discovery. |
| static-discovery | Statically list targets.                                                                                                  |
| triton           | Discover targets using [Triton](https://github.com/joyent/triton) service discovery.                                      |


See the example [configuration file](https://github.com/influxdata/kapacitor/blob/master/etc/kapacitor/kapacitor.conf) for details on configuring each discoverer.