influxdb

Commit Graph

Author	SHA1	Message	Date
Jason Wilder	27ae2929fc	Add wal-fsync-delay to Diagnostics	2017-03-15 16:31:03 -06:00
Jason Wilder	e9eb925170	Coalesce multiple WAL fsyncs Fsyncs to the WAL can cause higher IO with lots of small writes or slower disks. This reworks the previous wal fsyncing to remove the extra goroutine and remove the hard-coded 100ms delay. Writes to the wal still maintain the invariant that they do not return to the caller until the write is fsync'd. This also adds a new config options wal-fsync-delay (default 0s) which can be increased if a delay is desired. This is somewhat useful for system with slower disks, but the current default works well as is.	2017-03-15 16:31:03 -06:00
Jason Wilder	7bd1bd8ab3	Only calculate disk size if shard has changed Calling DiskSize can be expensive with many shards. Since the stats collection runs this every 10s by default, it can be expensive and wasteful to calculate the stats when nothing has changed. This avoids re-calculating the shard size unless something has chagned.	2017-03-15 16:29:57 -06:00
Jason Wilder	65464ea0d1	Merge pull request #8131 from influxdata/jw-values-merge Use standard merge algorithm when merging Values	2017-03-15 09:51:21 -06:00
Jason Wilder	a4cfeacedb	Use standard merge algorithm for merging values The previous version was very innefficient due to the benchmarks used to optimize it having a bug. This version always allocates a new slice, but is O(n).	2017-03-15 08:59:41 -06:00
Jason Wilder	4d37c9dc9e	Fix broken Values.Merge benchmark Merge had the side effect of modifying the original values so the results are wrong because they always hit the fast path after the first run.	2017-03-14 14:20:24 -06:00
Mark Rushakoff	535cf597f1	Report subset of config values in SHOW DIAGNOSTICS This includes hand-selected config settings that are safe to expose and not expected to include any kind of secrets. Fixes #7821	2017-03-14 11:34:19 -07:00
Jason Wilder	ca9c67a877	Generate encode*Values funcs	2017-03-14 11:54:53 -06:00
Jason Wilder	2f7d4995b4	Use typed values to avoid allocations This switches compactions to use type values (FloatValues) from the generic Values type. It avoids a bunch of allocations where each value much be converted from a specific type to an interface{}.	2017-03-09 16:27:07 -07:00
Jason Wilder	78b7815c49	Add block type for BlockIterator	2017-03-09 09:16:59 -07:00
Jason Wilder	b9e5375043	Merge branch '1.2' into jw-merge-12	2017-03-08 13:16:50 -07:00
Jason Wilder	394bca3aad	Validate field type when creating new fields	2017-03-06 16:13:17 -07:00
Jason Wilder	37187cbe6d	Delete series under fields lock Still seeing the panic that switching this logic around was supposed to fix. We now delete the bulk of data outside of the fields lock and then again, under the write lock, to ensure that the field mapping is accurate. We don't do the full delete under the lock because it can block writes and queries that require a read lock.	2017-03-06 14:19:55 -07:00
Jason Wilder	675d7c9d65	Merge branch '1.2' into jw-merge12	2017-03-06 11:09:05 -07:00
Jason Wilder	eab012ef61	Fix points missing after compaction If blocks containing overlapping ranges of time where partially recombined, it was possible for the some points to get dropped during compactions. This occurred because the window of time of the points we need to merge did not account for the partial blocks created from a prior merge. Fixes #8084	2017-03-06 10:17:11 -07:00
Jason Wilder	3c70abf061	Delete series before remove from field index There is a race where the field type can be deleted while a new type is written and during a query. When this happens, an iterator for the new type is created but old data make still exist in the cache for TSM files causing a panic.	2017-03-06 09:38:27 -07:00
Jason Wilder	29f8d8de76	Fix race in WALEntry.Encode and Value.Deduplicate Under high query load, a race exists in the cache and the WAL. Since writes currently hit the cache first, they are availble for query before they hit the WAL. If the WAL is writing and accessign the Value slice at the same time that a query is run that needs to dedup the same slice, a race occurs. To fix this, the cache now just copies the values instead of storing the slice passed in. Another way to fix this might be to have the writes go to the wal before the cache. I think the latter would be better, but it introduces some larger write path issues that we'd need to also address. e.g. if the cache was full, writes to the WAL would need to be rejected to avoid filling the disk. Copying the slice in the cache is simpler for now and does not appear to dramatically affect performance.	2017-03-06 09:38:22 -07:00
Ben Johnson	4c202eea09	Re-check field type under write lock.	2017-03-03 09:47:43 -07:00
Jonathan A. Sternberg	1081785cb4	Treat non-reserved measurement names with underscores as normal measurements A measurement name that begins with an underscore and does not conflict with one of the reserved measurement names will now be passed untouched to the underlying shards rather than being intercepted as an empty measurement. A user still shouldn't rely on measurements that begin with underscores to always be accessible, but this will prevent the most common use case from causing unexpected behavior since we will very rarely, if ever, add additional system sources.	2017-02-27 16:49:02 -06:00
Jason Wilder	a024003f2c	Merge branch '1.2' into jw-merge-12	2017-02-22 12:13:29 -07:00
Ben Johnson	78a9bb2527	Remove Tags.shouldCopy, replace with forceCopy on series creation. Previously, tags had a `shouldCopy` flag to indicate if those tags referenced an underlying buffer and should be copied to allow GC. Unfortunately, this prevented tags from being copied that were created and referenced the mmap which caused segfaults. This change removes the `shouldCopy` flag and replaces it with a `forceCopy` argument in `CreateSeriesIfNotExists()`. This allows the write path to indicate that tags must be cloned on insert.	2017-02-21 11:13:35 -07:00
Mark Rushakoff	601cbcd084	Merge branch '1.2' into mr-merge-12	2017-02-17 16:14:22 -08:00
Jonathan A. Sternberg	2fe48d6781	Rename zap import back to github.com/uber-go/zap They rebased a revision we were previously relying upon that allowed us to use the vanity name so we are reverting back to an older version with the old import path.	2017-02-17 17:17:22 -06:00
Ben Johnson	8e79ca5d75	Fix tag dereferencing panic. Clones series tags under lock during var ref iterator creation.	2017-02-15 17:56:47 -07:00
Jonathan A. Sternberg	71f62d33e6	Map types correctly when using a regex and one of the measurements is empty	2017-02-13 18:14:29 -06:00
Jason Wilder	4b6289ce58	Merge pull request #7942 from influxdata/jw-cache-partitions Reduce write timeouts	2017-02-10 10:07:08 -07:00
Jason Wilder	2f74e3f3d5	Use simple8b.CountBytes to avoid allocations	2017-02-09 10:47:03 -07:00
Jason Wilder	1bc0f68490	Merge branch '1.2' into jw-merge-12	2017-02-07 12:48:36 -07:00
Jonathan A. Sternberg	e1fa48d0dd	Fix ORDER BY time DESC with ordering series keys The order of series keys is in ascending alphabetical order, not descending alphabetical order, when it is ordered by descending time. This fixes the ordering so points are returned in descending order. The emitter also had the conditions for choosing which iterator to use in the wrong direction (which only affects aggregates with `FILL(none)`).	2017-02-06 15:49:12 -06:00
Jonathan A. Sternberg	95831b3307	Fix LIMIT and OFFSET when they are used in a subquery This fixes LIMIT and OFFSET when they are used in a subquery where the grouping of the inner query is different than the grouping of the outer query. When organizing tag sets, the grouping of the outer query is used so the final result is in the correct order. But, unfortunately, the optimization incorrectly limited the number of points based on the grouping in the outer query rather than the grouping in the inner query. The ideal solution would be to use the outer grouping to further organize it by the grouping for the inner subquery, but that's more difficult to do at the moment. As an easier fix, the query engine now limits the output of each series. This may result in these types of queries being slower in some situations like this one: SELECT mean(value) FROM (SELECT value FROM cpu GROUP BY host LIMIT 1) This will be slower in a situation where the `cpu` measurement has a high cardinality and many different tags. This also fixes `last()` and `first()` when they are used in a subquery because those functions use `LIMIT 1` as an internal optimization.	2017-02-06 14:04:34 -06:00
Jason Wilder	93a9d01643	Increase default waiting WAL writes	2017-02-06 11:48:51 -07:00
Jason Wilder	38a649fc40	Batch multiple WAL fsyncs Every write to the WAL current runs and fsync before returning. When there are lot of concurrent writes, this can cause the WAL to bottleneck write throughput since fsyncs are very expensive. This changes the writeToLog to fsync on an interval to allow multiple fsyncs calls to be batched up into one. The writeToLog behavior is the same in that it won't return until an fsync has been performed.	2017-02-06 11:48:45 -07:00
Jason Wilder	2e95b4043c	Merge branch '1.2' into jw-merge-12	2017-02-02 16:40:36 -07:00
Ben Johnson	faef0a99c9	Perform series tag iteration under lock. Adds a `tsdb.Series.ForEachTag()` function for safely iterating over a series' tags within the context of a lock. This preverts tags from being dereferenced during iteration which can cause a seg fault.	2017-02-01 16:25:53 -07:00
Jason Wilder	54ab3a7a0a	Don't write lock file store when opening new files When replacing TSM files, the new files can be opened before the write lock is taken to reduce lock contention in this code path.	2017-02-01 11:11:26 -07:00
Jason Wilder	6eb46d2100	Remove unnecessary read lock on engine	2017-02-01 11:10:41 -07:00
Jason Wilder	784a851742	Release cpu during compactions	2017-01-31 17:04:36 -07:00
Jason Wilder	278c1449d6	Increase number of cache partitions	2017-01-31 16:49:57 -07:00
Edd Robinson	91ee34b111	Merge pull request #7837 from influxdata/er-tidy General tidy up and subtle bug fixes	2017-01-26 13:43:07 +00:00
Cory LaNou	d54a955068	allow partial writes on field conflicts	2017-01-23 11:54:46 -07:00
Cory LaNou	0103e44896	allow partial writes on field conflicts	2017-01-23 12:25:35 -06:00
Gunnar	3722fa383d	Merge pull request #7718 from influxdata/ga-drop-stats Add stats on dropped measurements and series; Fixes #7697	2017-01-20 15:54:06 -08:00
Edd Robinson	feb7a2842c	Use unbuffered error channels in tests	2017-01-17 10:53:15 -08:00
Edd Robinson	fb7388cdfc	Remove dead code from various pkgs	2017-01-17 09:47:34 -08:00
Edd Robinson	292b30b82b	Fix subtle bugs and remove dead code from tsdb	2017-01-17 09:47:34 -08:00
Edd Robinson	320c5981cb	Fixes racy locking on measurement	2017-01-17 09:44:56 -08:00
Edd Robinson	45324b3848	Fixes racy locking on measurement	2017-01-16 14:22:11 -08:00
Joe LeGasse	cd00085e9e	Adjust Tags cloning This change delays Tag cloning until a new series is found, and will only clone Tags acquired from `ParsePoints...` and not those referencing the mmap-ed files (TSM) that are created on startup.	2017-01-13 13:15:36 -05:00
Mark Rushakoff	cdbdd156f3	Fix memory leak of retained HTTP write payloads This leak seems to have been introduced in `8aa224b22d`, present in 1.1.0 and 1.1.1. When points were parsed from HTTP payloads, their tags and fields referred to subslices of the request body; if any tag set introduced a new series, then those tags then were stored in the in-memory series index objects, preventing the HTTP body from being garbage collected. If there were no new series in the payload, then the request body would be garbage collected as usual. Now, we clone the tags before we store them in the index. This is an imperfect fix because the Point still holds references to the original tags, and the Point's field iterator also refers to the payload buffer. However, the current write code path does not retain references to the Point or its fields; and this change will likely be obsoleted when TSI is introduced. This change likely fixes #7827, #7810, #7778, and perhaps others.	2017-01-12 16:16:54 -08:00
Joe LeGasse	2db0250b22	Add db/rp name validation This change adds some very basic name validation with the following plain-english description: names must be non-zero sequence of printable characters that do not contain slashes ('/' or '\') and are not equal to either "." or "..". The intent is that, since we currently just use database and retention policy names directly as path elements, these rules will hopefully leave us with names that should be at least close to valid directory names. Ideally, we would restrict names even further or not use them as path elements directly, but this should be a step towards the former without restricting names "too much"	2017-01-12 17:38:10 -05:00

1 2 3 4 5 ...

1436 Commits (d3caef66854042c8391eaacbec9af6a002967de8)