influxdb

Commit Graph

Author	SHA1	Message	Date
Edd Robinson	5e342a2ddd	Ensure shared index removed on database drop When using the inmem index, if one drops a database, and then creates it again, the previous index object will be reused. This includes the previous cardinality estimation sketches, leading to inaccurate cardinality estimations.	2017-03-30 13:05:31 +01:00
Edd Robinson	ddf7f0fd7b	Remove uncalled method	2017-03-30 12:48:22 +01:00
Edd Robinson	fddaff2cc8	Merge master in	2017-03-29 18:00:28 +01:00
Edd Robinson	116230b427	Use varint for tag count	2017-03-29 16:31:13 +01:00
Edd Robinson	45f843fc91	Don't unassign shards when system shutting down	2017-03-29 11:57:38 +01:00
Ben Johnson	2edfb1c92d	Ignore series limit on database load.	2017-03-24 16:27:16 -06:00
Jason Wilder	ee03fbb164	Fix series tombstone sketch not updated when dropping measurment	2017-03-24 15:49:00 -06:00
Ben Johnson	d2b396bff5	Fix database series limit, remove shard series limit.	2017-03-24 13:16:00 -06:00
Ben Johnson	9fb8f1ec1d	Fix database and tag limits.	2017-03-24 09:48:10 -06:00
Jason Wilder	631681796d	Remove tsl file committed by mistake	2017-03-23 16:18:27 -06:00
Jason Wilder	7119ef8f29	Merge pull request #8193 from influxdata/jw-123-backports 1.2.3 backports	2017-03-23 13:31:35 -06:00
Jason Wilder	ca1919e5de	Use standard merge algorithm for merging values The previous version was very innefficient due to the benchmarks used to optimize it having a bug. This version always allocates a new slice, but is O(n).	2017-03-23 12:53:59 -06:00
Jason Wilder	ba2571903d	Fix broken Values.Merge benchmark Merge had the side effect of modifying the original values so the results are wrong because they always hit the fast path after the first run.	2017-03-23 12:53:50 -06:00
Jason Wilder	890ffb4ce8	Generate encode*Values funcs	2017-03-23 12:53:29 -06:00
Jason Wilder	ced953ae89	Use typed values to avoid allocations This switches compactions to use type values (FloatValues) from the generic Values type. It avoids a bunch of allocations where each value much be converted from a specific type to an interface{}.	2017-03-23 12:53:17 -06:00
Jason Wilder	a1c84ae6f3	Add block type for BlockIterator	2017-03-23 12:49:17 -06:00
Jason Wilder	2972a3f223	Remove MMAP derefencing code This code was added to address some slow startup issues. It is believed to be the cause of some segfault panic's that occur at query time when the underlying MMAP array has been unmapped. The current structure of code makes this change unnecessary now.	2017-03-23 12:46:23 -06:00
Jason Wilder	61f80db1b9	Skip cardinaltiy dups on circle race test	2017-03-22 15:20:38 -06:00
Jason Wilder	c443e639b0	Fix 32bit alignment issue in wal.sync	2017-03-22 11:21:29 -06:00
Jason Wilder	06306bbd97	Run store tests in parallel	2017-03-22 10:55:38 -06:00
Edd Robinson	1c4ecb12c1	Don't panic on nil engine	2017-03-22 10:07:29 -06:00
Ben Johnson	85c1ae4dd4	Remove sort.Slice	2017-03-21 12:33:05 -06:00
Ben Johnson	afe41f1c80	Fix tsm1/tsi1 broken tests.	2017-03-21 12:21:48 -06:00
Ben Johnson	1e9fa7bc2c	Fix 32-bit rhh implementation.	2017-03-21 11:44:13 -06:00
Jason Wilder	58c8736ebc	Merge pull request #8172 from influxdata/jw-dropped-points Fix series not getting created	2017-03-21 09:44:31 -06:00
Jason Wilder	92c722913d	Unlock index if dropping non-existent series	2017-03-21 09:19:44 -06:00
Ben Johnson	5cf41adcb8	Optimize tsi1 write path. Removes sorted series list in log, uses a buffer for HasSeries(), and prepends a length for series key encoding.	2017-03-21 08:44:35 -06:00
Edd Robinson	47448646ed	Remove DropSeries on index	2017-03-21 11:35:31 +00:00
Edd Robinson	f89de550ed	Significantly speed up DROP DATABASE	2017-03-21 11:35:31 +00:00
Jason Wilder	7f78c6ad8e	Fix series not getting created	2017-03-20 17:19:22 -06:00
Jason Wilder	8f7b251afd	Merge branch 'master' into jw-tsi	2017-03-20 17:17:26 -06:00
Jason Wilder	8177df2dab	Simplify Measurement.TagSets signature	2017-03-17 16:19:10 -06:00
Jason Wilder	2d5d899ac2	Allow queries to be interrupted during planning If a bad query is run, kill query and limits would not kick in until after it started executing. Some bad queries that involve high cardinality can cause the server to OOM just from planning which defeats the purpose of the max-select-series limit. This change primarily fixes max-select-series limit so that the query is killed earlier and has the side effect that kill query now can kill a query while it's being planned.	2017-03-17 16:00:54 -06:00
Jason Wilder	bc4aeefbed	Check max-series-limit in shard iterator creation The limit waited until all the iterators had been created which still allows problem queries to be planned. This allows the queries to be aborted much earlier in some cases.	2017-03-17 16:00:25 -06:00
Ben Johnson	5d67c424bf	Refactor tsi1 write locking.	2017-03-17 11:20:50 -06:00
Ben Johnson	70efc70abe	Reduce lock contention, fix rhh lookup.	2017-03-17 09:44:11 -06:00
Jason Wilder	27ae2929fc	Add wal-fsync-delay to Diagnostics	2017-03-15 16:31:03 -06:00
Jason Wilder	e9eb925170	Coalesce multiple WAL fsyncs Fsyncs to the WAL can cause higher IO with lots of small writes or slower disks. This reworks the previous wal fsyncing to remove the extra goroutine and remove the hard-coded 100ms delay. Writes to the wal still maintain the invariant that they do not return to the caller until the write is fsync'd. This also adds a new config options wal-fsync-delay (default 0s) which can be increased if a delay is desired. This is somewhat useful for system with slower disks, but the current default works well as is.	2017-03-15 16:31:03 -06:00
Jason Wilder	7bd1bd8ab3	Only calculate disk size if shard has changed Calling DiskSize can be expensive with many shards. Since the stats collection runs this every 10s by default, it can be expensive and wasteful to calculate the stats when nothing has changed. This avoids re-calculating the shard size unless something has chagned.	2017-03-15 16:29:57 -06:00
Ben Johnson	1807772388	Fix tsi tests.	2017-03-15 11:23:58 -06:00
Ben Johnson	ee2e046853	Merge remote-tracking branch 'upstream/tsi-log-compact' into tsi	2017-03-15 10:22:32 -06:00
Ben Johnson	cf7ba96377	Merge branch 'tsi-log-compact' into tsi	2017-03-15 10:18:40 -06:00
Ben Johnson	358b1e0b05	Merge remote-tracking branch 'upstream/master' into tsi	2017-03-15 10:13:32 -06:00
Jason Wilder	65464ea0d1	Merge pull request #8131 from influxdata/jw-values-merge Use standard merge algorithm when merging Values	2017-03-15 09:51:21 -06:00
Jason Wilder	a4cfeacedb	Use standard merge algorithm for merging values The previous version was very innefficient due to the benchmarks used to optimize it having a bug. This version always allocates a new slice, but is O(n).	2017-03-15 08:59:41 -06:00
Edd Robinson	7d997d508a	Fixes #8138	2017-03-15 12:50:22 +00:00
Edd Robinson	ddcea1c322	WHY YOU SMITE ME BEN. B. JOHNSON?	2017-03-15 12:50:03 +00:00
Jason Wilder	4d37c9dc9e	Fix broken Values.Merge benchmark Merge had the side effect of modifying the original values so the results are wrong because they always hit the fast path after the first run.	2017-03-14 14:20:24 -06:00
Mark Rushakoff	535cf597f1	Report subset of config values in SHOW DIAGNOSTICS This includes hand-selected config settings that are safe to expose and not expected to include any kind of secrets. Fixes #7821	2017-03-14 11:34:19 -07:00
Jason Wilder	ca9c67a877	Generate encode*Values funcs	2017-03-14 11:54:53 -06:00
Ben Johnson	d23f2971c3	Refactor TagBlockEncoder.	2017-03-10 10:08:16 -07:00
Jason Wilder	2f7d4995b4	Use typed values to avoid allocations This switches compactions to use type values (FloatValues) from the generic Values type. It avoids a bunch of allocations where each value much be converted from a specific type to an interface{}.	2017-03-09 16:27:07 -07:00
Jason Wilder	78b7815c49	Add block type for BlockIterator	2017-03-09 09:16:59 -07:00
Jason Wilder	b9e5375043	Merge branch '1.2' into jw-merge-12	2017-03-08 13:16:50 -07:00
Jason Wilder	394bca3aad	Validate field type when creating new fields	2017-03-06 16:13:17 -07:00
Jason Wilder	37187cbe6d	Delete series under fields lock Still seeing the panic that switching this logic around was supposed to fix. We now delete the bulk of data outside of the fields lock and then again, under the write lock, to ensure that the field mapping is accurate. We don't do the full delete under the lock because it can block writes and queries that require a read lock.	2017-03-06 14:19:55 -07:00
Jason Wilder	675d7c9d65	Merge branch '1.2' into jw-merge12	2017-03-06 11:09:05 -07:00
Jason Wilder	eab012ef61	Fix points missing after compaction If blocks containing overlapping ranges of time where partially recombined, it was possible for the some points to get dropped during compactions. This occurred because the window of time of the points we need to merge did not account for the partial blocks created from a prior merge. Fixes #8084	2017-03-06 10:17:11 -07:00
Jason Wilder	3c70abf061	Delete series before remove from field index There is a race where the field type can be deleted while a new type is written and during a query. When this happens, an iterator for the new type is created but old data make still exist in the cache for TSM files causing a panic.	2017-03-06 09:38:27 -07:00
Jason Wilder	29f8d8de76	Fix race in WALEntry.Encode and Value.Deduplicate Under high query load, a race exists in the cache and the WAL. Since writes currently hit the cache first, they are availble for query before they hit the WAL. If the WAL is writing and accessign the Value slice at the same time that a query is run that needs to dedup the same slice, a race occurs. To fix this, the cache now just copies the values instead of storing the slice passed in. Another way to fix this might be to have the writes go to the wal before the cache. I think the latter would be better, but it introduces some larger write path issues that we'd need to also address. e.g. if the cache was full, writes to the WAL would need to be rejected to avoid filling the disk. Copying the slice in the cache is simpler for now and does not appear to dramatically affect performance.	2017-03-06 09:38:22 -07:00
Ben Johnson	4c202eea09	Re-check field type under write lock.	2017-03-03 09:47:43 -07:00
Jonathan A. Sternberg	1081785cb4	Treat non-reserved measurement names with underscores as normal measurements A measurement name that begins with an underscore and does not conflict with one of the reserved measurement names will now be passed untouched to the underlying shards rather than being intercepted as an empty measurement. A user still shouldn't rely on measurements that begin with underscores to always be accessible, but this will prevent the most common use case from causing unexpected behavior since we will very rarely, if ever, add additional system sources.	2017-02-27 16:49:02 -06:00
Jason Wilder	a024003f2c	Merge branch '1.2' into jw-merge-12	2017-02-22 12:13:29 -07:00
Ben Johnson	78a9bb2527	Remove Tags.shouldCopy, replace with forceCopy on series creation. Previously, tags had a `shouldCopy` flag to indicate if those tags referenced an underlying buffer and should be copied to allow GC. Unfortunately, this prevented tags from being copied that were created and referenced the mmap which caused segfaults. This change removes the `shouldCopy` flag and replaces it with a `forceCopy` argument in `CreateSeriesIfNotExists()`. This allows the write path to indicate that tags must be cloned on insert.	2017-02-21 11:13:35 -07:00
Mark Rushakoff	601cbcd084	Merge branch '1.2' into mr-merge-12	2017-02-17 16:14:22 -08:00
Jonathan A. Sternberg	2fe48d6781	Rename zap import back to github.com/uber-go/zap They rebased a revision we were previously relying upon that allowed us to use the vanity name so we are reverting back to an older version with the old import path.	2017-02-17 17:17:22 -06:00
Ben Johnson	7ee0b9bb28	Rewrite SeriesBlockEncoder.	2017-02-16 09:39:51 -07:00
Ben Johnson	8e79ca5d75	Fix tag dereferencing panic. Clones series tags under lock during var ref iterator creation.	2017-02-15 17:56:47 -07:00
Ben Johnson	7e6f33ef48	TSI1 Index file compaction.	2017-02-15 08:44:22 -07:00
Ben Johnson	71d13fed6d	Immediate, parallel tsl compactions.	2017-02-15 08:44:21 -07:00
Ben Johnson	673143a0ad	Remove .tsl file.	2017-02-15 08:44:01 -07:00
Jonathan A. Sternberg	71f62d33e6	Map types correctly when using a regex and one of the measurements is empty	2017-02-13 18:14:29 -06:00
Edd Robinson	7749bc1c67	Store sorted series in logFile	2017-02-13 17:03:42 +00:00
Jason Wilder	4b6289ce58	Merge pull request #7942 from influxdata/jw-cache-partitions Reduce write timeouts	2017-02-10 10:07:08 -07:00
Edd Robinson	38eb6d5994	Don't load meta data for tsi	2017-02-09 18:04:23 +00:00
Edd Robinson	a6a2f9d5f0	Don't load meta data for tsi	2017-02-09 17:59:14 +00:00
Jason Wilder	2f74e3f3d5	Use simple8b.CountBytes to avoid allocations	2017-02-09 10:47:03 -07:00
Edd Robinson	2d59fb788c	Modify arguments to reduce allocations	2017-02-09 15:01:32 +00:00
Edd Robinson	c2bbc18e4b	Reduce repeated string allocations	2017-02-09 11:23:23 +00:00
Edd Robinson	21e821e5f9	Reduce memory footprint by pre-allocating	2017-02-08 18:49:03 +00:00
Edd Robinson	4fbba8234e	Add Size to models.Tags	2017-02-08 18:44:48 +00:00
Edd Robinson	a9d58cc879	Add log file benchmarks	2017-02-08 16:50:46 +00:00
Ben Johnson	d6fedcacd5	Refactor tsi.LogFile compaction.	2017-02-08 09:00:08 -07:00
Jason Wilder	1bc0f68490	Merge branch '1.2' into jw-merge-12	2017-02-07 12:48:36 -07:00
Ben Johnson	c4e9430de3	Recover after partial tsi log file write.	2017-02-07 10:02:31 -07:00
Jonathan A. Sternberg	e1fa48d0dd	Fix ORDER BY time DESC with ordering series keys The order of series keys is in ascending alphabetical order, not descending alphabetical order, when it is ordered by descending time. This fixes the ordering so points are returned in descending order. The emitter also had the conditions for choosing which iterator to use in the wrong direction (which only affects aggregates with `FILL(none)`).	2017-02-06 15:49:12 -06:00
Jonathan A. Sternberg	95831b3307	Fix LIMIT and OFFSET when they are used in a subquery This fixes LIMIT and OFFSET when they are used in a subquery where the grouping of the inner query is different than the grouping of the outer query. When organizing tag sets, the grouping of the outer query is used so the final result is in the correct order. But, unfortunately, the optimization incorrectly limited the number of points based on the grouping in the outer query rather than the grouping in the inner query. The ideal solution would be to use the outer grouping to further organize it by the grouping for the inner subquery, but that's more difficult to do at the moment. As an easier fix, the query engine now limits the output of each series. This may result in these types of queries being slower in some situations like this one: SELECT mean(value) FROM (SELECT value FROM cpu GROUP BY host LIMIT 1) This will be slower in a situation where the `cpu` measurement has a high cardinality and many different tags. This also fixes `last()` and `first()` when they are used in a subquery because those functions use `LIMIT 1` as an internal optimization.	2017-02-06 14:04:34 -06:00
Jason Wilder	93a9d01643	Increase default waiting WAL writes	2017-02-06 11:48:51 -07:00
Jason Wilder	38a649fc40	Batch multiple WAL fsyncs Every write to the WAL current runs and fsync before returning. When there are lot of concurrent writes, this can cause the WAL to bottleneck write throughput since fsyncs are very expensive. This changes the writeToLog to fsync on an interval to allow multiple fsyncs calls to be batched up into one. The writeToLog behavior is the same in that it won't return until an fsync has been performed.	2017-02-06 11:48:45 -07:00
Ben Johnson	d91e6eabac	Add max-values-per-tag to inmem index.	2017-02-06 11:14:13 -07:00
Edd Robinson	54193e1131	Ensure config contains valid index	2017-02-03 11:57:41 +00:00
Edd Robinson	908959a78a	Fix vet	2017-02-03 11:57:35 +00:00
Jason Wilder	2e95b4043c	Merge branch '1.2' into jw-merge-12	2017-02-02 16:40:36 -07:00
Ben Johnson	76235f1e00	Use original index type for existing shards.	2017-02-02 10:43:48 -07:00
Ben Johnson	c246f3d9b0	Use inmem index on existing shards.	2017-02-02 10:04:25 -07:00
Ben Johnson	faef0a99c9	Perform series tag iteration under lock. Adds a `tsdb.Series.ForEachTag()` function for safely iterating over a series' tags within the context of a lock. This preverts tags from being dereferenced during iteration which can cause a seg fault.	2017-02-01 16:25:53 -07:00
Edd Robinson	5a37d51f02	Refactor DropSeries	2017-02-01 22:07:44 +00:00
Ben Johnson	57f44d5f0c	Include index in snapshot.	2017-02-01 14:19:42 -07:00
Jason Wilder	54ab3a7a0a	Don't write lock file store when opening new files When replacing TSM files, the new files can be opened before the write lock is taken to reduce lock contention in this code path.	2017-02-01 11:11:26 -07:00
Jason Wilder	6eb46d2100	Remove unnecessary read lock on engine	2017-02-01 11:10:41 -07:00
Edd Robinson	0a9fc910f6	Fix potential race on inmem	2017-02-01 16:29:52 +00:00
Edd Robinson	ffced9b538	Move sketch merging into File	2017-02-01 15:33:30 +00:00
Edd Robinson	ca530d478a	Ensure we're testing both indexes	2017-02-01 14:56:35 +00:00
Edd Robinson	ab238251f4	Refactor measurement sketches	2017-02-01 14:56:35 +00:00
Edd Robinson	d8a73a1954	Go vet	2017-02-01 14:51:15 +00:00
Ben Johnson	05bbda2c0a	Fix measurement block sketches.	2017-02-01 07:40:00 -07:00
Jason Wilder	784a851742	Release cpu during compactions	2017-01-31 17:04:36 -07:00
Jason Wilder	278c1449d6	Increase number of cache partitions	2017-01-31 16:49:57 -07:00
Ben Johnson	86e908c384	Remove debug println().	2017-01-31 09:24:58 -07:00
Ben Johnson	f4a4d1e196	Fix series sketches.	2017-01-31 08:47:18 -07:00
Edd Robinson	ab94c1b743	Fixes #7882	2017-01-30 19:12:24 +00:00
Ben Johnson	42959ead1f	Fix CreateSeriesListIfNotExists() data race.	2017-01-30 10:07:07 -07:00
Edd Robinson	91ee34b111	Merge pull request #7837 from influxdata/er-tidy General tidy up and subtle bug fixes	2017-01-26 13:43:07 +00:00
Jason Wilder	2ae9406c1d	Reincorporate memory leak fix 7832 got lost during a rebase/merge. This reincorporate the fix for the in-memory index as well as tsi.	2017-01-24 20:13:48 -07:00
Jason Wilder	173003bc38	Fix go vet	2017-01-24 20:13:37 -07:00
Ben Johnson	047c21f4d9	Merge remote-tracking branch 'upstream/master' into tsi	2017-01-24 09:28:58 -07:00
Cory LaNou	d54a955068	allow partial writes on field conflicts	2017-01-23 11:54:46 -07:00
Cory LaNou	0103e44896	allow partial writes on field conflicts	2017-01-23 12:25:35 -06:00
Ben Johnson	c459d24a60	Test coverage.	2017-01-23 09:38:27 -07:00
Gunnar	3722fa383d	Merge pull request #7718 from influxdata/ga-drop-stats Add stats on dropped measurements and series; Fixes #7697	2017-01-20 15:54:06 -08:00
Edd Robinson	feb7a2842c	Use unbuffered error channels in tests	2017-01-17 10:53:15 -08:00
Edd Robinson	fb7388cdfc	Remove dead code from various pkgs	2017-01-17 09:47:34 -08:00
Edd Robinson	292b30b82b	Fix subtle bugs and remove dead code from tsdb	2017-01-17 09:47:34 -08:00
Edd Robinson	320c5981cb	Fixes racy locking on measurement	2017-01-17 09:44:56 -08:00
Edd Robinson	45324b3848	Fixes racy locking on measurement	2017-01-16 14:22:11 -08:00
Joe LeGasse	cd00085e9e	Adjust Tags cloning This change delays Tag cloning until a new series is found, and will only clone Tags acquired from `ParsePoints...` and not those referencing the mmap-ed files (TSM) that are created on startup.	2017-01-13 13:15:36 -05:00
Mark Rushakoff	cdbdd156f3	Fix memory leak of retained HTTP write payloads This leak seems to have been introduced in `8aa224b22d`, present in 1.1.0 and 1.1.1. When points were parsed from HTTP payloads, their tags and fields referred to subslices of the request body; if any tag set introduced a new series, then those tags then were stored in the in-memory series index objects, preventing the HTTP body from being garbage collected. If there were no new series in the payload, then the request body would be garbage collected as usual. Now, we clone the tags before we store them in the index. This is an imperfect fix because the Point still holds references to the original tags, and the Point's field iterator also refers to the payload buffer. However, the current write code path does not retain references to the Point or its fields; and this change will likely be obsoleted when TSI is introduced. This change likely fixes #7827, #7810, #7778, and perhaps others.	2017-01-12 16:16:54 -08:00
Joe LeGasse	2db0250b22	Add db/rp name validation This change adds some very basic name validation with the following plain-english description: names must be non-zero sequence of printable characters that do not contain slashes ('/' or '\') and are not equal to either "." or "..". The intent is that, since we currently just use database and retention policy names directly as path elements, these rules will hopefully leave us with names that should be at least close to valid directory names. Ideally, we would restrict names even further or not use them as path elements directly, but this should be a step towards the former without restricting names "too much"	2017-01-12 17:38:10 -05:00
Joe LeGasse	b19260fb26	Add some checks before removing directories Fixes #7822 This change first ensures that databases and retention policies exist before attempting to remove them from the Store. It also adds some checks in the `DeleteDatabase` and `DeleteRetentionPolicy` to ensure that maliciously named entries won't remove anything outside of the configured data directory.	2017-01-12 17:38:10 -05:00
Joe LeGasse	bf58d9ffb7	Update backup to use ioutil.ReadDir	2017-01-12 16:28:01 -05:00
Jason Wilder	11f264563a	Fix 32bit alignment	2017-01-12 12:01:49 -07:00
Jason Wilder	06a8fd6ca2	Simplifications and cleanup	2017-01-12 09:55:38 -07:00
Ben Johnson	f43b0f7ec9	Fix series & measurement deletion.	2017-01-12 09:29:40 -07:00
Edd Robinson	73ed864e1d	Add cache tests	2017-01-12 16:27:16 +00:00
Jason Wilder	1e56b5416b	Fix compactions sometimes getting stuck I ran into an issue where the cache snapshotting seemed to stop completely causing the cache to fill up and never recover. I believe this is due to the the Timer being reused incorrectly. Instead, use a Ticker that will fire more regularly and not require the resetting logic (which was wrong).	2017-01-11 17:57:40 -07:00
Jason Wilder	40b017f4a4	Fix Cache stats size collection The memory stats as well as the size of the cache were not accurate. There was also a problem where the cache size would be increased optimisitically, but if the cache size limit was hit, it would not be decreased. This would cause the cache size to grow without bounds with every failed write.	2017-01-11 17:54:51 -07:00
Jason Wilder	c433ff331f	Encode snapshots concurrently The CacheKeyIterator (used for snapshot compactions), iterated over each key and serially encoded the values for that key as the TSM file is written. With many series, this can be slow and will only use 1 CPU core even if more are available. This changes it so that the key space is split amongst a number of goroutines that start encoding all keys in parallel to improve throughput.	2017-01-11 17:54:27 -07:00
Jason Wilder	ae838ef323	Simplify Cache.Snapshot This simplifies the cache.Snapshot func to swap the hot cache to the snapshot cache instead of copy and appending entries. This reduces the amount of time the cache is write locked which should reduce cache contention for the read only code paths.	2017-01-11 11:12:02 -07:00
Jonathan A. Sternberg	3ba950b029	Fix for subqueries to use the parallel iterator correctly Also, fix the `Iterators.Merge(IteratorOptions)` function so it consults the `Ordered` attribute to determine which iterator it should use to merge the input iterators.	2017-01-11 10:47:18 -06:00
Ben Johnson	352817e8c4	Convert 32-bit offsets to 64-bit.	2017-01-11 08:58:10 -07:00
Jonathan A. Sternberg	b58d1778e2	Remove improper newlines from logging statements	2017-01-10 11:20:09 -06:00
Mark Rushakoff	a135906b43	Merge pull request #7747 from influxdata/mr-lint-cleanup Miscellaneous lint cleanup	2017-01-10 08:22:00 -08:00
Mark Rushakoff	3b3604e362	Fix race in (*tsm1.Cache).values Without this read lock, this race would happen during a concurrent snapshot compaction and query.	2017-01-09 14:48:28 -08:00
Jonathan A. Sternberg	4a559c4620	Merge pull request #7646 from influxdata/js-4619-subqueries Support subquery execution in the query language	2017-01-09 14:14:01 -06:00
Jason Wilder	eb4d311c0a	Add retry/backup when backing up a shard fails The backup command can fail if a snapshot is running which silently closes the connection. This causes the backup shard command to continue on as if nothing failed.	2017-01-09 11:28:48 -07:00
Ben Johnson	64c7715243	Rebase fixes.	2017-01-09 10:10:12 -07:00
Jason Wilder	194c5adfaf	Fix race on t.refs Read at 0x00c42018f620 by goroutine 58: github.com/influxdata/influxdb/tsdb/engine/tsm1.(TSMReader).Close() /root/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/reader.go:330 +0x94 github.com/influxdata/influxdb/tsdb/engine/tsm1.(FileStore).Close() /root/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/file_store.go:464 +0x123 Previous write at 0x00c42018f620 by goroutine 63: sync/atomic.AddInt64() /usr/local/go/src/runtime/race_amd64.s:276 +0xb github.com/influxdata/influxdb/tsdb/engine/tsm1.(TSMReader).Unref() /root/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/reader.go:352 +0x43 github.com/influxdata/influxdb/tsdb/engine/tsm1.(KeyCursor).Close()	2017-01-07 12:39:45 -07:00
Jonathan A. Sternberg	d7c8c7ca4f	Support subquery execution in the query language This adds query syntax support for subqueries and adds support to the query engine to execute queries on subqueries. Subqueries act as a source for another query. It is the equivalent of writing the results of a query to a temporary database, executing a query on that temporary database, and then deleting the database (except this is all performed in-memory). The syntax is like this: SELECT sum(derivative) FROM (SELECT derivative(mean(value)) FROM cpu GROUP BY *) This will execute derivative and then sum the result of those derivatives. Another example: SELECT max(min) FROM (SELECT min(value) FROM cpu GROUP BY host) This would let you find the maximum minimum value of each host. There is complete freedom to mix subqueries with auxiliary fields. The only caveat is that the following two queries: SELECT mean(value) FROM cpu SELECT mean(value) FROM (SELECT value FROM cpu) Have different performance characteristics. The first will calculate `mean(value)` at the shard level and will be faster, especially when it comes to clustered setups. The second will process the mean at the top level and will not include that optimization.	2017-01-07 13:00:48 -06:00
Mark Rushakoff	153277c01d	Merge pull request #7786 from influxdata/mr-cache-decrease-size Use one atomic operation in (*Cache).decreaseSize	2017-01-06 10:17:01 -08:00
Ben Johnson	2b3cd415e2	Fixing rebase.	2017-01-06 09:52:16 -07:00
Ben Johnson	d1f1e19591	Fixing rebase.	2017-01-06 09:31:25 -07:00
Ben Johnson	1003db0067	Add active log file tracking, time-based compaction.	2017-01-05 10:17:12 -07:00
Ben Johnson	c1c98223ec	Fix and optimize tsi1 FileSet.	2017-01-05 10:17:12 -07:00
Ben Johnson	31e74d809b	Add tsi FileSet.	2017-01-05 10:17:11 -07:00
Ben Johnson	dcd2a771b0	Optimizing tsi compaction.	2017-01-05 10:17:11 -07:00
Ben Johnson	1ce99e797f	Use series map in tsi1.LogFile.	2017-01-05 10:17:11 -07:00
Ben Johnson	9b1e8215e0	Remove dictionary encoding, add bulk series insertion.	2017-01-05 10:17:11 -07:00
Ben Johnson	5f7654173e	Add locking to sketch merge.	2017-01-05 10:17:11 -07:00
Ben Johnson	9bd19cdc69	Fix inmem DELETE SERIES.	2017-01-05 10:17:11 -07:00
Ben Johnson	f9efcb3365	Re-add shared in-memory index.	2017-01-05 10:17:09 -07:00
Edd Robinson	0f9b2bfe6a	Fix tests	2017-01-05 10:16:15 -07:00
Edd Robinson	4ccb8dbab1	Move series count check to shard	2017-01-05 10:16:13 -07:00
Edd Robinson	0cb74eedbf	Add log file (WAL) sketches	2017-01-05 10:15:38 -07:00
Edd Robinson	190c78c644	Add series sketches	2017-01-05 10:15:37 -07:00
Edd Robinson	695adafc00	Add measurement sketches	2017-01-05 10:15:37 -07:00
Ben Johnson	745b1973a8	tsi compaction	2017-01-05 10:15:37 -07:00
Ben Johnson	83e80f6d0b	Fix in-mem index integration tests.	2017-01-05 10:15:37 -07:00
Ben Johnson	183418dcbd	Fix tsi TAG KEYS iterator.	2017-01-05 10:15:36 -07:00
Ben Johnson	759ff4ab80	Add tsi1 term hash index.	2017-01-05 10:15:35 -07:00
Ben Johnson	75cfe244c4	Add series hash index.	2017-01-05 10:15:35 -07:00
Ben Johnson	9f8b206b51	Fix measurement system queries.	2017-01-05 10:15:34 -07:00
Ben Johnson	4aa78383d1	Fix tsi1 series deletion.	2017-01-05 10:14:48 -07:00
Ben Johnson	5965610de6	Refactoring tsi tombstoning.	2017-01-05 10:14:02 -07:00
Ben Johnson	e7940cc556	Add tsi1 series system iterator.	2017-01-05 10:14:00 -07:00
Ben Johnson	87f4e0ec0a	Add regex support in tsi1.	2017-01-05 10:12:29 -07:00
Ben Johnson	d13afa8f47	Iterator refactoring	2017-01-05 10:11:49 -07:00
Jason Wilder	1ba64f3610	Disable max-value-per-tag option temporarily This is too slow currently and causes all writes to timeout.	2017-01-05 10:11:47 -07:00
Jason Wilder	f0427d180e	Fix tsi index panics Hardcoded panics cause the server to crash in 10s due to stats collection.	2017-01-05 10:11:12 -07:00
Jason Wilder	4bf7b2bb19	Allow tsi to be enabled via config option	2017-01-05 10:11:12 -07:00
Jason Wilder	2b96c5d4d0	Set Tags on entry These were lost when reloading from the index. Fixes queries not returning any data.	2017-01-05 10:11:12 -07:00
Jason Wilder	a6490920fd	Fix reslicing indices The slicing was backwards causing the buffer to grow indefinitely and filling the disks on writes.	2017-01-05 10:11:12 -07:00
Jason Wilder	59864226b7	Add RWMutex to LogFile Fixes concurrent map access panic	2017-01-05 10:11:12 -07:00
Ben Johnson	fbe7f464ee	Improve insert performance.	2017-01-05 10:11:12 -07:00
Ben Johnson	bf89b94d17	Fix WalkTagKeys().	2017-01-05 10:11:11 -07:00
Ben Johnson	33412782ed	Fix go vet issue.	2017-01-05 10:11:10 -07:00
Ben Johnson	2b864c72c5	Refactor MeasurementBlockTrailer read/write.	2017-01-05 10:11:10 -07:00
Ben Johnson	cb93f10120	Remove per-shard in-memory index.	2017-01-05 10:11:09 -07:00
Ben Johnson	409b0165f5	shared in-memory index	2017-01-05 10:09:57 -07:00
Ben Johnson	a812502ea3	reintegrating in-memory index	2017-01-05 10:07:35 -07:00
Ben Johnson	1ac067e53b	intermediate	2017-01-05 10:03:09 -07:00
Ben Johnson	fda84955ea	Remove TODO	2017-01-05 10:02:42 -07:00
Ben Johnson	62d2b3ebe9	Series filtering.	2017-01-05 10:02:42 -07:00
Ben Johnson	62269c3cea	intermediate	2017-01-05 10:02:41 -07:00
Ben Johnson	5f5b02e052	intermediate	2017-01-05 10:01:49 -07:00
Ben Johnson	8863e3c0f3	Refactor tsi1 merge iterators, finish multi-file compaction.	2017-01-05 10:01:25 -07:00
Ben Johnson	e3af4b0dad	Refactor iterators.	2017-01-05 10:00:45 -07:00
Ben Johnson	ce9e3181a5	Refactor merge iterators.	2017-01-05 10:00:45 -07:00
Ben Johnson	0294e717a0	Add mm, tag key, tag value, & series iterators.	2017-01-05 10:00:44 -07:00
Ben Johnson	2bfafaed76	tsi1 log compaction	2017-01-05 10:00:44 -07:00
Ben Johnson	afce53e81b	Rebase fixes.	2017-01-05 10:00:44 -07:00
Ben Johnson	992e651588	Add tsi1.Log.	2017-01-05 10:00:44 -07:00
Ben Johnson	2a81351992	Implement tsdb.Index interface on tsi1.Index.	2017-01-05 10:00:43 -07:00
Edd Robinson	e2c3b52ca4	Adds a custom HyperLogLog++ implementation	2017-01-05 10:00:14 -07:00
Edd Robinson	da63b349a4	Fix bad rebase	2017-01-05 09:59:44 -07:00
Edd Robinson	ebc92ca04f	Fix overflow issues	2017-01-05 09:59:12 -07:00
Edd Robinson	149b1cef1d	Fix 32bit overflow; limit capacity	2017-01-05 09:59:10 -07:00
Edd Robinson	33623c1fa9	Revert back to original approach	2017-01-05 09:58:39 -07:00
Edd Robinson	9ed6040265	Tidy up	2017-01-05 09:58:37 -07:00
Edd Robinson	2a5c865b44	Use xxhash	2017-01-05 09:57:35 -07:00
Edd Robinson	2d9bd09784	Use []byte where possible in Index	2017-01-05 09:57:34 -07:00
Edd Robinson	3edbfb9197	Prevent panic when shard nil	2017-01-05 09:56:51 -07:00
Edd Robinson	3187cd4432	Cleanup series created stat	2017-01-05 09:56:49 -07:00
Edd Robinson	4b1ef68dc9	Move series and measurement stats to store	2017-01-05 09:54:05 -07:00
Edd Robinson	aaf85ae38d	Tombstoning with series cardinality part 1	2017-01-05 09:54:04 -07:00
Edd Robinson	bd8dd9a291	Sketches working	2017-01-05 09:54:04 -07:00
Edd Robinson	d19fbf5ab4	Wire in HLL estimator	2017-01-05 09:54:03 -07:00
Edd Robinson	2b8efefef4	Initial index interface	2017-01-05 09:51:43 -07:00
Edd Robinson	05bc4dec00	Refactor	2017-01-05 09:50:23 -07:00
Edd Robinson	c535e3899a	Remove in-memory index from Shard and Store	2017-01-05 09:47:09 -07:00
Edd Robinson	2171d9471b	Initialise index in shards	2017-01-05 09:42:48 -07:00
Ben Johnson	57d0556174	Fix 32-bit issues.	2017-01-05 09:34:37 -07:00
Ben Johnson	41f2babe66	Minor TSI index benchmark refactor	2017-01-05 09:34:37 -07:00
Ben Johnson	ac9c6a0207	Add TSI index benchmark.	2017-01-05 09:34:37 -07:00
Ben Johnson	8d40ceb00c	TSI1 Index	2017-01-05 09:34:36 -07:00
Ben Johnson	9b62df23d2	Add MeasurementBlock.	2017-01-05 09:34:36 -07:00
Ben Johnson	3240af07e0	Fix RHH packing.	2017-01-05 09:34:36 -07:00
Ben Johnson	e25d61e4bd	TagSet writer & reader.	2017-01-05 09:34:36 -07:00
Ben Johnson	4eeb81ef38	Add SeriesList tombstoning.	2017-01-05 09:34:36 -07:00
Ben Johnson	2c34b24f5c	Implemented SeriesList	2017-01-05 09:34:36 -07:00
Ben Johnson	6523675c20	Implemented RHH hash map.	2017-01-05 09:34:35 -07:00
Mark Rushakoff	6a94d200c8	Merge remote-tracking branch 'influx/master' into mr-godoc	2017-01-04 13:27:36 -08:00
Mark Rushakoff	89a587e865	Use one atomic operation in (Cache).decreaseSize The previous implementation was susceptible to a race condition (of correctness) since c.decreaseSize is called without a lock in (Cache).WriteMulti. There were already tests which asserted the correctness of the result of decreaseSize, so no tests were added or modified.	2017-01-04 13:13:31 -08:00
Cory LaNou	3c518f8927	panicing is bad -> error returns are good	2017-01-03 14:28:29 -06:00
Mark Rushakoff	07b87f2630	Miscellaneous lint cleanup	2017-01-03 09:47:32 -08:00
Mark Rushakoff	41415cf2fb	Update godoc for tsm1 package	2017-01-02 07:30:18 -08:00
Mark Rushakoff	4a774eb600	Update godoc for the tsdb package	2016-12-30 21:12:37 -08:00
Gustav Westling	26b33307ae	Resolved PR comments on test files	2016-12-30 11:42:38 +01:00
Gustav Westling	56d98325da	Removed ineffective assignments, and added checks for errors that previsouly was not checked	2016-12-29 20:26:15 +01:00
Jason Wilder	2468347ffb	Fix comment	2016-12-19 14:17:49 -07:00
Jason Wilder	326557e539	Fix race in partition.reset	2016-12-19 14:17:01 -07:00
Jason Wilder	e91e45d71c	Fix panic in cache benchmark	2016-12-19 14:17:01 -07:00
Jason Wilder	0b6b9ea1cb	Use atomics for cache.snapshotSize stat	2016-12-19 14:17:01 -07:00
Jason Wilder	637a67ea35	Reduce lock contention on measurementFields	2016-12-19 14:17:01 -07:00
Jason Wilder	b7c1e625b0	Move needSort tracking to Deduplicate This eliminates some *UnixNano() calls and also simplifies the cache logic so that it does not need to worry about whether entries are sorted.	2016-12-19 14:17:01 -07:00
Jason Wilder	dea87703cd	Reduce UnixNano pointer call	2016-12-19 14:17:01 -07:00
Mark Rushakoff	722b6345fe	Fix unchecked error in templated Read${TYPE}Block	2016-12-19 09:31:26 -08:00
Jonathan A. Sternberg	ec57108520	Use proper uber-go/zap import path It looks like the real import path to the project is go.uber.org/zap instead of github.com/uber-go/zap since the example in the project references that path.	2016-12-15 08:54:14 -06:00
Edd Robinson	ec27c57127	Further optimisations and a race fix	2016-12-14 18:23:36 +00:00
Edd Robinson	05ec6ad9ad	Add to index safely	2016-12-14 18:23:36 +00:00
Edd Robinson	d78ca1a0f3	Fix some races	2016-12-14 18:23:36 +00:00

... 3 4 5 6 7 ...

1798 Commits (1dfdc303771273bbc9ba39f325e1f79cce296aa0)