influxdb

Commit Graph

Author	SHA1	Message	Date
Jacob Marble	44c5da060b	buildtsi: Do not escape measurement names When `influx_inspect buildtsi` is used to create a new `tsi1` index, spaces in measurement names are escaped, so measurement "a b" is changed to "a\ b". This change modifies `models.ParseKeyBytes()` and `models.ParseName()` to unescape measurement names. `models.ParseKeyBytes()` returns unescaped tag keys, so this seems like the natural place to unescape measurement names. Also followed `scanMeasurement()` to see what other code could be problematic, and this should be everything (the result of one other use of `scanMeasurement()` is later escaped). Removed `tsdb.MeasurementFromSeriesKey()`. These methods are exported, so checked for side effects in other InfluxData repositories.	2018-05-30 15:20:56 -07:00
Ben Johnson	4cb36d5301	Add key sanitization to deletetsm This commit adds the `-sanitize` flag to `influx_inspect deletetsm` which will delete all keys that contain invalid, non-printable, or replacement character unicode. Usage: ```sh $ influx_inspect deletetsm -sanitize PATH ```	2018-04-30 13:06:33 -06:00
Edd Robinson	a446134d6f	Merge pull request #8034 from dsouzae/br_validate_points Validate points on input	2018-04-26 10:42:12 +01:00
Edd Robinson	6b350edf59	Ensure correct number of tags parsed This commit fixes a parsing bug that was causing extra tags to be generated when the incoming point contained escaped commas. As an optimisation, the slice of tags associated with a point was being pre-allocated using the number of commas in the series key as a hint to the appropriate size. The hinting did not consider literal comma values in the key though, and so it was possible for extra (empty) tag key and value pairs to be part of the tags structure associated with a parsed point.	2018-03-14 13:28:55 +00:00
Stuart Carnie	d4282317f3	Add ForEachTag API	2018-03-12 16:23:57 -07:00
Stuart Carnie	ceefa7f7b0	Improve MakeKey, HashKey performance	2018-03-12 16:23:57 -07:00
Stuart Carnie	4d93b04d33	MakeKey related benchmarks	2018-03-12 16:23:57 -07:00
Edd Robinson	67d1fa3972	Cleanup remaining packages	2018-01-21 12:08:25 -08:00
Stuart Carnie	5dfe3b2645	inmem startup improvments * only call ParseTags when necessary * remove dependency on inmem.Series in tsdb test package * Measurement and Series are no longer exported. Their use is restricted to the inmem package * improve Measurement and Series types by exporting immutable fields and removing unnecessary APIs and locks Reduced startup time from 28s to 17s. Overall improvement including #9162 reduces startup from 46s to 17s for 1MM series across 14 shards.	2017-12-29 07:58:52 -07:00
Jason Wilder	de5592cda9	Extract ParseKeyBytes from ParseKey Allows callers to use []byte and avoid a string allocation	2017-11-13 08:48:03 -07:00
Jason Wilder	ed246db55a	Fix panic: runtime error: slice bounds out of range Fixes #8538	2017-11-08 17:00:25 -07:00
Jonathan A. Sternberg	e415d0b10f	Merge pull request #8835 from influxdata/js-uint-write-protocol Add uint support into the write protocol	2017-10-09 10:55:16 -05:00
Jonathan A. Sternberg	2f47c3d28f	Add support for uint64 in the clients	2017-10-05 09:35:06 -05:00
Jonathan A. Sternberg	ff7b576389	Add uint support into the write protocol This is currently protected behind a conditional compilation flag. Use `-tags uint` or `-tags uint64` to enable this.	2017-09-19 10:44:26 -05:00
Jason Wilder	a8d9eeef36	Reduce lock contention when deleting high cardinality series Deleting high cardinality series could take a very long time, cause write timeouts as well as dead lock the process. This fixes these issue to by changing the approach for cleaning up the indexes and reducing lock contention. The prior approach delete each series and updated every index (inmem) during the delete. This was very slow and cause the index to be locked while it items in a slice were removed one by one. This has been changed to mark series as deleted and then rebuild the index asynchronously which speeds up the process. There was also a dead lock that could occur when deleing the field set. Deleting the field set held a write lock and the function it invoked under the lock could try to take a read lock on the field set. This would then deadlock. This approach was also very slow and caused time out for writes. It now uses faster approach that checks for the existing of the measurment in the cache and filestore which does not take write locks.	2017-09-07 11:36:02 -06:00
Ben Johnson	60ab1282ea	Refactor system iterators. Previously pseudo iterators could be created for meta data such as series, measurement, and tag data. These iterators were created at a higher level and lacked a lot of the power of the query engine. This commit moves system iterators down to the series level and supports the following: - _name - _seriesKey - _tagKey - _tagValue - _fieldKey These can be used as normal fields such as: SELECT _seriesKey FROM cpu This will return all the series keys for `cpu`.	2017-08-16 09:27:29 -06:00
Stuart Carnie	eec80692c4	Taught tsm1 storage engine how to read and write uint64 values * introduced UnsignedValue type * leveraged existing int64 compression algorithms (RLE, Simple 8B) * tsm and WAL can read and write UnsignedValue * compaction is aware of UnsignedValue * unsigned support to model, cursors and write points NOTE: there is no support to create unsigned points, as the line protocol has not been modified.	2017-07-24 09:03:22 -07:00
lrita	72fcf6283e	optimize point split, which reduce unnecessary allocate	2017-06-15 16:28:49 +08:00
Joe LeGasse	815f740f4c	initial fga work wip wip fix tests / build	2017-05-26 13:16:27 -07:00
Jason Wilder	5372db6327	Fix point validation to include field key length The series key stored in TSM files includes the field. We validated the series length using only the measurement and tag set which allowed very large field names to overflow. This now checks the series key as the measurement + tagset + field + the tsm field key separator size.	2017-05-24 14:39:54 -06:00
Jason Wilder	2cac46ebbc	Convert usage of strings to []byte Measurement name and field were converted between []byte and string repetively causing lots of garbage. This switches the code to use []byte in the write path.	2017-05-12 14:05:19 -06:00
Stuart Carnie	2770a17095	Merge pull request #8207 from stuartcarnie/master Fixes issue #8199	2017-04-26 15:32:09 -07:00
Jason Wilder	5c51ae7319	Merge branch '1.2' into jw-merge-123	2017-04-14 14:36:54 -06:00
Jason Wilder	ff1270dfeb	Fix dropping fields created data corruption The Point is intended to be immutable after being parsed since it is shared by several goroutines. When dropping a field (e.g. time), corrupted data can result if one goroutine is delete the field while another is marshaling the underlying byte slices. To avoid this, the shard will just skip invalid fields and series instead of trying to mutate them by deleting them.	2017-04-07 12:58:42 -06:00
Jason Wilder	c3e0748bd9	Optimize Point.NewPointFromBytes There was a check to ensure that fields exists when unmarshalBinary is called. This created a map and other garbage just to see if any fields exist. This changes it to use a FieldIterator that does not allocate as much as the other method.	2017-04-06 12:51:45 -06:00
Jason Wilder	1a4b1b3109	Fix delete time fields creating unparseable points If a field was named time was written and was subsequently dropped, it could leave a trailing comma in the series key causing it to fail to be parseable in other parts of the code.	2017-04-04 16:37:51 -06:00
Jason Wilder	abaf42fbab	Merge pull request #8251 from influxdata/jw-time-delete Fix delete time fields creating unparseable points	2017-04-04 18:37:00 -04:00
Jason Wilder	ca55fff12c	Fix delete time fields creating unparseable points If a field was named time was written and was subsequently dropped, it could leave a trailing comma in the series key causing it to fail to be parseable in other parts of the code.	2017-04-04 16:19:11 -06:00
Stuart Carnie	9e69bdbef5	Fixes issue #8199 Benchmarks ``` benchmark old ns/op new ns/op delta BenchmarkMarshal-8 1216 1007 -17.19% benchmark old allocs new allocs delta BenchmarkMarshal-8 4 2 -50.00% benchmark old bytes new bytes delta BenchmarkMarshal-8 416 256 -38.46% ```	2017-03-25 13:33:36 -07:00
Ben Johnson	358b1e0b05	Merge remote-tracking branch 'upstream/master' into tsi	2017-03-15 10:13:32 -06:00
Jason Wilder	675d7c9d65	Merge branch '1.2' into jw-merge12	2017-03-06 11:09:05 -07:00
Ben Johnson	dffd12319c	Add point.UnmarshalBinary() bounds checking.	2017-03-01 12:01:25 -07:00
Jason Wilder	a024003f2c	Merge branch '1.2' into jw-merge-12	2017-02-22 12:13:29 -07:00
Ben Johnson	78a9bb2527	Remove Tags.shouldCopy, replace with forceCopy on series creation. Previously, tags had a `shouldCopy` flag to indicate if those tags referenced an underlying buffer and should be copied to allow GC. Unfortunately, this prevented tags from being copied that were created and referenced the mmap which caused segfaults. This change removes the `shouldCopy` flag and replaces it with a `forceCopy` argument in `CreateSeriesIfNotExists()`. This allows the write path to indicate that tags must be cloned on insert.	2017-02-21 11:13:35 -07:00
Ellery D'Souza	95c5ddc940	Error on unsupported +/- Inf value, validate value client side in NewPoint. influxdb will give the following errors: value=+Inf - invalid boolean value=-Inf - invalid number	2017-02-20 13:00:43 -05:00
Edd Robinson	4fbba8234e	Add Size to models.Tags	2017-02-08 18:44:48 +00:00
Ben Johnson	047c21f4d9	Merge remote-tracking branch 'upstream/master' into tsi	2017-01-24 09:28:58 -07:00
Edd Robinson	fb7388cdfc	Remove dead code from various pkgs	2017-01-17 09:47:34 -08:00
Joe LeGasse	cd00085e9e	Adjust Tags cloning This change delays Tag cloning until a new series is found, and will only clone Tags acquired from `ParsePoints...` and not those referencing the mmap-ed files (TSM) that are created on startup.	2017-01-13 13:15:36 -05:00
Mark Rushakoff	cdbdd156f3	Fix memory leak of retained HTTP write payloads This leak seems to have been introduced in `8aa224b22d`, present in 1.1.0 and 1.1.1. When points were parsed from HTTP payloads, their tags and fields referred to subslices of the request body; if any tag set introduced a new series, then those tags then were stored in the in-memory series index objects, preventing the HTTP body from being garbage collected. If there were no new series in the payload, then the request body would be garbage collected as usual. Now, we clone the tags before we store them in the index. This is an imperfect fix because the Point still holds references to the original tags, and the Point's field iterator also refers to the payload buffer. However, the current write code path does not retain references to the Point or its fields; and this change will likely be obsoleted when TSI is introduced. This change likely fixes #7827, #7810, #7778, and perhaps others.	2017-01-12 16:16:54 -08:00
Mark Rushakoff	a135906b43	Merge pull request #7747 from influxdata/mr-lint-cleanup Miscellaneous lint cleanup	2017-01-10 08:22:00 -08:00
Ben Johnson	62d2b3ebe9	Series filtering.	2017-01-05 10:02:42 -07:00
Ben Johnson	62269c3cea	intermediate	2017-01-05 10:02:41 -07:00
Ben Johnson	8863e3c0f3	Refactor tsi1 merge iterators, finish multi-file compaction.	2017-01-05 10:01:25 -07:00
Ben Johnson	2a81351992	Implement tsdb.Index interface on tsi1.Index.	2017-01-05 10:00:43 -07:00
Ben Johnson	ac9c6a0207	Add TSI index benchmark.	2017-01-05 09:34:37 -07:00
Ben Johnson	8d40ceb00c	TSI1 Index	2017-01-05 09:34:36 -07:00
Mark Rushakoff	6a94d200c8	Merge remote-tracking branch 'influx/master' into mr-godoc	2017-01-04 13:27:36 -08:00
Mark Rushakoff	4aedd29b02	Merge pull request #7788 from influxdata/mr-point-data-methods Remove unused methods from Point: Data, SetData	2017-01-04 13:15:38 -08:00
Mark Rushakoff	3e473dd262	Remove unused methods from Point: Data, SetData These are not called anywhere in the TICK stack that I can see.	2017-01-03 16:11:40 -08:00

1 2 3

149 Commits (e4e087d1233a4212aa391c3f57914c0959629ce8)