influxdb

Commit Graph

Author	SHA1	Message	Date
David Norton	01d5805952	fix line protocol parsing panic	2018-08-09 10:56:34 -04:00
Jacob Marble	44c5da060b	buildtsi: Do not escape measurement names When `influx_inspect buildtsi` is used to create a new `tsi1` index, spaces in measurement names are escaped, so measurement "a b" is changed to "a\ b". This change modifies `models.ParseKeyBytes()` and `models.ParseName()` to unescape measurement names. `models.ParseKeyBytes()` returns unescaped tag keys, so this seems like the natural place to unescape measurement names. Also followed `scanMeasurement()` to see what other code could be problematic, and this should be everything (the result of one other use of `scanMeasurement()` is later escaped). Removed `tsdb.MeasurementFromSeriesKey()`. These methods are exported, so checked for side effects in other InfluxData repositories.	2018-05-30 15:20:56 -07:00
Edd Robinson	6b350edf59	Ensure correct number of tags parsed This commit fixes a parsing bug that was causing extra tags to be generated when the incoming point contained escaped commas. As an optimisation, the slice of tags associated with a point was being pre-allocated using the number of commas in the series key as a hint to the appropriate size. The hinting did not consider literal comma values in the key though, and so it was possible for extra (empty) tag key and value pairs to be part of the tags structure associated with a parsed point.	2018-03-14 13:28:55 +00:00
Stuart Carnie	4d93b04d33	MakeKey related benchmarks	2018-03-12 16:23:57 -07:00
Stuart Carnie	5dfe3b2645	inmem startup improvments * only call ParseTags when necessary * remove dependency on inmem.Series in tsdb test package * Measurement and Series are no longer exported. Their use is restricted to the inmem package * improve Measurement and Series types by exporting immutable fields and removing unnecessary APIs and locks Reduced startup time from 28s to 17s. Overall improvement including #9162 reduces startup from 46s to 17s for 1MM series across 14 shards.	2017-12-29 07:58:52 -07:00
Jason Wilder	ed246db55a	Fix panic: runtime error: slice bounds out of range Fixes #8538	2017-11-08 17:00:25 -07:00
Jonathan A. Sternberg	e415d0b10f	Merge pull request #8835 from influxdata/js-uint-write-protocol Add uint support into the write protocol	2017-10-09 10:55:16 -05:00
Jonathan A. Sternberg	2f47c3d28f	Add support for uint64 in the clients	2017-10-05 09:35:06 -05:00
Jonathan A. Sternberg	ff7b576389	Add uint support into the write protocol This is currently protected behind a conditional compilation flag. Use `-tags uint` or `-tags uint64` to enable this.	2017-09-19 10:44:26 -05:00
Joe LeGasse	815f740f4c	initial fga work wip wip fix tests / build	2017-05-26 13:16:27 -07:00
Jason Wilder	5372db6327	Fix point validation to include field key length The series key stored in TSM files includes the field. We validated the series length using only the measurement and tag set which allowed very large field names to overflow. This now checks the series key as the measurement + tagset + field + the tsm field key separator size.	2017-05-24 14:39:54 -06:00
Jason Wilder	2cac46ebbc	Convert usage of strings to []byte Measurement name and field were converted between []byte and string repetively causing lots of garbage. This switches the code to use []byte in the write path.	2017-05-12 14:05:19 -06:00
Stuart Carnie	2770a17095	Merge pull request #8207 from stuartcarnie/master Fixes issue #8199	2017-04-26 15:32:09 -07:00
Jason Wilder	5c51ae7319	Merge branch '1.2' into jw-merge-123	2017-04-14 14:36:54 -06:00
Jason Wilder	ff1270dfeb	Fix dropping fields created data corruption The Point is intended to be immutable after being parsed since it is shared by several goroutines. When dropping a field (e.g. time), corrupted data can result if one goroutine is delete the field while another is marshaling the underlying byte slices. To avoid this, the shard will just skip invalid fields and series instead of trying to mutate them by deleting them.	2017-04-07 12:58:42 -06:00
Jason Wilder	c3e0748bd9	Optimize Point.NewPointFromBytes There was a check to ensure that fields exists when unmarshalBinary is called. This created a map and other garbage just to see if any fields exist. This changes it to use a FieldIterator that does not allocate as much as the other method.	2017-04-06 12:51:45 -06:00
Jason Wilder	1a4b1b3109	Fix delete time fields creating unparseable points If a field was named time was written and was subsequently dropped, it could leave a trailing comma in the series key causing it to fail to be parseable in other parts of the code.	2017-04-04 16:37:51 -06:00
Jason Wilder	abaf42fbab	Merge pull request #8251 from influxdata/jw-time-delete Fix delete time fields creating unparseable points	2017-04-04 18:37:00 -04:00
Jason Wilder	ca55fff12c	Fix delete time fields creating unparseable points If a field was named time was written and was subsequently dropped, it could leave a trailing comma in the series key causing it to fail to be parseable in other parts of the code.	2017-04-04 16:19:11 -06:00
Stuart Carnie	9e69bdbef5	Fixes issue #8199 Benchmarks ``` benchmark old ns/op new ns/op delta BenchmarkMarshal-8 1216 1007 -17.19% benchmark old allocs new allocs delta BenchmarkMarshal-8 4 2 -50.00% benchmark old bytes new bytes delta BenchmarkMarshal-8 416 256 -38.46% ```	2017-03-25 13:33:36 -07:00
Ben Johnson	358b1e0b05	Merge remote-tracking branch 'upstream/master' into tsi	2017-03-15 10:13:32 -06:00
Ben Johnson	dffd12319c	Add point.UnmarshalBinary() bounds checking.	2017-03-01 12:01:25 -07:00
Edd Robinson	4fbba8234e	Add Size to models.Tags	2017-02-08 18:44:48 +00:00
Mark Rushakoff	cdbdd156f3	Fix memory leak of retained HTTP write payloads This leak seems to have been introduced in `8aa224b22d`, present in 1.1.0 and 1.1.1. When points were parsed from HTTP payloads, their tags and fields referred to subslices of the request body; if any tag set introduced a new series, then those tags then were stored in the in-memory series index objects, preventing the HTTP body from being garbage collected. If there were no new series in the payload, then the request body would be garbage collected as usual. Now, we clone the tags before we store them in the index. This is an imperfect fix because the Point still holds references to the original tags, and the Point's field iterator also refers to the payload buffer. However, the current write code path does not retain references to the Point or its fields; and this change will likely be obsoleted when TSI is introduced. This change likely fixes #7827, #7810, #7778, and perhaps others.	2017-01-12 16:16:54 -08:00
Cory LaNou	3c518f8927	panicing is bad -> error returns are good	2017-01-03 14:28:29 -06:00
kun	de4436e9d9	fix scan tag value panic	2016-12-20 08:39:18 +08:00
Mark Rushakoff	15ba7958f8	Use strings.Replacer to escape string field benchmark old ns/op new ns/op delta BenchmarkEscapeStringField_Plain-4 167 65.3 -60.90% BenchmarkEscapeString_Quotes-4 167 165 -1.20% BenchmarkEscapeString_Backslashes-4 211 184 -12.80% BenchmarkEscapeString_QuotesAndBackslashes-4 413 397 -3.87% BenchmarkExportTSMStrings_100s_250vps-4 33833611 27381442 -19.07% BenchmarkExportWALStrings_100s_250vps-4 34977761 29222717 -16.45% benchmark old allocs new allocs delta BenchmarkEscapeStringField_Plain-4 4 1 -75.00% BenchmarkEscapeString_Quotes-4 4 3 -25.00% BenchmarkEscapeString_Backslashes-4 5 3 -40.00% BenchmarkEscapeString_QuotesAndBackslashes-4 9 5 -44.44% BenchmarkExportTSMStrings_100s_250vps-4 201605 76938 -61.84% BenchmarkExportWALStrings_100s_250vps-4 225371 100728 -55.31% benchmark old bytes new bytes delta BenchmarkEscapeStringField_Plain-4 56 16 -71.43% BenchmarkEscapeString_Quotes-4 56 48 -14.29% BenchmarkEscapeString_Backslashes-4 104 80 -23.08% BenchmarkEscapeString_QuotesAndBackslashes-4 208 160 -23.08% BenchmarkExportTSMStrings_100s_250vps-4 10872629 6062048 -44.24% BenchmarkExportWALStrings_100s_250vps-4 10094933 5269980 -47.80%	2016-12-17 23:46:58 -08:00
Jason Wilder	2f776ea9e1	Fix string fields w/ trailing slashes A string field w/ a trailing slash before the quote would parse incorrectly because the quote would be seen as escaped. We have to treat \\ as an escape sequence within strings in order to handle this.	2016-12-01 15:24:11 -07:00
Joe LeGasse	743946fafb	models: Add FieldIterator type The FieldIterator is used to scan over the fields of a point, providing information, and delaying parsing/decoding the value until it is needed. This change uses this new type to avoid the allocation of a map for the fields which is then thrown away as soon as the points get converted into columns within the datastore.	2016-10-03 16:30:21 -06:00
Jason Wilder	ac9a7d520b	Pre-allocated Points slice when parsing points Over a longer period of writes, this allocation shows up quite a bit in profiles since the slice needs to be resized frequently. This scans the slice to count how many lines are going to be parsed in order to pre-allocate the slice capacity. It's slightly slower, but creates less garbage in the long run.	2016-09-26 12:19:15 -06:00
Cory LaNou	4a2d23da4a	fix failing vet issue in test	2016-09-26 11:29:02 -05:00
Joe LeGasse	0d2b339d7c	models: Added AppendString, PointSize, and Round to Point This change also updates the UDP client to take advantage of these improvements, as well as some code review changes.	2016-09-23 13:22:30 -04:00
Joe LeGasse	ee6816756a	udp client: large points will now be split, if possible The v2 UDP client will attempt to split points that exceed the configured payload size. It will only do this for points that have a timestamp specified.	2016-09-23 13:22:30 -04:00
Jonathan A. Sternberg	dc2527ce86	Merge branch '1.0'	2016-08-31 14:45:57 -05:00
Jonathan A. Sternberg	0d63889847	Allow blank lines in the line protocol input	2016-08-30 09:25:55 -05:00
Jonathan A. Sternberg	8b234546a8	Merge pull request #7204 from influxdata/1.0 Merge 1.0 branch to master	2016-08-25 15:20:30 -05:00
Jonathan A. Sternberg	10029caf2f	Support negative timestamps in the query engine Negative timestamps are now supported. We also now refuse two nanoseconds that are at the edge of the minimum time window. One of the nanoseconds we do not accept is because we need MinInt64 to be used for some internal comparisons in the TSM engine and it was causing an underflow when we subtracted one from the minimum time. The second is so we can have one minimum time that signifies the default minimum that nobody can write to (so we can implicitly rewrite the timestamp on aggregate queries) but still use the explicit timestamp if it is given to us by the user. We aren't able to tell the difference between if the user provided it or if it was implicit without those values being different. If the default minimum time is used with an aggregate query, we rewrite the time to be the epoch for backwards compatibility since we believe that's more important than supporting that extra nanosecond.	2016-08-25 12:52:41 -05:00
Ben Johnson	8aa224b22d	reduce memory allocations in index This commit changes the index to point to index data in the shards instead of keeping it in-memory on the heap.	2016-08-16 14:09:00 -06:00
Jason Wilder	d432aaa84d	Fix panic with parsing empty key Fixes #6990	2016-07-28 18:38:17 -06:00
Jonathan A. Sternberg	8e1b036b0a	Modify the max nanosecond time to be one nanosecond less The highest time represented by a nanosecond needs to be used for an exclusive range, so the maximum time needs to be one less than the possible maximum number of nanoseconds representable by an int64 so that we don't lose a point at that one time. Previously worked in the open source version because the timestamp used for finding a shard would be truncated by the retention policy so the lookup time didn't run into this edge case because it didn't rest on the truncation boundary. Since that point didn't really belong in that shard group and was placed there by mistake, it's best to fix this bug since the timestamp used to create the shard group should be capable of retrieving it.	2016-06-16 12:15:41 -05:00
Jonathan A. Sternberg	3bd9425edb	Fix the point validation parser to identify and sort tags correctly Fixes #6771.	2016-06-13 09:45:10 -05:00
Jason Wilder	97ad5fd2e6	Add ParseKey benchmark	2016-05-27 10:30:08 -06:00
Edd Robinson	f7c7b89b65	Merge pull request #6675 from influxdata/er-future-meta Ensure meta SHOW queries include future points	2016-05-27 13:02:23 +01:00
Edd Robinson	f4fc905fa9	Reject timestamps too far in future	2016-05-27 11:07:48 +01:00
Edd Robinson	46489a5195	Fix vet issue	2016-05-27 10:31:20 +01:00
Edd Robinson	39f3480f28	Ensure points with trailing whitespace are accepted	2016-05-26 19:00:24 +01:00
Jason Wilder	aa842fd38f	Return error if creating a point would exceed max key length	2016-03-30 23:57:41 -06:00
Joe LeGasse	24bcf46213	Update number scanning edge cases This should fix #5965, and other issues that result from submitting malformed numbers with points	2016-03-14 16:48:39 -04:00
Jon Seymour	d46e0407a0	Merge #5716 RHS merges cleanly with 0.10.0 maintenance branch. Signed-off-by: Jon Seymour <jon@wildducktheories.com>	2016-02-20 22:24:03 +11:00
Jon Seymour	9491846047	models: improve handling of points with empty field names or with no fields Influx does not support fields with empty names or points with no fields. NewPoint is changed to validate that all field names are non-empty. AddField is removed because we now require that all fields are specified on construction. NewPointFromByte is changed to return an error if a unmarshaled binary point does not have any fields. newFieldsFromBinary is changed to prevent an infinite loop that can arise while attempting to parse corrupt binary point data. TestNewPointsWithBytesWithCorruptData is changed to reflect the change in the behaviour of NewPointFromByte. Signed-off-by: Jon Seymour <jon@wildducktheories.com>	2016-02-20 22:22:26 +11:00

1 2

78 Commits (ca07a38402a9b49f89bd2de4b2b09fed595eb5ee)