influxdb

Commit Graph

Author	SHA1	Message	Date
Ben Johnson	60ab1282ea	Refactor system iterators. Previously pseudo iterators could be created for meta data such as series, measurement, and tag data. These iterators were created at a higher level and lacked a lot of the power of the query engine. This commit moves system iterators down to the series level and supports the following: - _name - _seriesKey - _tagKey - _tagValue - _fieldKey These can be used as normal fields such as: SELECT _seriesKey FROM cpu This will return all the series keys for `cpu`.	2017-08-16 09:27:29 -06:00
Stuart Carnie	eec80692c4	Taught tsm1 storage engine how to read and write uint64 values * introduced UnsignedValue type * leveraged existing int64 compression algorithms (RLE, Simple 8B) * tsm and WAL can read and write UnsignedValue * compaction is aware of UnsignedValue * unsigned support to model, cursors and write points NOTE: there is no support to create unsigned points, as the line protocol has not been modified.	2017-07-24 09:03:22 -07:00
lrita	72fcf6283e	optimize point split, which reduce unnecessary allocate	2017-06-15 16:28:49 +08:00
Joe LeGasse	815f740f4c	initial fga work wip wip fix tests / build	2017-05-26 13:16:27 -07:00
Jason Wilder	5372db6327	Fix point validation to include field key length The series key stored in TSM files includes the field. We validated the series length using only the measurement and tag set which allowed very large field names to overflow. This now checks the series key as the measurement + tagset + field + the tsm field key separator size.	2017-05-24 14:39:54 -06:00
Jason Wilder	2cac46ebbc	Convert usage of strings to []byte Measurement name and field were converted between []byte and string repetively causing lots of garbage. This switches the code to use []byte in the write path.	2017-05-12 14:05:19 -06:00
Stuart Carnie	2770a17095	Merge pull request #8207 from stuartcarnie/master Fixes issue #8199	2017-04-26 15:32:09 -07:00
Jason Wilder	5c51ae7319	Merge branch '1.2' into jw-merge-123	2017-04-14 14:36:54 -06:00
Jason Wilder	ff1270dfeb	Fix dropping fields created data corruption The Point is intended to be immutable after being parsed since it is shared by several goroutines. When dropping a field (e.g. time), corrupted data can result if one goroutine is delete the field while another is marshaling the underlying byte slices. To avoid this, the shard will just skip invalid fields and series instead of trying to mutate them by deleting them.	2017-04-07 12:58:42 -06:00
Jason Wilder	c3e0748bd9	Optimize Point.NewPointFromBytes There was a check to ensure that fields exists when unmarshalBinary is called. This created a map and other garbage just to see if any fields exist. This changes it to use a FieldIterator that does not allocate as much as the other method.	2017-04-06 12:51:45 -06:00
Jason Wilder	1a4b1b3109	Fix delete time fields creating unparseable points If a field was named time was written and was subsequently dropped, it could leave a trailing comma in the series key causing it to fail to be parseable in other parts of the code.	2017-04-04 16:37:51 -06:00
Jason Wilder	abaf42fbab	Merge pull request #8251 from influxdata/jw-time-delete Fix delete time fields creating unparseable points	2017-04-04 18:37:00 -04:00
Jason Wilder	ca55fff12c	Fix delete time fields creating unparseable points If a field was named time was written and was subsequently dropped, it could leave a trailing comma in the series key causing it to fail to be parseable in other parts of the code.	2017-04-04 16:19:11 -06:00
Stuart Carnie	9e69bdbef5	Fixes issue #8199 Benchmarks ``` benchmark old ns/op new ns/op delta BenchmarkMarshal-8 1216 1007 -17.19% benchmark old allocs new allocs delta BenchmarkMarshal-8 4 2 -50.00% benchmark old bytes new bytes delta BenchmarkMarshal-8 416 256 -38.46% ```	2017-03-25 13:33:36 -07:00
Ben Johnson	358b1e0b05	Merge remote-tracking branch 'upstream/master' into tsi	2017-03-15 10:13:32 -06:00
Jason Wilder	675d7c9d65	Merge branch '1.2' into jw-merge12	2017-03-06 11:09:05 -07:00
Ben Johnson	dffd12319c	Add point.UnmarshalBinary() bounds checking.	2017-03-01 12:01:25 -07:00
Jason Wilder	a024003f2c	Merge branch '1.2' into jw-merge-12	2017-02-22 12:13:29 -07:00
Ben Johnson	78a9bb2527	Remove Tags.shouldCopy, replace with forceCopy on series creation. Previously, tags had a `shouldCopy` flag to indicate if those tags referenced an underlying buffer and should be copied to allow GC. Unfortunately, this prevented tags from being copied that were created and referenced the mmap which caused segfaults. This change removes the `shouldCopy` flag and replaces it with a `forceCopy` argument in `CreateSeriesIfNotExists()`. This allows the write path to indicate that tags must be cloned on insert.	2017-02-21 11:13:35 -07:00
Edd Robinson	4fbba8234e	Add Size to models.Tags	2017-02-08 18:44:48 +00:00
Ben Johnson	047c21f4d9	Merge remote-tracking branch 'upstream/master' into tsi	2017-01-24 09:28:58 -07:00
Edd Robinson	fb7388cdfc	Remove dead code from various pkgs	2017-01-17 09:47:34 -08:00
Joe LeGasse	cd00085e9e	Adjust Tags cloning This change delays Tag cloning until a new series is found, and will only clone Tags acquired from `ParsePoints...` and not those referencing the mmap-ed files (TSM) that are created on startup.	2017-01-13 13:15:36 -05:00
Mark Rushakoff	cdbdd156f3	Fix memory leak of retained HTTP write payloads This leak seems to have been introduced in `8aa224b22d`, present in 1.1.0 and 1.1.1. When points were parsed from HTTP payloads, their tags and fields referred to subslices of the request body; if any tag set introduced a new series, then those tags then were stored in the in-memory series index objects, preventing the HTTP body from being garbage collected. If there were no new series in the payload, then the request body would be garbage collected as usual. Now, we clone the tags before we store them in the index. This is an imperfect fix because the Point still holds references to the original tags, and the Point's field iterator also refers to the payload buffer. However, the current write code path does not retain references to the Point or its fields; and this change will likely be obsoleted when TSI is introduced. This change likely fixes #7827, #7810, #7778, and perhaps others.	2017-01-12 16:16:54 -08:00
Mark Rushakoff	a135906b43	Merge pull request #7747 from influxdata/mr-lint-cleanup Miscellaneous lint cleanup	2017-01-10 08:22:00 -08:00
Ben Johnson	62d2b3ebe9	Series filtering.	2017-01-05 10:02:42 -07:00
Ben Johnson	62269c3cea	intermediate	2017-01-05 10:02:41 -07:00
Ben Johnson	8863e3c0f3	Refactor tsi1 merge iterators, finish multi-file compaction.	2017-01-05 10:01:25 -07:00
Ben Johnson	2a81351992	Implement tsdb.Index interface on tsi1.Index.	2017-01-05 10:00:43 -07:00
Ben Johnson	ac9c6a0207	Add TSI index benchmark.	2017-01-05 09:34:37 -07:00
Ben Johnson	8d40ceb00c	TSI1 Index	2017-01-05 09:34:36 -07:00
Mark Rushakoff	6a94d200c8	Merge remote-tracking branch 'influx/master' into mr-godoc	2017-01-04 13:27:36 -08:00
Mark Rushakoff	4aedd29b02	Merge pull request #7788 from influxdata/mr-point-data-methods Remove unused methods from Point: Data, SetData	2017-01-04 13:15:38 -08:00
Mark Rushakoff	3e473dd262	Remove unused methods from Point: Data, SetData These are not called anywhere in the TICK stack that I can see.	2017-01-03 16:11:40 -08:00
Cory LaNou	3c518f8927	panicing is bad -> error returns are good	2017-01-03 14:28:29 -06:00
Mark Rushakoff	07b87f2630	Miscellaneous lint cleanup	2017-01-03 09:47:32 -08:00
Mark Rushakoff	7b5b3189dd	Update godoc for package models	2016-12-30 18:02:52 -08:00
Mark Rushakoff	f6a3ffecaf	Require fields when marshalling Point I haven't been able to reproduce creating a point without any fields, but we've seen points in the wild that have been marshalled with no fields - that is, the length header for fields is uint32(0) and a well-formed encoded time follows. Attempting to unmarshal points via NewPointFromBytes returns ErrPointMustHaveAField, so it seems better to fail earlier with the same error, rather than allowing those points to be serialized in the first place.	2016-12-23 19:43:15 -08:00
kun	de4436e9d9	fix scan tag value panic	2016-12-20 08:39:18 +08:00
Mark Rushakoff	15ba7958f8	Use strings.Replacer to escape string field benchmark old ns/op new ns/op delta BenchmarkEscapeStringField_Plain-4 167 65.3 -60.90% BenchmarkEscapeString_Quotes-4 167 165 -1.20% BenchmarkEscapeString_Backslashes-4 211 184 -12.80% BenchmarkEscapeString_QuotesAndBackslashes-4 413 397 -3.87% BenchmarkExportTSMStrings_100s_250vps-4 33833611 27381442 -19.07% BenchmarkExportWALStrings_100s_250vps-4 34977761 29222717 -16.45% benchmark old allocs new allocs delta BenchmarkEscapeStringField_Plain-4 4 1 -75.00% BenchmarkEscapeString_Quotes-4 4 3 -25.00% BenchmarkEscapeString_Backslashes-4 5 3 -40.00% BenchmarkEscapeString_QuotesAndBackslashes-4 9 5 -44.44% BenchmarkExportTSMStrings_100s_250vps-4 201605 76938 -61.84% BenchmarkExportWALStrings_100s_250vps-4 225371 100728 -55.31% benchmark old bytes new bytes delta BenchmarkEscapeStringField_Plain-4 56 16 -71.43% BenchmarkEscapeString_Quotes-4 56 48 -14.29% BenchmarkEscapeString_Backslashes-4 104 80 -23.08% BenchmarkEscapeString_QuotesAndBackslashes-4 208 160 -23.08% BenchmarkExportTSMStrings_100s_250vps-4 10872629 6062048 -44.24% BenchmarkExportWALStrings_100s_250vps-4 10094933 5269980 -47.80%	2016-12-17 23:46:58 -08:00
Jason Wilder	2f776ea9e1	Fix string fields w/ trailing slashes A string field w/ a trailing slash before the quote would parse incorrectly because the quote would be seen as escaped. We have to treat \\ as an escape sequence within strings in order to handle this.	2016-12-01 15:24:11 -07:00
Jason Wilder	8fce6bba48	Add tag value cardinality limit	2016-10-10 11:42:15 -06:00
Jason Wilder	2ae6b5e1ed	Replace uses of newFieldsFromBinary with FieldIterator	2016-10-03 16:30:21 -06:00
Joe LeGasse	743946fafb	models: Add FieldIterator type The FieldIterator is used to scan over the fields of a point, providing information, and delaying parsing/decoding the value until it is needed. This change uses this new type to avoid the allocation of a map for the fields which is then thrown away as soon as the points get converted into columns within the datastore.	2016-10-03 16:30:21 -06:00
joelegasse	bc4282ad99	Merge pull request #7347 from influxdata/2016-09-22-zero-alloc-strconv-parse-numbers Zero-alloc wrappers for strconv.Parse{Int,Float}. Thanks @rw	2016-09-26 15:33:30 -04:00
rw	3155ff2a27	Implement and use zero-alloc FNV64a. + Remove a heap alloc in (Point).HashID() and (Row).tagsHash() (According to `-gcflags -m`). + Direct port from the stdlib. + Fuzz test for equivalence to stdlib version. + Save one alloc per line when writing with the bulk protocol.	2016-09-26 11:43:27 -07:00
rw	6906fe7240	Zero-alloc wrappers for strconv.Parse{Int,Float} + Reduces short-lived heap allocs during value parsing. + Fuzz tests to verify equivalence to stdlib functions.	2016-09-26 11:41:31 -07:00
Jason Wilder	ac9a7d520b	Pre-allocated Points slice when parsing points Over a longer period of writes, this allocation shows up quite a bit in profiles since the slice needs to be resized frequently. This scans the slice to count how many lines are going to be parsed in order to pre-allocate the slice capacity. It's slightly slower, but creates less garbage in the long run.	2016-09-26 12:19:15 -06:00
Joe LeGasse	0d2b339d7c	models: Added AppendString, PointSize, and Round to Point This change also updates the UDP client to take advantage of these improvements, as well as some code review changes.	2016-09-23 13:22:30 -04:00
Joe LeGasse	ee6816756a	udp client: large points will now be split, if possible The v2 UDP client will attempt to split points that exceed the configured payload size. It will only do this for points that have a timestamp specified.	2016-09-23 13:22:30 -04:00

1 2 3

106 Commits (e18425757d63b2e96057d0fa2f428b40b281f436)