influxdb

Commit Graph

Author	SHA1	Message	Date
Edd Robinson	aa7095be5a	Use a merge-based approach for TagValues	2017-08-02 14:10:52 +01:00
Jason Wilder	94a48774b7	Pull in new index filter	2017-08-02 14:10:52 +01:00
Stuart Carnie	46796d932f	add database to index, engine and shard; call AuthorizeSeriesRead	2017-05-26 13:21:50 -07:00
Jason Wilder	2cac46ebbc	Convert usage of strings to []byte Measurement name and field were converted between []byte and string repetively causing lots of garbage. This switches the code to use []byte in the write path.	2017-05-12 14:05:19 -06:00
Jason Wilder	88848a9426	Remove per shard monitor goroutine The monitor goroutine ran for each shard and updated disk stats as well as logged cardinality warnings. This goroutine has been removed by making the disks stats more lightweight and callable direclty from Statisics and move the logging to the tsdb.Store. The latter allows one goroutine to handle all shards.	2017-05-03 16:31:57 -06:00
Jason Wilder	f87fd7c7ed	Stop background compaction goroutines when shard is cold Each shard has a number of goroutines for compacting different levels of TSM files. When a shard goes cold and is fully compacted, these goroutines are still running. This change will stop background shard goroutines when the shard goes cold and start them back up if new writes arrive.	2017-05-03 16:31:57 -06:00
Jason Wilder	8fc9853ed8	Add max-concurrent-compactions limit This limit allows the number of concurrent level and full compactions to be throttled. Snapshot compactions are not affected by this limit as then need to run continously. This limit can be used to control how much CPU is consumed by compactions. The default is to limit to the number of CPU available.	2017-05-03 16:31:57 -06:00
Jason Wilder	a76146e34a	Add Store.Import capability This allows the contents of a backup to be imported into a shard without requiring the whole shard to be replaced.	2017-04-28 13:30:46 -06:00
Ben Johnson	9fb8f1ec1d	Fix database and tag limits.	2017-03-24 09:48:10 -06:00
Ben Johnson	358b1e0b05	Merge remote-tracking branch 'upstream/master' into tsi	2017-03-15 10:13:32 -06:00
Jonathan A. Sternberg	2fe48d6781	Rename zap import back to github.com/uber-go/zap They rebased a revision we were previously relying upon that allowed us to use the vanity name so we are reverting back to an older version with the old import path.	2017-02-17 17:17:22 -06:00
Ben Johnson	047c21f4d9	Merge remote-tracking branch 'upstream/master' into tsi	2017-01-24 09:28:58 -07:00
Jonathan A. Sternberg	d7c8c7ca4f	Support subquery execution in the query language This adds query syntax support for subqueries and adds support to the query engine to execute queries on subqueries. Subqueries act as a source for another query. It is the equivalent of writing the results of a query to a temporary database, executing a query on that temporary database, and then deleting the database (except this is all performed in-memory). The syntax is like this: SELECT sum(derivative) FROM (SELECT derivative(mean(value)) FROM cpu GROUP BY *) This will execute derivative and then sum the result of those derivatives. Another example: SELECT max(min) FROM (SELECT min(value) FROM cpu GROUP BY host) This would let you find the maximum minimum value of each host. There is complete freedom to mix subqueries with auxiliary fields. The only caveat is that the following two queries: SELECT mean(value) FROM cpu SELECT mean(value) FROM (SELECT value FROM cpu) Have different performance characteristics. The first will calculate `mean(value)` at the shard level and will be faster, especially when it comes to clustered setups. The second will process the mean at the top level and will not include that optimization.	2017-01-07 13:00:48 -06:00
Ben Johnson	c1c98223ec	Fix and optimize tsi1 FileSet.	2017-01-05 10:17:12 -07:00
Ben Johnson	9b1e8215e0	Remove dictionary encoding, add bulk series insertion.	2017-01-05 10:17:11 -07:00
Ben Johnson	f9efcb3365	Re-add shared in-memory index.	2017-01-05 10:17:09 -07:00
Edd Robinson	0f9b2bfe6a	Fix tests	2017-01-05 10:16:15 -07:00
Edd Robinson	4ccb8dbab1	Move series count check to shard	2017-01-05 10:16:13 -07:00
Ben Johnson	745b1973a8	tsi compaction	2017-01-05 10:15:37 -07:00
Ben Johnson	183418dcbd	Fix tsi TAG KEYS iterator.	2017-01-05 10:15:36 -07:00
Ben Johnson	9f8b206b51	Fix measurement system queries.	2017-01-05 10:15:34 -07:00
Ben Johnson	4aa78383d1	Fix tsi1 series deletion.	2017-01-05 10:14:48 -07:00
Ben Johnson	e7940cc556	Add tsi1 series system iterator.	2017-01-05 10:14:00 -07:00
Ben Johnson	fbe7f464ee	Improve insert performance.	2017-01-05 10:11:12 -07:00
Ben Johnson	409b0165f5	shared in-memory index	2017-01-05 10:09:57 -07:00
Ben Johnson	a812502ea3	reintegrating in-memory index	2017-01-05 10:07:35 -07:00
Ben Johnson	62d2b3ebe9	Series filtering.	2017-01-05 10:02:42 -07:00
Ben Johnson	62269c3cea	intermediate	2017-01-05 10:02:41 -07:00
Edd Robinson	9ed6040265	Tidy up	2017-01-05 09:58:37 -07:00
Edd Robinson	2d9bd09784	Use []byte where possible in Index	2017-01-05 09:57:34 -07:00
Edd Robinson	bd8dd9a291	Sketches working	2017-01-05 09:54:04 -07:00
Edd Robinson	d19fbf5ab4	Wire in HLL estimator	2017-01-05 09:54:03 -07:00
Edd Robinson	05bc4dec00	Refactor	2017-01-05 09:50:23 -07:00
Edd Robinson	c535e3899a	Remove in-memory index from Shard and Store	2017-01-05 09:47:09 -07:00
Jonathan A. Sternberg	ec57108520	Use proper uber-go/zap import path It looks like the real import path to the project is go.uber.org/zap instead of github.com/uber-go/zap since the example in the project references that path.	2016-12-15 08:54:14 -06:00
Jonathan A. Sternberg	21502a39e8	Switch logging to use structured logging everywhere The logging library has been switched to use uber-go/zap. While the logging has been changed to use structured logging, this commit does not change any of the logging statements to take advantage of the new structured log or new log levels. Those changes will come in future commits.	2016-12-14 10:45:15 -06:00
Jason Wilder	e8a28cfbab	Expose Shard.LastModified This returns the LastModified time of the shard. The LastModified time is the wall time when a change to the shards state occurred. It uses the WAL or FileStore to determine the max mod time.	2016-11-23 10:04:07 -07:00
Jonathan A. Sternberg	3681bc8a43	Filter out series within shards that do not have data for that series Previously, we would return a full tag set for every shard and the tag set would include all series that existed in the database index including series that didn't physically exist within that shard. This led to the tag sets returned being incredibly huge when we had high cardinality but sparse data. Since the data was sparse, it was unexpected that it would cause such a large strain on the system by most people. Now we filter out the series ids that are not assigned to the current shard when computing a tag set for that shard. This lowers the memory usage for high cardinality sparse data drastically and allows queries on those to complete successfully. This does not resolve issues for high cardinality data in every shard that is also spread out over a long series of time. That situation isn't nearly as common as the above situation though.	2016-10-20 14:15:34 -05:00
Jonathan A. Sternberg	837a9804cf	Refactoring the monitor service to avoid expvar Truncate the time interval output of the monitor service to be on even time intervals rather than on every minute based on the start time. This normalizes the output from the monitor service.	2016-07-07 11:13:58 -05:00
Jonathan A. Sternberg	497db2a6d3	Removing dead code from every package except influxql The tsdb package had a substantial amount of dead code related to the old query engine still in there. It is no longer used, so it was removed since it was left unmaintained. There is likely still more code that is the same, but wasn't found as part of this code cleanup. influxql has dead code show up because of the code generation so it is not included in this pruning.	2016-06-20 22:41:07 -05:00
Jason Wilder	1ff8ecf4fb	Add ability to disable shards Disabling a shard causes all writes and queries to a shard to return an error. This also disables compactions for the shard.	2016-05-31 10:51:54 -06:00
Jonathan A. Sternberg	23f6a706bb	Support cast syntax for selecting a specific type Casting syntax is done with the PostgreSQL syntax `field1::float` to specify which type should be used when selecting a field. You can also do `field1::field` or `tag1::tag` to specify that a field or tag should be selected. This makes it possible to select a tag when a field key and a tag key conflict with each other in a measurement. It also means it's possible to choose a field with a specific type if multiple shards disagree. If no types are given, the same ordering for how a type is chosen is used to determine which type to return. The FieldDimensions method has been updated to return the data type for the fields that get returned. The SeriesKeys function has also been removed since it is no longer needed. SeriesKeys was originally used for the fill iterator, but then expanded to be used by auxiliary iterators for determining the channel iterator types. The fill iterator doesn't need it anymore and the auxiliary types are better served by FieldDimensions implementing that functionality, so SeriesKeys is no longer needed. Fixes #6519.	2016-05-16 12:08:29 -04:00
Cory LaNou	f415cf89ad	wip	2016-05-10 11:01:03 -05:00
Cory LaNou	a3bf3e2ef1	added baseline backup/restore plumbing	2016-05-10 08:14:51 -05:00
Jason Wilder	e0304ae3d5	Fix shards not getting assigned to series on restart Also, simplifies the LoadMetaDataIndex func to not require a *Shard	2016-05-02 11:36:05 -06:00
Jason Wilder	abcb559b09	Remove index meta data when series and measurements are gone This remove the dropMeta param from the tsdb.Store.DeleteSeries and lets the shard determine when to remove the meta data from the index based on what series still have data in the shard. This uncovered a nasty bug in compactions where a fully deleted series would prematurely end the compactions and not carry forward the rest of the data in the TSM file. This is now fixed as well.	2016-04-29 16:31:57 -06:00
Jason Wilder	aefd2ad08b	Add DeleteSeries and DeleteSeriesRange	2016-04-27 13:09:53 -06:00
Jason Wilder	d13d01b516	Allow deleting series by time on a shard	2016-04-27 13:09:53 -06:00
Jason Wilder	3f4c5a5585	Fix race on measurementFields Both Shard and Engine had the same reference to the measurementField map, but they each protected it with their own locks. This causes a race when write and queries are occurring because writes can add new fields to the map while queries are reading from it. The fix moves the ownership to the Engine and provides protected accessors to that Shard now users. For the most parts, the access on shard were old dead code. Fixing the measurementFields map race created a new race on the internal fields map. This is now unexported and protected via MeasurementFields exported funcs. Fixes #6188	2016-04-01 18:57:01 -06:00
Mark Rushakoff	7a2adfcc5d	Remove unused WAL configuration variables/fields These were all b1/bz1 settings that no longer have any effect: - {Default,}MaxWALSize - {Default,}WALFlushInterval - {Default,}WALPartitionFlushDelay - {Default,WAL}ReadySeriesSize - {Default,WAL}CompactionThreshold - {Default,WAL}MaxSeriesSize - {Default,WAL}FlushColdInterval - {Default,WAL}PartitionSizeThreshold	2016-03-20 13:16:52 -07:00
Jason Wilder	992c78ee22	Remove period shard maintenance goroutine This is no longer used in tsm and just peridocially locks everything for no reason now.	2016-03-09 17:31:02 -07:00
Edd Robinson	16995b6c23	Add ShardError to provide context about shard that errored	2016-02-24 13:33:07 +00:00
Justin Nuß	82c276756a	Lint tsdb and tsdb/engine package	2016-02-10 21:33:46 +01:00
Ben Johnson	5a0d1ab7c1	rename influxdb/influxdb to influxdata/influxdb This commit changes all the import and URL references from: github.com/influxdb/influxdb to: github.com/influxdata/influxdb	2016-02-10 10:26:18 -07:00
Jonathan A. Sternberg	c2d1206177	Implement the fill iterator Fill requires an additional function for IteratorCreator to retrieve the series that will be returned from the iterator. When fill is required for an aggregate, the IteratorCreator will be asked what series will be returned by the created iterator.	2016-02-10 09:40:29 -07:00
Ben Johnson	00806de9b8	refactor query engine	2016-02-10 09:40:25 -07:00
Ben Johnson	57336bd6ee	fix conditionals	2016-02-10 09:40:24 -07:00
Ben Johnson	036382ee20	SLIMIT/SOFFSET	2016-02-10 09:40:24 -07:00
Ben Johnson	cde973f409	refactor query engine	2016-02-10 09:40:24 -07:00
Paul Dix	59fbd371fc	Implement backup/restore for TSM. This changes backup and restore to work for TSM. It breaks it for b1 and bz1, but since those are getting removed it's ok. The backup runs against any host that is specified and can backup either the metasstore, a database, specific retention policy, or a specific shard. It can also take incremental backups with the `since` flag, which will only backup TSM files that have been created since that timestamp. The backup is safe to run online. However, for shards that are still hot for writes, they won't be able to create new TSM files while the backup for that single shard runs. If the backup isn't too large and the write throughput isn't too high this shouldn't be a problem since the writes will just go into the WAL cache.	2015-12-30 18:06:50 -05:00
Paul Dix	1bee7d1512	Update TSM, remove old version, add config * remove rolloverTSMFileSize constant that is no longer used * remove the maxGenerationFileCount since it is no longer a limitation that's necessary with the new compaction scheme. We no longer read WAL segments as part of the compaction so memory is only used as we read in each individual key * remove minFileCount and switch to a user configurable variable * remove the mutex from WALSegmentWriter. There's never more than one open in the WAL at one time and it's not exported through any function so the lock on the WAL should be used. This simplified keeping track of the last write time and removed a bunch of unnecessary locks. * update WALSegmentWriter.Write to take the compressed bytes so that encoding and compression can occur before the call to write (while we don't hold the WAL lock) * remove a bunch of unnecessary locking in WAL.writeToLog * Add check for TSM file magic number and vesion * Remove old tsm, log, and unused cursor code * Remove references to tsm1dev everywhere except in the inspector * Clean up config options for compaction and snapshotting * Remove old TSM configuration options * Update the config.sample.toml with TSM options * Update WAL compact to force if it has been cold for writes for a configurable period of time (1h by default)	2015-12-06 18:50:39 -05:00
Jason Wilder	4a03469662	Integrate TSM compaction into dev engine	2015-12-02 09:45:23 -07:00
Jason Wilder	25206c729c	Add compactor type	2015-11-24 08:50:07 -07:00
Philip O'Toole	00b2454c53	Exit if invalid engine is selected Fix #4584, related to #4583	2015-10-27 17:29:18 -07:00
Paul Dix	267f34b94e	Updates based on PR feedback	2015-10-05 20:09:56 -04:00
Paul Dix	d47ddb5454	Cleanup after pd1 -> tsm1 name change.	2015-10-05 20:09:55 -04:00
Paul Dix	1c8eac1523	Add PerformMaintenance to store for flushes and compactions. Also fixed shard to work again with b1 and bz1 engines.	2015-10-05 20:06:22 -04:00
Paul Dix	982c28b947	Update to work with new cursor definitiono and Point in models	2015-10-05 20:06:21 -04:00
Paul Dix	2ba032b7a8	WIP: finish basics of PD1. IT WORKS! (kind of)	2015-10-05 20:06:21 -04:00
Paul Dix	7555ccbd70	WIP: engine work	2015-10-05 20:06:21 -04:00
Ben Johnson	b213ddad78	refactor cursor	2015-09-22 13:10:12 -06:00
Ben Johnson	a5269e9cc7	rename direction to ascending.	2015-09-22 13:09:26 -06:00
Cory LaNou	72f6f7d268	Merge pull request #4134 from influxdb/issue-3447 Refactor Points and Rows to dedicated packages	2015-09-17 15:27:48 -05:00
Philip O'Toole	e4fde993f1	Make engine configurable	2015-09-16 19:09:25 -07:00
Cory LaNou	d19a510ad2	refactor Points and Rows to dedicated packages	2015-09-16 15:33:08 -05:00
Jason Wilder	5a6b0afc4b	Replace cursor direction with a type	2015-09-03 22:31:48 -06:00
Jason Wilder	266bdc1c2b	Support sort by time DESC in wal and bz1 engines	2015-09-03 22:28:36 -06:00
Ben Johnson	deff06f850	add copier service This commit adds the copier service which allows one server to copy shards from another server. This will be used for moving shards in the cluster.	2015-09-03 13:07:35 -06:00
Paul Dix	73f3dc1e14	Update store to properly manage WAL create/delete. * Update the store to remove the WAL directories associated with a shard or database when they are deleted. * Fix the Store so that it creates separate WAL directories for databases and retention policies.	2015-08-21 11:22:04 -04:00
Paul Dix	9df3b7d828	Add WAL configuration options	2015-08-18 16:59:54 -04:00
Paul Dix	3348dab4e0	Fix bug with new shards not getting series data persisted.	2015-08-16 15:45:09 -04:00
Paul Dix	b583b896ce	Integrate WAL and BZ1 and make BZ1 the default engine.	2015-08-16 12:46:50 -04:00
Ben Johnson	a9cbf6c857	Rename v1 engine to b1 This commit changes the 'v1' engine to 'b1' to represent "bolt v1".	2015-07-29 08:55:07 -06:00
Ben Johnson	2a9f1d0704	remove Engine.DB	2015-07-22 11:08:10 -06:00
Ben Johnson	cc0607a5cf	remove Engine.Flush()	2015-07-22 11:08:10 -06:00
Ben Johnson	a7f50ae03c	refactor storage to engine	2015-07-22 11:08:10 -06:00
Ben Johnson	4dc15a833e	rename engine.go to executor.go	2015-07-22 11:07:06 -06:00
Ben Johnson	de1f9a3736	refactor tsdb tests into test package	2015-07-22 11:07:06 -06:00
Philip O'Toole	425a65fca1	RemoteShard mapping now performed over TCP With this change remote mapping no longer uses HTTP, as the HTTP ports exposed by nodes on the cluster are not known cluster wide. The TCP ports exposed by the cluster service are, so this change uses that functionality. Each RemoteMapper has its own dedicated connection pool for each node, and remote mapping TCP connections are in no way coupled with query TCP connections.	2015-07-20 10:44:38 -07:00
Philip O'Toole	5016caabb1	One Query Executor to rule them all This change significantly simplifies query executor code. Before this change there were two types of executors -- RawExecutor and AggregateExecutor. These two types only differed in one function Execute(). Otherwise all other methods on the Executors were common and duplicated between executors This change merges the two executors into a single type called, wait for it, Executor and simply switches execute functions depending on the statement type.	2015-07-18 11:27:17 -07:00
Philip O'Toole	b5984a7032	There is now a single StatefulMapper	2015-07-17 08:27:53 -07:00
Philip O'Toole	5f357020c6	It's not raw or aggregate, it's just "mapper"	2015-07-17 08:27:49 -07:00
Philip O'Toole	56b61beff9	Remove aggMapperOutput type It's identical to rawMapperOutput type.	2015-07-17 08:23:36 -07:00
Philip O'Toole	dc0aadf3b0	aggMapperValue is the same as rawMapperValue	2015-07-17 08:23:36 -07:00
Philip O'Toole	134ab87a49	Store a []interface{} in an interface{} This is really pushing the type system, but needs to be done to cleanly combine the raw and aggregate output mapper types.	2015-07-17 08:23:36 -07:00
Philip O'Toole	c468a65bd2	Actually check tagset when looking for lowest time	2015-07-16 11:33:09 -07:00
Philip O'Toole	2d162acb53	Rename query_engine.go to engine.go The functionality in this file is more like the older file, so a rename makes sense.	2015-07-15 22:06:08 -07:00

1 2 3

147 Commits (15ae0bd98de8371569be52731aa027aa0ffe90dc)