Commit Graph

436 Commits (5cd52c7a3c2595756e9b5fc90cca8af18010efb7)

Author SHA1 Message Date
Philip O'Toole beb58eeb18 Refactor CQ code to avoid race conditions.
Fixes race detected by CircleCI:

https://circle-artifacts.com/gh/influxdb/influxdb/2332/artifacts/1/tmp/circle-artifacts.8UTeJCy/test_logs_race.txt
2015-05-12 15:50:46 -07:00
Philip O'Toole 4446274577 Run CQ passed into goroutine, not from outer loop 2015-05-12 15:00:37 -07:00
Philip O'Toole 2d1af26d1a Update messaging client connection index
With this change heartbeating to the brokers will always communicate the
index of the most recently replicated message. This will allow broker
truncation and broker diagnostics to operate correctly.
2015-05-11 15:29:12 -07:00
Todd Persen 730e8cdfd4 Merge pull request #2405 from neonstalwart/time
change timestamp to time
2015-05-11 12:38:00 -07:00
ben hockey 9a3c28748c change timestamp to time 2015-05-11 12:28:47 -05:00
Gabriel Taylor-Russ 44c0017fd8 Remove unused variable
Correct typo
2015-05-08 12:04:13 -07:00
Philip O'Toole b93554e39b RLock server when checking shard groups
This fixes a race detected by the race detector. "Create shard groups"
commands must be broadcast across the cluster without holding the server
lock so the commands are created under lock, and then processed after
releasing the lock.
2015-05-01 12:03:58 -07:00
Philip O'Toole 6cbc80fa50 Only access shard stats if shard is local
Fixes issue #2452
2015-04-29 15:05:45 -07:00
ben hockey cb2fe96ba0 count each executed query in queriesExecuted stat 2015-04-28 15:03:56 -05:00
Jason Wilder 6e2a0f8310 Use a read lock for reading data node URL 2015-04-27 19:11:22 -06:00
Jason Wilder 75e638f95b Use a read lock for running continuous queries 2015-04-27 19:10:22 -06:00
Jason Wilder 1efbdd1631 Use a read lock to look up a retention policy 2015-04-27 19:04:00 -06:00
Jason Wilder f624537cf8 Use a read lock to authenticate users 2015-04-27 19:02:01 -06:00
Jason Wilder 105637fc88 Use a read lock for admin user lookup 2015-04-27 18:59:28 -06:00
Jason Wilder 03ed70e6a3 Use read lock for user lookup 2015-04-27 18:48:20 -06:00
Philip O'Toole 76c29053cf Move drop series lock control into local function
This change means that lock control can use the defer call, which means
there is no chance the RLock will be left locked at function exit
Previously this code was more complex as it managed locks manually,
since the RLock must be released to allow the "drop series" broadcast
message go through.
2015-04-27 14:19:24 -07:00
Philip O'Toole 257aa4d6bd Always append shard path in diags
This code is clearer -- simply append an empty path if the shard is not
local.

Fixes issue #2430
2015-04-26 11:01:12 -07:00
Philip O'Toole 0994c8d9d5 Ensure no field value is null
Fix issue #2420.
2015-04-26 10:59:18 -07:00
Cory LaNou 022cb62fdb remove shards from server memory 2015-04-23 16:33:18 -06:00
Cory LaNou c27953a764 put shard responsibilities in shard.go 2015-04-23 16:33:18 -06:00
Cory LaNou b922cdcb76 shard stat created vs opened 2015-04-22 17:20:16 -06:00
Cory LaNou 9fee13ce41 format show stats properly and add more shard stats 2015-04-22 17:14:37 -06:00
Philip O'Toole b2b60532f1 Merge pull request #2383 from influxdb/serve_shard
Add HTTP endpoint that serves a requested shard
2015-04-22 09:33:35 -07:00
Philip O'Toole 68ba7ba005 Merge pull request #2387 from influxdb/no_local_shard_stats
There are no stats for non-local shards
2015-04-22 09:30:49 -07:00
Philip O'Toole e75e6a9526 Add HTTP endpoint that serves a requested shard
With this change a datanode can stream the requested shard to the
client. An error is returned if the shard does not exist or the the
shard is not local to that node.

1 data node can hit this endpoint to request data for a given shard if
the data no longer resides on the broker.
2015-04-22 09:29:19 -07:00
Philip O'Toole 52f968fbc4 There are no stats for non-local shards 2015-04-22 08:47:08 -07:00
Jason Wilder efa87633fa Fix shard datanodes stats getting appended too many times 2015-04-21 23:48:12 -06:00
Philip O'Toole c855549973 Add shard path to first diag value
Fix issue #2369
2015-04-21 19:19:08 -07:00
Jason Wilder 38628e540b Make drop database close and release resources
Drop database did not close any open shard files or close
any topic reader/heartbeats.  In the tests, we create and drop new
databases during each test run so these were open files and connection
slowed things down and consumed a lot of RAM as the tests progressed.
2015-04-21 13:39:58 -06:00
Jason Wilder 90e3059a8b Fix processRawQuery from returning duplicate data 2015-04-21 13:39:58 -06:00
Philip O'Toole ec57f8c84f RLock shard during diagnostics 2015-04-20 14:03:55 -07:00
Philip O'Toole 16befaa834 SHOW DIAGNOSTICS must check if shards are local
Fix issue #2323.
2015-04-18 11:39:08 -07:00
David Norton a1790f2d0c fix #2337: panic if tag key isn't double quoted 2015-04-18 13:05:41 -04:00
Jason Wilder 8aa0d32b6f Add failover to other data nodes for distributed queries
Fixes #2190
2015-04-17 11:28:47 -06:00
Jason Wilder c52dfce897 Load balance distributed queries across data nodes
Adds a Balancer interface to allow RemoteMappers to send data node
requests to multiple nodes.  It also provides the ability to failed
requests to mark the data node as offline using exponential
backoff with a 5 min max wait time.

Fixes #2242
2015-04-17 11:28:47 -06:00
ben hockey dde380832a wire up drop CQ statements
fixes #2141
2015-04-15 11:14:30 -05:00
Philip O'Toole 37c42c9dd2 RLock server for SHOW RETENTION POLICIES 2015-04-14 17:13:35 -07:00
Ben Johnson c5bdb5af86 Fix cluster-wide restart issue. 2015-04-14 13:43:25 -06:00
Todd Persen eed570ea1b Fix merge conflict in CHANGELOG.md 2015-04-13 15:55:38 -07:00
Cory LaNou 3ab660fe28 fix error message from having invalid format \n 2015-04-13 15:56:49 -06:00
Jason Wilder e47ee66b07 Make top-level handler less brittle
Move the data node specific routes under a common /data prefix so
add new handler does require updates to the top level handler as well.
2015-04-13 15:38:42 -06:00
David Norton c94785780d fix #2251: fix panic when changing default RP 2015-04-12 13:04:10 -04:00
Philip O'Toole 9282a8ae6d Fix compilation errors after parser merge 2015-04-10 16:11:34 -07:00
Philip O'Toole bf1a8aa1e4 Use uint64 for Series IDs
Fixes issue #1649
2015-04-10 16:11:34 -07:00
Paul Dix 7661546a47 Finish up distributed queries. 2015-04-10 16:11:34 -07:00
Paul Dix d41b85a715 Remove the interval setting from NextInterval to make remote mappers work. 2015-04-10 16:11:34 -07:00
Paul Dix 113995032e WIP: Initial implementation of remote mapper for distributed queries. 2015-04-10 16:11:34 -07:00
Ben Johnson 3404386a02 Merge pull request #2236 from influxdb/term-signal
Term signal
2015-04-10 17:02:13 -06:00
Ben Johnson eaf4bfca0a Fix term signal.
This commit changes raft so that term changes are made immediately and
term change signals are made afterward. Previously, election timeouts
were invalidated by incoming term changes which caused an election loop.

Stale term was also fixed and http/pprof was added too.
2015-04-10 13:52:20 -06:00
Jason Wilder a5e180ca31 Merge pull request #2229 from influxdb/jw-run
Close resources when stopping a node
2015-04-09 22:10:50 -06:00