mirror of https://github.com/milvus-io/milvus.git
issue: #43828 Implement robust rewatch mechanism to handle etcd connection failures and node reconnection scenarios in DataCoord and QueryCoord, along with heartbeat lag monitoring capabilities. Changes include: - Implement rewatchDataNodes/rewatchQueryNodes callbacks for etcd reconnection scenarios - Add idempotent rewatchNodes method to handle etcd session recovery gracefully - Add QueryCoordLastHeartbeatTimeStamp metric for monitoring node heartbeat lag - Clean up heartbeat metrics when nodes go down to prevent metric leaks --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com> |
||
---|---|---|
.. | ||
OWNERS | ||
README.md | ||
cluster.go | ||
cluster_test.go | ||
common.go | ||
datanode_manager.go | ||
datanode_manager_test.go | ||
mock_cluster.go | ||
mock_data_node_manager.go | ||
mock_node_manager.go | ||
node_manager.go | ||
node_manager_test.go | ||
session.go |
README.md
Session Package
session
package contains the worker manager/nodes abstraction for datanodes and indexnodes.