milvus

Commit Graph

Author	SHA1	Message	Date
wei liu	3e9e830074	enhance: Implement rewatch mechanism for etcd failure scenarios (#43829 ) issue: #43828 Implement robust rewatch mechanism to handle etcd connection failures and node reconnection scenarios in DataCoord and QueryCoord, along with heartbeat lag monitoring capabilities. Changes include: - Implement rewatchDataNodes/rewatchQueryNodes callbacks for etcd reconnection scenarios - Add idempotent rewatchNodes method to handle etcd session recovery gracefully - Add QueryCoordLastHeartbeatTimeStamp metric for monitoring node heartbeat lag - Clean up heartbeat metrics when nodes go down to prevent metric leaks --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-08-14 10:31:44 +08:00
cai.zhang	6dbe5d475e	enhance: Refine task meta with key lock (#40613 ) issue: #39101 2.5 pr: #40146 #40353 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-03-14 15:44:22 +08:00
congqixia	cb7f2fa6fd	enhance: Use v2 package name for pkg module (#39990 ) Related to #39095 https://go.dev/doc/modules/version-numbers Update pkg version according to golang dep version convention --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-02-22 23:15:58 +08:00
yihao.dai	657550cf06	fix: Fix slow dist handle and slow observe (#38566 ) 1. Provide partition&channel level indexing in the collection target. 2. Make `SegmentAction` not wait for distribution. 3. Remove scheduler and target manager mutex. 4. Optimize logging to reduce CPU overhead. issue: https://github.com/milvus-io/milvus/issues/37630 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-01-15 20:17:00 +08:00
Zhen Ye	833c74aa66	enhance: add detail, replica count for resource group (#38314 ) issue: #30647 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-12-13 14:14:50 +08:00
jaime	0d99db23b8	fix: metrics leak on the coord nodes (#33075 ) issue: #32980 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-05-20 22:03:39 +08:00
congqixia	72c172a7d7	enhance: Remove duplicated collectionID label for task latency (#32308 ) `CollectionID` already exists in channel name, so remove it to save metrics traffic. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-16 18:55:19 +08:00
wei liu	7c7375031d	enhance: Add metrics for task latency in querycoord scheduler (#31405 ) This PR add metrics for task latency in querycoord scheduler, so if any kind of task stuck, it's easy to figure out by metrics --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-20 19:29:06 +08:00
congqixia	194a611814	enhance: Add metrics for querycoord current target cp lag (#31391 ) See also #31390 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-19 14:07:05 +08:00
MrPresent-Han	b09e7aeaf7	support detailed task metrics(#23414 ) (#24507 ) Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2023-05-30 14:59:28 +08:00
jaime	c9d0c157ec	Move some modules from internal to public package (#22572 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2023-04-06 19:14:32 +08:00

11 Commits (master)