milvus

Commit Graph

Author	SHA1	Message	Date
congqixia	447ff342fb	fix: Direct forward delta exclude l0 segments (#36899 ) Related to #36887 Forward delete to L0 segment will return error and mark l0 segment offline causing delegator unserviceable Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-16 14:05:23 +08:00
congqixia	caeab0cc1f	enhance: Fill start pos & level for growing segment (#36888 ) Start position & level info is missing for growing segment loaded in watch dml channel operation. Level is important for metrics and start position is crucial for growing exclude logic. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-16 14:03:31 +08:00
aoiasd	72dc07ba48	fix: bm25 search failed when nq > 1 and remove idf oracle when no bm25 field exist. (#36886 ) relate: https://github.com/milvus-io/milvus/issues/35853 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-10-16 12:51:23 +08:00
yihao.dai	f3b6792a25	enhance: Enhance segment log (#36848 ) /kind improvement Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-15 20:43:30 +08:00
congqixia	ba25320aea	fix: Unify loaded partition check to delegator (#36879 ) Related to #36370 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-15 19:15:23 +08:00
aoiasd	5ec4163d0f	feat: support bm25 logs mixcompaction (#36072 ) relate: https://github.com/milvus-io/milvus/issues/35853 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-10-14 16:57:22 +08:00
Buqian Zheng	383350c120	feat: added more checks for function creation check (#36766 ) issue: https://github.com/milvus-io/milvus/issues/35853 * BM25 Function now takes no params, k1, b should be passed via index params * support BM25 full text search when metric type is not present in search request * add more strict validation with functions at collection creation time Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-10-13 17:43:22 +08:00
Buqian Zheng	16b533cbf0	feat: Restful support for BM25 function (#36713 ) issue: https://github.com/milvus-io/milvus/issues/35853 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-10-13 17:41:21 +08:00
Bingyi Sun	a75bb85f3a	feat: support chunked column for sealed segment (#35764 ) This PR splits sealed segment to chunked data to avoid unnecessary memory copy and save memory usage when loading segments so that loading can be accelerated. To support rollback to previous version, we add an option `multipleChunkedEnable` which is false by default. Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-10-12 15:04:52 +08:00
aoiasd	db34572c56	feat: support load and query with bm25 metric (#36071 ) relate: https://github.com/milvus-io/milvus/issues/35853 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-10-11 10:23:20 +08:00
yihao.dai	3685edb264	enhance: Use common gc config (#36668 ) Use the GC config from `common` and remove the GC config from `queryNode`. issue: https://github.com/milvus-io/milvus/issues/36667 related pr: https://github.com/milvus-io/milvus/pull/34949 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-09 19:47:19 +08:00
SimFG	130a923dec	enhance: the estimate method when loading the collection (#36307 ) - issue: #36530 --------- Signed-off-by: SimFG <bang.fu@zilliz.com> Signed-off-by: xianliang.li <xianliang.li@zilliz.com> Co-authored-by: xianliang.li <xianliang.li@zilliz.com>	2024-10-09 17:35:19 +08:00
congqixia	ddc3e76803	fix: Add defer Unpin when error happens (#36620 ) Resolves: #36619 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-30 19:49:17 +08:00
wei liu	470bb0cc3f	enhance: Enable balance on querynode with different mem capacity (#36466 ) issue: #36464 This PR enable balance on querynode with different mem capacity, for query node which has more mem capactity will be assigned more records, and query node with the largest difference between assignedScore and currentScore will have a higher priority to carry the new segment. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-30 16:15:17 +08:00
cai.zhang	ecb2b242e2	enhance: Add sorted for segment info (#36469 ) issue: #33744 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-09-30 10:01:16 +08:00
congqixia	4fd9b0a8e3	enhance: Return segment id hint in QueryStream response (#36487 ) Related to #36482 This PR reuses `SealedSegmentIDsRetrieved` field in `RetrieveResults` struct to store segment id hint. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-26 10:13:14 +08:00
congqixia	ed95568a05	enhance: Fix PR conflict in reduce unit test (#36470 ) Related to #36433 #36180 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-24 18:01:13 +08:00
congqixia	98a917c5d4	enhance: [skip e2e] Add unittest for reducing duplicated pk from multi segments (#36433 ) Related to #35505 #36362 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-24 14:11:13 +08:00
Chun Han	df7ae08851	fix: iterator cursor progress too fast(#36179 ) (#36180 ) related: #36179 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-09-24 11:45:13 +08:00
congqixia	1833913f44	enhance: Add streaming forward policy switch for delegator (#36330 ) Related to #35303 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-23 18:01:12 +08:00
yihao.dai	763fd0dfc5	enhance: Use a separate mmap config for chunk cache (#36276 ) issue: https://github.com/milvus-io/milvus/issues/35273 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-09-15 16:23:09 +08:00
wei liu	329fb421cd	fix: fix search/query/count may access same growing and sealed segment (#36258 ) issue: #36257 during syncTargetVersion, sealed segment should be excluded, to avoid it's growing segment be conusmed from stream again. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-14 14:21:07 +08:00
congqixia	3bc7d63be9	fix: overwrite correct selection when pk duplicated (#35826 ) Related to #35505 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-14 10:27:08 +08:00
aoiasd	c22a2cebb6	fix: split stream query result to avoid grpc response too large error (#36090 ) relate: https://github.com/milvus-io/milvus/issues/36089 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-09-13 15:07:09 +08:00
congqixia	11dbe1e755	enhance: Add L0 forward policy to support remote load (#36189 ) Related to #35303 This PR add a param item to support change l0 forward behavior from bf filtering and forward to remote load. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-12 12:01:08 +08:00
Jiquan Long	89bf226f0b	feat: support keyword text match (#35923 ) fix: #35922 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-09-10 15:11:08 +08:00
Chun Han	9d0aa5c202	fix: empty result when having only one subReq(#36098 ) (#36128 ) related: #36098 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-09-10 14:25:07 +08:00
zhagnlu	208c8a2328	fix:support config index offsetcache and fix create same index again (#35985 ) #35971 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-09-08 18:23:05 +08:00
Chun Han	e480b103bd	feat: supporing hybrid search group_by (#35982 ) related: #35096 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-09-08 17:09:04 +08:00
XuanYang-cn	7859faf8ea	fix: Change deltalog memory estimation factor to one (#36033 ) See also: #36031 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-09-06 18:21:05 +08:00
XuanYang-cn	5e3f700e5d	enhance: Remove too frequent logs in Delete (#35980 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-09-05 14:25:03 +08:00
congqixia	8593c4580a	enhance: Add delete buffer related quota logic (#35918 ) See also #35303 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-05 11:39:03 +08:00
jaime	24fb10114b	enhance: remove cooling off in rate limiter for read requests (#35935 ) issue: #35934 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-09-04 14:39:10 +08:00
cai.zhang	2c9bb4dfa3	feat: Support stats task to sort segment by PK (#35054 ) issue: #33744 This PR includes the following changes: 1. Added a new task type to the task scheduler in datacoord: stats task, which sorts segments by primary key. 2. Implemented segment sorting in indexnode. 3. Added a new field `FieldStatsLog` to SegmentInfo to store token index information. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-09-02 14:19:03 +08:00
Zhen Ye	99dff06391	enhance: using streaming service in insert/upsert/flush/delete/querynode (#35406 ) issue: #33285 - using streaming service in insert/upsert/flush/delete/querynode - fixup flusher bugs and refactor the flush operation - enable streaming service for dml and ddl - pass the e2e when enabling streaming service - pass the integration tst when enabling streaming service --------- Signed-off-by: chyezh <chyezh@outlook.com> Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-08-29 10:03:08 +08:00
Chun Han	bfd9d86fe9	feat: support groupby size on go-layer(#33544 ) (#33845 ) related: #33544 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-08-27 14:21:00 +08:00
aoiasd	fe83805d56	fix: loss data bug for deprecated querynode DoubleBuffer (#35128 ) relate: https://github.com/milvus-io/milvus/issues/31548 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-08-27 14:10:59 +08:00
yihao.dai	f2b83d316b	enhance: Support memory mode chunk cache (#35347 ) Chunk cache supports loading raw vectors into memory. issue: https://github.com/milvus-io/milvus/issues/35273 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-08-25 15:42:58 +08:00
Gao	e8e3544a11	enhance: add hit segment num metrics for queryHook (#35577 ) issue: #35576 Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-08-23 12:49:02 +08:00
Zhen Ye	a773836b89	enhance: optimize milvus core building (#35610 ) issue: #35549,#35611,#35633 - remove milvus_segcore milvus_indexbuilder..., add libmilvus_core - core building only link once - move opendal compilation into cmake - fix odr --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-08-23 12:35:02 +08:00
SimFG	731d45abbe	enhance: provide more general configuration to control mmap behavior (#35359 ) - issue: #35273 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-08-21 00:22:54 +08:00
Ted Xu	41646c8439	feat: integrate new deltalog format (#35522 ) See #34123 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-08-20 19:06:56 +08:00
congqixia	2fbc628994	feat: Support field partial load collection (#35416 ) Related to #35415 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-08-20 16:49:02 +08:00
congqixia	f87af9bc54	enhance: Exclude L0 segment from readable snapshot (#35507 ) L0 segments now do not contain insert data and may cause confusion for query hook optimizer if counted as sealed segment number. This PR add segment level flag in segment entry and exclude L0 segments while get readable segment snaphsot Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-08-16 15:28:53 +08:00
congqixia	6ff238e88a	fix: Set corresponding DataScope for `loadStreamDelete` (#35312 ) Related to #35311 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-08-06 22:32:23 +08:00
zhagnlu	4b553b0333	enhance: revert remove duplicated pk function (#35103 ) issue: #34778 Revert "fix: fix query count(*) concurrently" Revert "enhance: mark duplicated pk as deleted " Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-08-05 10:48:17 +08:00
Chun Han	3faef63a25	enhance: add log for partition stats( #30376 ) (#35219 ) related: #30376 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-08-02 19:34:22 +08:00
Bingyi Sun	3641ae6611	fix: Fix index memory estimation (#35225 ) issue: https://github.com/milvus-io/milvus/issues/35229 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-08-02 16:24:15 +08:00
congqixia	c64a078458	enhance: Support proxy/delegator qn client pooling (#35194 ) See also #35196 Add param item for proxy/delegator query node client pooling and implement pooling logic --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-08-02 11:24:19 +08:00
wei liu	27b6d58981	fix: Set legacy level to l0 segment after qc restart (#35197 ) issue: #35087 after qc restarts, and target is not ready yet, if dist_handler try to update segment dist, it will set legacy level to l0 segment, which may cause l0 segment be moved to other node, cause search/query failed. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-02 10:18:13 +08:00
zhenshan.cao	aa247f192d	enhance: remove unused code for StorageV2 (#35132 ) issue: https://github.com/milvus-io/milvus/issues/34168 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-08-01 12:08:13 +08:00
Gao	6695c6d0a3	enhance: add channel num for queryHook optimization (#35104 ) At most cases, data in each channel is almost evenly distributed, we could utilize the channel num info to optimize searh param in queryHook Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-07-31 18:59:49 +08:00
congqixia	f7f9a729c9	enhance: Pre-allocate space for reduce data structure (#35118 ) Grow slice & map.growWork may cause a lot when segment number is large for big K query. This PR pre-allocate space for reduce methods to avoid this cost. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-31 10:35:49 +08:00
congqixia	de8a266d8a	enhance: Enable linux code checker (#35084 ) See also #34483 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-30 15:53:51 +08:00
wei liu	c45f38aa61	enhance: Update protobuf-go to protobuf-go v2 (#34394 ) issue: #34252 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-29 11:31:51 +08:00
Chun Han	e2e38e98df	fix: nil part stats without l2 compaction(#34923 ) (#34992 ) related: #34923 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-07-29 11:07:48 +08:00
cai.zhang	2372452fac	enhance: Optimized the GC logic to ensure that memory is released in time (#34949 ) issue: #34703 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-07-28 23:53:47 +08:00
Chun Han	c46c401112	fix: refine handling type for segment pruner(#34923 ) (#34925 ) related: #34923 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-07-25 13:57:45 +08:00
congqixia	2ac7164c39	enhance: Remove useless ops when there is no write (#34767 ) Related to #33235 THe querynode pipeline will make map & call ProcessInsert when there is no write messages. So querynodes will have high CPU usage even when there is no workload. This PR check msg length before composing data struct and calling method Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-19 14:31:42 +08:00
zhagnlu	804dd5409a	enhance: mark duplicated pk as deleted (#34586 ) fix #34247 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-07-16 14:25:39 +08:00
congqixia	531092c031	enhance: Add lint rule to forbid gogo protobuf (#34594 ) github.com/gogo/protobuf is deprecated and could be error prune after upgrade protobuf message to v2. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-12 10:19:35 +08:00
jaime	3b62138c5c	fix: unstable UT for level0 deletion (#34524 ) issue: #34533 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-07-11 10:02:56 +08:00
congqixia	d60e628aed	enhance: Avoid use concrete segment type in segments interfaces (#34521 ) See also #34519 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-10 10:18:12 +08:00
wei liu	eeb03a0d6a	fix: Query may return deleted records (#34501 ) issue: #34500 cause the sort in `GetLevel0Deletions` will broken the corresponed order between pks and tss, then the pks and tss will be sorted in segment.Delete() interface. This PR remove this uncessary and incorrect sort progress to avoid query may return deleted records. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-09 10:46:11 +08:00
Chun Han	8af187f673	fix: lose partitionIDs when scalar pruning and refine segment prune ratio metrics(#30376 ) (#34477 ) related: #30376 fix: paritionIDs lost when no setting paritions enhance: refine metrics for segment prune Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-07-08 19:54:15 +08:00
Chun Han	fcafdb6d5f	enhance: reconstruct scalar part's code for segment-pruner(#30376 ) (#34346 ) related: #30376 1. support more complex expr 2. add more ut test for unrelated fields Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-07-04 16:36:09 +08:00
Chun Han	34bec2ea5e	enhance: add metrics for segment prune latnecy(#30376 ) (#34094 ) related: #30376 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-07-03 10:04:07 +08:00
wei liu	b49862d4f3	enhance: Optimize grow slice cost during query (#34253 ) issue: #32252 This PR try to pre-allocate FieldData for Reduce operations in the Query chain using typeutil.PrepareResultFieldData to avoid the overhead of dynamically growing the slice during appendFieldData process. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-01 15:18:11 +08:00
wei liu	45203425fd	enhance: Avoid search querynode return nil status in response (#34100 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-26 11:50:11 +08:00
jaime	9630974fbb	enhance: move rocksmq from internal to pkg module (#33881 ) issue: #33956 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-06-25 21:18:15 +08:00
wayblink	f9a0f7bb25	Add an option to enable/disable vector field clustering key (#34097 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-06-25 18:52:04 +08:00
congqixia	fd922d921a	enhance: Add nilness linter and fix some small issues (#34049 ) Add `nilness` for govet linter and fixed some detected issues Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-24 14:52:03 +08:00
Chun Han	ca7ef26e4b	fix: sync part stats task cannot be finished(#30376 ) (#34027 ) related: #30376 also: refine log output for query_coord task by rephrasing action string Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-06-24 10:16:02 +08:00
chyezh	259a682673	enhance: async search and retrieve in cgo (#33228 ) issue: #30926, #33132 related pr: #33133 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-06-22 09:38:02 +08:00
smellthemoon	2a1356985d	enhance: support null in go payload (#32296 ) #31728 --------- Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-06-19 17:08:00 +08:00
Gao	a789c60380	enhance: autoindex for multi data type (#33868 ) issue: #22837 contain https://github.com/milvus-io/milvus/pull/33625 https://github.com/milvus-io/milvus/pull/33867 https://github.com/milvus-io/milvus/pull/33911 which already merged to 2.4 branch Signed-off-by: chasingegg <chao.gao@zilliz.com> Co-authored-by: foxspy <xianliang.li@zilliz.com>	2024-06-18 21:34:01 +08:00
congqixia	3fdaae8792	fix: Return record with largest timestamp for entires with same PK (#33936 ) See also #33883 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-18 15:55:59 +08:00
cqy123456	32f685ff12	enhance: growing segment support mmap (#32633 ) issue: https://github.com/milvus-io/milvus/issues/32984 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-06-18 14:42:00 +08:00
congqixia	ec64499536	fix: Check nodeID wildcard when removing pkOracle (#33895 ) See also #33894 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-18 14:11:58 +08:00
congqixia	2a04b0929a	fix: Prevent use captured iteration variable partitionID (#33906 ) See also #33902 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-17 19:11:59 +08:00
chyezh	9b69601dfb	fix: load operation when segment is on releasing (#31340 ) issue: #30857 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-06-14 15:35:56 +08:00
wei liu	4987067375	enhance: Execute bloom filter apply in parallel to speed up segment predict (#33792 ) issue: #33610 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-14 11:37:56 +08:00
wei liu	ab93d9c23d	enhance: Use BatchPkExist to reduce bloom filter func call cost (#33611 ) issue:#33610 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-13 17:57:56 +08:00
chyezh	8ca5ced821	fix: async warmup will be blocked by state lock (#33686 ) issue: #33685 Signed-off-by: chyezh <chyezh@outlook.com>	2024-06-10 21:59:53 +08:00
wayblink	a1232fafda	feat: Major compaction (#33620 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-10 21:34:08 +08:00
yihao.dai	3540eee977	enhance: Support L0 import (#33514 ) issue: https://github.com/milvus-io/milvus/issues/33157 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-07 14:17:20 +08:00
jaime	8858fcb40a	fix: fix loaded entity num is inaccurate (#33521 ) issue: #33520 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-06-04 20:09:54 +08:00
wei liu	34c6a989ab	enhance: Avoid load bf in delegator when qn worker has no more memory (#33557 ) query coord send load request to delegator, delegator load bf first, then forward load request to qn worker. but when qn worker has no more memory, it will return load failed immediatelly. then delegator roll back the loaded bf. query coord wil retry the load request, and delegator will load and roll back bf again and again. this PR delay the loading bf step until load segment succeed in worker. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-03 19:23:45 +08:00
wei liu	c6a1c49e02	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl. (#33405 ) issue: #32995 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. - Block BF construct time {"time": "54.128131ms"} - Block BF size {"size": 3021578} - Block BF Test cost {"time": "55.407352ms"} - Basic BF construct time {"time": "210.262183ms"} - Basic BF size {"size": 2396308} - Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. - Block BF TestLocation cost {"time": "529.97183ms"} - Basic BF TestLocation cost {"time": "3.197430181s"} --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-31 17:49:45 +08:00
Jiquan Long	0c5d8660aa	feat: support inverted index for array (#33452 ) issue: https://github.com/milvus-io/milvus/issues/27704 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-05-31 09:47:47 +08:00
Chun Han	416a2cf507	fix: query iterator lack results(#33137 ) (#33422 ) related: #33137 adding has_more_result_tag for various level's reduce to rectify reduce_stop_for_best Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-05-30 17:51:44 +08:00
jaime	0d3272ed6d	enhance: refine logs of cgo pool (#33373 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-05-27 19:06:11 +08:00
aoiasd	59a7a46904	enhance: Merge query stream result for reduce delete task (#32855 ) relate: https://github.com/milvus-io/milvus/issues/32854 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-05-27 18:15:43 +08:00
SimFG	cb99e3db34	enhance: add the includeCurrentMsg param for the Seek method (#33326 ) /kind improvement - issue: #33325 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-27 10:31:41 +08:00
jaime	58ee613fea	enhance: remove repeated stats of loaded entity (#33255 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-05-27 01:49:41 +08:00
yihao.dai	760223f80a	fix: use seperate warmup pool and disable warmup by default (#33348 ) 1. use a small warmup pool to reduce the impact of warmup 2. change the warmup pool to nonblocking mode 3. disable warmup by default 4. remove the maximum size limit of 16 for the load pool issue: https://github.com/milvus-io/milvus/issues/32772 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com> Co-authored-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-05-27 01:25:40 +08:00
Bingyi Sun	370562b4ec	fix: fix partition loaded num metric (#33316 ) issue: https://github.com/milvus-io/milvus/issues/32108 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-24 15:31:42 +08:00
wei liu	39f56678a0	enhance: Reduce bloom filter lock contention between insert and delete in query coord (#32643 ) issue: #32530 cause ProcessDelete need to check whether pk exist in bloom filter, and ProcessInsert need to update pk to bloom filter, when execute ProcessInsert and ProcessDelete in parallel, it will cause race condition in segment's bloom filter This PR execute ProcessInsert and ProcessDelete in serial to avoid block each other Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-22 19:11:40 +08:00
Xiaofan	3d105fcb4d	enhance: Remove l0 delete cache (#32990 ) fix #32979 remove l0 cache and build delete pk and ts everytime. this reduce the memory and also increase the code readability Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-05-21 22:53:40 +08:00
Bingyi Sun	0f8c6f49ff	enhance: mmap load raw data if scalar index does not have raw data (#33175 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-21 11:53:39 +08:00
wei liu	f1c9986974	enhance: Skip return data distribution if no change happen (#32814 ) issue: #32813 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-17 10:11:37 +08:00
Jiquan Long	dd9919a7dc	fix: two-phase retrieval on lru-segment (#32945 ) issue: #31822 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-05-15 17:53:34 +08:00
cai.zhang	6ea7633bd5	enhance: Add memory size for binlog (#33025 ) issue: #33005 1. add `MemorySize` field for insert binlog. 2. `LogSize` means the file size in the storage object. 3. `MemorySize` means the size of the data in the memory. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2024-05-15 12:59:34 +08:00
SimFG	1d48d0aeb2	enhance: use different value to get related data size according to segment type (#33017 ) issue: #30436 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-14 14:59:33 +08:00
Cai Yudong	4fc7915c70	enhance: unify data generation test APIs (#32955 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-14 14:33:33 +08:00
chyezh	96489b814d	fix: remove busy log (#33042 ) issue: #32963 Signed-off-by: chyezh <chyezh@outlook.com>	2024-05-14 14:20:32 +08:00
foxspy	f6777267e3	enhance: add score compute consistency config for knowhere (#32997 ) issue: https://github.com/milvus-io/milvus/issues/32583 related: #32584 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-05-13 14:21:31 +08:00
chyezh	1c84a1c9b6	fix: lru related issue fixup patch (#32916 ) issue: #32206, #32801 - search failure with some assertion, segment not loaded and resource insufficient. - segment leak when query segments --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-05-10 19:17:30 +08:00
wei liu	25689859a1	fix: Load index metric use wrong time unit (#32935 ) issue:#32899 This PR fix the wrong metric value of load index, which introduced by pr#32567, use wrong time unit for load index metrics Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-10 18:07:30 +08:00
Jiquan Long	0783582e2e	fix: temporarily disable two-phase retrieval when lru is enabled (#32927 ) issue: #31822 Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-05-10 14:19:45 +08:00
Bingyi Sun	5cbf081111	fix: fix index resource estimation (#32842 ) issue: #32820 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-10 11:53:30 +08:00
Bingyi Sun	17a79f4ca9	enhance: The LRU cache evicts items and retries loading if the disk limit is reached. (#32819 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-08 14:45:30 +08:00
wei liu	5038036ece	enhance: Reuse hash locations during access bloom fitler (#32642 ) issue: #32530 when try to match segment bloom filter with pk, we can reuse the hash locations. This PR maintain the max hash Func, and compute hash location once for all segment, reuse hash location can speed up bf access --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-07 06:13:47 -07:00
congqixia	40728ce83d	enhance: Add `metautil.Channel` to convert string compare to int (#32749 ) See also #32748 This PR: - Add `metautil.Channel` utiltiy which convert virtual name to physical channel name, collectionID and shard idx - Add channel mapper interface & implementation to convert limited physical channel name into int index - Apply `metautil.Channel` filter in querynode segment manager logic --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-07 19:13:35 +08:00
yihao.dai	9db3aa18bc	enhance: Remove deprecated EnableIndex (#32704 ) /kind improvement Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-07 17:11:30 +08:00
chyezh	641f702f64	fix: add request resource timeout for lazy load, refactor context usage in cache (#32709 ) issue: #32663 - Use new param to control request resource timeout for lazy load. - Remove the timeout parameter of `Do`, remove `DoWait`. use `context` to control the timeout. - Use `VersionedNotifier` to avoid notify event lost and broadcast, remove the redundant goroutine in cache. related dev pr: #32684 Signed-off-by: chyezh <chyezh@outlook.com>	2024-05-07 16:33:30 +08:00
congqixia	efa0c73c62	fix: Unify querynode metrics cleanup in collection release (#32805 ) Related to #32803 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-07 15:41:29 +08:00
aoiasd	31dca3249e	enhance: add type info for payload writer error message and add log when querynode find new collection (#32522 ) relate: https://github.com/milvus-io/milvus/issues/32668 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-05-07 14:45:29 +08:00
Jiquan Long	1f58cda957	enhance: add more trace for search & query (#32734 ) issue: https://github.com/milvus-io/milvus/issues/32728 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-05-07 13:03:29 +08:00
yihao.dai	cf4db3ff4e	enhance: Fix compilation error (#32797 ) Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-06 19:31:49 -07:00
congqixia	7102403a6b	fix: Add Wrapper and Keepalive for CTraceContext ids (#32746 ) See also #32742 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-07 10:05:35 +08:00
congqixia	53b5f1be17	enhance: Remove legacy L0 segment if watch failed (#32725 ) Like growing segments, legacy l0 segments shall be removed if watch dml channel execution fails as well. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-07 10:03:42 +08:00
Bingyi Sun	fecd9c21ba	feat: LRU cache implementation (#32567 ) issue: https://github.com/milvus-io/milvus/issues/32783 This pr is the implementation of lru cache on branch lru-dev. Signed-off-by: sunby <sunbingyi1992@gmail.com> Co-authored-by: chyezh <chyezh@outlook.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com> Co-authored-by: Ted Xu <ted.xu@zilliz.com> Co-authored-by: jaime <yun.zhang@zilliz.com> Co-authored-by: wayblink <anyang.wang@zilliz.com>	2024-05-06 20:29:30 +08:00
Chun Han	ac82cef04d	enhance: disable reload partstats by config (#32702 ) Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-04-29 19:11:26 +08:00
wei liu	c0555d4b45	fix: Remove read only node from replica immedaitely after node down (#32666 ) issue: #32665 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-28 20:25:25 +08:00
chyezh	2586c2f1b3	enhance: use WalkWithPrefix api for oss, enable piplined file gc (#31740 ) issue: #19095,#29655,#31718 - Change `ListWithPrefix` to `WalkWithPrefix` of OOS into a pipeline mode. - File garbage collection is performed in other goroutine. - Segment Index Recycle clean index file too. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-25 20:41:27 +08:00
Jiquan Long	c002745902	enhance: retrieve output fields after local reduce (#32346 ) issue: #31822 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-04-25 09:49:26 +08:00
congqixia	faa559592d	enhance: Make applyDelete work in paralell in segment level (#32291 ) `applyDelete` used to be serial for delete entries on each segments. This PR make it work in parallel with errgroup to improve performance --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-24 17:01:24 +08:00
yihao.dai	281a583eda	fix: Correct the negative queryable num entities metric (#32361 ) issue: https://github.com/milvus-io/milvus/issues/32281 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-24 15:55:24 +08:00
Cai Yudong	16b8b7b35d	enhance: Add get_vector unittest for float16 & bfloat16 (#32153 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-04-23 16:15:23 +08:00
chyezh	e19d17076f	fix: delete may lost when enable lru cache, some field should be reset when ReleaseData (#32012 ) issue: #30361 - Delete may be lost when segment is not data-loaded status in lru cache. skip filtering to fix it. - `stats_` and `variable_fields_avg_size_` should be reset when `ReleaseData` - Remove repeat load delta log operation in lru. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-16 11:17:20 +08:00
Gao	55d894bd5e	enhance: support disable search optimization (#32141 ) Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-04-16 10:51:20 +08:00
SimFG	c012e6786f	feat: support rate limiter based on db and partition levels (#31070 ) issue: https://github.com/milvus-io/milvus/issues/30577 co-author: @jaime0815 --------- Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com> Signed-off-by: SimFG <bang.fu@zilliz.com> Co-authored-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-04-12 16:01:19 +08:00
wei liu	68dec7dcd4	fix: Use correct ts to avoid exclude segment list leak (#31991 ) issue: #31990 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-12 10:39:19 +08:00
wei liu	1a98ce39f5	enhance: Remove useless logic about FromShardLeader (#32029 ) issue: #32047 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-10 20:11:19 +08:00
Xiaofan	dbab9c5096	fix: reduce didn't handle offset without limit and reduceStopForBest correctly (#32089 ) fix https://github.com/milvus-io/milvus/issues/32059 this pr fix two issues: offset is not handled correctly without specify a limit reduceStopForBest doesn't guarantee to return limit result even if there are more result when there is small segment Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-04-10 16:01:18 +08:00
wei liu	df208d538c	fix: Check exclude segment before add new growing segment (#31803 ) issue: #31479 #31797 milvus will add released segment to excluded info, and filter out it's stream data in filter_node. but for data buffered in insert_node's channel, if it belongs to growing segment which already be released, then it will all the growing segment back again. This PR maintain `excluded segments` in delegator, and check excluded segment before new growing segment. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-10 15:29:17 +08:00
Chun Han	f3f2a5a7e9	fix: evicted segments in the serverlss mode(#31959 ) (#31961 ) related: #31959 1. reset segment index status after evicting to lazyload=true 2. reset num_rows to null_opt Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-04-10 15:15:19 +08:00
SimFG	90bed1caf9	enhance: add the related data size for the read apis (#31816 ) issue: #30436 origin pr: #30438 related pr: #31772 --------- Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-04-10 15:07:17 +08:00
chyezh	c9faa6d936	enhance: add more metrics for cache and search (#31777 ) issue: #30931 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-10 10:55:17 +08:00
aoiasd	5b693c466d	fix: delegator filter out all partition's delete msg when loading segment (#31585 ) May cause deleted data queryable a period of time. relate: https://github.com/milvus-io/milvus/issues/31484 https://github.com/milvus-io/milvus/issues/31548 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-04-09 15:21:24 +08:00
zhenshan.cao	089c805e0a	enhance:Refactor hybrid search (#32020 ) issue: https://github.com/milvus-io/milvus/issues/25639 https://github.com/milvus-io/milvus/issues/31368 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-04-09 14:21:18 +08:00
congqixia	1f7f3993a1	fix: Validate PlaceholderGroups before combine them (#32016 ) See also #32015 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-09 11:33:17 +08:00
chyezh	73adf2a5cc	fix: use stateful lock to avoid load and release on LocalSegment concurrently (#31606 ) issue: #31605 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-08 17:09:16 +08:00
chyezh	7b400252ff	fix: add configuration disk capacity config for lru and fix some bug (#31977 ) issue: #30361 - Add configurable disk capacity limit - fix bitset reset logic - make insert record reinsert after clear Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-08 15:55:16 +08:00
congqixia	0feee53631	enhance: Add back unit test for compactor and fix some TODOs (#31829 ) This PR adds back compactor "Unhandled" data type unit test and fixes some TODOs behvaior Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-02 20:35:14 +08:00
jaime	bd853be8c7	enhance: Add db label for some usual metrics (#30956 ) issue: #31782 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-04-02 14:27:13 +08:00
wei liu	bb500d66c7	fix: Remove segment from leader view can't be executed (#31663 ) issue: #31664 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-01 10:39:12 +08:00
wei liu	c311932d5f	fix: Update segment's version in leader task (#31643 ) issue: #31468 1. when segment's version in leader view doesn't match segment's version in dist, should update leader view 2. after call loadDeltalog, should update segment's load version with latest ts 3. change leader task's priority from high to low, to avoid leader task replace segment task and balance task --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-01 10:37:21 +08:00
chyezh	1ad5ccc50f	enhance: add rg and db interface for segment and db/rg metric label (#31715 ) issue: #30931 Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-01 10:21:21 +08:00

1 2 3 4 5 ...

661 Commits (eb046863485fdf3e130fc60484485c901b81276b)