milvus

Commit Graph

Author	SHA1	Message	Date
Ted Xu	e47bf21305	fix: parse error given duplicated plan cache key (#37334 ) See: #37016 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-11-07 15:14:25 +08:00
wei liu	00f6d0ec51	fix: watch channel stuck due to misuse of timer.Reset (#37433 ) issue: #37166 cause the misuse of timer.Reset, which cause dispatcher failed to send msg to virtual channel buffer, and dispatcher do splitting again and again, which hold the dispatcher manager's lock, block watching channel progress. This PR fix the misuse of timer.Reset Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-07 14:34:24 +08:00
smellthemoon	86fd3200be	enhance: refactor createIndex in RESTful API (#37235 ) Make the parameter input method consistent with miluvs-client. Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-11-07 14:18:30 +08:00
Buqian Zheng	40b770cb7b	fix: add rescorer activation function for BM25 (#37481 ) issue: #35853 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-11-07 12:08:25 +08:00
jaime	f348bd9441	feat: add segment,pipeline, replica and resourcegroup api for WebUI (#37344 ) issue: #36621 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-11-07 11:52:25 +08:00
smellthemoon	9b6dd23f8e	fix: wrong path spelling when use rootpath in segcore (#37453 ) #36532 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-11-07 11:22:25 +08:00
aoiasd	b4c749dcd5	fix: merge sort segment loss data (#37400 ) relate: https://github.com/milvus-io/milvus/issues/37238 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-11-07 11:18:26 +08:00
XuanYang-cn	51ed2a61c8	fix: Correct dropped segment num metrics (#37410 ) See also: #31891 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-11-07 11:16:24 +08:00
cai.zhang	aed3b94b5d	enhance: Refine error message for contains array (#37383 ) issue: #36221 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-11-07 10:44:25 +08:00
Zhen Ye	cae9e1c732	fix: drop collection failed if enable streaming service (#37444 ) issue: #36858 - Start channel manager on datacoord, but with empty assign policy in streaming service. - Make collection at dropping state can be recovered by flusher to make sure that milvus consume the dropCollection message. - Add backoff for flusher lifetime. - remove the proxy watcher from timetick at rootcoord in streaming service. Also see the better fixup: #37176 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-11-07 10:26:26 +08:00
congqixia	ee54a98578	enhance: Add cgo call metrics for load/write API (#37405 ) Cgo API cost is not observerable since not metrics is related to them. This PR add metrics for some sync cgo call related to load & write --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-11-07 10:06:25 +08:00
congqixia	92028b7ff7	enhance: Use cancel label for ctx canceled storage op (#37468 ) Previously failed label is used for canceled storage op, which may cause wrong alarm when user cancel load operation or etc. This PR utilizes cancel label when such case happens. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-11-06 19:38:24 +08:00
aoiasd	d67853fa89	feat: Tokenizer support build with params and clone for concurrency (#37048 ) relate: https://github.com/milvus-io/milvus/issues/35853 https://github.com/milvus-io/milvus/issues/36751 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-11-06 17:48:24 +08:00
wei liu	8714774305	fix: search/query failed due to segment not loaded (#37403 ) issue: #36970 cause release segment and balance channel may happen at same time, and before new delegator become serviceable, if release segment exeuctes on new delegator, and search/query comes on old delegator, then release segment and query segment happens in parallel, if release segment execute first in worker, then search/query will got a SegmentNodeLoaded error. This PR add serviceable filter on delegator, then all load/release segment operation will happens on serviceable delegator. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-06 15:10:25 +08:00
SimFG	f1dd55e0c0	enhance: improve rootcoord task scheduling policy (#37352 ) - issue: #30301 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-11-06 14:48:26 +08:00
cai.zhang	625b6176cd	fix: Search for pk using raw data to reduce the overhead caused by views (#37202 ) issue: #37152 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-11-05 20:36:24 +08:00
congqixia	9a9de3df5c	enhance: Pass rpc stats via gin.Context (#37439 ) Related #37223 RPC stats worked in middleware but faild to get method & collection info Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-11-05 18:26:23 +08:00
Bingyi Sun	bd04cac4b3	fix: fix group by on chunked segment (#37292 ) https://github.com/milvus-io/milvus/issues/37244 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-11-05 17:12:23 +08:00
Zhen Ye	9a0e1c82bc	fix: repeated error code in milvus and segcore (#37359 ) issue: #37357 Signed-off-by: chyezh <chyezh@outlook.com>	2024-11-05 16:28:23 +08:00
wei liu	b83b376cfc	fix: Search/Query may failed during updating delegator cache. (#37116 ) issue: #37115 casue init query node client is too heavy, so we remove updateShardClient from leader mutex, which cause much more concurrent cornor cases. This PR delay query node client's init operation until `getClient` is called, then use leader mutex to protect updating shard client progress to avoid concurrent issues. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-05 10:52:23 +08:00
Zhen Ye	0c4321cf57	fix: crash when startup if the milvus volume is on-operation concurrently (#37312 ) issue: #37311 Signed-off-by: chyezh <chyezh@outlook.com>	2024-11-04 14:50:23 +08:00
foxspy	c27f477b6c	enhance: Update Knowhere version (#37333 ) issue: #37269 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-11-04 11:56:31 +08:00
foxspy	eaf86f7649	fix: optimize invalid datatype error msg (#37376 ) issue: #37151 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-11-04 11:46:28 +08:00
congqixia	f54cf41830	enhance: Move forward l0 logic out of delta lock (#37337 ) Related to #35303 `deleteMut` shall be protecting streaming delete buffer, forward l0 could be move out of the rlock section to reduce tsafe impact from loading segments. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-11-04 10:42:22 +08:00
smellthemoon	51cb2fbf97	fix: parse fail in empty json (#37294 ) #37200 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-11-03 19:00:21 +08:00
cai.zhang	50de122dc7	enhance: Rename textmatch to text_match (#37290 ) issue: #36672 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-11-03 18:40:27 +08:00
cai.zhang	0449c74d44	fix: Fix the bug where some expressions do not correctly parse the value (#37341 ) issue: #37274 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-11-02 13:24:22 +08:00
Ted Xu	b792b199d7	enhance: load deltalogs on demand when doing compactions (#37310 ) See #37234 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-11-01 16:40:21 +08:00
wayblink	d119a2541a	fix: fix hasCollection response has no status (#37254 ) issue: #37257 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-11-01 15:40:21 +08:00
congqixia	be71b98146	enhance: Fix case error msg after knowhere upgrade (#37339 ) Previous PR: #37315 Error message updated after knowhere behavior change, this PR update corresponding test error message. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-11-01 14:00:22 +08:00
Yinzuo Jiang	5aad000a93	enhance: [skip ci] remove tools/core_gen after #35251 (#36306 ) `tools/core_gen` python scripts are useless after https://github.com/milvus-io/milvus/pull/35251 fixes: #36305 Signed-off-by: Yinzuo Jiang <yinzuo.jiang@zilliz.com> Signed-off-by: Yinzuo Jiang <jiangyinzuo@foxmail.com>	2024-11-01 13:08:21 +08:00
foxspy	642a651f60	enhance: remove useless vector index param checker (#37329 ) issue: #34298 because all vector index config checker has been moved into vector_index_checker, then the useless checkers can be removed. Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-11-01 06:20:21 +08:00
foxspy	3224e58c5b	enhance: add unify vector index config management (#36846 ) issue: #34298 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-11-01 06:18:21 +08:00
Bingyi Sun	cd2655c861	fix: fix wrong method is called to fetch variable valid data (#37304 ) issue: https://github.com/milvus-io/milvus/issues/37147 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-11-01 01:52:20 +08:00
zhenshan.cao	63843dce33	fix: Fix conan gdal building problem (#37338 ) issue:https://github.com/milvus-io/milvus/issues/27576 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-10-31 21:04:16 +08:00
Hao Tan	67c4340565	feat: Geospatial Data Type and GIS Function Support for milvus server (#35990 ) issue:https://github.com/milvus-io/milvus/issues/27576 # Main Goals 1. Create and describe collections with geospatial fields, enabling both client and server to recognize and process geo fields. 2. Insert geospatial data as payload values in the insert binlog, and print the values for verification. 3. Load segments containing geospatial data into memory. 4. Ensure query outputs can display geospatial data. 5. Support filtering on GIS functions for geospatial columns. # Solution 1. Add Type: Modify the Milvus core by adding a Geospatial type in both the C++ and Go code layers, defining the Geospatial data structure and the corresponding interfaces. 2. Dependency Libraries: Introduce necessary geospatial data processing libraries. In the C++ source code, use Conan package management to include the GDAL library. In the Go source code, add the go-geom library to the go.mod file. 3. Protocol Interface: Revise the Milvus protocol to provide mechanisms for Geospatial message serialization and deserialization. 4. Data Pipeline: Facilitate interaction between the client and proxy using the WKT format for geospatial data. The proxy will convert all data into WKB format for downstream processing, providing column data interfaces, segment encapsulation, segment loading, payload writing, and cache block management. 5. Query Operators: Implement simple display and support for filter queries. Initially, focus on filtering based on spatial relationships for a single column of geospatial literal values, providing parsing and execution for query expressions. 6. Client Modification: Enable the client to handle user input for geospatial data and facilitate end-to-end testing.Check the modification in pymilvus. --------- Signed-off-by: tasty-gumi <1021989072@qq.com>	2024-10-31 20:58:20 +08:00
liliu-z	4bac2eb13e	enhance: Update Knowhere version (#37315 ) Signed-off-by: Li Liu <li.liu@zilliz.com>	2024-10-31 17:24:20 +08:00
Zhen Ye	448cc08960	fix: flowgraph crash when channel releasing (#37285 ) issue: #37284 Signed-off-by: chyezh <chyezh@outlook.com>	2024-10-31 16:30:21 +08:00
Xiaofan	f13faa37aa	fix: make sure alias is cached (#36807 ) fix #36806 Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-10-31 01:05:03 -07:00
cai.zhang	2ef6cbbf59	feat: The expression supports filling elements through templates (#37033 ) issue: #36672 The expression supports filling elements through templates, which helps to reduce the overhead of parsing the elements. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-10-31 14:20:22 +08:00
cai.zhang	4d98833bc3	fix: Set current partition stats version to 0 by default when not present (#37299 ) issue: #37156 1. Still need to record the current stats version. 2. Set it to 0 when the current stats version is not found. Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-10-31 12:48:21 +08:00
smellthemoon	b8492498ac	fix: mask with valid data when preCheckOverflow (#37221 ) #37175 --------- Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-10-31 10:44:26 +08:00
Gao	2092dc0ba1	enhance: reserve vector space to reduce reallocate cost in Views() and StringViews() (#37182 ) issue: #37152 Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-10-31 10:02:21 +08:00
congqixia	7961568223	fix: Rectify `OffsetOrderedArray` contain logic (#37305 ) Related to #36887 Remove non-hit pk delete record logic does not work since `insert_record_.contain` does not work due to logic problem. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-30 21:26:19 +08:00
Patrick Weizhi Xu	43ad9af529	fix: use max MvccTs for iterator (#37247 ) issue: #37158 Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-10-30 13:58:20 +08:00
Bingyi Sun	90948e9444	fix: add SearchOnSealed unit test and fix a bug (#37241 ) issue: https://github.com/milvus-io/milvus/issues/37244 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-10-30 10:26:19 +08:00
Ted Xu	262a994d6d	enhance: generally improve the performance of mix compactions (#37163 ) See #37234 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-10-29 18:12:20 +08:00
congqixia	9539739781	enhance: Release compacted growing segment if in dropped list (#37245 ) See also #37205 Previously releasing growing segments could be triggered by two conditions: - Sealed Segment with same id is loaded - Segment start position is before target checkpoint ts Which has a worst case that the corresponding sealed segment is compacted and the checkpoint is pinned by a growing l0 segment. This PR introduces a new rule that: a growing segment could be released if the segment id appeared in current target dropped segment id list. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-29 18:04:21 +08:00
smellthemoon	86b9c3ef4a	fix: to just check null in group by field only (#37191 ) #37187 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-10-29 15:38:30 +08:00
congqixia	3106384fc4	enhance: Return deltadata for `DeleteCodec.Deserialize` (#37214 ) Related to #35303 #30404 This PR change return type of `DeleteCodec.Deserialize` from `storage.DeleteData` to `DeltaData`, which reduces the memory usage of interface header. Also refine `storage.DeltaData` methods to make it easier to usage. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-29 12:04:24 +08:00
congqixia	0f59bfdf30	enhance: Use middleware to observe restful v2 in/out rpc stats (#37223 ) Related to #36102 Previous PR #36107 add grpc inteceptor to observe rpc stats. Using same strategy, this pr add gin middleware to observer restful v2 rpc stats. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-29 11:22:24 +08:00
congqixia	5a0135727d	fix: Check resource when loading deltalogs (#37195 ) Related to #36887 `LoadDeltaLogs` API did not check memory usage. When system is under high delete load pressure, this could result into OOM quit. This PR add resource check for `LoadDeltaLogs` actions and separate internal deltalog loading function with public one. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-29 10:04:25 +08:00
congqixia	224d797f94	fix: Use singleton delete pool and avoid goroutine leakage (#37220 ) Related to #36887 Previously using newly create pool per request shall cause goroutine leakage. This PR change this behavior by using singleton delete pool. This change could also provide better concurrency control over delete memory usage. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-29 10:02:24 +08:00
XuanYang-cn	26028f4137	fix: Exlude L0 compaction when clustering is executing (#37141 ) Also remove conflit check when executing L0. The exclusive is already guarenteed in scheduler See also: #37140 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-10-29 06:28:24 +08:00
congqixia	d8c1bd24f2	enhance: Utilize proxy metacache for `HasCollection` (#37185 ) Related to #37183 Utilize proxy metacache for `HasCollection` request, if collection exists in metacache, it could be deducted that collection must exist in system. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-28 18:54:23 +08:00
Patrick Weizhi Xu	fc69df44a1	fix: set guarantee ts for seach/query iterator (#37180 ) issue: #37158 Return the GuaranteeTS so that the subsequent requests following the correct TS. BeginTS is the current timestamp when the task is created. The GuaranteeTS is the one parsed based on both consistency level and beginTS, in PreExecute of the task on Proxy. The delegator will wait until GuaranteeTS is met. In PostExecute of the task on Proxy, the TS of the first iterator request will be returned to the SDK and add it to the subsequent requests. Hence, if the default consistency level is Eventually or Bounded, the order of TS will be > Guarantee TS < BeginTS If it returns the BeginTS, the second request will need to catch up and result in extra 200ms max of latency, which results in something like \| Call \| Latency \| \| --- \| --- \| \| first call on `Next()` \| 30ms \| \| second call on `Next()` \| 210ms \| \| third call on `Next()` \| 10ms \| \| fourth call on `Next()` \| 11 ms \| \| ... \| ... \| where we expect \| Call \| Latency \| \| --- \| --- \| \| first call on `Next()` \| 30ms \| \| second call on `Next()` \| 10ms \| \| third call on `Next()` \| 10ms \| \| fourth call on `Next()` \| 11 ms \| \| ... \| ... \| Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-10-28 15:57:35 +08:00
congqixia	f87acdf2a2	fix: Ref collection meta when load l0 segment meta only (#37178 ) Related to #37177 Previous PR #37160 Collection meta is not ref-ed when loading l0 segment in `RemoteLoad` policy, which cause collection meta release when lots of l0 segment released. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-28 15:49:38 +08:00
jaime	33b0b8df80	fix: may exceed max tnx in etcd operations (#36775 ) issue: #36772 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-28 15:37:30 +08:00
cai.zhang	86687bd8ed	enhance: Refine code for get_deleted_bitmap (#36819 ) issue: #33744 Check whether the PK is truly sorted in the debug model. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-10-28 15:19:30 +08:00
XuanYang-cn	4926021c02	fix: Skip mark compaction timeout for mix and l0 compaction (#37118 ) Timeout is a bad design for long running tasks, especially using a static timeout config. We should monitor execution progress and fail the task if the progress has been stale for a long time. This pr is a small patch to stop DC from marking compaction tasks timeout, while still waiting for DN to finish. The design is self-conflicted. After this pr, mix and L0 compaction are no longer controlled by DC timeout, but clustering is still under timeout control. The compaction queue capacity grows larger for priority calc, hence timeout compactions appears more often, and when timeout, the queuing tasks will be timeout too, no compaction will success after. See also: #37108, #37015 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-10-28 14:33:29 +08:00
Bingyi Sun	b81f162f6a	fix: fix several bugs and refactor some codes related with chunked segment (#37168 ) issue: https://github.com/milvus-io/milvus/issues/37147 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-10-28 14:17:30 +08:00
congqixia	7774b7275e	enhance: Replace PrimaryKey slice with PrimaryKeys saving memory (#37127 ) Related to #35303 Slice of `storage.PrimaryKey` will have extra interface cost for each element, which may cause notable memory usage when delta row count number is large. This PR replaces PrimaryKey slice with PrimaryKeys interface saving the extra interface cost. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-28 10:29:30 +08:00
jaime	9d16b972ea	feat: add tasks page into management WebUI (#37002 ) issue: #36621 1. Add API to access task runtime metrics, including: - build index task - compaction task - import task - balance (including load/release of segments/channels and some leader tasks on querycoord) - sync task 2. Add a debug model to the webpage by using debug=true or debug=false in the URL query parameters to enable or disable debug mode. Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-28 10:13:29 +08:00
foxspy	d7b2ffe5aa	enhance: add an unify vector index config checker (#36844 ) issue: #34298 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-10-28 10:11:37 +08:00
zhagnlu	eeb67a3845	fix:reset default auto index type for scalar (#37086 ) #32900 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-10-27 16:19:29 +08:00
Bingyi Sun	a2f0092e39	fix: check sparse float before calling get_dim (#37145 ) https://github.com/milvus-io/milvus/issues/37146 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-10-26 16:25:29 +08:00
aoiasd	fd72151037	fix: merge datanode bm25 error after reload growing segment with no data (#37154 ) Segment with numrow 0 don't init bm25 stats, cause flush with bm25 stats failed. relate: https://github.com/milvus-io/milvus/issues/37150 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-10-26 07:43:28 +08:00
congqixia	05f880708d	enhance: Make skip load work for all branches (#37160 ) Related to #37112 Skip load logic used to work only when there is multiple segment load info entires in load request. In continous delete case, delegator still loads l0 segment, which occupies lot of memory. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-25 23:37:29 +08:00
yihao.dai	ed37c27bda	fix: Fix collection leak in querynode (#37061 ) Unref the removed L0 segment count. issue: https://github.com/milvus-io/milvus/issues/36918 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-25 19:59:29 +08:00
smellthemoon	44ddcb5a63	fix: not check has_value before get value in JSON (#37128 ) https://github.com/milvus-io/milvus/issues/36236 also: https://github.com/milvus-io/milvus/issues/37113 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-10-25 17:19:28 +08:00
yihao.dai	d7b2906318	enhance: Make dataNode.import.maxConcurrentTaskNum dynamic (#37102 ) Resize import execution pool when config `dataNode.import.maxConcurrentTaskNum` update. issue: https://github.com/milvus-io/milvus/issues/37095 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-25 16:51:29 +08:00
SimFG	1cc9cb49ad	enhance: allow to delete data when disk quota exhausted (#37134 ) - issue: #37133 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-10-25 16:47:29 +08:00
cqy123456	ff0b7ea0ef	enhance: build interim index for mmapped vector in ChunkedSealedSegment (#36993 ) issue:https://github.com/milvus-io/milvus/issues/36392 related pr: https://github.com/milvus-io/milvus/pull/36391 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-10-25 15:55:28 +08:00
Yinzuo Jiang	3628593d20	feat: Implement custom function module in milvus expr (#36560 ) OSPP 2024 project: https://summer-ospp.ac.cn/org/prodetail/247410235?list=org&navpage=org Solutions: - parser (planparserv2) - add CallExpr in planparserv2/Plan.g4 - update parser_visitor and show_visitor - grpc protobuf - add CallExpr in plan.proto - execution (`core/src/exec`) - add `CallExpr` `ValueExpr` and `ColumnExpr` (both logical and physical) for function call and function parameters - function factory (`core/src/exec/expression/function`) - create a global hashmap when starting milvus (see server.go) - the global hashmap stores function signatures and their function pointers, the CallExpr in execution engine can get the function pointer by function signature. - custom functions - empty(string) - starts_with(string, string) - add cpp/go unittests and E2E tests closes: #36559 Signed-off-by: Yinzuo Jiang <jiangyinzuo@foxmail.com>	2024-10-25 15:25:30 +08:00
yihao.dai	b45cf2d49f	enhance: Add max length check for csv import (#37077 ) 1. Add max length check for csv import. 2. Tidy import options. 3. Tidy common import util functions. issue: https://github.com/milvus-io/milvus/issues/34150 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-25 14:37:29 +08:00
Buqian Zheng	088d5d7d76	fix: optimize BM25 err message (#37074 ) issue: https://github.com/milvus-io/milvus/issues/37022 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-10-25 14:35:45 +08:00
smellthemoon	84d48b498b	enhance: support upsert autoid==true in Restful API (#37072 ) Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-10-25 14:33:39 +08:00
yihao.dai	6e90f9e8d9	enhance: Support db for bulkinsert (#37012 ) issue: https://github.com/milvus-io/milvus/issues/31273 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-25 14:31:39 +08:00
aoiasd	22b917a1e6	enhance: Add collection name label for some metric (#36951 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-10-25 14:29:47 +08:00
smellthemoon	6ef014d931	fix: get correct size when sealed segment chunked (#37062 ) #37019 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-10-25 12:01:31 +08:00
Gao	ad2df904c6	fix: correctly set ExecTermArrayVariableInField bitset result (#37111 ) issue: https://github.com/milvus-io/milvus/issues/37110 Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-10-24 18:52:02 -07:00
Bingyi Sun	bf956a3ec2	fix: fix string field has invalid utf-8 (#37104 ) issue: https://github.com/milvus-io/milvus/issues/37083 We use vector of string_view to save data temporally but real string data will be released after record batch is deconstructed. Change it to vector of string to avoid memory corruption. --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-10-24 18:33:47 -07:00
smellthemoon	2b3f5bec07	fix: panic when create index on all none data (#37046 ) #37045 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-10-24 17:09:28 +08:00
congqixia	b086ef6b19	enhance: Skip load delta data in delegater when using RemoteLoad (#37082 ) Related to #35303 Delta data is not needed when using `RemoteLoad` l0 forward policy. By skipping load delta data, memory pressure could be eased if l0 segment size/number is large. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-24 16:21:37 +08:00
wei liu	39a91eb100	fix: Delegator may becomes unserviceable after querycoord restart (#37055 ) issue: #37054 after querycoord restart, segment_checker may release segment by mistake due to next target isn't ready yet. This PR requires release segment must happens after next target is ready. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-10-24 12:21:28 +08:00
congqixia	d8db3e8761	enhance: Add metrics for querynode delete buffer info (#37081 ) Related to #35303 This PR add metrics for querynode delegator delete buffer information, which is related to dml quota logic. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-24 10:47:28 +08:00
congqixia	f43527ef6f	enhance: Batch forward delete when using DirectForward (#37076 ) Relatedt #36887 DirectFoward streaming delete will cause memory usage explode if the segments number was large. This PR add batching delete API and using it for direct forward implementation. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-24 10:39:28 +08:00
wayblink	49b562207c	fix: Refine compactionTask to avoid data race (#36936 ) issue: #36897 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-10-24 09:55:28 +08:00
wei liu	f029314e20	fix: Dynamic release parition may fail search/query. (#37049 ) issue: #33550 cause wrong impl of UpdateCollectionNextTarget, if ReleaseCollection and UpdateCollectionNextTarget happens at same time, the the released partition's segment list may be add to target again, and delegator will be marked as unserviceable due to lack of segment. This PR fix the impl of UpdateCollectionNextTarget Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-10-24 01:03:28 +08:00
Gao	1d61b604e1	enhance: support retry search when topk is reduced and result not enough (#35645 ) issue: #35576 This pr is to cover those cases when queryHook optimize search params and make the result size insufficient, add retry search mechanism and add related metrics for alarming. --------- Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-10-23 19:19:30 +08:00
yellow-shine	8902e2220e	enhance: enable asan for cpp unittest (#37041 ) https://github.com/milvus-io/milvus/issues/35854 Signed-off-by: chyezh <chyezh@outlook.com> Co-authored-by: chyezh <chyezh@outlook.com>	2024-10-23 17:21:27 +08:00
cai.zhang	ac8c5fcd5d	enhance: Remove pre-marking segments as L2 during clustering compaction (#36799 ) issue: #36686 This pr will remove pre-marking segments as L2 during clustering compaction in version 2.5, and ensure compatibility with version 2.4. The core of this change is to ensure that the many-to-many lineage derivation logic is correct, making sure that both the parent and child cannot simultaneously exist in the target segment view. feature: - Clustering compaction no longer marks the input segments as L2. - Add a new field `is_invisible` to `segmentInfo`, and mark segments that have completed clustering but have not yet built indexes as `is_invisible` to prevent them from being loaded prematurely." - Do not mark the input segment as `Dropped` before the clustering compaction is completed. - After compaction fails, only the result segment needs to be marked as Dropped. compatibility: - If the upgraded task has not failed, there are no compatibility issues. - If the status after the upgrade is `MetaSaved`, then skip the stats task based on whether TmpSegments is empty. - If the failure occurs before `MetaSaved`: - there are no ResultSegments, and InputSegments have not been marked as dropped yet. - the level of input segments need to revert to LastLevel - If the failure occurs after `MetaSaved`: - ResultSegments have already been generated, and InputSegments have been marked as Dropped. At this point, simply make the ResultSegments visible. - the level of ResultSegments needs to be set to L1（in order to participate in mixCompaction） --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-10-23 17:15:28 +08:00
yihao.dai	f0b3942a08	enhance: Limit import job number (#36891 ) issue: https://github.com/milvus-io/milvus/issues/36890 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-23 16:01:28 +08:00
Zhen Ye	f3d9d05a28	fix: use binlog counter to trigger flush but not stats log (#37037 ) issue: #36804 Signed-off-by: chyezh <chyezh@outlook.com>	2024-10-23 15:07:29 +08:00
jaime	4746f47282	feat: management WebUI homepage (#36822 ) issue: #36784 1. Implement an embedded web server for WebUI access. 2. Complete the homepage development. Home page demo: <img width="2177" alt="iShot_2024-10-10_17 57 34" src="https://github.com/user-attachments/assets/38539917-ce09-4e54-a5b5-7f4f7eaac353"> Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-23 11:29:28 +08:00
congqixia	30121a5a0d	fix: Rectify delete buffer row count quota value (#37060 ) Related to #37057 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-23 10:15:29 +08:00
Bingyi Sun	90b3907a92	fix: fix missing return value in chunked column (#37064 ) issue: https://github.com/milvus-io/milvus/issues/36834 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-10-22 10:29:19 -07:00
Chun Han	e2f2fd55a5	enhance: avoid limiting ddl operations repeatedly(#37006 ) (#37010 ) related: #37006 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-10-22 20:11:27 +08:00
congqixia	5dd3f44cc1	enhance: Preallocate delete data slice to avoid growslice (#37043 ) Related to #36887 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-22 19:07:29 +08:00
congqixia	0d8f20f7ce	fix: Pass full field list when partial load enabled (#37053 ) Related to #37038 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-22 18:43:27 +08:00

1 2 3 4 5 ...

9705 Commits (1b4f7e3ac118b812f03c4cb84a281d3d5a3fa574)