milvus

Commit Graph

Author	SHA1	Message	Date
sparknack	bdd65871ea	enhance: tiered storage: estimate segment loading resource usage while considering eviction (#43323 ) issue: #41435 After introducing the caching layer's lazy loading and eviction mechanisms, most parts of a segment won't be loaded into memory or disk immediately, even if the segment is marked as LOADED. This means physical resource usage may be very low. However, we still need to reserve enough resources for the segments marked as LOADED. Thus, the logic of resource usage estimation during segment loading, which based on physcial resource usage only for now, should be changed. To address this issue, we introduced the concept of logical resource usage in this patch. This can be thought of as the base reserved resource for each LOADED segment. A segment’s logical resource usage is derived from its final evictable and inevictable resource usage and calculated as follows: ``` SLR = SFPIER + evitable_cache_ratio * SFPER ``` it also equals to ``` SLR = (SFPIER + SFPER) - (1.0 - evitable_cache_ratio) * SFPER ``` `SLR`: The logical resource usage of a segment. `SFPIER`: The final physical inevictable resource usage of a segment. `SFPER`: The final physical evictable resource usage of a segment. `evitable_cache_ratio`: The ratio of a segment's evictable resources that can be cached locally. The higher the ratio, the more physical memory is reserved for evictable memory. When loading a segment, two types of resource usage are taken into account. First is the estimated maximum physical resource usage: ``` PPR = HPR + CPR + SMPR - SFPER ``` `PPR`: The predicted physical resource usage after the current segment is allowed to load. `HPR`: The physical resource usage obtained from hardware information. `CPR`: The total physical resource usage of segments that have been committed but not yet loaded. When one new segment is allow to load, `CPR' = CPR + (SMR - SER)`. When one of the committed segments is loaded, `CPR' = CPR - (SMR - SER)`. `SMPR`: The maximum physical resource usage of the current segment. `SFPER`: The final physical evictable resource usage of the current segment. Second is the estimated logical resource usage, this check is only valid when eviction is enabled: ``` PLR = LLR + CLR + SLR ``` `PLR`: The predicted logical resource usage after the current segment is allowed to load. `LLR`: The total logical resource usage of all loaded segments. When a new segment is loaded, `LLR` should be updated to `LLR' = LLR + SLR`. `CLR`: The total logical resource usage of segments that have been committed but not yet loaded. When one new segment is allow to load, `CLR' = CLR + SLR`. When one of the committed segments is loaded, `CLR' = CLR - SLR`. `SLR`: The logical resource usage of the current segment. Only when `PPR < PRL && PLR < PRL` (`PRL`: Physical resource limit of the querynode), the segment is allowed to be loaded. --------- Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-08-01 21:31:37 +08:00
Buqian Zheng	b0226ef47c	fix: added more comprehensive container limit detection (#43693 ) issue: #41435 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-08-01 20:37:37 +08:00
wei liu	ecc2ac0426	fix: apply load config changes failed after restart (#43554 ) issue: #43107 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-08-01 20:13:37 +08:00
Xianhui Lin	0f0edff7f0	fix: increment offset for null data rows in JsonKeyStats (#43679 ) fix: increment offset for null data rows in JsonKeyStatsInvertedIndex issue: https://github.com/milvus-io/milvus/issues/43151 Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-08-01 15:53:37 +08:00
Buqian Zheng	21cec95fe8	fix: fix disk path sent to cachinglayer (#43685 ) `localDataRootPath` is used to init local chunk manager and has `querynode` appended to it, thus is incorrect #41435 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-08-01 13:19:36 +08:00
zhagnlu	2594250906	fix: fix miss loading index for storagev2 (#43674 ) #43653 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-08-01 13:07:36 +08:00
congqixia	5f2f4eb3d6	enhance: Ignore entry with same ts when DeleteRecord search pks (#43669 ) Related to #43660 This patch reduces the unwanted offset&ts entries having same timestamp of delete record. Under large amount of upsert, this false hit could increase large amount of memory usage while applying delete. The next step could be passing a callback to `search_pk_func_` to handle hit entry streamingly. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-08-01 10:15:36 +08:00
Ted Xu	e37cd19da2	enhance: enable storage v2 by default (#43652 ) Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2025-08-01 08:59:36 +08:00
zhagnlu	239f743a18	fix: add enable_mmap key to load config (#43672 ) #43670 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-07-31 21:35:37 +08:00
sparknack	4aabe23a45	enhance: update flat_hash_map.hpp to a modified version (#43506 ) issue: #41435 Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-07-31 20:09:36 +08:00
Chun Han	d72c0357ff	fix: empty hybridsearch result due to one-sub-search empty(#43537 ) (#43647 ) related: #43537 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2025-07-31 19:47:37 +08:00
congqixia	f29964bd17	fix: Add padding for sorted index preventing 0 length mmap (#43663 ) Related to #43655 This patch add a padding when writing mmap file for ScalarSortedIndex in case of mmap falure due to 0 mmap length. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-31 18:53:36 +08:00
zhagnlu	708e426bb3	enhance: using set element for string term type (#43049 ) issue: #43048 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-07-31 10:35:37 +08:00
zhagnlu	31801f5937	fix: fix pk in [..] skip next batch when using multi-chunk segment (#43618 ) #43494 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-07-31 10:15:37 +08:00
congqixia	089f02bcca	fix: [StorageV2] Align null bitmap offset for fixed-length datatype (#43654 ) Related to #43626 Similar to previous pr #43321, null bitmap could be dislocated if the bitset ptr does not count the offset of array Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-31 09:55:36 +08:00
congqixia	6a74a7de66	enhance: Make DeleteRecord search pks by batch and PinAll (#43640 ) Related to #43592 When delete records are large, search pk one by one will result into many `Pincells` call which creates lots of futures. This patch make search pk execute in batch to reduce this cost. Also add `GetAllChunks` API to utilize `PinAllCells` to reduce pins. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-30 19:15:36 +08:00
SimFG	9ffcc55b55	fix: Clean privilege cache after loading policy in InitPolicyInfo (#43642 ) - issue: #43641 Signed-off-by: SimFG <bang.fu@zilliz.com>	2025-07-30 16:57:37 +08:00
wei liu	1fae8f5ae3	enhance: Optimize FlushAll performance for multi-table scenarios (#43339 ) Replace multiple per-table flush RPC calls with single FlushAll RPC to improve performance in multi-table scenarios. issue: #43338 - Implement server-side FlushAll request processing in DataCoord/MixCoord - Add flushAllTask to handle unified flush operations across all tables - Replace proxy-side per-table flush iteration with single RPC call - Support both streaming and non-streaming service execution paths - Add comprehensive unit tests for new FlushAll implementation --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-30 15:37:37 +08:00
sthuang	a2c7ed2780	fix: [StorageV2] sort field binlogs paths for packed reader and writer (#43585 ) key changes: * fix unstable storage v2 compaction unit test by guaranteeing the order of paths during sync. * bump milvus-storage version, include https://github.com/milvus-io/milvus-storage/pull/222 https://github.com/milvus-io/milvus-storage/pull/223 https://github.com/milvus-io/milvus-storage/pull/224 https://github.com/milvus-io/milvus-storage/pull/225 https://github.com/milvus-io/milvus-storage/pull/226 * Also fix the below related oom issue. related: https://github.com/milvus-io/milvus/issues/43310 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-07-30 08:09:36 +08:00
congqixia	4fe55e3008	fix: [StorageV2] Use separate channel for `get_cells` (#43632 ) Related to #43584 There might be concurrent calls on `translator.get_cells`. The channel cannot be shared among these calls, otherwise the logic will break. This patch create new channel for each `get_cells` invocation in case of data race. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-29 20:59:38 +08:00
Zhen Ye	3e3775fb81	fix: panics when describe collection internal failure (#43630 ) issue: #43629 - also fix the scanner_switchable panic underlying wal scanner return context error. Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-29 20:33:36 +08:00
Zhen Ye	cd38d65417	fix: make savebinlogpath idompotent at binlog level (#43615 ) issue: #43574 - update all binlog every time when calling udpate savebinlogpath. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-29 19:47:36 +08:00
foxspy	d57890449f	enhance: update knowhere version (#43528 ) issue: #42937 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2025-07-29 17:21:36 +08:00
XuanYang-cn	0ccb95303e	feat: [CMEK] Add utils to load plugins (#42986 ) See also: #40321 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-07-29 17:17:36 +08:00
Buqian Zheng	052fb6c562	feat: add time based eviction to data managed by cachinglayer (#43490 ) issue: https://github.com/milvus-io/milvus/issues/41435 also added disk capacity protection --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-07-29 16:17:35 +08:00
Bingyi Sun	a765cd1eaa	enhance: unlink mmap file when chunk and index are destructed (#43524 ) issue: https://github.com/milvus-io/milvus/issues/41636 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-07-29 16:05:36 +08:00
congqixia	268f1cdace	fix: Hold field shared_ptr in case of being released (#43614 ) Related to #43584 Directly accessing `fields_` in `get_raw_data` may have race if load vec index happens concurrently during getting raw data. This PR make `bulk_subscript` hold shared_ptr of field column prevent field column being release during reading it. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-29 12:15:36 +08:00
Chun Han	4ee9f63f72	fix: return id by default(#43595 ) (#43601 ) related: #43595 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2025-07-29 12:07:36 +08:00
aoiasd	c9412434c8	enhance: add char group tokenizer (#42793 ) relate: https://github.com/milvus-io/milvus/issues/42792 Add char group tokenizer which support use costum char group or use some build-in char group as delimiters. Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-07-29 11:11:35 +08:00
congqixia	f666d89919	fix: [StorageV2] Access future result to get exception if any (#43613 ) Related to #43584 When `LoadWithStrategy` throw exception, the ex was wrapped in the returned future. If the future is not handled, this exception would be ignored. This patch add `future.get()` to get exception if any. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-28 22:33:35 +08:00
Xiaofan	bd31b32167	fix: hybridsearch should support offset param in restful api (#43586 ) Add support of offset param for reqeustful. api and refine some constant usage related #43556 Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2025-07-28 22:15:36 +08:00
yihao.dai	a29b3272b0	fix: Improve import memory management to prevent OOM (#43568 ) 1. Use blocking memory allocation to wait until memory becomes available 2. Perform memory allocation at the file level instead of per task 3. Limit Parquet file reader batch size to prevent excessive memory consumption 4. Limit import buffer size from 20% to 10% of total memory issue: https://github.com/milvus-io/milvus/issues/43387, https://github.com/milvus-io/milvus/issues/43131 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-28 21:25:35 +08:00
Zhen Ye	5b9b895cb0	fix: get schema panics when recover from channel checkpoint (#43605 ) issue: #43597, #43598 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-28 16:42:56 +08:00
Spade A	864d1b93b1	enhance: enable stlsort with mmap support (#43359 ) issue: https://github.com/milvus-io/milvus/issues/43358 --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-07-28 15:32:55 +08:00
zhagnlu	9bf1cb02d5	fix: add array_contains_all int to float converter (#43593 ) #43334 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-07-28 14:14:55 +08:00
yihao.dai	192521c6bd	enhance: Fix unbalanced task scheduling (#43581 ) Make scheduler always pick the node with the most available slots. issue: https://github.com/milvus-io/milvus/issues/43580 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-28 12:58:55 +08:00
congqixia	34d3f0c0f8	enhance: Reserve builder space for ValueSerializer (#43570 ) Add `arrowBuild.Reserve` call for `ValueSerializer` to reduce repeated resizing buffer when write size is large Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-28 11:02:55 +08:00
wei liu	7b8bf6393b	enhance: Improve partial result evaluation with row count based strategy (#43361 ) issue: #43360 Enhance the partial result evaluation mechanism in delegator to use row count based data ratio instead of simple segment count ratio for better accuracy. Key improvements: - Introduce PartialResultEvaluator interface for flexible evaluation strategy - Implement NewRowCountBasedEvaluator using sealed segment row count data - Replace segment count based ratio with row count based data ratio calculation - Update PinReadableSegments to return sealedRowCount information - Modify executeSubTasks to use configurable evaluator for partial result decisions - Add comprehensive unit tests for the new row count based evaluation logic This change provides more accurate partial result evaluation by considering the actual data volume rather than just segment quantity, leading to better query performance and consistency when some segments are unavailable. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-28 10:18:55 +08:00
Zhen Ye	7877aaa96c	fix: dirty cp metrics after drop (#43567 ) issue: #42688 - The channel cp is dropped by garbage collector - The channel is dropped and the cp is marked as math.Uint64 - If we drop it here, the update channel checkpoints will write the dirty cp back. Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-27 23:22:55 +08:00
Zhen Ye	feb5db60f2	fix: make flush save binlog paths idempotent (#43579 ) issue: #43574 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-27 23:14:55 +08:00
Spade A	faeb7fd410	feat: impl StructArray -- create schema, insert, and retrieve data (#42855 ) Ref https://github.com/milvus-io/milvus/issues/42148 https://github.com/milvus-io/milvus/pull/42406 impls the segcore part of storage for handling with VectorArray. This PR: 1. impls the go part of storage for VectorArray 2. impls the collection creation with StructArrayField and VectorArray 3. insert and retrieve data from the collection. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com> Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com> Signed-off-by: SpadeA-Tang <u6748471@anu.edu.au>	2025-07-27 01:30:55 +08:00
Buqian Zheng	b497d3d7a4	fix: call promise->setValue only after released the ListNode mtx (#43547 ) issue: #43261 `promise->setValue(folly::Unit());` may run callbacks inline and some of them may attempt to grab `mtx_`. So we should not call `promise->setValue(folly::Unit());` while holding the lock. --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-07-26 18:34:55 +08:00
Ted Xu	9041bf1b9a	fix: including shouldCopy parameter in file readers (#43578 ) This parameter determines whether the returned value should be a copy or a reference from the arrow array. The updates enhance memory management and provide more control over data handling during deserialization. See #43186 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2025-07-26 17:30:55 +08:00
Bingyi Sun	742d72a6c2	fix: Fix wrong null offsets for json path index (#43390 ) issue: https://github.com/milvus-io/milvus/issues/43315 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-07-26 17:26:54 +08:00
Bingyi Sun	a89e579485	fix: use tantivy version to make json index compatible with milvus 2.5 (#43563 ) issue: https://github.com/milvus-io/milvus/issues/43562 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-07-26 17:18:55 +08:00
congqixia	0b860b4aec	fix: Revert "enhance: DataCodec to release ownership of input_data after initialization (#43542 )" (#43571 )	2025-07-25 20:48:16 +08:00
Zhen Ye	070aabd27e	enhance: fix remove flushing state of segment (#43560 ) issue: #43559, #42884 - also fix the data lost when streaming resuming from old arch message. Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-25 18:08:54 +08:00
congqixia	2a7b7a811a	fix: [StorageV2] Throw exception when read rg fails (#43561 ) Related to #43261 Read error with catched in `LoadWithStrategy`. Caller could not detect read failure when some error occurred. This patch make `LoadWithStrategy` throw ex instead of swallowing it. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-25 17:40:55 +08:00
yihao.dai	0e1f367164	enhance: Fail compaction task to prevent data loss (#43545 ) We’ve frequently observed data loss caused by broken mutual exclusion in compaction tasks. This PR introduces a post-check: before modifying metadata upon compaction task completion, it verifies the state of the input segments. If any input segment has been dropped, the compaction task will be marked as failed. issue: https://github.com/milvus-io/milvus/issues/43513 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-25 16:24:54 +08:00
Ted Xu	078ccf5e08	fix: the underlying record got released in clustering compaction (#43551 ) See: #43186 In this PR: 1. Flush renamed to FlushChunk, while a new Flush primitive is introduced to serialize values to records. 2. Segment mapping in clustering compaction now process data by records instead of values, it calls flush to all buffers after each record is processed. Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2025-07-25 15:04:54 +08:00
Buqian Zheng	d23205b718	enhance: DataCodec to release ownership of input_data after initialization (#43542 ) issue: https://github.com/milvus-io/milvus/issues/43088 issue: https://github.com/milvus-io/milvus/issues/43038 see also https://github.com/milvus-io/milvus/pull/43533. Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-07-25 14:24:54 +08:00
wei liu	369a811ae1	fix: only clear exclude node list after refresh shard leader cache (#43553 ) issue: #43511 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-25 14:18:54 +08:00
sthuang	5cebc9f7f6	fix: [StorageV2] handle correct cid with multiple files and add storage v2 prefix logs (#43539 ) related: #43372 --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-07-25 11:22:54 +08:00
Shuyoou	87326a5a64	fix: [skip e2e] webui collection filter params error (#42969 ) Fix Issue: #40929 Signed-off-by: Shuyoou <shuyoou@outlook.com>	2025-07-25 10:40:53 +08:00
Spade A	10fe53ff59	feat: support json for ngram (#43170 ) Ref https://github.com/milvus-io/milvus/issues/42053 This PR enable ngram to support json data type. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-07-25 10:28:54 +08:00
sthuang	a0c9f499ee	fix: [StorageV2] sync panic with nullable add field (#43142 ) related: https://github.com/milvus-io/milvus/pull/42932 fix: https://github.com/milvus-io/milvus/issues/43072 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-07-25 10:08:53 +08:00
zhagnlu	c86307aef0	enhance: forbid two column comparison with json type in parser stage (#43382 ) #43381 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-07-24 19:42:54 +08:00
yihao.dai	804a7692a6	fix: Fix delete loss caused by missing mutual exclusion in sort compaction (#43540 ) issue: https://github.com/milvus-io/milvus/issues/43513 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-24 14:53:34 +08:00
Buqian Zheng	d367770649	enhance: greatly reduce the loading memory overhead - by up to 25% (#43533 ) issue: #43088 issue: #43038 The current loading process: * When loading an index, we first download the index files into a list of buffers, say A * then constructing(copying) them into a vector of FieldDatas(each file is a FieldData), say B * assembles them together as a huge BinarySet, say C * lastly, copy into the actual index data structure, say D The problem: * We can see that, after each step, we don't need the data in previous step. * But currently, we release the memory of A, B, C only after we have finished constructing D * This leads to a up to 4x peak memory usage comparing with the raw index size, during the loading process * This PR allows timely releasing of B after we assembled C. So after this PR, the peak memory usage during loading will be up to 3x of the raw index size. I will create another PR to release A after we created B, that seems more complicated and need more work. Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-07-24 11:26:54 +08:00
congqixia	4bdb5ccafa	fix: Close segment writer when reader returns error (#43531 ) Realted #43520 Datanode may have memory leakage when reader returns error. In previously mention issue, datanodes got OOM killed due to continueous error in read path. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-24 11:18:54 +08:00
Jean-Francois Weber-Marx	1bd66b09e3	enhance: allow '.' and '-' characters in usernames (#42417 ) (#42588 ) related: #42417 - update the isValidUsername function to accept dots and hyphens in addition to letters, digits, and underscores - this change improves compatibility with common username formats and addresses feedback in issue #42417 Signed-off-by: Jean-Francois Weber-Marx <jfwm@hotmail.com> Signed-off-by: Jean-Francois Weber-Marx <jf.webermarx@criteo.com>	2025-07-24 09:54:54 +08:00
wei liu	990a25e51a	fix: Prevent delete records loss during slow segment loading [QueryNodeV2] (#43527 ) issue: #42884 Fixes an issue where delete records for a segment are lost from the delete buffer if `load segment` execution on the delegator is too slow, causing `syncTargetVersion` or other cleanup operations to clear them prematurely. Changes include: - Introduced `Pin` and `Unpin` methods in `DeleteBuffer` interface and its implementations (`doubleCacheBuffer`, `listDeleteBuffer`). - Added a `pinnedTimestamps` map to track timestamps protected from cleanup by specific segments. - Modified `LoadSegments` in `shardDelegator` to `Pin` relevant segment delete records before loading and `Unpin` them afterwards. - Added `isPinned` check in `UnRegister` and `TryDiscard` methods of `listDeleteBuffer` to skip cleanup if corresponding timestamps are pinned. - Added comprehensive unit tests for `Pin`, `Unpin`, and `isPinned` functionality, covering basic, multiple pins, concurrent, and edge cases. This ensures the integrity of delete records by preventing their premature removal from the delete buffer during segment loading. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-24 01:00:54 +08:00
congqixia	1cf8ed505f	fix: Implement `NeededFields` feature in `RecordReader` (#43523 ) Related to #43522 Currently, passing partial schema to storage v2 packed reader may trigger SEGV during clustering compaction unit test. This patch implement `NeededFields` differently in each `RecordReader` imlementation. For now, v2 will implemented as no-op. This will be supported after packed reader support this API. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-24 00:22:54 +08:00
Zhen Ye	e9ab73e93d	enhance: add schema version at recovery storage (#43500 ) issue: #43072, #43289 - manage the schema version at recovery storage. - update the schema when creating collection or alter schema. - get schema at write buffer based on version. - recover the schema when upgrading from 2.5. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-23 21:38:54 +08:00
yihao.dai	9fbd41a97d	fix: Adjust binlog and parquet reader buffer size for import (#43495 ) 1. Modify the binlog reader to stop reading a fixed 4096 rows and instead use the calculated bufferSize to avoid generating small binlogs. 2. Use a fixed bufferSize (32MB) for the Parquet reader to prevent OOM. issue: https://github.com/milvus-io/milvus/issues/43387 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-23 21:28:54 +08:00
foxspy	ed57650b52	fix: remove invalid restrictions on dim for int8 vector (#43469 ) issue: #43466 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2025-07-23 20:22:54 +08:00
cai.zhang	74c08069ef	fix: Set result storage version for sort compaction (#43521 ) issue: #43520 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-23 19:04:53 +08:00
zhagnlu	d64dceea47	fix:add convert int to float function to array_contains related expr (#43468 ) #43281 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-07-23 15:20:53 +08:00
junjiejiangjjj	4db877f76c	fix: Fix weighted rerank (#43503 ) #43478 Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com>	2025-07-23 14:54:53 +08:00
Buqian Zheng	7ced9fc5d9	fix: fix loading resource estimation (#43509 ) currently we multiplied the requesting size when adding to loading, but did not do so when estimating projected usage. issue: #43088 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-07-23 10:36:53 +08:00
congqixia	cc1034fe96	fix: [AddField] Resolve FieldIndexing dangling reference (#43499 ) Related to #43113 This PR: - Change member of FieldIndex from `FieldMeta &` to needed `DataType` and dim member resolving dangling reference after schema change - Add double check after acquiring lock to reduce multiple assignment - Change `auto schema` to `auto& schema` to reduce schema copy Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-23 00:14:52 +08:00
sthuang	59bbdd93f5	fix: [StorageV2] fill the correct group chunk into cell (#43486 ) The root cause of the issue lies in the fact that when a sealed segment contains multiple row groups, the get_cells function may receive unordered cids. This can result in row groups being written into incorrect cells during data retrieval. Previously, this issue was hard to reproduce because the old Storage V2 writer had a bug that caused it to write row groups larger than 1MB. These large row groups could lead to uncontrolled memory usage and eventually an OOM (Out of Memory) error. Additionally, compaction typically produced a single large row group, which avoided the incorrect cell-filling issue during query execution. related: https://github.com/milvus-io/milvus/issues/43388, https://github.com/milvus-io/milvus/issues/43372, https://github.com/milvus-io/milvus/issues/43464, #43446, #43453 --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-07-22 22:22:53 +08:00
XuanYang-cn	92f4fc0e8b	fix: Set status when err is not empty (#43403 ) See also: #43341 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-07-22 17:48:53 +08:00
cai.zhang	f19e0ef6e4	fix: Ensure task execution order by using a priority queue (#43271 ) issue: #43260 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-22 17:42:53 +08:00
cai.zhang	e26a532504	enhance: Only download necessary fields during clustering analyze phase (#43322 ) issue: #43310 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-22 16:40:52 +08:00
Zhen Ye	df7e507c49	fix: balance may not trigger at balance checker when upgrading (#43462 ) issue: #43416 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-22 16:02:53 +08:00
Buqian Zheng	0599113a4b	enhance: add timeout to resource reservation (#43441 ) issue: https://github.com/milvus-io/milvus/issues/41435 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-07-22 15:24:53 +08:00
yihao.dai	a839017e81	fix: Handle retry state in import task (#43474 ) issue: https://github.com/milvus-io/milvus/issues/43473 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-22 14:52:53 +08:00
Chun Han	5a1092304c	fix: refine judgement for batch views(#38736 ) (#43481 ) related: #38736 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2025-07-22 14:20:53 +08:00
congqixia	5c0f0ee765	enhance: [StorageV2] Return EOF when packedReader closed (#43465 ) This patch makes `PackedReader` return EOF when try to calling `ReadNext` after closing it. This behavior make importv2.binlog reader could retry after EOF reached and act normally. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-22 14:04:52 +08:00
yihao.dai	5124ed9758	fix: Fix import fileStats incorrectly set to nil (#43463 ) 1. Ensure that tasks in the InProgress state return valid fileStats. 2. Enhance import logs. issue: https://github.com/milvus-io/milvus/issues/43387 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-22 12:37:01 +08:00
congqixia	563e2935c5	enhance: [StorageV2] Fill ts range default values for `PackedBinlogRecordWriter` (#43454 ) This PR fill default value for `PackedBinlogRecordWriter` timestamp range so target segment meta will contains correct timestamp range Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-22 12:04:53 +08:00
sthuang	f77571d5c1	fix: [StorageV2] file writer write row group split to default size (#43471 ) Bumped milvus storage version. related: https://github.com/milvus-io/milvus/issues/43310 * https://github.com/milvus-io/milvus-storage/pull/213 * https://github.com/milvus-io/milvus-storage/pull/217 * https://github.com/milvus-io/milvus-storage/pull/220 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-07-22 09:52:52 +08:00
sthuang	6c5f5f1e32	enhance: [StorageV2] refactor group chunk translator (#43406 ) related: #43372 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-07-21 19:46:53 +08:00
sparknack	81694739ef	fix: revert ska::flat_hash_set to std::unordered_set to address an un… (#43428 ) issue: #43388 Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-07-21 17:39:40 +08:00
aoiasd	e9fc140eaf	fix: jieba tokenizer cause panic when dict word was empty string (#43337 ) relate: https://github.com/milvus-io/milvus/issues/42779 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-07-21 16:34:53 +08:00
aoiasd	c7b53ed43b	enhance: run rust format (#43447 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-07-21 14:12:53 +08:00
junjiejiangjjj	77f3a1f213	enhance: Add search post pipeline (#43065 ) https://github.com/milvus-io/milvus/issues/35856 Signed-off-by: junjiejiangjjj <junjie.jiang@zilliz.com>	2025-07-21 11:10:52 +08:00
Bingyi Sun	21e71f6eb2	fix: Check json nested path before validating data type (#43329 ) issue: #43279 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-07-21 10:30:54 +08:00
Zhen Ye	69c8c2660b	fix: create nil start position segment if sync start position before insert (#43435 ) issue: #43434 - the segment start position can be carried by other segment sync operation. so the sync start position operation can happens before insert. - TODO: It's a wired design should be removed. Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-21 09:50:52 +08:00
Bingyi Sun	09b6407e63	enhance: optimize error msg for json index inconsistent parameters (#43345 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-07-21 00:32:52 +08:00
Xianhui Lin	c13393418c	fix: invalid string error when enabled json stats (#43380 ) fix: invalid string error when enabled json stats issue: https://github.com/milvus-io/milvus/issues/43151 Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-07-20 23:38:53 +08:00
aoiasd	f7e1f1c382	enhance: support download lindera system dictionary online (#43121 ) relate: https://github.com/milvus-io/milvus/issues/43120 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-07-20 23:24:52 +08:00
Zhen Ye	25b76e1fde	fix: cannot auto balance the channel from old arch to streamingnode (#43424 ) issue: #43416, #43413 - also fix the panic on streamingnode when concurrent sync Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-20 23:00:52 +08:00
Buqian Zheng	389104d200	enhance: rename PanicInfo to ThrowInfo (#43384 ) issue: #41435 this is to prevent AI from thinking of our exception throwing as a dangerous PANIC operation that terminates the program. Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-07-19 20:22:52 +08:00
Buqian Zheng	f7b262a702	feat: make storagev1 to support eviction (#43219 ) issue: https://github.com/milvus-io/milvus/issues/41435 turns out we have per file binlog size in golang code, by passing it into segcore we can support eviction in storage v1 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-07-19 02:02:52 +08:00
congqixia	672a83f66b	enhance: Skip remove op if key in save set (#43425 ) Related to #43407 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-18 17:37:39 +08:00
cai.zhang	2adc6ce0bc	fix: Call AlterCollection when only rename collection (#43420 ) issue: #43407 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-18 15:46:56 +08:00
Spade A	42ad786f75	fix: update tantivy for fixing dir removing race condition (#43399 ) fix: https://github.com/milvus-io/milvus/issues/43258 Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-07-18 15:44:56 +08:00
congqixia	8fc7069e1a	fix: Make MultiSaveAndRemove execute removal first (#43408 ) Realted to #43407 When `MultiSaveAndRemove` like ops contains same key in saves and removal keys it may cause data lost if the execution order is save first than removal. This PR make all the kv execute removal first then save the new values. Even when same key appeared in both saves and removals, the new value shall stay. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-18 15:41:40 +08:00
Zhen Ye	b142589942	enhance: support all partitions in shard manager for L0 segment (#43385 ) issue: #42416 - change the key from partitionID into PartitionUniqueKey to support AllPartitionsID Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-18 11:40:51 +08:00
Zhen Ye	5aa7a116d2	fix: change maxTimeTickDelay from 5m into 20m (#43377 ) issue: #43266 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-18 11:29:42 +08:00
Buqian Zheng	d793def47c	feat: impose a physical memory limit when loading cells (#43222 ) issue: #41435 issue: https://github.com/milvus-io/milvus/issues/43038 This PR also: 1. removed ERROR state from ListNode 2. CacheSlot will do reserveMemory once for all requested cells after updating the state to LOADING, so now we transit a cell to LOADING before its resource reservation 3. reject resource reservation directly if size >= max_size --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-07-18 11:18:52 +08:00
Zhen Ye	07fa2cbdd3	enhance: wal balance consider the wal status on streamingnode (#43265 ) issue: #42995 - don't balance the wal if the producing-consuming lag is too long. - don't balance if the rebalance is set as false. - don't balance if the wal is balanced recently. Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-18 11:10:51 +08:00
Zhen Ye	3aacd179f7	fix: balance channel before balance segment when upgrading (#43346 ) issue: #43117, #42966, #43373 - also fix channel balance may not work at 2.6. - fix error lost at delete path - add mvcc into s/q log - change the log level for TestCoordDownSearch Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-17 20:16:52 +08:00
Spade A	8612a2c946	enhance: optimize in by batch-in (#43268 ) fix: https://github.com/milvus-io/milvus/issues/43267 --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-07-17 19:40:52 +08:00
sparknack	9b4081e110	enhance: cachinglayer: some performance optimization (#42858 ) issue: #41435 We compared the performance using the modified test_sealed.cpp, which randomly accesses all rows in all chunks and counts the number of runs within 3s. ## performance data comparison (ops/second) chunk config: 1x1000 \| Field Type \| w/o cachinglayer (commit `640f526301`) \| w/ cachinglayer \| w/ cachinglayer + opt \| \|---\|---\|---\|---\| \| Bool field \| 82428 \| -63.6% (29983) \| +2.7% (84675) \| \| Int8 field \| 82228 \| -63.3% (30166) \| +2.4% (84163) \| \| Int16 field \| 82572 \| -63.8% (29867) \| +1.8% (84036) \| \| Int32 field \| 82797 \| -63.7% (30031) \| +1.5% (84043) \| \| Int64 field \| 81077 \| -62.9% (30107) \| +0.6% (81604) \| \| Float field \| 82678 \| -63.4% (30266) \| +1.8% (84146) \| \| Double field \| 81925 \| -63.4% (29974) \| +0.2% (82097) \| \| Varchar field \| 19933 \| -19.6% (16027) \| +18.9% (23690) \| \| JSON field \| 16519 \| -96.8% (533) \| +2.5% (16927) \| \| Int array field \| 7325 \| -13.7% (6321) \| -1.4% (7220) \| \| Long array field \| 6347 \| -8.9% (5781) \| -0.1% (6344) \| \| Bool array field \| 8275 \| -14.0% (7116) \| +0.4% (8311) \| \| String array field \| 2281 \| -5.0% (2168) \| +0.2% (2287) \| \| Double array field \| 6427 \| -13.3% (5574) \| -2.0% (6301) \| \| Float array field \| 7291 \| -13.0% (6346) \| -1.5% (7183) \| \| Vector field \| 27487 \| -40.4% (16371) \| -4.7% (26192) \| \| Float16 vector field \| 49773 \| -54.6% (22601) \| -5.9% (46834) \| \| BFloat16 vector field \| 49783 \| -53.1% (23350) \| -5.7% (46934) \| \| Int8 vector field \| 63871 \| -59.0% (26179) \| -6.2% (59926) \| --- chunk config: 10x1000 \| Field Type \| w/o cachinglayer (commit `640f526301`) \| w/ cachinglayer \| w/ cachinglayer + opt \| \|---\|---\|---\|---\| \| Bool field \| 3659 \| -48.6% (1879) \| +110.1% (7686) \| \| Int8 field \| 3410 \| -45.3% (1864) \| +123.9% (7636) \| \| Int16 field \| 3647 \| -48.6% (1874) \| +110.1% (7661) \| \| Int32 field \| 3647 \| -48.8% (1866) \| +109.6% (7645) \| \| Int64 field \| 3645 \| -48.9% (1863) \| +107.8% (7573) \| \| Float field \| 3647 \| -49.0% (1861) \| +109.5% (7639) \| \| Double field \| 3640 \| -45.1% (1998) \| +108.4% (7586) \| \| Varchar field \| 1594 \| -23.9% (1213) \| +20.6% (1922) \| \| JSON field \| 1202 \| -26.5% (884) \| +16.1% (1396) \| \| Int array field \| 602 \| -12.3% (528) \| +12.7% (678) \| \| Long array field \| 529 \| -12.2% (465) \| +7.5% (569) \| \| Double array field \| 537 \| -13.0% (467) \| +6.4% (571) \| \| Vector field \| 1520 \| -37.9% (943) \| -5.5% (1437) \| \| Float16 vector field \| 2607 \| -47.0% (1382) \| +6.4% (2774) \| \| BFloat16 vector field \| 2586 \| -46.5% (1383) \| +8.8% (2813) \| \| Int8 vector field \| 3101 \| -47.3% (1633) \| +41.9% (4400) \| --------- Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-07-17 11:20:51 +08:00
zhagnlu	ee43954534	fix:fix text_match bug because of not adapting to multi-chunk model (#43303 ) https://github.com/milvus-io/milvus/issues/43296 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-07-17 10:32:51 +08:00
yihao.dai	df8ceb123b	enhance: Support parallel execution of L0 import tasks (#43213 ) issue: https://github.com/milvus-io/milvus/issues/43212 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-17 10:14:50 +08:00
XuanYang-cn	4dcaa97682	fix: Use diskSegmentMaxSize for coll with sparse and dense vectors (#43194 ) Previous code uses diskSegmentMaxSize if and only if all of the collection's vector fields are indexed with DiskANN index. When introducing sparse vectors, since sparse vector cannot be indexed with DiskANN index, collections with both dense and sparse vectors will use maxSize instead. This PR changes the requirments of using diskSegmentMaxSize to all dense vectors are indexed with DiskANN indexs, ignoring sparse vector fields. See also: #43193 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-07-16 18:04:52 +08:00
Spade A	d750816ba0	fix: remove std::string support for stlsort index (#43355 ) fix: https://github.com/milvus-io/milvus/issues/43354 The current implementation of stdsort index is not supported for std::string. Remove the code. Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-07-16 17:46:51 +08:00
congqixia	5d90b65342	enhance: [StorageV2] Add storage version in Data/Query view resp (#43348 ) Related to #39173 Add `storage_version` in data/query view segment info response --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-16 15:52:51 +08:00
foxspy	58a9e49066	enhance: update knowhere version (#43331 ) issue: #42937 #43294 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2025-07-16 15:04:50 +08:00
yihao.dai	b69e601fe1	fix: [StorageV2] Correct read and write buffer size (#43335 ) Correct read and buffer size to 64MB to prevent OOM during clustering compaction. issue: https://github.com/milvus-io/milvus/issues/43310 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-16 14:28:52 +08:00
Bingyi Sun	1b8c958cff	enhance: fix tantivy wrapper is freed after json flat executor is destructed (#43233 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-07-16 10:58:50 +08:00
congqixia	fe8de016d5	fix: [StorageV2] Align null bitmap offset when loading multi-chunk (#43321 ) Related to #43262 This patch fixes following logic bug: - When multiple chunks are loaded and size cannot be divided by 8, just appending uint8_t as bitmap will cause null bitmap dislocation - `null_bitmap_data()` points to start of whole row group, which may not stand for current `arrow::Array` The current solutions is: - Reorganize the null_bitmap with currect size & offset - Pass `array->offset()` in tuple to info the current offset Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-15 19:22:50 +08:00
Bingyi Sun	ccfaa7bee8	fix: Fix the bug when offsets is nullptr in bulk api (#43127 ) issue: https://github.com/milvus-io/milvus/issues/42978 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-07-15 17:54:50 +08:00
Zhen Ye	ffc8c0730c	fix: wrong metric for sn timetick (#43312 ) issue: #43266 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-14 20:40:50 +08:00
Spade A	db91d85dbc	feat: more types of matches for ngram (#43081 ) Ref https://github.com/milvus-io/milvus/issues/42053 This PR enable ngram to support more kinds of matches such as prefix and postfix match. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-07-14 20:34:50 +08:00
Spade A	e14a52721e	enhance: use stl sort with high cardinality for data_type int (#43305 ) fix: https://github.com/milvus-io/milvus/issues/43304 Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-07-14 18:40:50 +08:00
congqixia	ae48f0e484	fix: [StorageV2] Handle missing column creating index (#43292 ) Related to #43250 Use FieldIDList to check missing field. If column is missing, return empty resultset Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-14 17:06:50 +08:00
wei liu	039564199c	fix: Prevent duplicate segment results in count queries (#43173 ) issue: #41570 Fix issue where growing and sealed segments could be searched simultaneously, causing inflated count() results. This was caused by logic introduced in PR #42009 that made sealed segments readable before target version advancement. Changes include: - Fix conditional filtering logic in PinReadableSegments to prevent sealed segments from becoming readable prematurely - Use target version filter for full results (ratio=1.0) to ensure sealed segments only become readable after target advancement - Use query view segment list filter for partial results (ratio<1.0) to maintain backward compatibility - Simplify target version setting in AddDistributions to prevent premature segment readability - Add logging for redundant growing segments during sync - Add comprehensive unit tests covering the duplicate segment scenario This fix ensures count() queries return accurate results by preventing the same segment from being counted in both growing and sealed states. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-14 11:10:49 +08:00
foxspy	8171a2a0b5	enhance: update knowhere version (#43246 ) issue: #42937 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2025-07-14 11:06:49 +08:00
Ted Xu	07894b37b6	enhance: returning collection metadata from cache (#42823 ) See #43187 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2025-07-14 10:54:50 +08:00
Bingyi Sun	21a96bc903	enhance: Save meta with txn limit (#43263 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-07-14 10:32:49 +08:00
yihao.dai	1984be646c	fix: Fix storagev2 binlog import (#43221 ) issue: https://github.com/milvus-io/milvus/issues/43218 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-13 22:52:49 +08:00
Alexander Guzhva	a848c4a8c5	fix: fix incorrect bitset for the division comparison when the right is < 0 (#43179 ) issue: https://github.com/milvus-io/milvus/issues/42900 @sunby Unfortunately, it is not that easy to fix as it was thought in #43177 Upd: also handles `Inf` and `NaN` values, and the division by zero case for `fp32` and `fp64` Signed-off-by: Alexandr Guzhva <alexanderguzhva@gmail.com>	2025-07-11 19:04:49 +08:00
Zhen Ye	15a6631147	enhance: add quota limit based on sn consuming lag (#43105 ) issue: #42995 - The consuming lag at streaming node will be reported to coordinator. - The consuming lag will trigger the write limit and deny by quota center. - Set the ttProtection by default. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-11 14:10:49 +08:00
cai.zhang	c54a04c71c	fix: L2 segments remain as L2 even after sort compaction (#43237 ) issue: #43186 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-11 11:30:48 +08:00
Zhen Ye	f598ca2b4e	fix: block at msgpack adaptor and wrong metrics (#43235 ) issue: #43018 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-11 10:14:49 +08:00
congqixia	5a9efb3f81	enhance: [StorageV2] Refine storage rw option usage & validation (#43175 ) Related to #39173 This PR: - Make all datanode task passes storage config via storage config option - Remove legacy comments, rootPath & bucketName parameters - Fix clustering compaction option behavior - Add validation logic for `rwOptions` - Use correct storageType from storageConfig - Add storage config in sync task --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-11 01:14:48 +08:00
congqixia	6bbed3b019	fix: [AddField] Add shared_lock for insert prevent race (#43229 ) Related to #43113 When schema change happens, insert shall not happen, otherwise: - Data race may happen causing insertion failure - Inconsistent data schema This PR add shared_lock prevent this data race. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-10 21:26:48 +08:00
PjJinchen	a90694165b	feat: Supports tracing services that require header-based authentication. (#43211 ) issue: https://github.com/milvus-io/milvus/issues/43082 support tracing services that require header-based authentication. for example: aliyun SLS, volcengine LogService etc... [aliyun SLS](https://help.aliyun.com/zh/sls/import-trace-data-from-golang-applications-to-log-service-by-using-opentelemetry-sdk-for-golang?spm=a2c4g.11186623.help-menu-search-28958.d_1#section-ktk-xxz-8om) Add a headers config in trace config ``` trace: exporter: otlp sampleFraction: 1 otlp: endpoint: milvus-cn-beijing-pre.cn-beijing.log.aliyuncs.com:10010 method: # otlp export method, acceptable values: ["grpc", "http"], using "grpc" by default secure: true headers: # base64 initTimeoutSeconds: 10 ``` it is encoded as base64, raw data is json ``` { "x-sls-otel-project": "milvus-cn-beijing-pre", "x-sls-otel-instance-id": "milvus-cn-beijing-pre", "x-sls-otel-ak-id": "xxx", "x-sls-otel-ak-secret": "xxx" } ``` [volcengine tls](https://www.volcengine.com/docs/6470/812322#grpc-%E5%8D%8F%E8%AE%AE%E5%88%9D%E5%A7%8B%E5%8C%96%E7%A4%BA%E4%BE%8B) Add a headers config in trace config ``` trace: exporter: otlp sampleFraction: 1 otlp: endpoint: xxx method: # otlp export method, acceptable values: ["grpc", "http"], using "grpc" by default secure: true headers: # base64 initTimeoutSeconds: 10 ``` it is encoded as base64, raw data is json ``` { "x-tls-otel-region": "cn-beijing", "x-tls-otel-tracetopic": "milvus-cn-beijing-pre", "x-tls-otel-ak": "xxx", "x-tls-otel-sk": "xxx" } ``` Signed-off-by: PjJinchen <6268414+pj1987111@users.noreply.github.com>	2025-07-10 17:32:48 +08:00
wei liu	b2597c6329	enhance: apply load config changes after QueryCoord restart (#43108 ) issue: #43107 - Add checkLoadConfigChanges() to apply load config during startup - Call config check in startQueryCoord() after restart - Skip auto-updates for collections with user-specified replica numbers - Add is_user_specified_replica_mode field to preserve user settings - Add comprehensive unit tests with mockey Ensures existing collections use latest cluster-level config after restart. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-10 14:28:48 +08:00
cai.zhang	3ffd44f302	fix: Fix remaining issues with Datanode pooling and StorageV2 (#43147 ) issue: #43146 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-10 14:26:48 +08:00
yihao.dai	ee9a95189a	enhance: Print segments info after import done (#43200 ) issue: https://github.com/milvus-io/milvus/issues/42488 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-10 12:38:47 +08:00
Chun Han	07745439b5	fix: empty search groupby result causing crash(#43137 ) (#43214 ) related: #43137 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2025-07-10 12:04:48 +08:00
cai.zhang	47144429bf	fix: Fix regeneratePartitionStats failed after restore clusteringCompactionTask (#43205 ) issue: #43186 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-10 10:40:47 +08:00
Zhen Ye	490c5d5088	fix: lost message version after compatible message modification (#43217 ) issue: #43018 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-10 10:36:48 +08:00
Bingyi Sun	13f6e2130b	fix: Fix hybrid search return back empty set if one result is emtpy (#43209 ) issue: https://github.com/milvus-io/milvus/issues/43160 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-07-10 10:34:55 +08:00
congqixia	f027eea545	enhance: [AddField] Add log for segcore segment schema change (#43215 ) Related to #39178 This PR add logs for segment schema change operations. Also fixes the nit comments from PR #42490 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-10 10:22:47 +08:00
aoiasd	97b1c3ed96	enhance: add warn log if some segment's bm25 stats lacks (#43111 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-07-09 23:22:47 +08:00
cai.zhang	95e767611a	fix: Fix merge sort loss data when last row in a record is deleted (#43216 ) issue: #43207 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-09 22:18:48 +08:00
zhagnlu	21d1fb2aa3	fix: fix move cursor bug for chunk segment with index (#43095 ) #42974 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-07-09 17:38:47 +08:00
cai.zhang	41d1c8d6b3	fix: Handle error for invalid function params and prevent panic (#43189 ) issue: #43188 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-09 12:46:46 +08:00
tinswzy	c4634d861e	fix: v2.6 WebUI metrics response schema change bug (#42957 ) #42919 fix metrics response schema incompatibility with WebUI v2.6 Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>	2025-07-08 22:56:47 +08:00
cai.zhang	6989e18599	enhance: Move sort stats task to sort compaction (#42562 ) issue: #42560 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-08 20:22:47 +08:00
aoiasd	54cc0b60f2	fix: dropped segment in excluded segment use wrong excluded ts (#43115 ) cause some excluded growing data insert again relate: https://github.com/milvus-io/milvus/issues/43114 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-07-08 18:04:46 +08:00
Spade A	d41eec6f10	fix: void copy when getting json chunk (#43183 ) fix: https://github.com/milvus-io/milvus/issues/43182 Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-07-08 15:28:46 +08:00
cai.zhang	8720feeb79	fix: Fix enqueuing when current batch is fully deleted (#43174 ) issue: #43045 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-08 12:20:46 +08:00
Ted Xu	6153272d4b	enhance: disabling max entry limit by default (#43166 ) See: #43055 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2025-07-08 10:10:46 +08:00
yihao.dai	9cbd194c6b	fix: Prevent import from generating small binlogs (#43132 ) - Introduce dynamic buffer sizing to avoid generating small binlogs during import - Refactor import slot calculation based on CPU and memory constraints - Implement dynamic pool sizing for sync manager and import tasks according to CPU core count issue: https://github.com/milvus-io/milvus/issues/43131 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-07 21:32:47 +08:00
sthuang	a0ae5bccc9	fix: [StorageV2] load growing segment get dim datatype check (#43168 ) related: https://github.com/milvus-io/milvus/issues/43072 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-07-07 15:46:47 +08:00
congqixia	ab818dcbca	fix: [StorageV2] Pass storage config for compaction rw (#43167 ) Related to #43148 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-07 15:32:46 +08:00
sthuang	276c52490d	fix: [StorageV2] missing arrow fs when building index (#43162 ) fix: https://github.com/milvus-io/milvus/issues/43150, https://github.com/milvus-io/milvus/issues/43149 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-07-07 15:26:46 +08:00
sthuang	9f361a228e	enhance: storage v2 chunked column memory size from meta (#43130 ) use meta to get chunked column memory size to avoid getting cells actually from storage. related: #39173 --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-07-07 14:24:46 +08:00
congqixia	d09764508a	fix: [Storagev2] Close segment readers in mergeSort (#43116 ) Related to #43062 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-04 23:56:44 +08:00
junjiejiangjjj	fafd5db43f	fix: rank params bug (#43112 ) https://github.com/milvus-io/milvus/issues/42985 Signed-off-by: junjiejiangjjj <junjie.jiang@zilliz.com>	2025-07-04 18:28:44 +08:00
Zhen Ye	46b6f1b9e2	fix: panic when logging a old message should be skipped (#43076 ) issue: #43074 - fix: panic when logging a old message should be skipped, #43074 - fix: make the ack of broadcaster idompotent, #43026 - fix: lost dropping collection when upgrading, #43092 - fix: panic when DropPartition happen after DropCollection, #43027, #43078 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-04 16:04:44 +08:00
groot	1ee8cea35b	enhance: bulkinsert handle nullable/defaultValue/functionOutput fields (#42956 ) issue: https://github.com/milvus-io/milvus/issues/42173 Signed-off-by: yhmo <yihua.mo@zilliz.com>	2025-07-04 14:20:44 +08:00
congqixia	684f027496	fix: Remove trimming space logic when validating collection name (#43064 ) Related to #43031 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-04 11:00:45 +08:00
cai.zhang	4133e3b8fd	fix: Enable merge sort and fix sort bug (#43080 ) issue: #42980, #43034 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-04 10:18:44 +08:00
Spade A	fce0bbe2ae	fix: remove redundant locks for null_offset (#43103 ) Ref: https://github.com/milvus-io/milvus/issues/40308 https://github.com/milvus-io/milvus/pull/40363 add lock for protecting concurrent read/write for null offset. But we don't need this for sealed segment. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-07-04 10:10:45 +08:00
Zhen Ye	e97e44d56e	enhance: limit the gc concurrency when cpu is high (#43059 ) issue: #42833 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-04 09:22:43 +08:00
congqixia	1d9a9a993d	fix: [StorageV2] Use correct template typename for `cache_raw_data_to_disk_common` (#43104 ) Related to #43099 Previously `cache_raw_data_to_disk_common` used `milvus::DataType` template typename, which shall be `knowhere::bf16` or other actual datatype. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-03 18:50:46 +08:00
Zhen Ye	bbbc7d4517	enhance: collect all cgo calling into metric and log slow cgo call (#43035 ) issue: #42833 - also fix the error metric for async cgo. - also make sure the roles can be seen when node startup, #43041. Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-03 15:00:44 +08:00
cai.zhang	f6b2a71c95	enhance: Remove chunkmanager-related dependencies from datanode (#43021 ) issue: #41611 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-03 14:44:45 +08:00
congqixia	1fae5230fe	fix: Check field mmap property before apply collection level one (#43090 ) Related to #43089 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-03 14:30:44 +08:00
Bingyi Sun	6e38e9d18f	fix: Add json cast type for flat index (#42970 ) issue: #42916 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-07-03 14:14:44 +08:00
sparknack	7e855f1046	enhance: add disk file writer with Direct IO support (#42665 ) issue: #43040 This patch introduces a disk file writer that supports Direct IO. Currently, it is exclusively utilized during the QueryNode load process. Below is its parameters: 1. `common.diskWriteMode` This parameter controls the write mode of the local disk, which is used to write temporary data downloaded from remote storage. Currently, only QueryNode uses 'common.diskWrite*' parameters. Support for other components will be added in the future. The options include 'direct' and 'buffered'. The default value is 'buffered'. 2. `common.diskWriteBufferSizeKb` Disk write buffer size in KB, only used when disk write mode is 'direct', default is 64KB. Current valid range is [4, 65536]. If the value is not aligned to 4KB, it will be rounded up to the nearest multiple of 4KB. 3. `common.diskWriteNumThreads` This parameter controls the number of writer threads used for disk write operations. The valid range is [0, hardware_concurrency]. It is designed to limit the maximum concurrency of disk write operations to reduce the impact on disk read performance. For example, if you want to limit the maximum concurrency of disk write operations to 1, you can set this parameter to 1. The default value is 0, which means the caller will perform write operations directly without using an additional writer thread pool. In this case, the maximum concurrency of disk write operations is determined by the caller's thread pool size. Both parameters can be updated during runtime. --------- Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-07-02 22:18:44 +08:00
congqixia	7bc7b18ed5	fix: [AddField] Prevent concurrent load during UpdateSchema (#43043 ) Related to #43028 This PR: - Add mutex prevent concurrent load segment & schema change - Add schema verison field in load meta - Update schema in PutOrRef if schema verison is larger --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-02 17:38:44 +08:00
congqixia	8962b0058d	fix: [StorageV2] Check writer nil when closing not written one (#43056 ) Related to #43047 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-02 14:22:43 +08:00
Zhen Ye	09c6df62d8	fix: use impl and remove the close method of broadcast service (#42992 ) issue: #38399 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-02 10:30:44 +08:00
wei liu	c381bf3e41	enhance: add logs for count(*) (#43001 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-01 19:36:43 +08:00
Zhen Ye	08fff353af	fix: Revert "enhance: Enable mergeSort by default starting from version 2.6.0 (#42981 )" (#43046 ) issue: #43034 - implementation of mergeSortMultipleSegments is wrong. Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-01 17:30:29 +08:00
Spade A	26ec841feb	feat: optimize `Like` query with n-gram (#41803 ) Ref #42053 This is the first PR for optimizing `LIKE` with ngram inverted index. Now, only VARCHAR data type is supported and only InnerMatch LIKE (%xxx%) query is supported. How to use it: ``` milvus_client = MilvusClient("http://localhost:19530") schema = milvus_client.create_schema() ... schema.add_field("content_ngram", DataType.VARCHAR, max_length=10000) ... index_params = milvus_client.prepare_index_params() index_params.add_index(field_name="content_ngram", index_type="NGRAM", index_name="ngram_index", min_gram=2, max_gram=3) milvus_client.create_collection(COLLECTION_NAME, ...) ``` min_gram and max_gram controls how we tokenize the documents. For example, for min_gram=2 and max_gram=4, we will tokenize each document with 2-gram, 3-gram and 4-gram. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com> Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com>	2025-07-01 10:08:44 +08:00
wei liu	396120ade5	enhance: Improve delegator serviceable check with coordinator sync state (#42975 ) issue: #42404 Add syncedByCoord field to ensure delegator only becomes serviceable after coordinator sync, preventing unreliable service state when memory is insufficient. Issue: When memory is low, delegator may become serviceable before current target is ready, but segments can be released at any time, making the serviceable state unreliable. Changes include: - Add syncedByCoord field to track coordinator sync status - Update Serviceable() to require both data readiness and coord sync - Set syncedByCoord=true in SyncTargetVersion - Add comprehensive test coverage Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-01 10:00:43 +08:00
Zhen Ye	ecb24e7232	enhance: use multi-process framework in integration test (#42976 ) issue: #41609 - add env `MILVUS_NODE_ID_FOR_TESTING` to set up a node id for milvus process. - add env `MILVUS_CONFIG_REFRESH_INTERVAL` to set up the refresh interval of paramtable. - Init paramtable when calling `paramtable.Get()`. - add new multi process framework for integration test. - change all integration test into multi process. - merge some test case into one suite to speed up it. - modify some test, which need to wait for issue #42966, #42685. - remove the waittssync for delete collection to fix issue: #42989 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-30 14:22:43 +08:00
wei liu	c919340763	enhance: Optimize channel node balancing for uneven QN distribution (#42786 ) issue: #42860 Fix channel node allocation when QueryNode count is not a multiple of channel count. The previous algorithm used simple division which caused uneven distribution with remainders. Key improvements: - Implement smart remainder distribution algorithm - Refactor large function into focused helper functions - Support two-phase rebalancing (release then allocate) - Handle edge cases like insufficient nodes gracefully --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-30 12:14:42 +08:00
rhys	48661655d6	fix: streamingcoord and streamingnode client support internal tls (#42685 ) https://github.com/milvus-io/milvus/issues/42680 streamingnode/streamingcoord support internal tls Signed-off-by: rhys <sdbwlr@163.com>	2025-06-27 17:50:42 +08:00
Zhen Ye	8367e4ec6a	fix: set 72h for wal retention (#42910 ) issue: #42706 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-27 17:36:43 +08:00
Bingyi Sun	23c784cf69	fix: Fix querynode crash caused by json index (#42982 ) issue: https://github.com/milvus-io/milvus/issues/42978 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-27 16:44:41 +08:00
XuanYang-cn	17f1ab71bb	enhance: Remove not inused BuildIndexInfo (#42926 ) 1. removed not inuse cgo methods in index_c.h/cpp 2. removed indexcogowrapper/build_index_info.go See also: #39242 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-06-27 15:00:42 +08:00
congqixia	9b06ecb72f	enhance: [StorageV2] Release record and close reader (#42983 ) Related to #39173 This PR - Close packed reader after sort - Release arrow.Record preventing memory leakage - Invoke `pack_reader->Close()` for CloseReader --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-27 14:46:43 +08:00
sthuang	238bd30f42	fix: [StorageV2] end to end minor issues for sync, stats, and load (#42948 ) Fix issues in end-to-end tests: 1. Split column groups based on schema, rather than estimating by average chunk row size. Ensure column group consistency within a segment, to avoid errors caused by loading multiple column group chunks simultaneously. 2. Use sorted segmentId when generating the stats binlog path, to ensure consistent and correct file path resolution. 3. Determine field IDs as follows: For multi-column column groups, retrieve the field ID list from metadata. For single-column column groups, use the column group ID directly as the field ID. related: #39173 fix: #42862 --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-27 14:44:42 +08:00
Zhen Ye	2d73e6eaa8	fix: mixcoord will not handle timetick anymore (#42965 ) issue: #42954 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-26 19:14:42 +08:00
Zhen Ye	3602817c53	fix: dynamic log level for streaming node (#42964 ) issue: #42963 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-26 19:12:50 +08:00
congqixia	5dd1f841d2	enhance: [AddField] Add Restful API for addfield (#42972 ) Related to #39718 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-26 18:46:41 +08:00
Bingyi Sun	289b8b85d3	enhance: remove name check for alter index task (#42953 ) issue: https://github.com/milvus-io/milvus/issues/42952 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-26 16:32:41 +08:00
foxspy	be05b653c1	enhance: update knowhere version (#42938 ) issue: #42937 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2025-06-26 01:22:41 +08:00
yihao.dai	d7c9914eff	fix: Consider fields number when preallocating ids for import (#42810 ) In corner cases where there are many fields but only a small number of rows to import, the default preallocated IDs may be insufficient. To address this, consider the number of fields when preallocating IDs. issue: https://github.com/milvus-io/milvus/issues/42518 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-06-25 23:38:41 +08:00
wei liu	be492c2939	fix: Add missing keylocks in ReleasePartition operation (#42940 ) issue: #42098 Fix concurrent access issue by adding proper locking around ReleasePartition operation to prevent race conditions when releasing partitions on the same collection. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-25 21:48:42 +08:00
congqixia	336e743b55	fix: [AddField] Respect growing mmap setting adding empty field (#42933 ) Related to #42856 Data under mmapped growing segment shall be treated respecting growingMmap setting. Otherwise, varchar datatype could be treated with logic error. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-25 21:10:42 +08:00
congqixia	942055fa7d	fix: Use task timestamp to calculate TTL timestamp (#42920 ) Related to #42918 Previously the `CollectionTtlTimestamp` could be overflowed when the guarantee_ts==1, which means using `Eventually` consistency level. This patch use task timestamp, allocated by scheduler, to generate ttl timestamp ignore the potential very small timestamp being used. Also add overflow check for ttl timestamp calculated. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-25 20:48:42 +08:00
zhagnlu	69872f45ad	fix: fix is_not_in for trie index (#42716 ) #42604 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-06-25 16:52:42 +08:00
cai.zhang	ebe1c95bb1	enhance: Add Size interface to FileReader to eliminate the StatObject call during Read (#42908 ) issue: #42907 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-06-25 14:36:41 +08:00
aoiasd	e2566c0e92	enhance: bm25 stats local cache use local storage path (#42923 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-06-25 13:44:46 +08:00
XuanYang-cn	0dfe5308e1	enhance: Tidy Download and decode in segcore storage (#42902 ) 1. Unify calling from GetObjectData 2. Move SetData inside Deserialize See also: #40013 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-06-25 11:10:43 +08:00
sthuang	0d57acb13a	enhance: [StorageV2] field id as meta path for wide column when load (#42863 ) related: #42862 #39173 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-25 11:08:48 +08:00
sthuang	d4260b47fa	fix: [StorageV2] sync panic with add field (#42932 ) related: #39663 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-25 10:08:40 +08:00
sthuang	ad6d620e9f	fix: [StorageV2] Compiling debug mode throw DCHECK s3 initialize error (#42922 ) related: https://github.com/milvus-io/milvus/issues/42844 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-24 19:30:41 +08:00
Spade A	50f7579d8f	fix: fix some bugs discovered by chaos tests (#42906 ) fix: https://github.com/milvus-io/milvus/issues/42870 This PR fixes: 1. SetBitset fn shuold consider growing segments with concurrent write 2. avoid using from_raw_parts directly --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-06-24 16:32:42 +08:00
XuanYang-cn	0adf44e6f8	enhance: Check if segment has too many deletions together (#42668 ) This PR moves the deltalog file count check inside hasTooManyDeletions check. Unifies the logic on checking if a segment has too many deletions including: delta log count, deleted rows ratio and deltalog size. This change removes several uncessary traverse through segment's binlogs and deltalogs. And add more clear trigger logs Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-06-24 16:30:49 +08:00
Bingyi Sun	669ea51ce5	enhance: Make json index compatible with caching layer (#42484 ) issue: https://github.com/milvus-io/milvus/issues/42483 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-24 15:16:41 +08:00
congqixia	718cd203c6	fix: OR binary expr is prunable only when both children are prunable (#42912 ) Related to #42903 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-24 09:38:24 +08:00
zhagnlu	1024121ad9	fix:fix incorrect use of class member (#42885 ) #39173 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-06-23 20:36:46 +08:00
congqixia	0a0a6b3471	enhance: Fill dbName for `OperatePrivilegeV2Request` in interceptor (#42898 ) Related to #40340 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-23 18:04:41 +08:00
cai.zhang	59b003adac	enhance: Skip modify field meta when rename collection or rename dbName (#42875 ) issue: #42873 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-06-23 17:04:41 +08:00
congqixia	ee056f0bff	fix: [AddField] Fill default value in serde logic when field missing (#42891 ) Related to #42856 Default value will be missing after segment get sorted/compacted. This PR is a temp workaround since in long term default value shall be filled with storage engine instead. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-23 14:20:41 +08:00
Bingyi Sun	24e24caf14	fix: Remove cached null expr result (#42818 ) issue: #42698 cached result may be changed in caller so there is no need to cache it Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-23 10:44:40 +08:00
Zhen Ye	a081906fb4	enhance: smaller backoff configuration for wal balancer to make faster recovery (#42869 ) issue: #42835 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-23 10:32:40 +08:00
Xianhui Lin	b902960057	fix: revert remote jsonstats path (#42882 ) fix: revert remote jsonstats path relate-pr:https://github.com/milvus-io/milvus/pull/42676 issue:https://github.com/milvus-io/milvus/issues/42872 Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-06-21 13:24:39 +08:00
cai.zhang	8f8ffe9989	fix: Reduce task slot for standalone to 1/4 of normal datanode (#42808 ) issue: #42129 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-06-20 16:38:46 +08:00
Spade A	e15926b40c	enhance: optimize tantivy cargo config (#42880 ) fix: https://github.com/milvus-io/milvus/issues/42879 Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-06-20 16:17:49 +08:00
aoiasd	43a9f7a79e	enhance: Add and run rust format command in makefile (#42807 ) relate: https://github.com/milvus-io/milvus/issues/42806 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-06-20 10:22:39 +08:00
Zhen Ye	6798fdc3b3	fix: rocksmq cannot graceful stop (#42841 ) issue: #40532 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-19 19:38:39 +08:00
congqixia	74ea57bac1	enhance: Remove unused load field check from proxy (#42816 ) Related to #42489 Since load list works as hint after cachelayer implemented, the related check logic could be removed to keep code logic clean. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-19 19:34:47 +08:00
Zhen Ye	fadc053d7a	fix: filter new proxy when initializing proxy session at timeticksync (#42831 ) issue: #40532 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-19 16:44:40 +08:00
Zhen Ye	2fd8f910b0	fix: data duplicated when msgdispatcher make splitting (#42827 ) issue: #41570 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-19 16:32:39 +08:00
junjiejiangjjj	9865d672f7	fix: Model rerank supports Truncate (#42643 ) https://github.com/milvus-io/milvus/issues/42632 Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com>	2025-06-19 15:02:41 +08:00
sthuang	4a0a2441f2	enhance: [StorageV2] field id as meta path for wide column (#42787 ) related: #39173 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-19 15:00:38 +08:00
congqixia	4ba177cd2c	enhance: [StorageV2] Handle narrow column group resource estimation (#42842 ) Related to #39173 In storage v2, "narrow" column group could have group id not mapped schema, which causing loading fails or resource estimation result inaccurate. This PR handles this case by mapping binlog from index instead of vice versa. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-19 14:44:39 +08:00
wei liu	bf5fde1431	fix: Prevent delegator unserviceable due to shard leader change (#42689 ) issue: #42098 #42404 Fix critical issue where concurrent balance segment and balance channel operations cause delegator view inconsistency. When shard leader switches between load and release phases of segment balance, it results in loading segments on old delegator but releasing on new delegator, making the new delegator unserviceable. The root cause is that balance segment modifies delegator views, and if these modifications happen on different delegators due to leader change, it corrupts the delegator state and affects query availability. Changes include: - Add shardLeaderID field to SegmentTask to track delegator for load - Record shard leader ID during segment loading in move operations - Skip release if shard leader changed from the one used for loading - Add comprehensive unit tests for leader change scenarios This ensures balance segment operations are atomic on single delegator, preventing view corruption and maintaining delegator serviceability. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-19 12:10:38 +08:00
Spade A	e2c85eec81	fix: load stats index based on mmap config (#42788 ) ref https://github.com/milvus-io/milvus/issues/42626 This PR makes text match index and json key stats index be loaded based on mmap config. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-06-19 10:10:39 +08:00
aoiasd	d49989345b	enhance: forbid regex filter clone regex for each streamer (#42781 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-06-18 16:10:39 +08:00
cai.zhang	d122e6d1e2	enhance: Make Web UI toggleable via config (#42814 ) issue: #42813 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-06-18 13:02:38 +08:00
Bingyi Sun	6bebb68727	fix: Return all targets segments in ListLoadedSegments (#42728 ) issue: https://github.com/milvus-io/milvus/issues/42412 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-18 11:20:38 +08:00
Spade A	80f1d707f7	fix: tidy up path for scalar index (#42676 ) Ref #42626 This path tidy up path for scalar index including path for loading index from remote storage and temporary path for buliding index. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-06-18 00:42:38 +08:00
congqixia	f9caad95b9	fix: [AddField] Check field empty instead of existence (#42789 ) Related to #42773 Growing segment fills all known meta into `InsertRecord` data, which cause even the field is missing, the field data will still exists. This PR update the logic while finish loading growing segment to check field empty or not instead of existence. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-17 17:22:39 +08:00
cai.zhang	a9dcd4a380	enhance: ChunkManager is no longer created during datanode initialization (#42791 ) issue: #41611 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-06-17 17:06:38 +08:00
Chun Han	001619aef9	feat: supporing load priority for loading (#42413 ) related: #40781 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2025-06-17 15:22:38 +08:00
aoiasd	4e68c6d222	enhance: add test for new parameters in access log (#42546 ) relate: https://github.com/milvus-io/milvus/issues/41801 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-06-17 11:14:38 +08:00
wei liu	679930bb93	enhance: refine delegator state checking error msg (#42673 ) issue: #42661 Add NotStopped() and IsWorking() methods to shardDelegator for better state management and error handling. Changes include: - Add instance state checking methods with proper error messages - Replace lifetime package calls with delegator instance methods - Add comprehensive unit tests for state transitions and error cases - Improve error reporting with channel name for better debugging Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-17 10:40:38 +08:00
congqixia	880915e08b	enhance: Print out-of-date schema ts when returning ErrSchemaMismatch (#42790 ) Related to #41858 This PR add log while debugging schema mismatch between pymilvus cache and proxy schema. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-17 10:38:37 +08:00
zhagnlu	9c31a47c0f	fix:fix arith mod bug for big int (#42699 ) #42624 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-06-17 10:36:38 +08:00
zhagnlu	a887d81716	fix:reject div or mod by zero for binaryarith expr (#42691 ) #42538 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-06-17 10:34:46 +08:00
zhagnlu	2025a2a53c	fix:fix wrong use return error for parse unsupported arith (#42729 ) #42061 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-06-17 10:20:37 +08:00
congqixia	f01ff57f3f	fix: [StorageV2] Use correct offset filling null bitmap (#42774 ) Related to #39173 `null_bitmap_data()` returns raw pointer of null bitmap of Array. While after slicing, this bitmap is not rewritten due to zero copy implementation, so the current start pos maybe non-zero while FillFieldData generating column `valid_data` array. This PR add `offset` param for `FillFieldData` method, and force all invocation pass correct offset of `null_bitmap_data` ptr. Also update milvus-storage commit fixing reader failed to return data when buffer size smaller than row group size problem. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-17 10:08:38 +08:00
congqixia	9653ec8d8c	fix: [AddField] Remove load list check on querycoord (#42736 ) Related to #42735 Load field list shall work as hint after tiered storage impl, so the load list compare is meaningless and block load with empty list after adding a new field. This PR totally moves the check logic. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-17 09:50:37 +08:00
wei liu	0b4a17c22b	fix: Fix exclude nodes clearing logic position in load balancer retry (#42577 ) issue: #42561 Move the exclude nodes clearing logic from ExecuteWithRetry to selectNode after shard leader cache refresh to ensure proper retry behavior: - Remove premature exclude clearing in ExecuteWithRetry that happened before shard leader cache update - Add exclude clearing logic in selectNode after refreshing shard leader cache when all replicas are excluded - Ensure multiple retries can properly update shard leader cache and clear exclude list when needed - Add comprehensive tests for edge cases including empty shard leaders and mixed serviceable node scenarios --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-17 08:15:24 +08:00
sthuang	ed5dbf3eaa	enhance: [StorageV2] sync separate vector datatype into its own column group (#42638 ) related: #39173 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-16 11:48:37 +08:00
zhagnlu	d35c33da9f	fix: fix wrong assgin to chunk object (#42672 ) #39173 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-06-15 21:18:37 +08:00
aoiasd	201e980d3d	fix: flow graph should free function resource after all node close (#42731 ) relate: https://github.com/milvus-io/milvus/issues/42730 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-06-13 22:14:37 +08:00
yihao.dai	9acba25fad	enhance: Replace pointer-based map key with id in garbage collector (#42647 ) issue: https://github.com/milvus-io/milvus/issues/42592 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-06-13 20:50:36 +08:00
congqixia	ef8829c5bc	fix: [AddField] Skip missing nullable field in insertCodec (#42724 ) Related to #42723 Previous PR #42684 permit insert msg transformation but insertCodec did not adapt the same skip logic, whic causes panicking. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-13 19:56:36 +08:00
Bingyi Sun	1bf960b1a8	enhance: Check loaded segments before gc (#42639 ) issue: https://github.com/milvus-io/milvus/issues/42412 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-13 17:44:38 +08:00
Zhen Ye	1f66b650e9	fix: pulsar cannot work properly if backlog exceed (#42653 ) issue: #42649 - the sync operation of different pchannel is concurrent now. - add a option to notify the backlog clear automatically. - make pulsar walimpls can be recovered from backlog exceed. Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-13 14:28:37 +08:00
wei liu	78c39edbce	fix: Fix potential panic when DeleteCheckpoint is nil (#42664 ) issue: #42663 Fix panic issue when processing VchannelInfo messages from older coordinator versions that don't have DeleteCheckpoint field. Changes: - Add null safety check for DeleteCheckpoint before accessing methods - Maintain backward compatibility with legacy message formats - Improve seek position selection logic for both old and new versions --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-13 14:26:36 +08:00
congqixia	cbed31933a	fix: [AddField] Permit missing new nullable field in InsertMsg (#42684 ) Related to #41858 #41951 #42084 When insert msg consumer (pipeline/flowgraph) have newer schema than insertMsg, it have to adapter the insert msg used old schema(missing newly added field) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-13 13:52:35 +08:00
congqixia	d59002d45e	fix: Make controller wait checker worker quit and add nil protection (#42704 ) Related to #42702 This patch add wait logic for `CheckerController` and nil check for channel checker in case of panicking during server/testcase stop procedure Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-13 13:20:35 +08:00

... 3 4 5 6 7 ...

11128 Commits (master)