milvus

Commit Graph

Author	SHA1	Message	Date
cai.zhang	6989e18599	enhance: Move sort stats task to sort compaction (#42562 ) issue: #42560 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-08 20:22:47 +08:00
yihao.dai	9cbd194c6b	fix: Prevent import from generating small binlogs (#43132 ) - Introduce dynamic buffer sizing to avoid generating small binlogs during import - Refactor import slot calculation based on CPU and memory constraints - Implement dynamic pool sizing for sync manager and import tasks according to CPU core count issue: https://github.com/milvus-io/milvus/issues/43131 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-07-07 21:32:47 +08:00
congqixia	ab818dcbca	fix: [StorageV2] Pass storage config for compaction rw (#43167 ) Related to #43148 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-07 15:32:46 +08:00
congqixia	d09764508a	fix: [Storagev2] Close segment readers in mergeSort (#43116 ) Related to #43062 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-04 23:56:44 +08:00
groot	1ee8cea35b	enhance: bulkinsert handle nullable/defaultValue/functionOutput fields (#42956 ) issue: https://github.com/milvus-io/milvus/issues/42173 Signed-off-by: yhmo <yihua.mo@zilliz.com>	2025-07-04 14:20:44 +08:00
cai.zhang	f6b2a71c95	enhance: Remove chunkmanager-related dependencies from datanode (#43021 ) issue: #41611 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-03 14:44:45 +08:00
Zhen Ye	08fff353af	fix: Revert "enhance: Enable mergeSort by default starting from version 2.6.0 (#42981 )" (#43046 ) issue: #43034 - implementation of mergeSortMultipleSegments is wrong. Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-01 17:30:29 +08:00
congqixia	9b06ecb72f	enhance: [StorageV2] Release record and close reader (#42983 ) Related to #39173 This PR - Close packed reader after sort - Release arrow.Record preventing memory leakage - Invoke `pack_reader->Close()` for CloseReader --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-27 14:46:43 +08:00
sthuang	238bd30f42	fix: [StorageV2] end to end minor issues for sync, stats, and load (#42948 ) Fix issues in end-to-end tests: 1. Split column groups based on schema, rather than estimating by average chunk row size. Ensure column group consistency within a segment, to avoid errors caused by loading multiple column group chunks simultaneously. 2. Use sorted segmentId when generating the stats binlog path, to ensure consistent and correct file path resolution. 3. Determine field IDs as follows: For multi-column column groups, retrieve the field ID list from metadata. For single-column column groups, use the column group ID directly as the field ID. related: #39173 fix: #42862 --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-27 14:44:42 +08:00
cai.zhang	ebe1c95bb1	enhance: Add Size interface to FileReader to eliminate the StatObject call during Read (#42908 ) issue: #42907 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-06-25 14:36:41 +08:00
cai.zhang	8f8ffe9989	fix: Reduce task slot for standalone to 1/4 of normal datanode (#42808 ) issue: #42129 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-06-20 16:38:46 +08:00
Zhen Ye	2fd8f910b0	fix: data duplicated when msgdispatcher make splitting (#42827 ) issue: #41570 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-19 16:32:39 +08:00
cai.zhang	a9dcd4a380	enhance: ChunkManager is no longer created during datanode initialization (#42791 ) issue: #41611 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-06-17 17:06:38 +08:00
sthuang	ed5dbf3eaa	enhance: [StorageV2] sync separate vector datatype into its own column group (#42638 ) related: #39173 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-16 11:48:37 +08:00
yihao.dai	86876682da	enhance: Enhance import integration tests and logs (#42612 ) 1. Optimize the import process: skip subsequent steps and mark the task as complete if the number of imported rows is 0. 2. Improve import integration tests: a. Add a test to verify that autoIDs are not duplicated b. Add a test for the corner case where all data is deleted c. Shorten test execution time 3. Enhance import logging: a. Print imported segment information upon completion b. Include file name in failure logs issue: https://github.com/milvus-io/milvus/issues/42488, https://github.com/milvus-io/milvus/issues/42518 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-06-12 20:02:35 +08:00
yihao.dai	e6da4a64b5	fix: Pre-check import message to prevent pipeline block indefinitely (#42415 ) Pre-check import message to prevent pipeline block indefinitely. issue: https://github.com/milvus-io/milvus/issues/42414 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com> Co-authored-by: chyezh <chyezh@outlook.com>	2025-06-11 13:40:38 +08:00
sthuang	89c3afb12e	fix: [StorageV2] index/stats task level storage v2 fs (#42191 ) related: #39173 --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-10 11:06:35 +08:00
congqixia	a9aaa86193	enhance: [StorageV2] Pass bucket name for compaction readers (#42607 ) Related to #39173 Like logic in #41919, storage v2 fs shall use complete paths with bucketName prefix to be compatible with its definition. This PR fills bucket name from config when creating reader for compaction tasks. NOTE: the bucket name shall be read from task params config for compaction task pooling. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-10 10:20:35 +08:00
yihao.dai	837349dead	enhance: Adjust default import buffer size (#42541 ) Increase insert buffer size from 16MB to 64MB, while keeping delete buffer size at 16MB. issue: https://github.com/milvus-io/milvus/issues/42518 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-06-09 13:02:33 +08:00
aoiasd	2eb24fbe7c	fix: analyzer memory leak because function runner not close (#41839 ) relate: https://github.com/milvus-io/milvus/issues/41213 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-06-05 14:24:40 +08:00
congqixia	373deba0bd	fix: Pass cluster id tranforming drop task to drop job request (#42531 ) Related to #42530 The cluster id is missing when drop worker drop causing redoing task on report duplicated task error. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-05 13:20:32 +08:00
yihao.dai	6fda1f69c8	fix: Fix duplicate autoID between import and insert (#42519 ) Remove the unlimited logID mechanism and switch to redundantly allocating a large number of IDs. issue: https://github.com/milvus-io/milvus/issues/42518 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-06-04 19:58:31 +08:00
yihao.dai	e0113b375e	fix: Fix sort stats generates large binlogs (#42456 ) Remove the hardcoded batchSize of 100,000 and instead trigger a write every 64MB based on actual data size. This prevents sort stats from generating excessively large binlog files. issue: https://github.com/milvus-io/milvus/issues/42400 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-06-04 09:56:39 +08:00
yihao.dai	297331b2cc	enhance: Add slot and tasks num metrics (#42141 ) issue: https://github.com/milvus-io/milvus/issues/41123 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-05-30 21:52:30 +08:00
Chun Han	ed0df38605	enhance: resize high priority wqthreadpool dynamically(#40838 ) (#41549 ) (#41929 ) related: #40838 pr: https://github.com/milvus-io/milvus/pull/41549 Signed-off-by: MrPresent-Han <chun.han@gmail.com>	2025-05-30 10:18:36 +08:00
Zhen Ye	66cc194ab2	enhance: add partition gc at streaming arch (#42179 ) issue: #41976 - make drop partition message as a broadcast message. - add gc when drop partition message is acked. - add a call back to handle the broadcast message when ack. - the ack operation of broadcast message will retry until success. Signed-off-by: chyezh <chyezh@outlook.com>	2025-05-29 23:20:30 +08:00
hckex	020d36624c	enhance: Fix typo 'dimesion' to 'dimension' in PreExecute method (#42160 ) This PR fixes a minor typo in a log message in the `PreExecute` method of `internal/datanode/index/task_index.go`. Corrected "dimesion" to "dimension". Signed-off-by: hckex <33862757+hckex@users.noreply.github.com>	2025-05-29 12:24:30 +08:00
groot	14563ad2b3	enhance: bulkinsert handles nullable/default (#42127 ) issue: https://github.com/milvus-io/milvus/issues/42096, https://github.com/milvus-io/milvus/issues/42130 Signed-off-by: yhmo <yihua.mo@zilliz.com>	2025-05-28 18:02:28 +08:00
hckex	a20500b3ed	doc: Fix small typo in internal/datanode/README.md (#42089 ) This PR fixes a minor typo in the README file of the datanode module. Corrected "imformation" to "information". Signed-off-by: hckex <33862757+hckex@users.noreply.github.com>	2025-05-27 10:16:27 +08:00
cai.zhang	80fe573c76	enhance: Pass the compaction configuration through request parameters (#41979 ) issue: #41123 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-05-26 11:52:27 +08:00
XuanYang-cn	252d49d01e	fix: ChannelManager double assignment (#41837 ) See also: #41876 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-05-23 14:16:29 +08:00
yihao.dai	83c9527e70	enhance: Use QuerySlot interface for tasks (#41989 ) Use `QuerySlot` rpc instead of `QueryTask` for querying slot. issue: https://github.com/milvus-io/milvus/issues/41123 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-05-23 10:30:28 +08:00
yihao.dai	142bd2fc05	enhance: Pooling for data tasks (#41256 ) 1. Add global scheduler for datacoord. 2. Define and implement new CreateTask, QueryTask, DropTask interfaces. 3. Refine Import, Compaction, Stats, Index task. issue: https://github.com/milvus-io/milvus/issues/41123 Co-authored-by: Cai Zhang <cai.zhang@zilliz.com>	2025-05-20 21:06:24 +08:00
congqixia	a22088a380	enhance: [StorageV2] Make packed reader use correct path (#41919 ) Related to #39173 This PR - Use updated path with bucketName for packedReader - Update milvus-storage commit to report reader/writer initialization failure, see also milvus-io/milvus-storage#192 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-05-20 10:36:23 +08:00
yihao.dai	65dd3982d8	fix: Fix ants.Pool goroutine leak (#41892 ) 1. Release the pool after it is no longer in use. 2. Upgrade ants.Pool to fix the goroutine leak issue (see [PR #287](https://github.com/panjf2000/ants/pull/287)). issue: https://github.com/milvus-io/milvus/issues/41838 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-05-19 17:56:22 +08:00
congqixia	b8d7045539	enhance: [Add Field] Use consistent schema for single buffer (#41891 ) Related to #41873 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-05-17 19:46:22 +08:00
yihao.dai	6c1a37fca1	fix: Fix import reader goroutine leak (#41869 ) Close the chunk manager's reader after the import completes to prevent goroutine leaks. issues: https://github.com/milvus-io/milvus/issues/41868 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-05-16 10:18:35 +08:00
congqixia	a6d09ff4cd	enhance: [StorageV2] fix issues integrating basic RW operations (#41834 ) Related to #39173 This PR: - Upgrade milvus-storage commit to fix filesystem finalized issue - Add bucket-name as prefix for all fs style access io - Initial arrow fs on querynodes startup - Fix timestamp access when loading sealed segment --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-05-15 09:52:23 +08:00
yihao.dai	36e9e41627	fix: Fix no candidate segments error for small import (#41771 ) When autoID is enabled, the preimport task estimates row distribution by evenly dividing the total row count (numRows) across all vchannels: `estimatedCount = numRows / vchannelNum`. However, the actual import task hashes real auto-generated IDs to determine the target vchannel. This mismatch can lead to inaccurate row distribution estimation in such corner cases: - Importing 1 row into 2 vchannels: • Preimport: 1 / 2 = 0 → both v0 and v1 are estimated to have 0 rows • Import: real autoID (e.g., 457975852966809057) hashes to v1 → actual result: v0 = 0, v1 = 1 To resolve such corner case, we now allocate at least one segment for each vchannel when autoID is enabled, ensuring all vchannels are prepared to receive data even if no rows are estimated for them. issue: https://github.com/milvus-io/milvus/issues/41759 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-05-14 15:30:21 +08:00
foxspy	358bc150df	enhance: add force rebuild index configuration (#41473 ) issue: #41431 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2025-05-14 10:52:21 +08:00
aoiasd	9166c77a72	fix: bulk insert should use function runner's input field list instead schema's (#41560 ) relate: https://github.com/milvus-io/milvus/issues/41213 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-05-12 19:14:56 +08:00
Zhen Ye	e675da76e4	enhance: simplify the proto message, make segment assignment code more clean (#41671 ) issue: #41544 - simplify the proto message for flush and create segment. - simplify the msg handler for flowgraph. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-05-11 20:49:00 +08:00
cai.zhang	15ffd28643	fix: Set worker totalSlot in standalone mode is half of cluster mode (#41730 ) issue: #41616, #41732 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-05-09 18:30:58 +08:00
sthuang	6c377b6e86	feat: Storage v2 index and stats raw data (#41534 ) related: #39173 --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-04-30 08:48:54 +08:00
cai.zhang	640f526301	fix: Update current scalar index version to compatible tantivy different versions (#41141 ) issue: #40823 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-04-27 20:44:39 +08:00
junjiejiangjjj	e56adc121b	enhance: refactor embedding credentials manager (#41442 ) https://github.com/milvus-io/milvus/issues/35856 Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com>	2025-04-24 14:34:38 +08:00
SimFG	91d40fa558	fix: Update logging context and upgrade dependencies (#41318 ) - issue: #41291 --------- Signed-off-by: SimFG <bang.fu@zilliz.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2025-04-23 10:52:38 +08:00
congqixia	b36c88f3c8	enhance: [AddField] Broadcast schema change via WAL (#41373 ) Related to #39718 Add Broadcast logic for collection schema change and notifies: - Streamnode - Delegator - Streamnode - Flush component - QueryNodes via grpc --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-04-22 16:28:37 +08:00
yihao.dai	dccfc69660	enhance: Get compaction params from request (#41125 ) Make DataNode use compaction parameters from request instead of configuration. issue: https://github.com/milvus-io/milvus/issues/41123 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-04-15 10:28:53 +08:00
Xianhui Lin	3963fc818f	fix:Add debug memory freeing in sortStats (#41284 ) issue: https://github.com/milvus-io/milvus/issues/41218 Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-04-15 09:56:29 +08:00

1 2 3 4 5 ...

1226 Commits (nico301)