Commit Graph

18889 Commits (08c6ab8cfb8c08da68dfd3df900808095bc4ccd3)

Author SHA1 Message Date
sammy.huang a2475d4afc
enhance: [skip e2e]change cache path (#31445) (#31948)
pr: #31445

Signed-off-by: Liang Huang <sammy.huang@zilliz.com>
2024-04-07 15:17:17 +08:00
sre-ci-robot c94ec3edc0
[automated] Bump milvus version to v2.3.13 (#31949)
Bump milvus version to v2.3.13
Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2024-04-07 14:53:18 +08:00
nico 835862df22
enhance: [skip e2e] update milvus helm repo in nightly test (#31815)
Signed-off-by: nico <cheng.yuan@zilliz.com>
2024-04-07 10:25:16 +08:00
yihao.dai fa447532d7
fix: add some check fieldata dim (#31564) (#31899)
issue: https://github.com/milvus-io/milvus/issues/30138
issue: https://github.com/milvus-io/milvus/issues/31724

pr: https://github.com/milvus-io/milvus/pull/31564

---------

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
Co-authored-by: cqy123456 <39671710+cqy123456@users.noreply.github.com>
2024-04-05 10:04:16 +08:00
congqixia ba36f66a5c
fix: [2.3] Use server ctx instead of loopCtx for datacoord LivenessCheck (#31691) (#31747)
Cherry-pick from master
pr: #31691
See also #31689

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-04-02 23:30:33 -07:00
aoiasd 4752c0291b
fix: channel meta mergeFlushSegment not idempotent cause data loss (#31837)
fix channel meta mergeFlushSegment not idempotent may cause data loss
when update compacted segment buffer, because may update buffer to
segment which has been covered.
relate: https://github.com/milvus-io/milvus/issues/31548

Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>
2024-04-02 23:25:50 -07:00
aoiasd cf02c623ab
fix: fix injection invalid bug by add inject task to handler inject when queue was empty (#31819)
relate: https://github.com/milvus-io/milvus/issues/31548

Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>
2024-04-03 14:05:14 +08:00
PowderLi a992334d0e
fix: [cherry-pick] use milvus-io/vcpkg (#31808)
issue: #31769
master pr: #31770
3.4 pr: #31799

GitHub Disables The XZ Repository because of
https://github.com/advisories/GHSA-rxwq-x6h5-x525

Signed-off-by: PowderLi <min.li@zilliz.com>
2024-04-02 11:09:14 +08:00
cqy123456 47f767cf32
enhance: remove float16 in 2.3 branch (#31720)
issue: https://github.com/milvus-io/milvus/issues/31696

Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>
2024-03-30 10:49:13 +08:00
foxspy 7394c5156c
enhance: Update Knowhere Cloud version (#31729)
/kind branch-feature

Signed-off-by: xianliang.li <xianliang.li@zilliz.com>
2024-03-29 18:15:17 +08:00
nico c1ca0b2b0d
test: update chart version and some cases (#31686)
Signed-off-by: nico <cheng.yuan@zilliz.com>
2024-03-29 14:09:18 +08:00
XuanYang-cn 055dd7ea1d
fix: Clear compaction tasks when release channel (#31694)
See also: #31648
pr: #31666

Signed-off-by: yangxuan <xuan.yang@zilliz.com>
2024-03-29 10:53:12 +08:00
cai.zhang 4a4a4b1061
fix: [Pick]Check whether the DataType of the column is legal (#31699)
issue: #31696 
master pr: #31697 
2.4 pr: #31698

---------

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2024-03-29 09:31:12 +08:00
XuanYang-cn 69931a6e7f
fix: Skip changing meta if nodeID not match with channel (#31665)
See also: #31648
pr: #31666

Signed-off-by: yangxuan <xuan.yang@zilliz.com>
2024-03-28 16:05:11 +08:00
congqixia 4021f4478f
enhance: [Cherry-pick] Save collection targets by batches (#31616) (#31655)
Cherry-pick from master
pr: #31616 
See also #28491 #31240

When colleciton number is large, querycoord saves collection target one
by one, which is slow and may block querycoord exits.

In local run, 500 collections scenario may lead to about 40 seconds
saving collection targets.

This PR changes the `SaveCollectionTarget` interface into batch one and
organizes the collection in 16 per bundle batches to accelerate this
procedure.

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-28 10:25:11 +08:00
congqixia 35100ba2e5
fix: [2.3] Make target observer auto/manual task mutual exclusive (#31584) (#31603)
Cherry-pick from master
pr: #31584
See also #30867

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-27 19:51:14 +08:00
sammy.huang 9d9f2cdf4d
enhance: enable docker buildkit (#31599)
pr:#31600

Signed-off-by: Liang Huang <sammy.huang@zilliz.com>
2024-03-27 16:47:11 +08:00
groot 91cdada12a
fix: minio ssl compatible issue (#31619)
issue: https://github.com/milvus-io/milvus/issues/30709
pr: https://github.com/milvus-io/milvus/pull/31607

Signed-off-by: yhmo <yihua.mo@zilliz.com>
2024-03-27 14:41:20 +08:00
jaime 8713765f4d
fix: proxy ctx cancel before revoking session (#31595)
issue: #31219
pr: #31596

Signed-off-by: jaime <yun.zhang@zilliz.com>
2024-03-26 11:45:09 +08:00
zhuwenxing a6fe96f20a
test: [cherry-pick][2.3] add restful v2 testcase (#31583)
pr: https://github.com/milvus-io/milvus/pull/31404

---------

Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>
2024-03-26 10:49:09 +08:00
aoiasd 50315282bb
fix: [Cherry-Pick] delegator filter out all partition's delete msg when loading segment (#31587)
May cause deleted data queryable a period of time.
issue : #31484 
pr: https://github.com/milvus-io/milvus/pull/31585

Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>
2024-03-25 20:33:09 +08:00
congqixia f9c8cd8e7b
enhance: Bump milvus & proto version to v2.3.13 (#31579)
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-25 18:03:10 +08:00
sammy.huang f98e9c9100
enhance: to reduce container image size (#31570)
related pr:#31576

Signed-off-by: Liang Huang <sammy.huang@zilliz.com>
2024-03-25 18:01:17 +08:00
sre-ci-robot 2f0fa8488c
[automated] Update Pytest image changes (#31559)
Update Pytest image changes
See changes:
6b9004c728
Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2024-03-25 16:43:08 +08:00
nico 6b9004c728
enhance: update pymilvus version (#31524)
pr: #31529

Signed-off-by: nico <cheng.yuan@zilliz.com>
2024-03-25 10:47:08 +08:00
PowderLi f2f0d44a5d
feat: [cherry-pick] restful phase two (#30430)
issue: #28348 #29732

Support to trace the grpc request, pr: #28349
Support to trace restful request and request error, pr: #28685

restful phase two, pr: #29728 #30343
include: collections, entities, partitions, users, roles, indexes,
aliases, import jobs

---------

Signed-off-by: SimFG <bang.fu@zilliz.com>
Signed-off-by: PowderLi <min.li@zilliz.com>
Co-authored-by: SimFG <bang.fu@zilliz.com>
2024-03-25 10:39:09 +08:00
aoiasd 7c234f23c3
fix: double buffer was invalid when put entry which size larger than max size (#31549)
relate: https://github.com/milvus-io/milvus/issues/31548

---------

Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>
2024-03-23 21:09:07 +08:00
congqixia 368180bce4
fix: [2.3] Check nodeID before update channel checkpoint (#31473) (#31508)
Cherry-pick from master
pr: #31473
See also #31470 #31506

This PR adds nodeID assignment verification before updating channel
checkpoints.

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-23 07:07:07 +08:00
Jiquan Long ab059bb064
enhance: add more metrics (#31271) (#31511)
/kind improvement
pr: #31271 
fix: https://github.com/milvus-io/milvus/issues/31272

This pr add more metrics, which are:

Slow query count, which the duration considered as slow can be
configurable;
Number of deleted entities;
Number of entities per collection;
Number of loaded entities per collection;
Number of indexed entities;
Number of indexed entities, per collection, per index and whether it's a
vetor index;
Quota states (LongTimeTickDelay, MemoryExhuasted, DiskQuotaExhuasted)
per database;

---------

Signed-off-by: longjiquan <jiquan.long@zilliz.com>
2024-03-22 16:11:07 +08:00
wei liu ef523bfef3
fix: Unstable ut TestGetClientFailed (#31296) (#31472)
issue: #31295
pr: #31296

This PR fix unstable ut TestGetClientFailed

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-22 11:01:07 +08:00
wei liu 0bf595a513
enhance: Speed up target recovery after query coord restart (#31240) (#31449)
issue: #28491
pr: #31240

after querycoord restart, it will pull a new target, which include
channel and segment list. when segments loaded on querynode has reached
the target, the collection could provide search/query. but if segment
list changes by time, ater querycoord pull a new target, it will takes a
few minutes to catch up the target's segment distribution. and before
that, query/search will fail due to lack of segments.

This PR save the current loaded target to meta storein querycoord's stop
progress, and recover it when query coord starts, to speed up the target
recovery time.

---------

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-22 10:27:17 +08:00
wei liu f8496dbc73
fix: Balance channel stuck forever due to logic dead lock (#31202) (#31455)
issue: #30816
pr: #31202

cause balance channel will stuck until leader view catch up the current
target, then start to unsub the old delegator. which make sure that the
new delegator can provide search before release old delegator. but
another logic in segment_checker skip loading segment during balance
channel. so during balance channel, if query node crash, new delegator
can't catch up target forever, then stuck forever.

This PR remove the rule that skip loading segment during balance channel
to avoid the logic dead lock here.

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-21 18:11:07 +08:00
wei liu c8658d17f8
fix: Grpcclient return unrecoverable error (#31256) (#31452)
issue: #31222
pr: #31256

grpcclient's `call` func return a unrecoverable error, then the caller's
retry policy also breaks due to this unrecoverable error.

This PR introduce `retry.Handle`, the new func use `func() (bool,
error)` as input parameters, which return `shouldRetry` directly, to
avoid grpcclient return a unrecoverable error

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-21 11:59:12 +08:00
wei liu 6b761204ce
fix: Set node unreachable when get shard client failed (#31277) (#31451)
issue: #30531
pr: #31277

cause get client from `shardClientMgr`, doesn't means query node is
unavailable. because of the ref counter policy in `shardClientMgr`,
which will clean the client, if no collection use qn as shard leader.

This PR fix that set node unreachable when get shard client failed.

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-21 11:57:08 +08:00
wei liu 5994b6a7b0
fix: Search doesn't expire shard leader cache (#31380) (#31450)
issue: #31351
pr: #31380
This PR fixed that search doesn't expire shard leader cache when send
request to query node failed, which make every request keep trying to
connect a offline query node

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-21 11:55:07 +08:00
groot 1ca7cba222
enhance: Support MinIO TLS connection (#31292)
issue: https://github.com/milvus-io/milvus/issues/30709
master pr: #31311

Signed-off-by: yhmo <yihua.mo@zilliz.com>
Co-authored-by: Chen Rao <chenrao317328@163.com>
2024-03-21 11:15:20 +08:00
congqixia 94f3aec80a
enhance: [Cherry-pick] Add metrics for querycoord current target cp lag (#31391) (#31463)
Cherry-pick from master
pr: #31391 #31399
See also #31390

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-21 10:17:07 +08:00
wei liu fef430daed
fix: Wrong behavior of CurrentTargetFirst/NextTargetFirst in target manager(#31379) (#31419)
issue: #31162
pr: #31379

when give scope CurrentTargetFirst/NextTargetFirst, it's expected to
scan both current and next target.

This PR fixed wrong behavior of CurrentTargetFirst/NextTargetFirst in
target manager, which may cause unexpected task generated, and load
collection may stuck forever due to dirty leader view.

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-20 23:39:07 +08:00
cai.zhang 52a7eb9548
fix: Fix bug for get segment index state (#31429)
issue: #31361 
master pr: #31427 
2.4 pr: #31428

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2024-03-20 15:05:06 +08:00
congqixia 86e347a1a4
enhance: [2.3] Cache formatted key for param item (#31388) (#31402)
Cherry-pick from master
pr: #31388 
See also #30806

`formatKey` may cost lots of CPU on string processing under high QPS
scenario, this PR adds a formattedKeys cache preventing string operation
in each param get value.

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-19 19:25:10 +08:00
cai.zhang ef530a2324
enhance: When describing an index, fetch the index info in batches (#31239)
issue: #29313 
master pr: #31238

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2024-03-15 16:37:09 +08:00
sre-ci-robot e77afcb5d5
[automated] Bump milvus version to v2.3.12 (#31303)
Bump milvus version to v2.3.12
Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2024-03-15 16:19:05 +08:00
nico 75a86bc2d3
test: update test cases (#31253)
Signed-off-by: nico <cheng.yuan@zilliz.com>
2024-03-15 15:23:10 +08:00
Jiquan Long 50bfde92f2
fix: wrong num_entities used when mmap variable length data (#30848) (#31274)
https://github.com/milvus-io/milvus/issues/30728
pr: #30848

Signed-off-by: longjiquan <jiquan.long@zilliz.com>
2024-03-14 20:33:03 +08:00
congqixia 4e48a4de0e
enhance: Bump milvus & proto version to v2.3.12 (#31193)
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-14 19:09:04 +08:00
jaime 5ddb0b435f
fix: revoke session may be ignored due to server context cancellation in advance (#31213)
issue: #31219
pr: #31220

Signed-off-by: jaime <yun.zhang@zilliz.com>
2024-03-14 19:05:04 +08:00
sre-ci-robot a33751a2d7
[automated] Update Pytest image changes (#31235)
Update Pytest image changes
See changes:
645cc0bdc3
Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2024-03-14 09:59:11 +08:00
nico 645cc0bdc3
test: update test cases (#31161)
Signed-off-by: nico <cheng.yuan@zilliz.com>
2024-03-13 19:05:11 +08:00
sre-ci-robot 5386a2c43e
[automated] Update Pytest image changes (#31108)
Update Pytest image changes
See changes:
005dbf2b24
Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2024-03-13 11:21:19 +08:00
chyezh 7105e0b261
fix: lost dbname when only passing collection id to describeCollection (#31177)
issue: #30931
pr: #31167

Signed-off-by: chyezh <chyezh@outlook.com>
2024-03-11 19:51:03 +08:00