milvus

Commit Graph

Author	SHA1	Message	Date
congqixia	2b285e5573	fix: Wrap init segcore tracing with golang timeout (#33494 ) See also #33483 Wrap `C.InitTrace` & `C.SetTrace` with timeout preventing otlp initializtion hangs forever when endpoint is not set correctly --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-03 19:25:51 +08:00
wei liu	c6a1c49e02	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl. (#33405 ) issue: #32995 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. - Block BF construct time {"time": "54.128131ms"} - Block BF size {"size": 3021578} - Block BF Test cost {"time": "55.407352ms"} - Basic BF construct time {"time": "210.262183ms"} - Basic BF size {"size": 2396308} - Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. - Block BF TestLocation cost {"time": "529.97183ms"} - Basic BF TestLocation cost {"time": "3.197430181s"} --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-31 17:49:45 +08:00
congqixia	a26d6cdf23	fix: Remove group checker when closing qn pipeline (#33443 ) See also #33442 This fix shall prevent group checker keep printing "some node(s) haven't received input" err message after collection released Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-29 14:07:44 +08:00
aoiasd	59a7a46904	enhance: Merge query stream result for reduce delete task (#32855 ) relate: https://github.com/milvus-io/milvus/issues/32854 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-05-27 18:15:43 +08:00
Xiaofan	36cbce4def	enhance: optimize datanode cpu usage under large collection number (#33267 ) fix #33266 try to improve cpu usage by refactoring the ttchecker logic and caching string Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-05-25 04:43:41 +08:00
Cai Yudong	4004e4c545	enhance: Optimize bulk insert unittest (#33224 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-24 10:23:41 +08:00
cai.zhang	be77ceba84	enhance: Use proto for passing info in cgo (#33184 ) issue: #33183 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-05-23 10:31:40 +08:00
wei liu	39f56678a0	enhance: Reduce bloom filter lock contention between insert and delete in query coord (#32643 ) issue: #32530 cause ProcessDelete need to check whether pk exist in bloom filter, and ProcessInsert need to update pk to bloom filter, when execute ProcessInsert and ProcessDelete in parallel, it will cause race condition in segment's bloom filter This PR execute ProcessInsert and ProcessDelete in serial to avoid block each other Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-22 19:11:40 +08:00
Cai Yudong	cb480d17c8	fix: Fix SparseFloatVector data parse error for parquet (#33187 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-21 15:09:39 +08:00
smellthemoon	89ad3eb0ca	enhance: reduce memory when read field (#33195 ) Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-05-20 22:25:38 +08:00
yihao.dai	32560263fa	enhance: Query slot for compaction task (#32881 ) Query slot of compaction in datanode, and transfer the control logic for limiting compaction tasks from datacoord to the datanode. issue: https://github.com/milvus-io/milvus/issues/32809 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-17 18:19:38 +08:00
Cai Yudong	b560602885	enhance: Store SparseFloatVector into parquet as JSON string (#33101 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-17 15:01:37 +08:00
Cai Yudong	4ef163fb70	enhance: Support readable JSON file import for Float16/BFloat16/SparseFloat (#33064 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-16 14:47:35 +08:00
cai.zhang	6ea7633bd5	enhance: Add memory size for binlog (#33025 ) issue: #33005 1. add `MemorySize` field for insert binlog. 2. `LogSize` means the file size in the storage object. 3. `MemorySize` means the size of the data in the memory. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2024-05-15 12:59:34 +08:00
Cai Yudong	4fc7915c70	enhance: unify data generation test APIs (#32955 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-14 14:33:33 +08:00
Cai Yudong	dc89c6f810	enhance: remove duplicated data generation APIs for bulk insert test (#32889 ) Issue: #22837 including following changes: 1. Add API CreateInsertData() and BuildArrayData() in internal/util/testutil 2. Remove duplicated test APIs from importutilv2 unittest and bulk insert integration test Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-10 15:27:31 +08:00
aoiasd	54a51b1236	enhance: Support dynamic config for opentelemetry trace (#32169 ) relate: https://github.com/milvus-io/milvus/issues/31940 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-05-09 17:43:30 +08:00
Cai Yudong	8bb58d0460	enhance: optimize vector offsets handling for parquet (#32822 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-09 14:43:30 +08:00
wei liu	ba02d54a30	enhance: update shard leader cache when leader location changed (#32470 ) issue: #32466 this PR enhance that when shard location changed, update proxy's shard leader cache. in case of query node failover case, proxy can find replica recover --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-08 10:05:29 +08:00
Cai Yudong	bcdbd1966e	feat: Support sparse float vector bulk insert for binlog/json/parquet (#32649 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-07 18:43:30 +08:00
yihao.dai	4de063ae14	fix: Make the dynamic column optional in parquet import (#32738 ) issue: https://github.com/milvus-io/milvus/issues/32729 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-07 11:21:29 +08:00
SimFG	0ea08b008a	enhance: add the config to control the way when fail to init plugin (#32680 ) issue: #32679 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-07 11:01:31 +08:00
yihao.dai	1594122c0a	enhance: Make the dynamic field file optional during numpy import (#32596 ) 1. Make the dynamic field file optional during numpy import 2. Add integration importing test with dynamic 3. Disallow file of pk when autoID=true during numpy import issue: https://github.com/milvus-io/milvus/issues/32542 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-28 19:39:25 +08:00
chyezh	2586c2f1b3	enhance: use WalkWithPrefix api for oss, enable piplined file gc (#31740 ) issue: #19095,#29655,#31718 - Change `ListWithPrefix` to `WalkWithPrefix` of OOS into a pipeline mode. - File garbage collection is performed in other goroutine. - Segment Index Recycle clean index file too. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-25 20:41:27 +08:00
wei liu	04f355a802	enhance: Enable alter database props in rootcoord (#32458 ) issue: #30040 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-25 10:53:25 +08:00
Cai Yudong	5fc439c600	feat: Bulk insert support fp16/bf16 (#32157 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-04-22 10:05:22 +08:00
chyezh	48fe977a9d	enhance: declarative resource group api (#31930 ) issue: #30647 - Add declarative resource group api - Add config for resource group management - Resource group recovery enhancement --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-15 08:13:19 +08:00
yihao.dai	aa96843d31	fix: Fix import hanging and improve logging output (#32166 ) Fix import hanging when the previous import task failed, and improve parquet import logging outout. issue: https://github.com/milvus-io/milvus/issues/31834 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-13 22:03:23 +08:00
SimFG	c012e6786f	feat: support rate limiter based on db and partition levels (#31070 ) issue: https://github.com/milvus-io/milvus/issues/30577 co-author: @jaime0815 --------- Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com> Signed-off-by: SimFG <bang.fu@zilliz.com> Co-authored-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-04-12 16:01:19 +08:00
yihao.dai	273df98e20	enhance: Add binlog import intergration test (#32112 ) issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-11 10:31:18 +08:00
congqixia	25a1c9ecf0	fix: Make coordinator `Register` not blocked on ProcessActiveStandby (#32069 ) See also #32066 This PR make coordinator register successful and let `ProcessActiveStandBy` run async. And roles may receive stop signal and notify servers. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-10 18:49:18 +08:00
yihao.dai	1b5554c8cb	enhance: Support $meta key for json import (#32013 ) During JSON import: 1. Allow the specification of the $meta key 2. Prohibit duplicated keys within the $meta field, for instance, `{"id": 1, "vector": [], "x": 6, "$meta": {"x": 8}}` issue: https://github.com/milvus-io/milvus/issues/31835 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-10 17:27:17 +08:00
SimFG	90bed1caf9	enhance: add the related data size for the read apis (#31816 ) issue: #30436 origin pr: #30438 related pr: #31772 --------- Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-04-10 15:07:17 +08:00
zhenshan.cao	089c805e0a	enhance:Refactor hybrid search (#32020 ) issue: https://github.com/milvus-io/milvus/issues/25639 https://github.com/milvus-io/milvus/issues/31368 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-04-09 14:21:18 +08:00
cai.zhang	1b767669a4	enhance: Throw error instead of crash when index cannot be built (#31844 ) issue: #27589 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-04-09 11:51:18 +08:00
Cai Yudong	00438f408f	enhance: Unify data type check APIs for go (#31887 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-04-07 14:27:22 +08:00
SimFG	ac26908cc4	enhance: Remove the storage info report (#31772 ) issue: #30436 origin pr: #30438 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-04-01 20:50:59 -07:00
yihao.dai	4e264003bf	enhance: Ensure ImportV2 waits for the index to be built and refine some logic (#31629 ) Feature Introduced: 1. Ensure ImportV2 waits for the index to be built Enhancements Introduced: 1. Utilization of local time for timeout ts instead of allocating ts from rootcoord. 3. Enhanced input file length check for binlog import. 4. Removal of duplicated manager in datanode. 5. Renaming of executor to scheduler in datanode. 6. Utilization of a thread pool in the scheduler in datanode. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-01 20:09:13 +08:00
congqixia	357fe814ce	fix: Remove unnecessary deleteSession operation (#31647 ) See also #31628 The `Revoke` operation shall delete all keys related to the lease attaching to. This `deleteSession` operation may also remove the session key in next epoch by mistake and cause chaos session status Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-29 13:57:11 +08:00
cqy123456	976928ecd1	fix: fix fp16/bf16 some code missing and add more fp16/bf16 test (#31612 ) issue: #31534 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-03-28 14:11:10 +08:00
SimFG	b1a1cca10b	feat: add more operation detail info for better allocation (#30438 ) issue: #30436 --------- Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-03-28 06:33:11 +08:00
wei liu	92971707de	enhance: Add restful api for devops to execute rolling upgrade (#29998 ) issue: #29261 This PR Add restful api for devops to execute rolling upgrade, including suspend/resume balance and manual transfer segments/channels. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-27 16:15:19 +08:00
yihao.dai	31cf849f68	enhance: Support retriving file size from importutilv2.Reader (#31533 ) To reduce the overhead caused by listing the S3 objects, add an interface to importutil.Reader to retrieve file sizes. issue: https://github.com/milvus-io/milvus/issues/31532, https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-25 20:29:07 +08:00
yihao.dai	9a13b9822f	enhance: Return more fields in import progress response (#31539 ) Return more fields in import progress response, include importedRows and totalRows. Additionally, ensure compatibility with the old import progress response by retaining fields of create timestamp and row count. issue: https://github.com/milvus-io/milvus/issues/31448 https://github.com/milvus-io/milvus/issues/31237 https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-24 21:57:06 +08:00
yihao.dai	0fe5e90e8b	enhance: Remove import v1 (#31403 ) Remove all code and logic related to import v1. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-22 15:29:09 +08:00
Chun Han	c3264ca3e3	feat: support segment pruner (#31003 ) related: #30376	2024-03-22 13:57:06 +08:00
groot	c81909bfab	enhance: Support MinIO TLS connection (#31311 ) issue: https://github.com/milvus-io/milvus/issues/30709 pr: #31292 Signed-off-by: yhmo <yihua.mo@zilliz.com> Co-authored-by: Chen Rao <chenrao317328@163.com>	2024-03-21 11:15:20 +08:00
jaime	db79be3ae0	fix: ctx cancel should be the last step while stopping server (#31220 ) issue: #31219 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-03-15 10:33:05 +08:00
wei liu	147a3b8bdc	fix: Grpcclient return unrecoverable error (#31256 ) issue: #31222 grpcclient's `call` func return a unrecoverable error, then the caller's retry policy also breaks due to this unrecoverable error. This PR introduce `retry.Handle`, the new func use `func() (bool, error)` as input parameters, which return `shouldRetry` directly, to avoid grpcclient return a unrecoverable error Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-15 10:03:05 +08:00
Buqian Zheng	3c80083f51	feat: [Sparse Float Vector] add sparse vector support to milvus components (#30630 ) add sparse float vector support to different milvus components, including proxy, data node to receive and write sparse float vectors to binlog, query node to handle search requests, index node to build index for sparse float column, etc. https://github.com/milvus-io/milvus/issues/29419 --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-03-13 14:32:54 -07:00
yihao.dai	87b3c25b15	fix: Fix binlog import (#31205 ) 1. File type validation is omitted during binlog import. 2. System fields are appended to the schema during binlog import. issue: https://github.com/milvus-io/milvus/issues/28521 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-13 10:35:04 +08:00
cai.zhang	de2c95d00c	enhance: Constraint dynamic field as key-value format (#31183 ) issue: #31051 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-03-12 12:45:03 +08:00
aoiasd	d19a82e3f6	enhance: support stream call for grpc client (#30013 ) relate: https://github.com/milvus-io/milvus/issues/29719 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-03-07 17:45:01 +08:00
wei liu	fa73520b57	fix: Make datacoord client retry on index api (#30654 ) issue: #20553 This PR add retry on all interface which belong to indexcoord in milvus 2.2 and. move to data coord in milvus 2.3, to prevent meet `unimplemented` error during rolling upgrade from milvus 2.2 to 2.3. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-04 15:32:59 +08:00
yihao.dai	3775464b7c	enhance: Support varchar autoid for bulkinsertV1 (#30896 ) This PR is a supplement to PR https://github.com/milvus-io/milvus/pull/30377. Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-02-28 21:20:59 +08:00
chyezh	77477d6340	fix: wrong context passing into NewClient, error handling lost in session_util (#30817 ) issue: #30799 Signed-off-by: chyezh <chyezh@outlook.com>	2024-02-28 10:40:09 +08:00
yiwangdr	32cff25f97	enhance: decrease coordinator init time (#29822 ) This PR mainly improve two items: 1. Target observer should refresh loading status during init time. An uninitialized loading status blocks search/query. Currently, the target observer refreshes every 10 seconds, i.e. we'd need to wait for 10s for no reason. That's also the reason why we constantly see false log "collection unloaded" upon mixcoord restarts. 2. Delete session when service is stopped. So that the new service doesn't need to wait for the previous session to expire (~10s). Item 1 is the major improvement of this PR, which should speed up init time by 10s. Item 2 is not a big concern in most cases as coordinators usually shut down after stop(). In those cases, coordinator restart triggers serverID change which further triggers an existing logic that deletes expired session. This PR only fixes rare cases where serverID doesn't change. integration test: `go test -tags dynamic -v -coverprofile=profile.out -covermode=atomic tests/integration/coordrecovery/coord_recovery_test.go -timeout=20m` Performance after the change: Average init time of coordinators: 10s Hardware: M2 Pro Test setup: 1000 collections with 1000 rows (dim=128) per collection. issue: #29409 Signed-off-by: yiwangdr <yiwangdr@gmail.com>	2024-02-05 14:00:12 +08:00
smellthemoon	6bc10f9fdd	enhance: support varchar autoid when bulkinsert (#30377 ) support varchar autoid when bulkinsert Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-02-01 19:45:09 +08:00
xige-16	060c8603a3	fix: Support mvcc with hybrid serach (#30114 ) issue: https://github.com/milvus-io/milvus/issues/29656 /kind bug Signed-off-by: xige-16 <xi.ge@zilliz.com> --------- Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-02-01 16:03:03 +08:00
yihao.dai	c5918290e6	feat: Add import executor and manager for datanode (#29438 ) This PR introduces novel importv2 roles for datanode: 1. Executor: To execute tasks, a import task will be divided into the following steps: read data -> hash data -> sync data; 2. Manager: To manage all the tasks; issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-31 20:45:04 +08:00
cai.zhang	f619d792c0	enhance: Break down the granularity of collection info cache expired (#29977 ) issue: #29772 1. `DropPartition` only invalidates the cache related to the partition. 2. `CreateAlias` does not invalidate the cache. Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-01-30 16:45:02 +08:00
chyezh	211143c5e6	enhance: add basic information of milvus into metrics (#29665 ) add basic build information and runtime component dependency into metrics. issue: #29664 Signed-off-by: chyezh <ye.zhen@zilliz.com>	2024-01-29 15:47:02 +08:00
smellthemoon	9512af357b	enhance: reduce memory when read data (#30284 ) Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-01-26 20:49:00 +08:00
SimFG	aa7014a360	enhance: move the cgo code in the pkg dir to interal dir (#30261 ) /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-01-25 15:15:01 +08:00
PowderLi	08ca0a2ca5	feat: support etcd authentication (#30226 ) issue: #28895 add 3 configuration for ETCD config Signed-off-by: PowderLi <min.li@zilliz.com>	2024-01-24 11:35:00 +08:00
Patrick Weizhi Xu	0907d76253	enhance: pass partition key scalar info if enabled when build vector index (#29931 ) issue: #29892 Pass optional scalar IVF offsets to Cardinal Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-01-24 00:04:55 +08:00
chyezh	5ee9f734c1	fix: Use determined order to lock in BlockAll to avoid deadlock (#29246 ) issue: #29104 Signed-off-by: chyezh <ye.zhen@zilliz.com>	2024-01-22 14:50:56 +08:00
yihao.dai	ddd741a5d4	fix: Fix closing closed chan in proxy watcher (#30143 ) issue: https://github.com/milvus-io/milvus/issues/30142 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-19 23:02:54 +08:00
congqixia	10acdbbe8e	enhance: free CString in InitTraceConfig (#30055 ) `C.CString` result needs to be freed after usage Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-17 15:15:03 +08:00
congqixia	c0f0548702	fix: use SafeChan preventing close channel multiple times (#30022 ) See also #29935 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-16 17:34:54 +08:00
Bingyi Sun	e1258b8cad	feat: integrate storagev2 into loading segment (#29336 ) issue: #29335 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-01-12 18:10:51 +08:00
wayblink	1df3f90696	feat: Implement DescribeAlias and ListAliases interfaces (#29641 ) #22882 /kind feature Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-01-11 19:12:51 +08:00
Xu Tong	e429965f32	Add float16 approve for multi-type part (#28427 ) issue：https://github.com/milvus-io/milvus/issues/22837 Add bfloat16 vector, add the index part of float16 vector. Signed-off-by: Writer-X <1256866856@qq.com>	2024-01-11 15:48:51 +08:00
Cai Yudong	cb9d9ec0f0	enhance: Correct sampleFraction's type to float (#29810 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2024-01-10 13:18:50 +08:00
yihao.dai	3d07b6682c	feat: Add import reader for numpy (#29253 ) This PR implements a new numpy reader for import. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-08 19:42:49 +08:00
yihao.dai	156a0dd450	feat: Add import reader for Parquet (#29618 ) This PR implements a Parquet reader for import. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-07 19:38:49 +08:00
yihao.dai	23183ffb0f	feat: Add import reader for json (#29252 ) This PR implements a new json reader for import. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-05 18:12:48 +08:00
smellthemoon	1c1f2a1371	enhance:change some logs (#29579 ) related #29588 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-01-05 16:12:48 +08:00
yihao.dai	3561586edf	feat: Add import reader for binlog (#28910 ) This PR defines the new import reader interfaces and implement a binlog reader for import. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-05 11:48:47 +08:00
Jiquan Long	3f46c6d459	feat: support inverted index (#28783 ) issue: https://github.com/milvus-io/milvus/issues/27704 Add inverted index for some data types in Milvus. This index type can save a lot of memory compared to loading all data into RAM and speed up the term query and range query. Supported: `INT8`, `INT16`, `INT32`, `INT64`, `FLOAT`, `DOUBLE`, `BOOL` and `VARCHAR`. Not supported: `ARRAY` and `JSON`. Note: - The inverted index for `VARCHAR` is not designed to serve full-text search now. We will treat every row as a whole keyword instead of tokenizing it into multiple terms. - The inverted index don't support retrieval well, so if you create inverted index for field, those operations which depend on the raw data will fallback to use chunk storage, which will bring some performance loss. For example, comparisons between two columns and retrieval of output fields. The inverted index is very easy to be used. Taking below collection as an example: ```python fields = [ FieldSchema(name="pk", dtype=DataType.VARCHAR, is_primary=True, auto_id=False, max_length=100), FieldSchema(name="int8", dtype=DataType.INT8), FieldSchema(name="int16", dtype=DataType.INT16), FieldSchema(name="int32", dtype=DataType.INT32), FieldSchema(name="int64", dtype=DataType.INT64), FieldSchema(name="float", dtype=DataType.FLOAT), FieldSchema(name="double", dtype=DataType.DOUBLE), FieldSchema(name="bool", dtype=DataType.BOOL), FieldSchema(name="varchar", dtype=DataType.VARCHAR, max_length=1000), FieldSchema(name="random", dtype=DataType.DOUBLE), FieldSchema(name="embeddings", dtype=DataType.FLOAT_VECTOR, dim=dim), ] schema = CollectionSchema(fields) collection = Collection("demo", schema) ``` Then we can simply create inverted index for field via: ```python index_type = "INVERTED" collection.create_index("int8", {"index_type": index_type}) collection.create_index("int16", {"index_type": index_type}) collection.create_index("int32", {"index_type": index_type}) collection.create_index("int64", {"index_type": index_type}) collection.create_index("float", {"index_type": index_type}) collection.create_index("double", {"index_type": index_type}) collection.create_index("bool", {"index_type": index_type}) collection.create_index("varchar", {"index_type": index_type}) ``` Then, term query and range query on the field can be speed up automatically by the inverted index: ```python result = collection.query(expr='int64 in [1, 2, 3]', output_fields=["pk"]) result = collection.query(expr='int64 < 5', output_fields=["pk"]) result = collection.query(expr='int64 > 2997', output_fields=["pk"]) result = collection.query(expr='1 < int64 < 5', output_fields=["pk"]) ``` --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2023-12-31 19:50:47 +08:00
cai.zhang	c45f8a2946	fix: Import data from parquet file in streaming way (#29514 ) issue: #29292 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2023-12-27 15:30:46 +08:00
XuanYang-cn	7a6aa8552a	fix: add back existing datanode metrics (#29360 ) See also: #29204 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-12-22 14:20:43 +08:00
congqixia	f699be79f7	fix: grpc client check session skipped due to role not match (#29356 ) Related to #28815 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-21 10:12:51 +08:00
wei liu	e41fd6fbde	enhance: Move proxy client manager to util package (#28955 ) issue: #28898 This PR move the `ProxyClientManager` to util package, in case of reusing it's implementation in querycoord Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2023-12-20 19:22:42 +08:00
wayblink	2274aa3b50	fix: bulkinsert binlog didn't consider ts order when processing delta data (#29163 ) #29162 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2023-12-14 14:36:40 +08:00
Bingyi Sun	ad866d2889	feat: integrate storagev2 into index build process (#28995 ) issue: https://github.com/milvus-io/milvus/issues/28994 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2023-12-13 17:24:38 +08:00
wei liu	fe1eeae2aa	enhance: Use mockery to replace manual mock code (#29074 ) issue: #29043 This PR remove mannul mock code for proxy and data coord --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2023-12-13 10:46:44 +08:00
cai.zhang	49b8657f95	enhance: Support implicit type conversion for parquet (#29046 ) issue: #29019 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2023-12-12 16:14:44 +08:00
congqixia	1fe5f12bd5	enhance: Add client connect wrapper to keep connection alive (#29058 ) See also #29057 Add wrapper to maintain client&connection When reset operation is needed, `Close` method shall wait until all on-going request return --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-11 17:20:38 +08:00
cai.zhang	2b05460ef9	enhance: Make import-related error message clearer (#28978 ) issue: #28976 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2023-12-08 10:12:38 +08:00
wayblink	6736f65345	feat: skip some empty ttMsg in Datanode flowgraph (#28756 ) /kind feature Signed-off-by: wayblink <anyang.wang@zilliz.com>	2023-12-07 01:00:37 +08:00
yihao.dai	d26b563a8b	feat: Define import API and metadata (#28731 ) Define the new rpc and metadata for ImportV2. see also: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2023-12-04 19:56:35 +08:00
Bingyi Sun	45e6801ce4	feat: Add checker activation service interfaces (#28850 ) issue: #28610 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2023-12-04 17:38:37 +08:00
cai.zhang	f5f4f0872e	enhance: Support importing data with parquet file (#28608 ) issue: #28272 Numpy does not support array type import. Array type data is imported through parquet. Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2023-11-29 20:52:27 +08:00
cai.zhang	1b7a503f89	enhance: Revert import support csv format (#28760 ) Revert import support csv format. issue: #28778 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2023-11-28 14:32:27 +08:00
cai.zhang	c29b60e18e	enhance: Support Array DataType for bulk_insert (#28341 ) issue: #28272 Support array DataType for bulk_insert with json, binlog files. Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2023-11-27 13:50:27 +08:00
MrPresent-Han	fc30d291be	fix createCollection failed occasionally (#28592 ) (#28712 ) fix: create collection seldom failure #28592 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2023-11-27 11:10:25 +08:00
wayblink	da339535d5	enhance: Merge flowgraph goroutines into 1 (#28654 ) /kind enhancement #24826 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2023-11-23 19:52:25 +08:00
smellthemoon	73f2bab454	enhance:add some log when create client and get component states (#28160 ) /kind improvement Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2023-11-22 09:12:22 +08:00
PowderLi	c238bff9fb	fix: symbol 'GetStorageMetrics' and 'enableDynamicField' (#28580 ) /kind bug to #28579 #28504 1. replace enableDynamic with enableDynamicField 2. cgo directly link to milvus_storage Signed-off-by: PowderLi <min.li@zilliz.com>	2023-11-21 10:20:22 +08:00

1 2 3 4 5 ...

1654 Commits (b71e058bc534dfac506d2fffba0a2a80d60936d4)