milvus

Commit Graph

Author	SHA1	Message	Date
Alexander Guzhva	3addc68c66	enhance: [Cherry-pick] Custom bitset and bitset_view implementations (#31592 ) Issue: https://github.com/milvus-io/milvus/issues/31285 pr: https://github.com/milvus-io/milvus/pull/30454 Basically, I've replaced FixedVector<bool> and boost::dynamic_bitset with custom bitset and bitsetview in order to reduce the memory bandwidth & increase performance for the filtering. (cherry picked from commit `5dcecc882d`)	2024-03-26 10:05:09 +08:00
groot	a0535edb67	enhance: Support MinIO TLS connection (#31396 ) issue: https://github.com/milvus-io/milvus/issues/30709 pr: https://github.com/milvus-io/milvus/pull/31292 Signed-off-by: yhmo <yihua.mo@zilliz.com> Co-authored-by: Chen Rao <chenrao317328@163.com>	2024-03-21 11:15:20 +08:00
zhagnlu	6856ba1e69	fix: fix mmap failed when string field all value is empty (#31418 ) pr: #31406 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-03-20 14:45:10 +08:00
sammy.huang	b773581fde	enhance: fetch simdjson directly in the format of targz (#31370 ) pr: #31369 Signed-off-by: Liang Huang <sammy.huang@zilliz.com>	2024-03-18 18:55:10 +08:00
liliu-z	fdb3231151	enhance: Upgrade Knowhere (#31308 ) /kind improvement Signed-off-by: Li Liu <li.liu@zilliz.com>	2024-03-18 14:21:04 +08:00
Gao	038c570ef3	enhance: upgrade folly to run on arm (#31284 ) Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-03-15 15:39:03 +08:00
Chun Han	6939ad15f2	fix:possible out-of-bound due to groupby when reduing(#30711 ) (#31200 ) related: #30711 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-03-14 13:07:03 +08:00
Buqian Zheng	7fc3094a42	fix: fix growing index data race and properly handle build error (#31170 ) issue: https://github.com/milvus-io/milvus/issues/31169 also properly handling index build error by re-create a new index so that nothing will be left in the previous failed index build attempt. Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-03-13 20:19:04 +08:00
Buqian Zheng	96cfae55a5	feat: [Sparse Float Vector] segcore to support sparse vector search and get raw vector by id (#30629 ) This PR adds the ability to search/get sparse float vectors in segcore, and added unit tests by modifying lots of existing tests into parameterized ones. https://github.com/milvus-io/milvus/issues/29419 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-03-12 09:16:30 -07:00
zhagnlu	c8b54f321a	fix:restrict pk in [...] optimization situations (#31184 ) #31154 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-03-12 14:49:03 +08:00
cai.zhang	6a83f16871	feat: Support for multiple forms of JSON (#31052 ) issue: #31051 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-03-11 19:55:02 +08:00
Buqian Zheng	070dfc77bf	feat: [Sparse Float Vector] segcore basics and index building (#30357 ) This commit adds sparse float vector support to segcore with the following: 1. data type enum declarations 2. Adds corresponding data structures for handling sparse float vectors in various scenarios, including: * FieldData as a bridge between the binlog and the in memory data structures * mmap::Column as the in memory representation of a sparse float vector column of a sealed segment; * ConcurrentVector as the in memory representation of a sparse float vector of a growing segment which supports inserts. 3. Adds logic in payload reader/writer to serialize/deserialize from/to binlog 4. Adds the ability to allow the index node to build sparse float vector index 5. Adds the ability to allow the query node to build growing index for growing segment and temp index for sealed segment without index built This commit also includes some code cleanness, comment improvement, and some unit tests for sparse vector. https://github.com/milvus-io/milvus/issues/29419 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-03-11 14:45:02 +08:00
Cai Yudong	a99143dd52	fix: Save traceID and spanID as hex string into search config (#31071 ) Issue: #30961 Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2024-03-11 14:21:01 +08:00
sre-ci-robot	53af6d8c59	[automated] Update Knowhere Commit (#31151 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-03-09 01:55:02 +08:00
Cai Yudong	122981aeb9	fix: Disable knowhere trace as a quick fix (#31055 ) Issue: #30961 Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2024-03-08 15:27:01 +08:00
Chun Han	3574bdf858	enhance: ban range-search iteration for search-group-by (#30824 ) related: #30033 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-03-08 14:17:00 +08:00
presburger	19c64067af	enhance: jemalloc aarch64 platform use 64k pagesize. (#29522 ) enhance: jemalloc aarch64 platform use 64k pagesize. issue: #28843 Signed-off-by: Yusheng.Ma <Yusheng.Ma@zilliz.com>	2024-03-07 21:01:01 +08:00
sre-ci-robot	2d9de233fc	[automated] Update Knowhere Commit (#31089 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-03-07 12:05:02 +08:00
sre-ci-robot	c047f09110	[automated] Update Knowhere Commit (#31015 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-03-05 16:31:00 +08:00
Xiaofan	4bda6c33ad	fix: binary vector should not limit dimension to 32768 (#30676 ) all the vector dimension check should happen on collection creation but not index build fix #30285 Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-03-05 14:21:00 +08:00
sre-ci-robot	3dc5e38240	[automated] Update Knowhere Commit (#30989 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-03-04 16:34:59 +08:00
MrPresent-Han	29f44f840a	enhance: refine groupBy error msg(#29968 ) (#30920 ) related: #29968 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-03-01 18:53:03 +08:00
cai.zhang	1aa97a5c21	enhance: Support more relational operators for binary expressions (#30902 ) issue: #30677 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-03-01 16:57:00 +08:00
Jiquan Long	e2f35954d4	enhance: support pattern matching on json field (#30779 ) issue: https://github.com/milvus-io/milvus/issues/30714 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-28 18:31:00 +08:00
Jiquan Long	16b785e149	enhance: optimize the memory usage and speed up loading variable length data (#30787 ) /kind improvement this removes the 1x copying while loading variable length data, also avoids constructing std::string, which could lead to memory fragmentation --------- Signed-off-by: yah01 <yah2er0ne@outlook.com> Signed-off-by: longjiquan <jiquan.long@zilliz.com> Co-authored-by: yah01 <yah2er0ne@outlook.com>	2024-02-28 16:45:00 +08:00
Jiquan Long	4459078e0b	fix: wrong num_entities used when mmap variable length data (#30848 ) https://github.com/milvus-io/milvus/issues/30728 Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-28 16:38:56 +08:00
congqixia	a115b731ed	enhance: fix old pr cpp format issue (#30894 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-28 16:28:20 +08:00
Buqian Zheng	f658dd5faa	enhance: update knowhere version to 60a5c9c (#30788 ) /kind improvement Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-02-28 14:18:55 +08:00
Cai Yudong	8a219e0102	feat: Support knowhere trace using OpenTelemetry (#30750 ) Issue: #21508 Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2024-02-28 12:29:00 +08:00
sre-ci-robot	6e9f3ea531	[automated] Update Knowhere Commit (#30744 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-02-28 10:50:57 +08:00
yah01	57397b1307	enhance: add new LRU cache impl (#30360 ) - remove the unused LRU cache - add new LRU cache impl which wraps github.com/karlseguin/ccache related #30361 --------- Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-02-27 20:58:40 +08:00
Jiquan Long	3e82d21ca1	enhance: reduce 1x memory copy when loading json (#30753 ) /kind improvement --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-27 10:18:55 +08:00
Jiquan Long	e2330f02f8	fix: pattern match use incorrect raw data (#30764 ) issue: https://github.com/milvus-io/milvus/issues/30687 We store all the varchar datas in an continuous address and use string_view to quickly find them. In this case, using string_view.data() directly will point to all rest varchar datas. --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-22 19:56:52 +08:00
MrPresent-Han	77eb6defb1	feat: support groupby on growing and non-indexed sealed egment(#30307 ) (#30644 ) related: #30308 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-02-21 14:02:53 +08:00
zhagnlu	18aac076de	fix: move test from NEON to X86 (#30324 ) #26137 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-02-21 11:58:53 +08:00
zhagnlu	0118bef2a2	fix: replace sse2 simd interface with older version (#30668 ) #30667 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-02-21 10:04:54 +08:00
zhagnlu	976b6fc0e4	enhance: change opendal as compile configurable (#30384 ) #30373 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-02-20 19:16:52 +08:00
yah01	b74673c147	enhance: calculate the accuracy memory usage while loading segment (#30473 ) the old version Knowhere would copy the index data while loading, we need to consider this to avoid OOM. Knowhere provides a util function to indicate whether it will load the index with disk, if not, we need to double the memory usage prediction for index data Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-02-20 14:52:51 +08:00
foxspy	43e8cd531d	enhance: Update Knowhere version (#30675 ) issue: #30669 Signed-off-by: xianliang <xianliang.li@zilliz.com>	2024-02-19 22:04:51 +08:00
congqixia	18c351efa6	fix: Prevent ChunkCache use absolute path in All-in-one mode (#30666 ) See also #30651 Append operator of `std::filesystem::path` will replace whole path when the param of "/" operation is an absolute path. In "All-in-one" mode, this shall cause ChunkCache removing the original vector data file when building chunk cache during/after load procedure. This PR changes the ChunkCache path generation logic to a separate function in which will check whether the file path is absolute or not. If the file path is absolute, it removes the root path prefix and return concatenated file path. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-19 20:58:51 +08:00
Cai Yudong	5bb28a9ea4	enhance: Print out range_filter and radius when range search param check fail (#30623 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2024-02-18 15:40:48 +08:00
Alexander Guzhva	a297baae9d	enhance: remove unused code (#30601 ) Signed-off-by: Alexandr Guzhva <alexanderguzhva@gmail.com>	2024-02-13 10:26:47 +08:00
zhagnlu	e8a6f1ea2b	fix: erase pk empty check when pk index replace raw data (#30432 ) #30350 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-02-07 14:56:47 +08:00
MrPresent-Han	92d1d744ae	fix: groupby results lack good results(#29883 ) (#30428 ) related: #29883 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-02-06 17:08:34 +08:00
cqy123456	5449e862d5	fix: safety access unordered_map and remove some useless code excute (#30504 ) issue: https://github.com/milvus-io/milvus/issues/30358 and https://github.com/milvus-io/milvus/issues/30491 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-02-05 22:03:09 +08:00
sre-ci-robot	ebbe32df9a	[automated] Update Knowhere Commit (#30515 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-02-05 01:32:44 +08:00
Jiquan Long	a587450e56	enhance: [skip-e2e] disable asan (#30498 ) fix: #30511 /kind improvement --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-04 21:25:05 +08:00
sre-ci-robot	20c9cfc587	[automated] Update Knowhere Commit (#30487 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-02-04 01:23:04 +08:00
Jiquan Long	e549148a19	enhance: full-support for wildcard pattern matching (#30288 ) issue: #29988 This pr adds full-support for wildcard pattern matching from end to end. Before this pr, the users can only use prefix match in their expression, for example, "like 'prefix%'". With this pr, more flexible syntax can be combined. To do so, this pr makes these changes: - 1. support regex query both on index and raw data; - 2. translate the pattern matching to regex query, so that it can be handled by the regex query logic; - 3. loose the limit of the expression parsing, which allows general pattern matching syntax; With the support of regex query in segcore backend, we can also add mysql-like `REGEXP` syntax later easily. --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-01 12:37:04 +08:00
PowderLi	5cf9bb236e	enhance: restful support import jobs (#30343 ) issue: #28521 #29732 include 1. list collection's import jobs 2. create a new import job 3. get the progress of an import job fix: 1. mix the order of dbName & collectionName #29728 2. trace log keep the same as v1 3. support traceID 4. azure precheck, blob name cannot end with / #29703 --------- Signed-off-by: PowderLi <min.li@zilliz.com>	2024-01-31 17:57:04 +08:00

1 2 3 4 5 ...

1402 Commits (cdc_test)