From 0d4daa22e00adeeccb31b6999d5ab7ffa7013b50 Mon Sep 17 00:00:00 2001 From: "zhenshan.cao" Date: Tue, 20 Apr 2021 13:53:56 +0800 Subject: [PATCH] Update README --- CHANGELOG.md | 979 +++++++++++++++++++++++++++++++++++++++++++++ CODE_OF_CONDUCT.md | 48 +++ COMMUNITY.md | 99 +++++ CONTRIBUTING.md | 128 ++++++ DESIGN.md | 13 + INSTALL.md | 317 +++++++++++++++ LICENSE | 201 ++++++++++ NOTICE.md | 24 ++ README.md | 90 ++++- README_CN.md | 95 +++++ RELEASE.md | 17 + SECURITY.md | 15 + SUPPORT.md | 14 + 13 files changed, 2039 insertions(+), 1 deletion(-) create mode 100644 CHANGELOG.md create mode 100644 CODE_OF_CONDUCT.md create mode 100644 COMMUNITY.md create mode 100644 CONTRIBUTING.md create mode 100644 DESIGN.md create mode 100644 INSTALL.md create mode 100644 LICENSE create mode 100644 NOTICE.md create mode 100644 README_CN.md create mode 100644 RELEASE.md create mode 100644 SECURITY.md create mode 100644 SUPPORT.md diff --git a/CHANGELOG.md b/CHANGELOG.md new file mode 100644 index 0000000000..eb19eadcb1 --- /dev/null +++ b/CHANGELOG.md @@ -0,0 +1,979 @@ +# Changelog + +Please mark all change in change log and use the issue from GitHub + +# Milvus 1.1.0 (TBD) +## Bug +- \#4778 Fix failure to open vector index in mishards +- \#4797 Fix wrong results of merged different 'topk' queries +- \#4838 When an index created on the empty collection, the server will respond immediately +- \#4858 Fix the crash when query for a large topk on GPU Flat +- \#4862 Read-only nodes not to merge segments +- \#4894 The capacity of bloom filter should be determined by the row count of its segment +- \#4908 GPU cache will not be cleaned up in time after the collection is dropped +- \#4933 Milvus waste time during building index for small segment + +## Feature +- \#1434 Storage: enabling s3 storage support (implemented by Unisinsight) + +## Improvement +- \#4756 Improve the performance of get_vectors_by_id +- \#4856 Upgrade hnswlib to v0.5.0 + +# Milvus 1.0.0 (2021-03-09) +## Bug +- \#4739 Fix mishards probe test problem +- \#4749 Fix minor memory leak when building IVF_SQ8 on GPU +- \#4757 Fix minor memory leak when querying by IVF_SQ8H +- \#4765 Fix minor memory leak when building NSG +- \#4766 Fix minor memory leak when querying by BinaryFlat + +## Feature +- \#3977 Support logging to stdout + +## Improvement +- \#4754 Reduce the package download size of grpc-milvus for C++ SDK + +## Task + +# Milvus 0.10.6 (2021-02-23) +## Bug +- \#4683 A negative zero may be returned if the metric_type is Tanimoto +- \#4678 Server crash on BinaryFlat if dimension is not a power of 2 +- \#4719 The GPU cache holds much more data than the configured value + +## Feature + +## Improvement +- \#1970 Improve the performance of BinaryFlat by AVX2 +- \#3920 Add an optional parameter 'nbtis' for IVF_PQ +- \#4676,#4614 Support configurable metric labels cluster and instance for Prometheus (implemented by IQIYI) + +## Task + +# Milvus 0.10.5 (2021-01-07) +## Bug +- \#4296,#4554 Fix mishards add name-mismatched pod to read-only group +- \#4307 Specify partition to load for load_collection() +- \#4378 Multi-threads to call load_collection() and search() cause Milvus hang +- \#4484 Milvus only search default partition if search parameter 'partition_tags' contains '_default' +- \#4492 while share file system servcie became abnormal, the milvus can not supply query service + +## Feature +- \#4504 Add a metric to display the number of files opened by Milvus + +## Improvement +- \#4454 Optimize the process of indexing and querying + +## Task + +# Milvus 0.10.4 (2020-12-02) +## Bug +- \#3626 Fix server crash when searching with IVF_PQ on GPU +- \#3903 The performance of IVF_SQ8H in 0.10.3 is degraded +- \#3906 Fix the delete task state to avoid server crash +- \#4012 Milvus hangs when continually creating and dropping partitions +- \#4174 Fix out of memory caused by too many data loaded to GPU +- \#4318 Fix memory leak in IVF indexes + +## Feature +- \#3773 Support IVF_PQ to run on FPGA (implemented by Montage-tech) + +## Improvement +- \#3775 Improve search performance in the case that no item deleted +- \#4075 Improve performance for create large amount of partitions + +## Task + +# Milvus 0.10.3 (2020-9-21) +## Bug +- \#3536 Release search task in time to avoid excessive memory usage +- \#3656 Fix to check search params 'nprobe' of BIN_IVF_FLAT +- \#3742 If the GPU cache is too small, IVF_SQ8H using multiple GPUs will cause to crash +- \#3760 After IVF_SQ8H querying, an CUDA error will occur when Milvus exits + +## Feature + +## Improvement +- \#3213 Allow users to specify a distance type at runtime for Flat index +- \#3254 Allow more choices for the parameter ‘m’ of IVF_PQ +- \#3606 The supported parameter 'nprobe' of IVF is expanded to [1, 16384] +- \#3639 The supported parameter 'topk' of searching is expanded to [1, 16384] +- \#3784 Upgrade mishards up to v0.10.3 + +## Task + +# Milvus 0.10.2 (2020-08-15) + +## Bug +- \#2890 Fix the index size caculation in cache +- \#2952 Fix the result merging of IVF_PQ IP +- \#2975 Fix config UT failed +- \#3012 If the cache is too small, queries using multiple GPUs will cause to crash +- \#3133 Reverse query result in mishards if metric type is IP + +## Feature + +## Improvement +- \#2653 Improve IVF search performance when NQ and nProbe are both large +- \#2828 Let Faiss not to compile half float by default + +## Task + +# Milvus 0.10.1 (2020-07-20) + +## Bug +- \#2487 Enlarge timeout value for creating collection +- \#2487 HotFix release lock failed on NAS +- \#2557 Fix random crash of INSERT_DUPLICATE_ID case +- \#2578 Result count doesn't match target vectors count +- \#2585 Support IVF_PQ IP on GPU +- \#2598 Fix Milvus docker image report illegal instruction +- \#2617 Fix HNSW and RNSG index files size +- \#2637 Suit the range of HNSW parameters +- \#2642 Create index failed and server crashed +- \#2649 Search parameter of annoy has conflict with document +- \#2690 Remove body parser in show-partitions endpoints +- \#2692 Milvus hangs during multi-thread concurrent search +- \#2739 Fix mishards start failed +- \#2752 Milvus formats vectors data to double-precision and return to http client +- \#2767 Fix a bug of getting wrong nprobe limitation in knowhere on GPU version +- \#2768 After building the index, the number of vectors increases +- \#2774 Server down during loading data +- \#2776 Fix too many data copies during creating IVF index +- \#2813 To implemente RNSG IP + +## Feature + +## Improvement +- \#2932 Upgrade mishards for milvus 0.10.1 + +## Task + +# Milvus 0.10.0 (2020-06-15) + +## Bug +- \#2367 Fix inconsistent reading and writing when using mishards +- \#2368 Make read node detect delete behavior +- \#2394 Drop collection timeout if too many partitions created on collection +- \#2549 Launch server fail using demo config +- \#2564 cache.cache_size range check error + +## Feature +- \#2363 Update branch version +- \#2510 Upgrade Milvus config + +## Improvement +- \#2381 Upgrade FAISS to 1.6.3 +- \#2429 Fix Milvus 0.9.1 performance degrade issue +- \#2441 Improve Knowhere code coverage +- \#2466 Optimize k-selection implemention of faiss gpu version +- \#2489 Add exception throw on mysql meta error +- \#2495 Add creating lock file failure reason. +- \#2516 Improve unit test coverage +- \#2548 Upgrade mishards for milvus v0.10.0 + +## Task + +# Milvus 0.9.1 (2020-05-29) + +## Bug +- \#2366 Reduce SQL execution times for collection contains lot of partitions +- \#2378 Duplicate data after server restart +- \#2395 Fix large nq cudaMalloc error +- \#2399 The nlist set by the user may not take effect +- \#2403 MySQL max_idle_time is 10 by default +- \#2450 The deleted vectors may be found on GPU +- \#2456 openblas library install failed + +## Feature + +## Improvement +- \#2353 Remove log_config from code and scripts +- \#2370 Clean compile warning +- \#2410 Logging build index progress +- \#2461 Upgrade mishards for milvus 0.9.1 + +# Milvus 0.9.0 (2020-05-15) + +## Bug +- \#1705 Limit the insert data batch size +- \#1776 Error out when index SQ8H run in CPU mode +- \#1925 To flush all collections, flush cannot work +- \#1929 Skip MySQL meta schema field width check +- \#1946 Fix load index file CPU2GPU fail during searching +- \#1955 Switch create_index operation to background once client break connection +- \#1997 Index file missed after compact +- \#2002 Remove log error msg `Attributes is null` +- \#2073 Fix CheckDBConfigBackendUrl error message +- \#2076 CheckMetricConfigAddress error message +- \#2120 Fix Search expected failed if search params set invalid +- \#2121 Allow regex match partition tag when search +- \#2128 Check has_partition params +- \#2131 Distance/ID returned is not correct if searching with duplicate ids +- \#2141 Fix server start failed if wal directory exist +- \#2169 Fix SingleIndexTest.IVFSQHybrid unittest +- \#2194 Fix get collection info failed +- \#2196 Fix server start failed if wal is disabled +- \#2203 0.8.0 id=-1 is returned when total count < topk +- \#2228 Fix show partitions failed in http module +- \#2231 Use server_config to define hard-delete delay time for segment files +- \#2261 Re-define result returned by has_collection if collection in delete state +- \#2264 Milvus opened too many files when the metric_config.enable_monitor=true +- \#2266 Server hangs when using multi-clients to query different collections +- \#2280 has_partition should return true for `_default` + +## Feature +- \#1751 Add api SearchByID +- \#1752 Add api GetVectorsByID +- \#1962 Add api HasPartition +- \#1965 FAISS/NSG/HNSW/ANNOY use unified distance calculation algorithm +- \#2054 Check if CPU instruction sets are illegal +- \#2057 Add a config parameter to switch off http server +- \#2059 Add lock file avoid multiple instances modifying data at the same time +- \#2064 Warn when use SQLite as metadata management +- \#2111 Check GPU environment before start server +- \#2206 Log file rotating +- \#2240 Obtain running rpc requests information +- \#2268 Intelligently detect openblas library in system to avoid installing from source code every time +- \#2283 Suspend the building tasks when any query comand arrives. + +## Improvement +- \#221 Refactor LOG macro +- \#833 Catch exception in RolloutHandler and output in stderr +- \#1796 Compile Openblas with source code to improve the performance +- \#1942 Background merge file strategy +- \#2039 Support Milvus run on SSE CPUs +- \#2149 Merge server_cpu_config.template and server_gpu_config.template +- \#2153 Upgrade thirdparty oatpp to v1.0.0 +- \#2167 Merge log_config.conf with server_config.yaml +- \#2173 Check storage permission +- \#2178 Using elkan K-Means to improve IVF +- \#2185 Change id to string format in http module +- \#2186 Update endpoints in http module +- \#2190 Fix memory usage is twice of index size when using GPU searching +- \#2248 Use hostname and port as instance label of metrics +- \#2252 Upgrade mishards APIs and requirements +- \#2256 k-means clustering algorithm use only Euclidean distance metric +- \#2300 Upgrade mishrads configuration to version 0.4 +- \#2311 Update mishards methods +- \#2330 Change url for behavior 'get_entities_by_id' +- \#2347 Update http document for v0.9.0 +- \#2358 Upgrade mishards for v0.9.0 + +## Task + +# Milvus 0.8.0 (2020-04-15) + +## Bug +- \#1276 SQLite throw exception after create 50000+ partitions in a table +- \#1762 Server is not forbidden to create new partition which tag is `_default` +- \#1789 Fix multi-client search cause server crash +- \#1832 Fix crash in tracing module +- \#1873 Fix index file serialize to incorrect path +- \#1881 Fix bad alloc when index files lost +- \#1883 Fix inserted vectors becomes all zero when index_file_size >= 2GB +- \#1901 Search failed with flat index +- \#1903 Fix invalid annoy result +- \#1910 C++ SDK GetIDsInSegment could not work for large dataset + +## Feature +- \#261 Integrate ANNOY into Milvus +- \#1655 GPU index support delete vectors +- \#1660 IVF PQ CPU support deleted vectors searching +- \#1661 HNSW support deleted vectors searching +- \#1825 Add annoy index type in C++ sdk +- \#1849 NSG support deleted vectors searching +- \#1893 Log config information and device information + +## Improvement +- \#1627 Move read/write index APIs into codec +- \#1784 Add Substructure and Superstructure in http module +- \#1858 Disable S3 build +- \#1882 Add index annoy into http module +- \#1885 Optimize knowhere unittest +- \#1886 Refactor log on search and insert request +- \#1897 Heap pop and push can be realized by heap_swap_top +- \#1921 Use TimeRecorder instead of chrono +- \#1928 Fix too many data and uid copies when loading files +- \#1930 Upgrade mishards to v0.8.0 + +## Task + +# Milvus 0.7.1 (2020-03-29) + +## Bug +- \#1301 Data in WAL may be accidentally inserted into a new table with the same name. +- \#1634 Fix search demo bug in HTTP doc +- \#1635 Vectors can be returned by searching after vectors deleted if `cache_insert_data` set true +- \#1648 The cache cannot be used all when the vector type is binary +- \#1651 Check validity of dimension when collection metric type is binary one +- \#1663 PQ index parameter 'm' validation +- \#1686 API search_in_files cannot work correctly when vectors is stored in certain non-default partition +- \#1689 Fix SQ8H search fail on SIFT-1B dataset +- \#1667 Create index failed with type: rnsg if metric_type is IP +- \#1708 NSG search crashed +- \#1724 Remove unused unittests +- \#1728 Optimize request handler to combine similar query +- \#1734 Opentracing for combined search request +- \#1735 Fix search out of memory with ivf_flat +- \#1747 Expected error status if search with partition_tag not existed +- \#1756 Fix memory exhausted during searching +- \#1781 Fix search hang with SQ8H +- \#1812 Fix incorrect request method in search example in http readme +- \#1818 Duplicate data generated after restart milvus server + +## Feature +- \#1603 BinaryFlat add 2 Metric: Substructure and Superstructure + +## Improvement +- \#267 Improve search performance: reduce delay +- \#342 Knowhere and Wrapper refactor +- \#1537 Optimize raw vector and uids read/write +- \#1546 Move Config.cpp to config directory +- \#1547 Rename storage/file to storage/disk and rename classes +- \#1548 Move store/Directory to storage/Operation and add FSHandler +- \#1572 Optimize config cpu/gpu cache_capacity setter +- \#1619 Improve compact performance +- \#1649 Fix Milvus crash on old CPU +- \#1653 IndexFlat (SSE) and IndexBinaryFlat performance improvement for small NQ +- \#1678 Remove CUSTOMIZATION macro +- \#1698 Upgrade mishards to v0.7.0 +- \#1719 Improve Milvus log +- \#1754 Optimize behavior to get file ids from metadata in mishards +- \#1799 Update docker images to 0.7.1 in mishards + +## Task + +# Milvus 0.7.0 (2020-03-11) + +## Bug +- \#715 Milvus crash when searching and building index simultaneously using SQ8H +- \#744 Don't return partition table for show_tables +- \#770 Server unittest run failed on low-end server +- \#805 IVFTest.gpu_seal_test unittest failed +- \#831 Judge branch error in CommonUtil.cpp +- \#977 Server crash when create tables concurrently +- \#990 Check gpu resources setting when assign repeated value +- \#995 Table count set to 0 if no tables found +- \#1010 Improve error message when offset or page_size is equal 0 +- \#1022 Check if partition name is valid +- \#1028 Check if table exists when show partitions +- \#1029 Check if table exists when try to delete partition +- \#1066 Optimize http insert and search speed +- \#1022 Check if partition name is legal +- \#1028 Check if table exists when show partitions +- \#1029 Check if table exists when try to delete partition +- \#1066 Optimize http insert and search speed +- \#1067 Add binary vectors support in http server +- \#1075 Improve error message when page size or offset is illegal +- \#1082 Check page_size or offset value to avoid float +- \#1115 Http server support load table into memory +- \#1152 Error log output continuously after server start +- \#1211 Server down caused by searching with index_type: HNSW +- \#1240 Update license declaration +- \#1298 Unit test failed when on CPU2GPU case +- \#1359 Negative distance value returned when searching with HNSW index type +- \#1429 Server crashed when searching vectors with GPU +- \#1476 Fix vectors results bug when getting vectors from segments +- \#1484 Index type changed to IDMAP after compacted +- \#1491 Server crashed during adding vectors +- \#1499 Fix duplicated ID number issue +- \#1504 Avoid possible race condition between delete and search +- \#1507 set_config for insert_buffer_size is wrong +- \#1510 Add set interfaces for WAL configurations +- \#1511 Fix big integer cannot pass to server correctly +- \#1517 Result is not correct when search vectors in multi partition, index type is RNSG +- \#1518 Table count did not match after deleting vectors and compact +- \#1521 Make cache_insert_data take effect in-service +- \#1525 Add setter API for config preload_table +- \#1529 Fix server crash when cache_insert_data enabled +- \#1530 Set table file with correct engine type in meta +- \#1532 Search with ivf_flat failed with open-dataset: sift-256-hamming +- \#1535 Degradation searching performance with metric_type: binary_idmap +- \#1549 Fix server/wal config setting bug +- \#1556 Index file not created after table and index created +- \#1560 Search crashed with Super-high dimensional binary vector +- \#1564 Too low recall for glove-200-angular, ivf_pq index +- \#1571 Meta engine type become IDMAP after dropping index for BINARY table +- \#1574 Set all existing bitset in cache when applying deletes +- \#1577 Row count incorrect if delete vectors then create index +- \#1580 Old segment folder not removed after merge/compact if create_index is called before adding data +- \#1590 Server down caused by failure to write file during concurrent mixed operations +- \#1598 Server down during mixed operations +- \#1601 External link bug in HTTP doc +- \#1609 Refine Compact function +- \#1808 Building index params check for Annoy +- \#1852 Search index type failed with reason `failed to load index file` + +## Feature +- \#216 Add CLI to get server info +- \#343 Add Opentracing +- \#665 Support get/set config via CLI +- \#759 Put C++ sdk out of milvus/core +- \#766 If partition tag is similar, wrong partition is searched +- \#771 Add server build commit info interface +- \#788 Add web server into server module +- \#813 Add push mode for prometheus monitor +- \#815 Support MinIO storage +- \#823 Support binary vector tanimoto/jaccard/hamming metric +- \#830 Support WAL(write-ahead logging) +- \#853 Support HNSW +- \#861 Support DeleteById / SearchByID / GetVectorById / Flush +- \#910 Change Milvus c++ standard to c++17 +- \#1122 Support AVX-512 in FAISS +- \#1204 Add api to get table data information +- \#1250 Support CPU profiling +- \#1302 Get all record IDs in a segment by given a segment id +- \#1461 Add crud APIs and segments APIs into http module +- \#1463 Update config version to 0.2 +- \#1531 Remove S3 related config + +## Improvement +- \#738 Use Openblas / lapack from apt install +- \#758 Enhance config description +- \#791 Remove Arrow +- \#834 Add cpu mode for built-in Faiss +- \#848 Add ready-to-use config files to the Milvus repo for enhanced user experince +- \#860 Remove redundant checks in CacheMgr's constructor +- \#908 Move "primary_path" and "secondary_path" to storage config +- \#931 Remove "collector" from config +- \#966 Update NOTICE.md +- \#1002 Rename minio to s3 in Storage Config section +- \#1078 Move 'insert_buffer_size' to Cache Config section +- \#1105 Error message is not clear when creating IVFSQ8H index without gpu resources +- \#740, #849, #878, #972, #1033, #1161, #1173, #1199, #1190, #1223, #1222, #1257, #1264, #1269, #1164, #1303, #1304, #1324, #1388, #1459 Various fixes and improvements for Milvus documentation. +- \#1297 Hide partition_name parameter, avid user directly access partition table +- \#1234 Do S3 server validation check when Milvus startup +- \#1263 Allow system conf modifiable and some take effect directly +- \#1310 Add default partition tag for a table +- \#1320 Remove debug logging from faiss +- \#1426 Support to configure whether to enabled autoflush and the autoflush interval +- \#1444 Improve delete +- \#1448 General proto api for NNS libraries +- \#1480 Add return code for AVX512 selection +- \#1524 Update config "preload_table" description +- \#1544 Update resources name in HTTP module +- \#1567 Update yaml config description + +## Task +- \#1327 Exclude third-party code from codebeat +- \#1331 Exclude third-party code from codacy + +# Milvus 0.6.0 (2019-12-07) + +## Bug +- \#228 Memory usage increased slowly during searching vectors +- \#246 Exclude src/external folder from code coverage for jenkin ci +- \#248 Reside src/external in thirdparty +- \#316 Some files not merged after vectors added +- \#327 Search does not use GPU when index type is FLAT +- \#331 Add exception handle when search fail +- \#340 Test cases run failed on 0.6.0 +- \#353 Rename config.h.in to version.h.in +- \#374 sdk_simple return empty result +- \#377 Create partition success if tag name only contains spaces +- \#397 sdk_simple return incorrect result +- \#399 Create partition should be failed if partition tag existed +- \#412 Message returned is confused when partition created with null partition name +- \#416 Drop the same partition success repeatally +- \#440 Query API in customization still uses old version +- \#440 Server cannot startup with gpu_resource_config.enable=false in GPU version +- \#458 Index data is not compatible between 0.5 and 0.6 +- \#465 Server hang caused by searching with nsg index +- \#485 Increase code coverage rate +- \#486 gpu no usage during index building +- \#497 CPU-version search performance decreased +- \#504 The code coverage rate of core/src/scheduler/optimizer is too low +- \#509 IVF_PQ index build trapped into dead loop caused by invalid params +- \#513 Unittest DELETE_BY_RANGE sometimes failed +- \#523 Erase file data from cache once the file is marked as deleted +- \#527 faiss benchmark not compatible with faiss 1.6.0 +- \#530 BuildIndex stop when do build index and search simultaneously +- \#532 Assigin value to `table_name` from confest shell +- \#533 NSG build failed with MetricType Inner Product +- \#543 client raise exception in shards when search results is empty +- \#545 Avoid dead circle of build index thread when error occurs +- \#547 NSG build failed using GPU-edition if set gpu_enable false +- \#548 NSG search accuracy is too low +- \#552 Server down during building index_type: IVF_PQ using GPU-edition +- \#561 Milvus server should report exception/error message or terminate on mysql metadata backend error +- \#579 Build index hang in GPU version when gpu_resources disabled +- \#596 Frequently insert operation cost too much disk space +- \#599 Build index log is incorrect +- \#602 Optimizer specify wrong gpu_id +- \#606 No log generated during building index with CPU +- \#616 IP search metric_type is not supported by IVF_PQ index +- \#631 FAISS isn't compiled with O3 option +- \#636 (CPU) Create index PQ should be failed if table metric type set Inner Product +- \#649 Typo "partiton" should be "partition" +- \#654 Random crash when frequently insert vector one by one +- \#658 Milvus error out when building SQ8H index without GPU resources +- \#668 Update badge of README +- \#670 Random failure of unittest db_test::SEARCH_TEST +- \#674 Server down in stability test +- \#696 Metric_type changed from IP to L2 +- \#705 Fix search SQ8H crash without GPU resource + +## Feature +- \#12 Pure CPU version for Milvus +- \#77 Support table partition +- \#127 Support new Index type IVFPQ +- \#226 Experimental shards middleware for Milvus +- \#227 Support new index types SPTAG-KDT and SPTAG-BKT +- \#346 Support build index with multiple gpu +- \#420 Update shards merge part to match v0.5.3 +- \#488 Add log in scheduler/optimizer +- \#502 C++ SDK support IVFPQ and SPTAG +- \#560 Add version in server config file +- \#605 Print more messages when server start +- \#644 Add a new rpc command to get milvus build version whether cpu or gpu +- \#709 Show last commit id when server start + +## Improvement +- \#255 Add ivfsq8 test report detailed version +- \#260 C++ SDK README +- \#266 RPC request source code refactor +- \#274 Logger the time cost during preloading data +- \#275 Rename C++ SDK IndexType +- \#284 Change C++ SDK to shared library +- \#306 Use int64 for all config integer +- \#310 Add Q&A for 'protocol https not supported or disable in libcurl' issue +- \#314 add Find FAISS in CMake +- \#322 Add option to enable / disable prometheus +- \#354 Build migration scripts into milvus docker image +- \#358 Add more information in build.sh and install.md +- \#404 Add virtual method Init() in Pass abstract class +- \#409 Add a Fallback pass in optimizer +- \#433 C++ SDK query result is not easy to use +- \#449 Add ShowPartitions example for C++ SDK +- \#470 Small raw files should not be build index +- \#584 Intergrate internal FAISS +- \#611 Remove MILVUS_CPU_VERSION +- \#634 FAISS GPU version is compiled with O0 +- \#737 Refactor server module to separate Grpc from server handler and scheduler + +## Task + +# Milvus 0.5.3 (2019-11-13) + +## Bug +- \#258 Bytes type in proto cause big-endian/little-endian problem + +## Feature + +## Improvement +- \#204 improve grpc performance in search +- \#207 Add more unittest for config set/get +- \#208 Optimize unittest to support run single test more easily +- \#284 Change C++ SDK to shared library +- \#260 C++ SDK README + +## Task + +# Milvus 0.5.2 (2019-11-07) + +## Bug +- \#194 Search faild: message="Table file doesn't exist" + +## Feature + +## Improvement +- \#190 Update default config:use_blas_threshold to 1100 and server version printout to 0.5.2 + +## Task + +# Milvus 0.5.1 (2019-11-04) + +## Bug +- \#134 JFrog cache error +- \#161 Search IVFSQHybrid crash on gpu +- \#169 IVF_FLAT search out of memory + +## Feature +- \#90 The server start error messages could be improved to enhance user experience +- \#104 test_scheduler core dump +- \#115 Using new structure for tasktable +- \#139 New config option use_gpu_threshold +- \#146 Add only GPU and only CPU version for IVF_SQ8 and IVF_FLAT +- \#164 Add CPU version for building index + +## Improvement +- \#64 Improvement dump function in scheduler +- \#80 Print version information into log during server start +- \#82 Move easyloggingpp into "external" directory +- \#92 Speed up CMake build process +- \#96 Remove .a file in milvus/lib for docker-version +- \#118 Using shared_ptr instead of weak_ptr to avoid performance loss +- \#122 Add unique id for Job +- \#130 Set task state MOVED after resource copy it completed +- \#149 Improve large query optimizer pass +- \#156 Not return error when search_resources and index_build_device set cpu +- \#159 Change the configuration name from 'use_gpu_threshold' to 'gpu_search_threshold' +- \#168 Improve result reduce +- \#175 add invalid config unittest + +## Task + +# Milvus 0.5.0 (2019-10-21) + +## Bug +- MS-568 Fix gpuresource free error +- MS-572 Milvus crash when get SIGINT +- MS-577 Unittest Query randomly hung +- MS-587 Count get wrong result after adding vectors and index built immediately +- MS-599 Search wrong result when table created with metric_type: IP +- MS-601 Docker logs error caused by get CPUTemperature error +- MS-605 Server going down during searching vectors +- MS-620 Get table row counts display wrong error code +- MS-622 Delete vectors should be failed if date range is invalid +- MS-624 Search vectors failed if time ranges long enough +- MS-637 Out of memory when load too many tasks +- MS-639 SQ8H index created failed and server hang +- MS-640 Cache object size calculate incorrect +- MS-641 Segment fault(signal 11) in PickToLoad +- MS-644 Search crashed with index-type: flat +- MS-647 grafana display average cpu-temp +- MS-652 IVFSQH quantization double free +- MS-650 SQ8H index create issue +- MS-653 When config check fail, Milvus close without message +- MS-654 Describe index timeout when building index +- MS-658 Fix SQ8 Hybrid can't search +- MS-665 IVF_SQ8H search crash when no GPU resource in search_resources +- \#9 Change default gpu_cache_capacity to 4 +- \#20 C++ sdk example get grpc error +- \#23 Add unittest to improve code coverage +- \#31 make clang-format failed after run build.sh -l +- \#39 Create SQ8H index hang if using github server version +- \#30 Some troubleshoot messages in Milvus do not provide enough information +- \#48 Config unittest failed +- \#59 Topk result is incorrect for small dataset + +## Improvement +- MS-552 Add and change the easylogging library +- MS-553 Refine cache code +- MS-555 Remove old scheduler +- MS-556 Add Job Definition in Scheduler +- MS-557 Merge Log.h +- MS-558 Refine status code +- MS-562 Add JobMgr and TaskCreator in Scheduler +- MS-566 Refactor cmake +- MS-574 Milvus configuration refactor +- MS-578 Make sure milvus5.0 don't crack 0.3.1 data +- MS-585 Update namespace in scheduler +- MS-606 Speed up result reduce +- MS-608 Update TODO names +- MS-609 Update task construct function +- MS-611 Add resources validity check in ResourceMgr +- MS-619 Add optimizer class in scheduler +- MS-626 Refactor DataObj to support cache any type data +- MS-648 Improve unittest +- MS-655 Upgrade SPTAG +- \#42 Put union of index_build_device and search resources to gpu_pool +- \#67 Avoid linking targets multiple times in cmake + +## Feature +- MS-614 Preload table at startup +- MS-627 Integrate new index: IVFSQHybrid +- MS-631 IVFSQ8H Index support +- MS-636 Add optimizer in scheduler for FAISS_IVFSQ8H + +## Task +- MS-554 Change license to Apache 2.0 +- MS-561 Add contributing guidelines, code of conduct and README docs +- MS-567 Add NOTICE.md +- MS-569 Complete the NOTICE.md +- MS-575 Add Clang-format & Clang-tidy & Cpplint +- MS-586 Remove BUILD_FAISS_WITH_MKL option +- MS-590 Refine cmake code to support cpplint +- MS-600 Reconstruct unittest code +- MS-602 Remove zilliz namespace +- MS-610 Change error code base value from hex to decimal +- MS-624 Re-organize project directory for open-source +- MS-635 Add compile option to support customized faiss +- MS-660 add ubuntu_build_deps.sh +- \#18 Add all test cases + +# Milvus 0.4.0 (2019-09-12) + +## Bug +- MS-119 The problem of combining the log files +- MS-121 The problem that user can't change the time zone +- MS-411 Fix metric unittest linking error +- MS-412 Fix gpu cache logical error +- MS-416 ExecutionEngineImpl::GpuCache has not return value cause crash +- MS-417 YAML sequence load disable cause scheduler startup failed +- MS-413 Create index failed and server exited +- MS-427 Describe index error after drop index +- MS-432 Search vectors params nprobe need to check max number +- MS-431 Search vectors params nprobe: 0/-1, expected result: raise exception +- MS-331 Crate Table : when table exists, error code is META_FAILED(code=15) rather than ILLEGAL TABLE NAME(code=9)) +- MS-430 Search no result if index created with FLAT +- MS-443 Create index hang again +- MS-436 Delete vectors failed if index created with index_type: IVF_FLAT/IVF_SQ8 +- MS-449 Add vectors twice success, once with ids, the other no ids +- MS-450 server hang after run stop_server.sh +- MS-458 Keep building index for one file when no gpu resource +- MS-461 Mysql meta unittest failed +- MS-462 Run milvus server twices, should display error +- MS-463 Search timeout +- MS-467 mysql db test failed +- MS-470 Drop index success, which table not created +- MS-471 code coverage run failed +- MS-492 Drop index failed if index have been created with index_type: FLAT +- MS-493 Knowhere unittest crash +- MS-453 GPU search error when nprobe set more than 1024 +- MS-474 Create index hang if use branch-0.3.1 server config +- MS-510 unittest out of memory and crashed +- MS-507 Dataset 10m-512, index type sq8,performance in-normal when set CPU_CACHE to 16 or 64 +- MS-543 SearchTask fail without exception +- MS-582 grafana displays changes frequently + +## Improvement +- MS-327 Clean code for milvus +- MS-336 Scheduler interface +- MS-344 Add TaskTable Test +- MS-345 Add Node Test +- MS-346 Add some implementation of scheduler to solve compile error +- MS-348 Add ResourceFactory Test +- MS-350 Remove knowhere submodule +- MS-354 Add task class and interface in scheduler +- MS-355 Add copy interface in ExcutionEngine +- MS-357 Add minimum schedule function +- MS-359 Add cost test in new scheduler +- MS-361 Add event in resource +- MS-364 Modify tasktableitem in tasktable +- MS-365 Use tasktableitemptr instead in event +- MS-366 Implement TaskTable +- MS-368 Implement cost.cpp +- MS-371 Add TaskTableUpdatedEvent +- MS-373 Add resource test +- MS-374 Add action definition +- MS-375 Add Dump implementation for Event +- MS-376 Add loader and executor enable flag in Resource avoid diskresource execute task +- MS-377 Improve process thread trigger in ResourceMgr, Scheduler and TaskTable +- MS-378 Debug and Update normal_test in scheduler unittest +- MS-379 Add Dump implementation in Resource +- MS-380 Update resource loader and executor, work util all finished +- MS-383 Modify condition variable usage in scheduler +- MS-384 Add global instance of ResourceMgr and Scheduler +- MS-389 Add clone interface in Task +- MS-390 Update resource construct function +- MS-391 Add PushTaskToNeighbourHasExecutor action +- MS-394 Update scheduler unittest +- MS-400 Add timestamp record in task state change function +- MS-402 Add dump implementation for TaskTableItem +- MS-406 Add table flag for meta +- MS-403 Add GpuCacheMgr +- MS-404 Release index after search task done avoid memory increment continues +- MS-405 Add delete task support +- MS-407 Reconstruct MetricsCollector +- MS-408 Add device_id in resource construct function +- MS-409 Using new scheduler +- MS-413 Remove thrift dependency +- MS-410 Add resource config comment +- MS-414 Add TaskType in Scheduler::Task +- MS-415 Add command tasktable to dump all tasktables +- MS-418 Update server_config.template file, set CPU compute only default +- MS-419 Move index_file_size from IndexParam to TableSchema +- MS-421 Add TaskLabel in scheduler +- MS-422 Support DeleteTask in Multi-GpuResource case +- MS-428 Add PushTaskByDataLocality in scheduler +- MS-440 Add DumpTaskTables in sdk +- MS-442 Merge Knowhere +- MS-445 Rename CopyCompleted to LoadCompleted +- MS-451 Update server_config.template file, set GPU compute default +- MS-455 Distribute tasks by minimal cost in scheduler +- MS-460 Put transport speed as weight when choosing neighbour to execute task +- MS-459 Add cache for pick function in tasktable +- MS-476 Improve search performance +- MS-482 Change search stream transport to unary in grpc +- MS-487 Define metric type in CreateTable +- MS-488 Improve code format in scheduler +- MS-495 cmake: integrated knowhere +- MS-496 Change the top_k limitation from 1024 to 2048 +- MS-502 Update tasktable_test in scheduler +- MS-504 Update node_test in scheduler +- MS-505 Install core unit test and add to coverage +- MS-508 Update normal_test in scheduler +- MS-532 Add grpc server unittest +- MS-511 Update resource_test in scheduler +- MS-517 Update resource_mgr_test in scheduler +- MS-518 Add schedinst_test in scheduler +- MS-519 Add event_test in scheduler +- MS-520 Update resource_test in scheduler +- MS-524 Add some unittest in event_test and resource_test +- MS-525 Disable parallel reduce in SearchTask +- MS-527 Update scheduler_test and enable it +- MS-528 Hide some config used future +- MS-530 Add unittest for SearchTask->Load +- MS-531 Disable next version code +- MS-533 Update resource_test to cover dump function +- MS-523 Config file validation +- MS-539 Remove old task code +- MS-546 Add simple mode resource_config +- MS-570 Add prometheus docker-compose file +- MS-576 Scheduler refactor +- MS-592 Change showtables stream transport to unary + +## Feature +- MS-343 Implement ResourceMgr +- MS-338 NewAPI: refine code to support CreateIndex +- MS-339 NewAPI: refine code to support DropIndex +- MS-340 NewAPI: implement DescribeIndex + +## Task +- MS-297 disable mysql unit test + +# Milvus 0.3.1 (2019-07-10) + +## Bug + +- MS-148 Disable cleanup if mode is read only +- MS-149 Fixed searching only one index file issue in distributed mode +- MS-153 Fix c_str error when connecting to MySQL +- MS-157 Fix changelog +- MS-190 Use env variable to switch mem manager and fix cmake +- MS-217 Fix SQ8 row count bug +- MS-224 Return AlreadyExist status in MySQLMetaImpl::CreateTable if table already exists +- MS-232 Add MySQLMetaImpl::UpdateTableFilesToIndex and set maximum_memory to default if config value = 0 +- MS-233 Remove mem manager log +- MS-230 Change parameter name: Maximum_memory to insert_buffer_size +- MS-234 Some case cause background merge thread stop +- MS-235 Some test cases random fail +- MS-236 Add MySQLMetaImpl::HasNonIndexFiles +- MS-257 Update bzip2 download url +- MS-288 Update compile scripts +- MS-330 Stability test failed caused by server core dumped +- MS-347 Build index hangs again +- MS-382 fix MySQLMetaImpl::CleanUpFilesWithTTL unknown column bug + +## Improvement +- MS-156 Add unittest for merge result functions +- MS-152 Delete assert in MySQLMetaImpl and change MySQLConnectionPool impl +- MS-204 Support multi db_path +- MS-206 Support SQ8 index type +- MS-208 Add buildinde interface for C++ SDK +- MS-212 Support Inner product metric type +- MS-241 Build Faiss with MKL if using Intel CPU; else build with OpenBlas +- MS-242 Clean up cmake and change MAKE_BUILD_ARGS to be user defined variable +- MS-245 Improve search result transfer performance +- MS-248 Support AddVector/SearchVector profiling +- MS-256 Add more cache config +- MS-260 Refine log +- MS-249 Check machine hardware during initialize +- MS-261 Update faiss version to 1.5.3 and add BUILD_FAISS_WITH_MKL as an option +- MS-266 Improve topk reduce time by using multi-threads +- MS-275 Avoid sqlite logic error excetion +- MS-278 add IndexStatsHelper +- MS-313 add GRPC +- MS-325 add grpc status return for C++ sdk and modify some format +- MS-278 Add IndexStatsHelper +- MS-312 Set openmp thread number by config +- MS-305 Add CPU core percent metric +- MS-310 Add milvus CPU utilization ratio and CPU/GPU temperature metrics +- MS-324 Show error when there is not enough gpu memory to build index +- MS-328 Check metric type on server start +- MS-332 Set grpc and thrift server run concurrently +- MS-352 Add hybrid index + +## Feature +- MS-180 Add new mem manager +- MS-195 Add nlist and use_blas_threshold conf +- MS-137 Integrate knowhere + +## Task + +- MS-125 Create 0.3.1 release branch +- MS-306 Optimize build efficiency + +# Milvus 0.3.0 (2019-06-30) + +## Bug +- MS-104 Fix unittest lcov execution error +- MS-102 Fix build script file condition error +- MS-80 Fix server hang issue +- MS-89 Fix compile failed, libgpufaiss.a link missing +- MS-90 Fix arch match incorrect on ARM +- MS-99 Fix compilation bug +- MS-110 Avoid huge file size + +## Improvement +- MS-82 Update server startup welcome message +- MS-83 Update vecwise to Milvus +- MS-77 Performance issue of post-search action +- MS-22 Enhancement for MemVector size control +- MS-92 Unify behavior of debug and release build +- MS-98 Install all unit test to installation directory +- MS-115 Change is_startup of metric_config switch from true to on +- MS-122 Archive criteria config +- MS-124 HasTable interface +- MS-126 Add more error code +- MS-128 Change default db path + +## Feature + +- MS-57 Implement index load/search pipeline +- MS-56 Add version information when server is started +- MS-64 Different table can have different index type +- MS-52 Return search score +- MS-66 Support time range query +- MS-68 Remove rocksdb from third-party +- MS-70 cmake: remove redundant libs in src +- MS-71 cmake: fix faiss dependency +- MS-72 cmake: change prometheus source to git +- MS-73 cmake: delete civetweb +- MS-65 Implement GetTableRowCount interface +- MS-45 Implement DeleteTable interface +- MS-75 cmake: change faiss version to 1.5.2; add CUDA gencode +- MS-81 Fix faiss ptx issue; change cuda gencode +- MS-84 cmake: add arrow, jemalloc and jsoncons third party; default build option OFF +- MS-85 add NetIO metric +- MS-96 add new query interface for specified files +- MS-97 Add S3 SDK for MinIO Storage +- MS-105 Add MySQL +- MS-130 Add prometheus_test +- MS-144 Add nprobe config +- MS-147 Enable IVF +- MS-130 Add prometheus_test + +## Task +- MS-74 Change README.md in cpp +- MS-88 Add support for arm architecture + +# Milvus 0.2.0 (2019-05-31) + +## Bug + +- MS-32 Fix thrift error +- MS-34 Fix prometheus-cpp thirdparty +- MS-67 Fix license check bug +- MS-76 Fix pipeline crash bug +- MS-100 CMake: fix AWS build issue +- MS-101 Change AWS build type to Release + +## Improvement + +- MS-20 Clean Code Part 1 + +## Feature + +- MS-5 Implement Auto Archive Feature +- MS-6 Implement SDK interface part 1 +- MS-16 Implement metrics without prometheus +- MS-21 Implement SDK interface part 2 +- MS-26 CMake. Add thirdparty packages +- MS-31 CMake: add prometheus +- MS-33 CMake: add -j4 to make third party packages build faster +- MS-27 Support gpu config and disable license build config in cmake +- MS-47 Add query vps metrics +- MS-37 Add query, cache usage, disk write speed and file data size metrics +- MS-30 Use faiss v1.5.2 +- MS-54 CMake: Change Thrift third party URL to github.com +- MS-69 Prometheus: add all proposed metrics + +## Task + +- MS-1 Add CHANGELOG.md +- MS-4 Refactor the vecwise_engine code structure +- MS-62 Search range to all if no date specified diff --git a/CODE_OF_CONDUCT.md b/CODE_OF_CONDUCT.md new file mode 100644 index 0000000000..296d5d9a07 --- /dev/null +++ b/CODE_OF_CONDUCT.md @@ -0,0 +1,48 @@ +# Milvus Code of Conduct + +In the interest of fostering an open and welcoming environment, we as contributors and maintainers pledge to making participation in our project and our community a harassment-free experience for everyone, regardless of age, body size, disability, ethnicity, gender identity and expression, level of experience, nationality, personal appearance, race, religion, or sexual identity and orientation. + +## Our Standards + +Examples of behavior that contributes to creating a positive environment include: + +- Using welcoming and inclusive language. +- Being respectful of differing viewpoints and experiences. +- Gracefully accepting constructive criticism. +- Focusing on what is best for the community. +- Showing empathy towards other community members. + +Examples of unacceptable behavior by participants include: + +- The use of sexualized language or imagery and unwelcome sexual attention or advances. +- Trolling, insulting/derogatory comments, and personal or political attacks. +- Public or private harassment. +- Publishing others' private information, such as a physical or electronic address, without explicit permission. +- Conduct which could reasonably be considered inappropriate for the forum in which it occurs. + +All Milvus forums and spaces are meant for professional interactions, and any behavior which could reasonably be considered inappropriate in a professional setting is unacceptable. + +## Our Responsibilities + +Project maintainers are responsible for clarifying the standards of acceptable behavior and are expected to take appropriate and fair corrective action in response to any instances of unacceptable behavior. + +Project maintainers have the right and responsibility to remove, edit, or reject comments, commits, code, wiki edits, issues, and other contributions that are not aligned to this Code of Conduct, or to ban temporarily or permanently any contributor for other behaviors that they deem inappropriate, threatening, offensive, or harmful. + +## Scope + +This Code of Conduct applies to all content on milvus.io, Milvus’s GitHub organization, or any other official Milvus web presence allowing for community interactions, as well as at all official Milvus events, whether offline or online. + +The Code of Conduct also applies within all project spaces and in public spaces whenever an individual is representing Milvus or its community. Examples of representing a project or community include using an official project e-mail address, posting via an official social media account, or acting as an appointed or de facto representative at an online or offline event. Representation of Milvus may be further defined and clarified by project maintainers. + +## Enforcement + +Instances of abusive, harassing, or otherwise unacceptable behavior may be reported by contacting the project team at conduct@lfai.foundation. All complaints will be reviewed and investigated and will result in a response that is deemed necessary and appropriate to the circumstances. The project team is obligated to maintain confidentiality with regard to the reporter of an incident. Further details of specific enforcement policies may be posted separately. + +Project maintainers who do not follow or enforce the Code of Conduct in good faith may face temporary or permanent repercussions as determined by other members of the project's leadership. + +## Attribution + +This Code of Conduct is adapted from the [Contributor Covenant](https://www.contributor-covenant.org/), version 1.4, available at https://www.contributor-covenant.org/version/1/4/code-of-conduct.html + +For answers to common questions about this code of conduct, see https://www.contributor-covenant.org/faq + diff --git a/COMMUNITY.md b/COMMUNITY.md new file mode 100644 index 0000000000..ed96c0840e --- /dev/null +++ b/COMMUNITY.md @@ -0,0 +1,99 @@ +# Community Roles + + + +- [TSC Members](#tsc-members) +- [Committers](#committers) +- [Reviewers](#reviewers) +- [Contributors](#contributors) + + + +## TSC Members + +The Technical Steering Committee (TSC) functions as the core management team that oversees the Milvus community. The TSC decides the roadmap of the project and makes major decisions related to the community. + +TSC members have the following responsibilities: + +- Coordinate the technical direction of the project. +- Approve project or system proposals. +- Decide on formal releases of the project's software. +- Organize and remove sub-projects. +- Coordinate any marketing, events, or communications regarding the project. + +To become a TSC member, a committer must be sponsored by a TSC member and the sponsorship must be approved by 2/3 of all TSC members. + +The TSC members are listed below: + +- [cxie](https://github.com/cxie) +- [JinHai-CN](https://github.com/JinHai-CN) + +## Committers + +Committers lead project feature discussions and oversee the overall project quality. + +Committers have the following responsibilities: + +- Lead feature design discussions and implementation. +- Ensure the overall project quality and approve PRs. +- Participate in product release, feature planning, and roadmap design. +- Have a constructive and friendly attitude in all community interactions. +- Mentor reviewers and contributors. + +To become a committer, a reviewer must have contributed broadly throughout the Milvus project. A reviewer must also be sponsored by a committer and the sponsorship must be approved by the TSC. + +The committers are listed below: + +- [JinHai-CN](https://github.com/JinHai-CN) + +## Reviewers + +Reviewers review new code contributions and ensure the quality of existing code. + +Reviewers have the following responsibilities: + +- Participate in feature design discussion and implementation. +- Ensure the quality of owned code modules. +- Ensure the technical accuracy of documentation. +- Quickly respond to issues and PRs and conduct code reviews. + +To become a reviewer, a contributor must have provided continued and quality contribution to the Milvus project for at least 6 months and have contributed at least one major component where the contributor has taken an ownership role. + +The reviewers are listed below: + +- [XuPeng-SH](https://github.com/XuPeng-SH) +- [yhmo](https://github.com/yhmo) +- [scsven](https://github.com/scsven) +- [cydrain](https://github.com/cydrain) +- [shengjun1985](https://github.com/shengjun1985) + +## Contributors + +Contributors can be anyone who has successfully submitted at least one PR to the Milvus project. + +The contributors are listed below: + +- [ZhifengZhang-CN](https://github.com/ZhifengZhang-CN) +- [tinkerlin](https://github.com/tinkerlin) +- [youny626](https://github.com/youny626) +- [fishpenguin](https://github.com/fishpenguin) +- [BossZou](https://github.com/BossZou) +- [del-zhenwu](https://github.com/del-zhenwu) +- [jielinxu](https://github.com/jielinxu) +- [yamasite](https://github.com/yamasite) +- [Yukikaze-CZR](https://github.com/Yukikaze-CZR) +- [Heisenberg-Y](https://github.com/Heisenberg-Y) +- [sahuang](https://github.com/sahuang) +- [op-hunter](https://github.com/op-hunter) +- [GuanyunFeng](https://github.com/GuanyunFeng) +- [thywdy](https://github.com/thywdy) +- [erdustiggen](https://github.com/erdustiggen) +- [akihoni](https://github.com/akihoni) +- [shiyu22](https://github.com/shiyu22) +- [shengjh](https://github.com/shengjh) +- [dvzubarev](https://github.com/dvzubarev) +- [aaronjin2010](https://github.com/aaronjin2010) +- [ReigenAraka](https://github.com/ReigenAraka) +- [JackLCL](https://github.com/JackLCL) +- [Bennu-Li](https://github.com/Bennu-Li) +- [ABNER-1](https://github.com/ABNER-1) diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md new file mode 100644 index 0000000000..c5eb46a25c --- /dev/null +++ b/CONTRIBUTING.md @@ -0,0 +1,128 @@ +# Contributing to Milvus + +Contributions to Milvus are welcome from everyone. We strive to make the contribution process simple and straightforward. Up-to-date information can be found at [milvus.io](https://milvus.io/). + +The following are a set of guidelines for contributing to Milvus. Following these guidelines makes contributing to this project easy and transparent. These are mostly guidelines, not rules. Use your best judgment, and feel free to propose changes to this document in a pull request. + +As for everything else in the project, the contributions to Milvus are governed by our [Code of Conduct](CODE_OF_CONDUCT.md). + + +## What contributions can I make? + +Contributions to Milvus fall into the following categories. + +1. To report a bug or a problem with documentation, please file an [issue](https://github.com/milvus-io/milvus/issues/new/choose) providing the details of the problem. If you believe that the issue needs priority attention, please comment on the issue to notify the team. +2. To propose a new feature, please file a new feature request [issue](https://github.com/milvus-io/milvus/issues/new/choose). Describe the intended feature and discuss the design and implementation with the team and community. Once the team agrees on the plan, you can follow the [Contributing code](CONTRIBUTING.md#contributing-code) to implement it. +3. To implement a feature or bug-fix for an existing outstanding issue, follow the [Contributing code](CONTRIBUTING.md#contributing-code). If you need more context on a particular issue, comment on the issue to let people know. + +## How can I contribute? + +### Contributing code + +If you have improvements to Milvus, send us your pull requests! For those just getting started, see [GitHub workflow](#github-workflow). Make sure to refer to the related issue in the ccomment of your pull request and update [CHANGELOG.md](CHANGELOG.md). + +All submissions will be reviewed as quickly as possible. Once it is accepted, the status of the project to which it is associated will be changed to **Reviewer approved**. This means we are working on submitting your pull request to the internal repository. After the change has been submitted internally, your pull request will be merged automatically on GitHub. + +### GitHub workflow + +Generally, we follow the "fork-and-pull" Git workflow. + +1. [Fork](https://docs.github.com/en/github/getting-started-with-github/fork-a-repo) the repository on GitHub. +2. Clone your fork to your local machine with `git clone git@github.com:/milvus.git`. +3. Create a branch with `git checkout -b my-topic-branch`. +4. [Commit](https://docs.github.com/en/github/collaborating-with-issues-and-pull-requests/committing-changes-to-a-pull-request-branch-created-from-a-fork) changes to your own branch, then push to to GitHub with `git push --set-upstream origin my-topic-branch`. You must record your changes in [CHANGELOG.md](CHANGELOG.md) with issue numbers and descriptions. +5. Submit a [pull request](https://docs.github.com/en/github/collaborating-with-issues-and-pull-requests/about-pull-requests) so that we can review your changes. + +Remember to [sync your forked repository](https://docs.github.com/en/github/getting-started-with-github/fork-a-repo#keep-your-fork-synced) *before* submitting proposed changes upstream. If you have an existing local repository, please update it before you start, to minimize the chance of merge conflicts. + +```shell +git remote add upstream git@github.com:milvus-io/milvus.git +git checkout master +git pull upstream master +git checkout -b my-topic-branch +``` + +### General guidelines + +Before submitting your pull requests for review, make sure that your changes are consistent with the [coding style](CONTRIBUTING.md#coding-style), and run [unit tests](CONTRIBUTING.md#run-unit-test-with-code-coverage) to check your code coverage rate. + +- Include unit tests when you contribute new features, as they help to prove that your code works correctly, and also guard against future breaking changes to lower the maintenance cost. +- Bug fixes also require unit tests, because the presence of bugs usually indicates insufficient test coverage. +- Keep API compatibility in mind when you change code in Milvus. Reviewers of your pull request will comment on any API compatibility issues. +- When you contribute a new feature to Milvus, the maintenance burden is (by default) transferred to the Milvus team. This means that the benefit of the contribution must be compared against the cost of maintaining the feature. + +### Developer Certificate of Origin (DCO) + +All contributions to this project must be accompanied by acknowledgment of, and agreement to, the [Developer Certificate of Origin](https://developercertificate.org/). Acknowledgment of and agreement to the Developer Certificate of Origin _must_ be included in the comment section of each contribution and _must_ take the form of `Signed-off-by: {{Full Name}} <{{email address}}>` (without the `{}`). Contributions without this acknowledgment will be required to add it before being accepted. If contributors are unable or unwilling to agree to the Developer Certificate of Origin, their contribution will not be included. + +Contributors sign-off that they adhere to DCO by adding the following Signed-off-by line to commit messages: + +```text +This is my commit message + +Signed-off-by: Random J Developer +``` + +Git also has a `-s` command line option to append this automatically to your commit message: + +```shell +$ git commit -s -m 'This is my commit message' +``` + +## Coding Style +The coding style used in Milvus generally follow [Google C++ Style Guide](https://google.github.io/styleguide/cppguide.html). +And we made the following changes based on the guide: + +- 4 spaces for indentation +- Adopt .cpp file extension instead of .cc extension +- 120-character line length +- Camel-Cased file names + +### Format code + +Install clang-format +```shell +$ sudo apt-get install clang-format +$ rm cmake_build/CMakeCache.txt +``` +Check code style +```shell +$ ./build.sh -l +``` +To format the code +```shell +$ cd cmake_build +$ make clang-format +``` + +## Run unit test with code coverage + +Before submitting your Pull Request, make sure you have run unit test, and your code coverage rate is >= 90%. + +Install lcov +```shell +$ sudo apt-get install lcov +``` +Run unit test and generate code for code coverage check +```shell +$ ./build.sh -u -c +``` + +Run MySQL docker +```shell +docker pull mysql:latest +docker run -p 3306:3306 -e MYSQL_ROOT_PASSWORD=123456 -d mysql:latest +``` + +Run code coverage + +```shell +$ ./coverage.sh -u root -p 123456 -t 127.0.0.1 +``` + +Or start your own MySQL server, and then run code coverage + +```shell +$ ./coverage.sh -u ${MYSQL_USERNAME} -p ${MYSQL_PASSWORD} -t ${MYSQL_SERVER_IP} +``` + diff --git a/DESIGN.md b/DESIGN.md new file mode 100644 index 0000000000..a1d7e2736e --- /dev/null +++ b/DESIGN.md @@ -0,0 +1,13 @@ +# Milvus Design Documents + +If you would like to propose a new feature for Milvus, it is recommended that you submit a design document following the [design document template](https://docs.google.com/document/d/1KhWs9b53K6WQUZ_FGWhIaeTraaSqsLQF7v2v68sAh_c/edit?usp=sharing). + +The following list contains existing design documents for Milvus. + +- [Support DSL interface](https://docs.google.com/document/d/1U83LY36TyaG3WD67Q9HWg9saD3qQcz9BfMcEScgwQPM/edit?usp=sharing) +- [Support delete/search by ID, attribute filtering, ID de-duplication](https://docs.google.com/document/d/1CDKdTj_DnE90YaZrPgsMaphqOTkMdbKETNrsFKj_Bco/edit?usp=sharing) +- [Support write-ahead logging](https://docs.google.com/document/d/12N8RC_wJb2dvEKY9jrlh8hU_eH8jxQVBewoPuHNqcXE/edit?usp=sharing) +- [Support in-service config modification](https://docs.google.com/document/d/1pK1joWJgAHM5nVp3q005iLbLqU5bn9InWeBy0mRAoSg/edit?usp=sharing) +- [Support Multi-Storage](https://docs.google.com/document/d/1iwwLH4Jtm3OXIVb7jFYsfmcbOyX6AWZKaNJAaXC7-cw/edit?usp=sharing) +- [Support AVX-512](https://docs.google.com/document/d/1do6_JgRCYdcV95sTPE6rLoiBK8wAcZki5Ypp7jbgqK0/edit?usp=sharing) +- [Refactor Knowhere](https://docs.google.com/document/d/1HY27EXV4UjJhDEmJ9t4Rjh7I1sB8iJHvqvliM6HHLS8/edit?usp=sharing) diff --git a/INSTALL.md b/INSTALL.md new file mode 100644 index 0000000000..5a2c029214 --- /dev/null +++ b/INSTALL.md @@ -0,0 +1,317 @@ +# Install Milvus from Source Code + + + +- [Build from source](#build-from-source) + + - [Requirements](#requirements) + + - [Compilation](#compilation) + + - [Launch Milvus server](#launch-milvus-server) + +- [Compile Milvus on Docker](#compile-milvus-on-docker) + + - [Step 1 Pull Milvus Docker images](#step-1-pull-milvus-docker-images) + + - [Step 2 Start the Docker container](#step-2-start-the-docker-container) + + - [Step 3 Download Milvus source code](#step-3-download-milvus-source-code) + + - [Step 4 Compile Milvus in the container](#step-4-compile-milvus-in-the-container) + +- [Troubleshooting](#troubleshooting) + + - [Error message: `protocol https not supported or disabled in libcurl`](#error-message-protocol-https-not-supported-or-disabled-in-libcurl) + + - [Error message: `internal compiler error`](#error-message-internal-compiler-error) + + - [Error message: `error while loading shared libraries: libmysqlpp.so.3`](#error-message-error-while-loading-shared-libraries-libmysqlppso3) + + - [CMake version is not supported](#cmake-version-is-not-supported) + + + +## Build from source + +### Requirements + +- Operating system + + - Ubuntu 18.04 or higher + + - CentOS 7 + + > Note: If your Linux operating system does not meet the requirements, we recommend that you pull a Docker image of [Ubuntu 18.04](https://docs.docker.com/install/linux/docker-ce/ubuntu/) or [CentOS 7](https://docs.docker.com/install/linux/docker-ce/centos/) as your compilation environment. + +- GCC 7.0 or higher to support C++ 17 + +- CMake 3.14 or higher + +- Git + +For GPU-enabled version, you will also need: + +- CUDA 10.x (10.0, 10.1, 10.2) + +- NVIDIA driver 418 or higher + +### Compilation + +#### Step 1 Download Milvus source code and specify version + +Download Milvus source code, change directory and specify version (for example, 0.10.3): + +```shell +$ git clone https://github.com/milvus-io/milvus +$ cd ./milvus/core +$ git checkout 0.10.3 +``` + +#### Step 2 Install dependencies + +##### Install in Ubuntu + +```shell +$ ./ubuntu_build_deps.sh +``` + +##### Install in CentOS + +```shell +$ ./centos7_build_deps.sh +``` + +#### Step 3 Build Milvus source code + +If you want to use CPU-only: + +run `build.sh`: + +```shell +$ ./build.sh -t Release +``` + +If you want to use GPU-enabled: + +1. Add cuda library path to `LD_LIBRARY_PATH`: + +```shell +$ export LD_LIBRARY_PATH=/usr/local/cuda/lib64:$LD_LIBRARY_PATH +``` + +2. Add cuda binary path to `PATH`: + +```shell +$ export PATH=/usr/local/cuda/bin:$PATH +``` + +3. Add a `-g` parameter to run `build.sh`: + +```shell +$ ./build.sh -g -t Release +``` + +When the build completes, everything that you need to run Milvus is under `[Milvus root path]/core/milvus`. + +### Launch Milvus server + +```shell +$ cd [Milvus root path]/core/milvus +``` + +Add `lib/` directory to `LD_LIBRARY_PATH` + +```shell +$ export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:[Milvus root path]/core/milvus/lib +``` + +Then start Milvus server: + +```shell +$ cd scripts +$ ./start_server.sh +``` + +To stop Milvus server, run: + +```shell +$ ./stop_server.sh +``` + +## Compile Milvus on Docker + +With the following Docker images, you should be able to compile Milvus on any Linux platform that runs Docker. To build a GPU supported Milvus, you need to install [NVIDIA Docker](https://github.com/NVIDIA/nvidia-docker/) first. + +### Step 1 Pull Milvus Docker images + +#### Ubuntu 18.04 + +Pull CPU-only image: + +```shell +$ docker pull milvusdb/milvus-cpu-build-env:v0.7.0-ubuntu18.04 +``` + +Pull GPU-enabled image: + +```shell +$ docker pull milvusdb/milvus-gpu-build-env:v0.7.0-ubuntu18.04 +``` + +#### CentOS 7 + +Pull CPU-only image: + +```shell +$ docker pull milvusdb/milvus-cpu-build-env:v0.7.0-centos7 +``` + +Pull GPU-enabled image: + +```shell +$ docker pull milvusdb/milvus-gpu-build-env:v0.7.0-centos7 +``` + +### Step 2 Start the Docker container + +Start a CPU-only container: + +```shell +$ docker run -it -p 19530:19530 -d +``` + +Start a GPU container: + +- For nvidia docker 2: + +```shell +$ docker run --runtime=nvidia -it -p 19530:19530 -d +``` + +- For nvidia container toolkit: + +```shell +docker run --gpus all -it -p 19530:19530 -d +``` + +To enter the container: + +```shell +$ docker exec -it [container_id] bash +``` + +### Step 3 Download Milvus source code + +Download Milvus source code: + +```shell +$ git clone https://github.com/milvus-io/milvus +``` + +To enter its core directory: + +```shell +$ cd ./milvus/core +``` + +Specify version (for example, 0.10.3): + +```shell +$ git checkout 0.10.3 +``` + +### Step 4 Compile Milvus in the container + +If you are using a CPU-only image: + +1. run `build.sh`: + +```shell +$ ./build.sh -t Release +``` + +2. Start Milvus server: + +```shell +$ ./start_server.sh +``` + +If you are using a GPU-enabled image: + +1. Add cuda library path to `LD_LIBRARY_PATH`: + +```shell +$ export LD_LIBRARY_PATH=/usr/local/cuda/lib64:$LD_LIBRARY_PATH +``` + +2. Add cuda binary path to `PATH`: + +```shell +$ export PATH=/usr/local/cuda/bin:$PATH +``` + +3. Add a `-g` parameter to run `build.sh`: + +```shell +$ ./build.sh -g -t Release +``` + +4. Start Milvus server: + +```shell +$ ./start_server.sh +``` + +## Troubleshooting + +### Error message: `protocol https not supported or disabled in libcurl` + +Follow the steps below to solve this problem: + +1. Make sure you have `libcurl4-openssl-dev` installed in your system. +2. Try reinstalling the latest CMake from source with `--system-curl` option: + + ```shell + $ ./bootstrap --system-curl + $ make + $ sudo make install + ``` + + If the `--system-curl` command doesn't work, you can also reinstall CMake in **Ubuntu Software** on your local computer. + +### Error message: `internal compiler error` + +Try increasing the memory allocated to Docker. If this doesn't work, you can reduce the number of threads in CMake build in `[Milvus root path]/core/build.sh`. + +```shell +make -j 8 install || exit 1 # The default number of threads is 8. +``` + +Note: You might also need to configure CMake build for faiss in `[Milvus root path]/core/src/index/thirdparty/faiss`. + +### Error message: `error while loading shared libraries: libmysqlpp.so.3` + +Follow the steps below to solve this problem: + +1. Check whether `libmysqlpp.so.3` is correctly installed. +2. If `libmysqlpp.so.3` is installed, check whether it is added to `LD_LIBRARY_PATH`. + +### CMake version is not supported + +Follow the steps below to install a supported version of CMake: + +1. Remove the unsupported version of CMake. +2. Get CMake 3.14 or higher. Here we get CMake 3.14. + + ```shell + $ wget https://cmake.org/files/v3.14/cmake-3.14.7-Linux-x86_64.tar.gz + ``` + +3. Extract the file and install CMake. + + ```shell + $ tar zxvf cmake-3.14.7-Linux-x86_64.tar.gz + $ mv cmake-3.14.7-Linux-x86_64 /opt/cmake-3.14.7 + $ ln -sf /opt/cmake-3.14.7/bin/* /usr/bin/ + ``` diff --git a/LICENSE b/LICENSE new file mode 100644 index 0000000000..261eeb9e9f --- /dev/null +++ b/LICENSE @@ -0,0 +1,201 @@ + Apache License + Version 2.0, January 2004 + http://www.apache.org/licenses/ + + TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION + + 1. Definitions. + + "License" shall mean the terms and conditions for use, reproduction, + and distribution as defined by Sections 1 through 9 of this document. + + "Licensor" shall mean the copyright owner or entity authorized by + the copyright owner that is granting the License. + + "Legal Entity" shall mean the union of the acting entity and all + other entities that control, are controlled by, or are under common + control with that entity. For the purposes of this definition, + "control" means (i) the power, direct or indirect, to cause the + direction or management of such entity, whether by contract or + otherwise, or (ii) ownership of fifty percent (50%) or more of the + outstanding shares, or (iii) beneficial ownership of such entity. + + "You" (or "Your") shall mean an individual or Legal Entity + exercising permissions granted by this License. + + "Source" form shall mean the preferred form for making modifications, + including but not limited to software source code, documentation + source, and configuration files. + + "Object" form shall mean any form resulting from mechanical + transformation or translation of a Source form, including but + not limited to compiled object code, generated documentation, + and conversions to other media types. + + "Work" shall mean the work of authorship, whether in Source or + Object form, made available under the License, as indicated by a + copyright notice that is included in or attached to the work + (an example is provided in the Appendix below). + + "Derivative Works" shall mean any work, whether in Source or Object + form, that is based on (or derived from) the Work and for which the + editorial revisions, annotations, elaborations, or other modifications + represent, as a whole, an original work of authorship. For the purposes + of this License, Derivative Works shall not include works that remain + separable from, or merely link (or bind by name) to the interfaces of, + the Work and Derivative Works thereof. + + "Contribution" shall mean any work of authorship, including + the original version of the Work and any modifications or additions + to that Work or Derivative Works thereof, that is intentionally + submitted to Licensor for inclusion in the Work by the copyright owner + or by an individual or Legal Entity authorized to submit on behalf of + the copyright owner. For the purposes of this definition, "submitted" + means any form of electronic, verbal, or written communication sent + to the Licensor or its representatives, including but not limited to + communication on electronic mailing lists, source code control systems, + and issue tracking systems that are managed by, or on behalf of, the + Licensor for the purpose of discussing and improving the Work, but + excluding communication that is conspicuously marked or otherwise + designated in writing by the copyright owner as "Not a Contribution." + + "Contributor" shall mean Licensor and any individual or Legal Entity + on behalf of whom a Contribution has been received by Licensor and + subsequently incorporated within the Work. + + 2. Grant of Copyright License. Subject to the terms and conditions of + this License, each Contributor hereby grants to You a perpetual, + worldwide, non-exclusive, no-charge, royalty-free, irrevocable + copyright license to reproduce, prepare Derivative Works of, + publicly display, publicly perform, sublicense, and distribute the + Work and such Derivative Works in Source or Object form. + + 3. Grant of Patent License. Subject to the terms and conditions of + this License, each Contributor hereby grants to You a perpetual, + worldwide, non-exclusive, no-charge, royalty-free, irrevocable + (except as stated in this section) patent license to make, have made, + use, offer to sell, sell, import, and otherwise transfer the Work, + where such license applies only to those patent claims licensable + by such Contributor that are necessarily infringed by their + Contribution(s) alone or by combination of their Contribution(s) + with the Work to which such Contribution(s) was submitted. If You + institute patent litigation against any entity (including a + cross-claim or counterclaim in a lawsuit) alleging that the Work + or a Contribution incorporated within the Work constitutes direct + or contributory patent infringement, then any patent licenses + granted to You under this License for that Work shall terminate + as of the date such litigation is filed. + + 4. Redistribution. You may reproduce and distribute copies of the + Work or Derivative Works thereof in any medium, with or without + modifications, and in Source or Object form, provided that You + meet the following conditions: + + (a) You must give any other recipients of the Work or + Derivative Works a copy of this License; and + + (b) You must cause any modified files to carry prominent notices + stating that You changed the files; and + + (c) You must retain, in the Source form of any Derivative Works + that You distribute, all copyright, patent, trademark, and + attribution notices from the Source form of the Work, + excluding those notices that do not pertain to any part of + the Derivative Works; and + + (d) If the Work includes a "NOTICE" text file as part of its + distribution, then any Derivative Works that You distribute must + include a readable copy of the attribution notices contained + within such NOTICE file, excluding those notices that do not + pertain to any part of the Derivative Works, in at least one + of the following places: within a NOTICE text file distributed + as part of the Derivative Works; within the Source form or + documentation, if provided along with the Derivative Works; or, + within a display generated by the Derivative Works, if and + wherever such third-party notices normally appear. The contents + of the NOTICE file are for informational purposes only and + do not modify the License. You may add Your own attribution + notices within Derivative Works that You distribute, alongside + or as an addendum to the NOTICE text from the Work, provided + that such additional attribution notices cannot be construed + as modifying the License. + + You may add Your own copyright statement to Your modifications and + may provide additional or different license terms and conditions + for use, reproduction, or distribution of Your modifications, or + for any such Derivative Works as a whole, provided Your use, + reproduction, and distribution of the Work otherwise complies with + the conditions stated in this License. + + 5. Submission of Contributions. Unless You explicitly state otherwise, + any Contribution intentionally submitted for inclusion in the Work + by You to the Licensor shall be under the terms and conditions of + this License, without any additional terms or conditions. + Notwithstanding the above, nothing herein shall supersede or modify + the terms of any separate license agreement you may have executed + with Licensor regarding such Contributions. + + 6. Trademarks. This License does not grant permission to use the trade + names, trademarks, service marks, or product names of the Licensor, + except as required for reasonable and customary use in describing the + origin of the Work and reproducing the content of the NOTICE file. + + 7. Disclaimer of Warranty. Unless required by applicable law or + agreed to in writing, Licensor provides the Work (and each + Contributor provides its Contributions) on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or + implied, including, without limitation, any warranties or conditions + of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A + PARTICULAR PURPOSE. You are solely responsible for determining the + appropriateness of using or redistributing the Work and assume any + risks associated with Your exercise of permissions under this License. + + 8. Limitation of Liability. In no event and under no legal theory, + whether in tort (including negligence), contract, or otherwise, + unless required by applicable law (such as deliberate and grossly + negligent acts) or agreed to in writing, shall any Contributor be + liable to You for damages, including any direct, indirect, special, + incidental, or consequential damages of any character arising as a + result of this License or out of the use or inability to use the + Work (including but not limited to damages for loss of goodwill, + work stoppage, computer failure or malfunction, or any and all + other commercial damages or losses), even if such Contributor + has been advised of the possibility of such damages. + + 9. Accepting Warranty or Additional Liability. While redistributing + the Work or Derivative Works thereof, You may choose to offer, + and charge a fee for, acceptance of support, warranty, indemnity, + or other liability obligations and/or rights consistent with this + License. However, in accepting such obligations, You may act only + on Your own behalf and on Your sole responsibility, not on behalf + of any other Contributor, and only if You agree to indemnify, + defend, and hold each Contributor harmless for any liability + incurred by, or claims asserted against, such Contributor by reason + of your accepting any such warranty or additional liability. + + END OF TERMS AND CONDITIONS + + APPENDIX: How to apply the Apache License to your work. + + To apply the Apache License to your work, attach the following + boilerplate notice, with the fields enclosed by brackets "[]" + replaced with your own identifying information. (Don't include + the brackets!) The text should be enclosed in the appropriate + comment syntax for the file format. We also recommend that a + file or class name and description of purpose be included on the + same "printed page" as the copyright notice for easier + identification within third-party archives. + + Copyright [yyyy] [name of copyright owner] + + Licensed under the Apache License, Version 2.0 (the "License"); + you may not use this file except in compliance with the License. + You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. diff --git a/NOTICE.md b/NOTICE.md new file mode 100644 index 0000000000..b7cf8653e6 --- /dev/null +++ b/NOTICE.md @@ -0,0 +1,24 @@ +| Name | License | +| ---------------- | -------------------------------------------------------------------------------------------------------------------------------| +| Boost | [Boost Software License](https://github.com/boostorg/boost/blob/master/LICENSE_1_0.txt) | +| FAISS | [MIT](https://github.com/facebookresearch/faiss/blob/master/LICENSE) | +| Gtest | [BSD 3-Clause](https://github.com/google/googletest/blob/master/LICENSE) | +| LAPACK | [LAPACK](https://github.com/Reference-LAPACK/lapack/blob/master/LICENSE) | +| MySQLPP | [LGPL 2.1](https://tangentsoft.com/mysqlpp/artifact/b128a66dab867923) | +| OpenBLAS | [BSD 3-Clause](https://github.com/xianyi/OpenBLAS/blob/develop/LICENSE) | +| Prometheus | [Apache 2.0](https://github.com/prometheus/prometheus/blob/master/LICENSE) | +| SQLite | [Public Domain](https://www.sqlite.org/copyright.html) | +| SQLite-ORM | [BSD 3-Clause](https://github.com/fnc12/sqlite_orm/blob/master/LICENSE) | +| yaml-cpp | [MIT](https://github.com/jbeder/yaml-cpp/blob/master/LICENSE) | +| ZLIB | [ZLIB](http://zlib.net/zlib_license.html) | +| libunwind | [MIT](https://github.com/libunwind/libunwind/blob/master/LICENSE) | +| gperftools | [BSD 3-Clause](https://github.com/gperftools/gperftools/blob/master/COPYING) | +| grpc | [Apache 2.0](https://github.com/grpc/grpc/blob/master/LICENSE) | +| EASYLOGGINGPP | [MIT](https://github.com/zuhd-org/easyloggingpp/blob/master/LICENSE) | +| Json | [MIT](https://github.com/nlohmann/json/blob/develop/LICENSE.MIT) | +| opentracing-cpp | [Apache 2.0](https://github.com/opentracing/opentracing-cpp/blob/master/LICENSE) | +| libfiu | [BOLA](https://github.com/albertito/libfiu/blob/master/LICENSE) | +| aws-sdk-cpp | [Apache 2.0](https://github.com/aws/aws-sdk-cpp/blob/master/LICENSE) | +| SPTAG | [MIT](https://github.com/microsoft/SPTAG/blob/master/LICENSE) | +| hnswlib | [Apache 2.0](https://github.com/nmslib/hnswlib/blob/master/LICENSE) | +| annoy | [Apache 2.0](https://github.com/spotify/annoy/blob/master/LICENSE) | diff --git a/README.md b/README.md index cb8698d8f5..57d0cc561f 100644 --- a/README.md +++ b/README.md @@ -1 +1,89 @@ -# Milvus Distributed +![Milvuslogo](https://github.com/milvus-io/docs/blob/master/v0.9.1/assets/milvus_logo.png) +[![Slack](https://img.shields.io/badge/Join-Slack-orange)](https://join.slack.com/t/milvusio/shared_invite/zt-e0u4qu3k-bI2GDNys3ZqX1YCJ9OM~GQ) +![GitHub](https://img.shields.io/github/license/milvus-io/milvus) +![Docker pulls](https://img.shields.io/docker/pulls/milvusdb/milvus) + +[![Build Status](http://internal.zilliz.com:18080/jenkins/job/milvus-ci/job/master/badge/icon)](http://internal.zilliz.com:18080/jenkins/job/milvus-ci/job/master/) +[![CII Best Practices](https://bestpractices.coreinfrastructure.org/projects/3563/badge)](https://bestpractices.coreinfrastructure.org/projects/3563) +[![codecov](https://codecov.io/gh/milvus-io/milvus/branch/master/graph/badge.svg)](https://codecov.io/gh/milvus-io/milvus) +[![codebeat badge](https://codebeat.co/badges/e030a4f6-b126-4475-a938-4723d54ec3a7?style=plastic)](https://codebeat.co/projects/github-com-milvus-io-milvus-master) +[![CodeFactor Grade](https://www.codefactor.io/repository/github/milvus-io/milvus/badge)](https://www.codefactor.io/repository/github/milvus-io/milvus) +[![Codacy Badge](https://api.codacy.com/project/badge/Grade/c4bb2ccfb51b47f99e43bfd1705edd95)](https://app.codacy.com/gh/milvus-io/milvus?utm_source=github.com&utm_medium=referral&utm_content=milvus-io/milvus&utm_campaign=Badge_Grade_Dashboard) + +## What is Milvus + +As an open source vector database, Milvus is easy-to-use, highly reliable, robust, and blazing fast. Adopted by over 600 organizations and institutions worldwide, Milvus empowers applications in a variety of fields, including image processing, computer vision, natural language processing, voice recognition, recommender systems, drug discovery, and more. + +The following is Milvus architecture: + +![arch](https://github.com/milvus-io/docs/blob/master/v0.10.5/assets/milvus_arch.png) + +For more detailed introduction of Milvus and its architecture, see [Milvus overview](https://www.milvus.io/docs/overview.md). See Milvus [release notes](https://www.milvus.io/docs/release_notes.md) to keep up-to-date with its releases and updates. + +Milvus is an incubation-stage project at [LF AI & Data Foundation](https://lfaidata.foundation/). + +## Get started + +### Install Milvus + +See [Milvus install guide](https://www.milvus.io/docs/install_milvus.md) to install Milvus using Docker. To install Milvus from source code, see [build from source](INSTALL.md). + +### Try example programs + +Try an example program with Milvus using [Python](https://www.milvus.io/docs/example_code.md), [Java](https://github.com/milvus-io/milvus-sdk-java/tree/master/examples), [Go](https://github.com/milvus-io/milvus-sdk-go/tree/master/examples), or [C++ example code](https://github.com/milvus-io/milvus/tree/master/sdk/examples). + +## Supported clients + +- [Go](https://github.com/milvus-io/milvus-sdk-go) +- [Python](https://github.com/milvus-io/pymilvus) +- [Java](https://github.com/milvus-io/milvus-sdk-java) +- [C++](https://github.com/milvus-io/milvus/tree/master/sdk) +- [RESTful API](https://github.com/milvus-io/milvus/tree/master/core/src/server/web_impl) +- [Node.js](https://www.npmjs.com/package/@arkie-ai/milvus-client) (Contributed by [arkie](https://www.arkie.cn/)) + +## Application scenarios + +You can use Milvus to build intelligent systems in a variety of AI application scenarios. See [Milvus Scenarios](https://milvus.io/scenarios) for live demos. You can also see [Milvus Bootcamp](https://github.com/milvus-io/bootcamp) for detailed solutions and application scenarios. + +## Benchmark + +See our [test reports](https://github.com/milvus-io/milvus/tree/master/docs) for more information about performance benchmarking of different indexes in Milvus. + +## Roadmap + +To learn what's coming up soon in Milvus, read our [Roadmap](https://github.com/milvus-io/milvus/milestones). + +It is a Work in Progress, and is subject to reasonable adjustments when necessary. And we greatly appreciate any comments/requirements/suggestions regarding Milvus' roadmap.:clap: + +## Contribution guidelines + +Contributions are welcomed and greatly appreciated. Please read our [contribution guidelines](CONTRIBUTING.md) for detailed contribution workflow. This project adheres to the [code of conduct](CODE_OF_CONDUCT.md) of Milvus. You must abide by this code to participate. + +We use [GitHub issues](https://github.com/milvus-io/milvus/issues) to track issues and bugs. For general questions and public discussions, please join our community. + +## Join our community + +:heart: To connect with other users and contributors, you can join our [Slack channel](https://join.slack.com/t/milvusio/shared_invite/zt-e0u4qu3k-bI2GDNys3ZqX1YCJ9OM~GQ). + +See our [community](https://github.com/milvus-io/community) repository to learn more about our governance and access more community resources. + + +## Resources + +- [Milvus.io](https://www.milvus.io) + +- [Milvus FAQ](https://www.milvus.io/docs/faq/operational_faq.md) + +- [Milvus Medium](https://medium.com/@milvusio) + +- [Milvus CSDN](https://zilliz.blog.csdn.net/) + +- [Milvus Twitter](https://twitter.com/milvusio) + +- [Milvus Facebook](https://www.facebook.com/io.milvus.5) + +- [Milvus design docs](DESIGN.md) + +## License + +[Apache License 2.0](LICENSE) diff --git a/README_CN.md b/README_CN.md new file mode 100644 index 0000000000..c62c51fef3 --- /dev/null +++ b/README_CN.md @@ -0,0 +1,95 @@ + +![Milvuslogo](https://github.com/milvus-io/docs/blob/master/v0.9.1/assets/milvus_logo.png) +[![Slack](https://img.shields.io/badge/Join-Slack-orange)](https://join.slack.com/t/milvusio/shared_invite/zt-e0u4qu3k-bI2GDNys3ZqX1YCJ9OM~GQ) + +![GitHub](https://img.shields.io/github/license/milvus-io/milvus) +![Docker pulls](https://img.shields.io/docker/pulls/milvusdb/milvus) + +[![Build Status](http://internal.zilliz.com:18080/jenkins/job/milvus-ci/job/master/badge/icon)](http://internal.zilliz.com:18080/jenkins/job/milvus-ci/job/master/) +[![CII Best Practices](https://bestpractices.coreinfrastructure.org/projects/3563/badge)](https://bestpractices.coreinfrastructure.org/projects/3563) +[![codecov](https://codecov.io/gh/milvus-io/milvus/branch/master/graph/badge.svg)](https://codecov.io/gh/milvus-io/milvus) +[![codebeat badge](https://codebeat.co/badges/e030a4f6-b126-4475-a938-4723d54ec3a7?style=plastic)](https://codebeat.co/projects/github-com-milvus-io-milvus-master) +[![Codacy Badge](https://api.codacy.com/project/badge/Grade/c4bb2ccfb51b47f99e43bfd1705edd95)](https://app.codacy.com/gh/milvus-io/milvus?utm_source=github.com&utm_medium=referral&utm_content=milvus-io/milvus&utm_campaign=Badge_Grade_Dashboard) + +[English](README.md) | 中文版 + +# 欢迎来到 Milvus + +## Milvus 是什么 + +Milvus 是一款开源的特征向量数据库,具有使用方便、实用可靠、稳定高效和搜索迅速等特点,在全球范围内被六百家余组织和机构所采用。Milvus 已经被广泛应用于多个领域,其中包括图像处理、机器视觉、自然语言处理、语音识别、推荐系统以及新药发现等。 + +Milvus 的架构如下: + +![arch](https://github.com/milvus-io/docs/blob/master/v0.10.5/assets/milvus_arch.png) + +若要了解 Milvus 详细介绍和整体架构,请访问 [Milvus 简介](https://www.milvus.io/docs/overview.md)。你可以通过 [版本发布说明](https://www.milvus.io/docs/release_notes.md) 获取最新版本的功能和更新。 + +Milvus 是一个 [LF AI & Data 基金会](https://lfaidata.foundation/) 的孵化项目。获取更多,请访问 [lfai.foundation](https://lfaidata.foundation/)。 + +## Milvus 快速上手 + +### 安装 Milvus + +请参阅 [Milvus 安装指南](https://www.milvus.io/docs/install_milvus.md) 使用 Docker 容器安装 Milvus。若要基于源码编译,请访问 [源码安装](INSTALL.md)。 + +### 尝试示例代码 + +你可以尝试用 [Python](https://www.milvus.io/docs/example_code.md),[Java](https://github.com/milvus-io/milvus-sdk-java/tree/master/examples),[Go](https://github.com/milvus-io/milvus-sdk-go/tree/master/examples),或者 [C++](https://github.com/milvus-io/milvus/tree/master/sdk/examples) 运行 Milvus 示例代码。 + +## 支持的客户端 + +- [Go](https://github.com/milvus-io/milvus-sdk-go) +- [Python](https://github.com/milvus-io/pymilvus) +- [Java](https://github.com/milvus-io/milvus-sdk-java) +- [C++](https://github.com/milvus-io/milvus/tree/master/sdk) +- [RESTful API](https://github.com/milvus-io/milvus/tree/master/core/src/server/web_impl) +- [Node.js](https://www.npmjs.com/package/@arkie-ai/milvus-client) (由 [arkie](https://www.arkie.cn/) 提供) + +## 应用场景 + +Milvus 可以应用于多种 AI 场景。你可以访问 [Milvus 应用场景](https://milvus.io/scenarios) 体验在线场景展示。你也可以访问 [Milvus 训练营](https://github.com/milvus-io/bootcamp) 了解更详细的应用场景和解决方案。 + +## 性能基准测试 + +关于 Milvus 性能基准的更多信息,请参考 [测试报告](https://github.com/milvus-io/milvus/tree/master/docs)。 + +## 路线图 + +你可以参考我们的 [路线图](https://github.com/milvus-io/milvus/milestones),了解 Milvus 即将实现的新特性。 + +路线图尚未完成,并且可能会存在合理改动。我们欢迎各种针对路线图的意见、需求和建议。 + +## 贡献者指南 + +我们由衷欢迎你推送贡献。关于贡献流程的详细信息,请参阅 [贡献者指南](https://github.com/milvus-io/milvus/blob/master/CONTRIBUTING.md)。本项目遵循 Milvus [行为准则](https://github.com/milvus-io/milvus/blob/master/CODE_OF_CONDUCT.md)。如果你希望参与本项目,请遵守该准则的内容。 + +我们使用 [GitHub issues](https://github.com/milvus-io/milvus/issues) 追踪问题和补丁。若你希望提出问题或进行讨论,请加入我们的社区。 + +## 加入 Milvus 社区 + +欢迎加入我们的 [Slack 频道](https://join.slack.com/t/milvusio/shared_invite/zt-e0u4qu3k-bI2GDNys3ZqX1YCJ9OM~GQ)以便与其他用户和贡献者进行交流。 + +## 加入 Milvus 技术交流微信群 + +![qrcode](https://github.com/milvus-io/docs/blob/master/v1.0.0/assets/qrcode.png) + +## 相关链接 + +- [Milvus.io](https://www.milvus.io) + +- [Milvus 常见问题](https://www.milvus.io/cn/docs/faq/operational_faq.md) + +- [Milvus Medium](https://medium.com/@milvusio) + +- [Milvus CSDN](https://zilliz.blog.csdn.net/) + +- [Milvus Twitter](https://twitter.com/milvusio) + +- [Milvus Facebook](https://www.facebook.com/io.milvus.5) + +- [Milvus 设计文档](DESIGN.md) + +## 许可协议 + +[Apache 许可协议 2.0 版](LICENSE) \ No newline at end of file diff --git a/RELEASE.md b/RELEASE.md new file mode 100644 index 0000000000..35f059d3d3 --- /dev/null +++ b/RELEASE.md @@ -0,0 +1,17 @@ +# Milvus Release Methodology and Criterias + +## Release methodology + +Milvus releases are packages that have been approved for general public release, with varying degrees of caveat regarding their perceived quality or potential for change. +They are stable releases intended for everyday usage by developers and non-developers. + +Project versioning follows the specification of [Semantic Versioning 2.0.0](https://semver.org/). + +## Release criteria + +- Milvus core test code coverage must be at least 90%. +- Reported bugs should not have any critical issues. +- All bugs, new features, enhancements must be tested. +- All documents need to be reviewed with no broken link. +- Pressure testing, stability testing, accuracy testing and performance testing results should be evaluated. + diff --git a/SECURITY.md b/SECURITY.md new file mode 100644 index 0000000000..5dd59cbf03 --- /dev/null +++ b/SECURITY.md @@ -0,0 +1,15 @@ +# Security Policy + +## Supported versions + +The following versions of Milvus are currently being supported with security updates. + +| Version | Supported | +| --------- | ------------------ | +| 0.6.0 | ✔️ | +| <= 0.5.3 | :x: | + +## Reporting a vulnerability + +To report a security vulnerability, please reach out to the Milvus team via . + diff --git a/SUPPORT.md b/SUPPORT.md new file mode 100644 index 0000000000..3cbe4ea182 --- /dev/null +++ b/SUPPORT.md @@ -0,0 +1,14 @@ +# Support for deploying and using Milvus + +We use GitHub for tracking bugs and feature requests. If you need any support for using Milvus, please refer to the following resources below. + +## Documentation +- [User Documentation](https://www.milvus.io/docs/install_milvus.md) +- [Troubleshooting Guide](https://www.milvus.io/docs/troubleshoot.md) +- [FAQ](https://www.milvus.io/docs/operational_faq.md) + +## Real-time chat +[Slack](https://join.slack.com/t/milvusio/shared_invite/enQtNzY1OTQ0NDI3NjMzLWNmYmM1NmNjOTQ5MGI5NDhhYmRhMGU5M2NhNzhhMDMzY2MzNDdlYjM5ODQ5MmE3ODFlYzU3YjJkNmVlNDQ2ZTk): The #general channel is the place where people offer support. + +## Other +[Bootcamp](https://github.com/milvus-io/bootcamp): It provides more scenario-based applications and demos of Milvus.