milvus/cmd/tools/binlogv2
sthuang 43c3c160ff
feat: [StorageV2] cmd binlog tool (#43648)
related: #39173 

Core Features
* Parquet File Analysis: Analyze Milvus binlog Parquet files with
metadata extraction
* MinIO Integration: Direct connection to MinIO storage for remote file
analysis
* Vector Data Deserialization: Specialized handling of Milvus vector
data in binlog files
* Interactive CLI: Command-line interface with interactive exploration

Analysis Capabilities
* Metadata & Vector Analysis: Extract schema info, row counts, and
vector statistics
* Data Export: Export data to JSON format with configurable limits
* Query Functionality: Search for specific records by ID
* Batch Processing: Analyze multiple Parquet files simultaneously

User Experience
* Verbose Output: Detailed logging for debugging
* Error Handling: Robust error handling for file access and parsing
* Flexible Output: Support for single file and batch analysis formats

---------

Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>
Co-authored-by: nico <109071306+NicoYuan1986@users.noreply.github.com>
2025-07-31 15:05:37 +08:00
..
parquet_analyzer feat: [StorageV2] cmd binlog tool (#43648) 2025-07-31 15:05:37 +08:00
.gitignore feat: [StorageV2] cmd binlog tool (#43648) 2025-07-31 15:05:37 +08:00
export_to_json.py feat: [StorageV2] cmd binlog tool (#43648) 2025-07-31 15:05:37 +08:00
minio_client.py feat: [StorageV2] cmd binlog tool (#43648) 2025-07-31 15:05:37 +08:00
minio_parquet_analyzer.py feat: [StorageV2] cmd binlog tool (#43648) 2025-07-31 15:05:37 +08:00
parquet_analyzer_cli.py feat: [StorageV2] cmd binlog tool (#43648) 2025-07-31 15:05:37 +08:00
requirements.txt feat: [StorageV2] cmd binlog tool (#43648) 2025-07-31 15:05:37 +08:00