Files
AIclinicalresearch/tests/run_tests.bat
HaHafeng 40c2f8e148 feat(rag): Complete RAG engine implementation with pgvector
Major Features:
- Created ekb_schema (13th schema) with 3 tables: KB/Document/Chunk
- Implemented EmbeddingService (text-embedding-v4, 1024-dim vectors)
- Implemented ChunkService (smart Markdown chunking)
- Implemented VectorSearchService (multi-query + hybrid search)
- Implemented RerankService (qwen3-rerank)
- Integrated DeepSeek V3 QueryRewriter for cross-language search
- Python service: Added pymupdf4llm for PDF-to-Markdown conversion
- PKB: Dual-mode adapter (pgvector/dify/hybrid)

Architecture:
- Brain-Hand Model: Business layer (DeepSeek) + Engine layer (pgvector)
- Cross-language support: Chinese query matches English documents
- Small Embedding (1024) + Strong Reranker strategy

Performance:
- End-to-end latency: 2.5s
- Cost per query: 0.0025 RMB
- Accuracy improvement: +20.5% (cross-language)

Tests:
- test-embedding-service.ts: Vector embedding verified
- test-rag-e2e.ts: Full pipeline tested
- test-rerank.ts: Rerank quality validated
- test-query-rewrite.ts: Cross-language search verified
- test-pdf-ingest.ts: Real PDF document tested (Dongen 2003.pdf)

Documentation:
- Added 05-RAG-Engine-User-Guide.md
- Added 02-Document-Processing-User-Guide.md
- Updated system status documentation

Status: Production ready
2026-01-21 20:24:29 +08:00

106 lines
1.1 KiB
Batchfile
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
@echo off
REM Windows批处理脚本 - 运行缺失值处理功能测试
echo ========================================
echo 缺失值处理功能 - 自动化测试
echo ========================================
echo.
REM 检查Python是否安装
python --version >nul 2>&1
if %errorlevel% neq 0 (
echo [错误] Python未安装或不在PATH中
pause
exit /b 1
)
echo [1/3] 检查Python服务状态...
curl -s http://localhost:8001/health >nul 2>&1
if %errorlevel% neq 0 (
echo [警告] Python服务未运行请先启动服务
echo cd extraction_service
echo python main.py
echo.
pause
exit /b 1
)
echo [OK] Python服务运行正常
echo.
echo [2/3] 检查依赖...
python -c "import pandas, numpy, requests" >nul 2>&1
if %errorlevel% neq 0 (
echo [警告] 缺少依赖,正在安装...
pip install pandas numpy requests
)
echo [OK] 依赖检查完成
echo.
echo [3/3] 运行测试...
echo.
python test_fillna_operations.py
echo.
echo ========================================
echo 测试完成
echo ========================================
pause