Files
AIclinicalresearch/extraction_service/test_execute_simple.py
HaHafeng 75ceeb0653 hotfix(dc/tool-c): Fix compute formula validation and binning NaN serialization
Critical fixes:
1. Compute column: Add Chinese comma support in formula validation
   - Problem: Formula with Chinese comma failed validation
   - Fix: Add Chinese comma character to allowed_chars regex
   - Example: Support formulas like 'col1(kg)+ col2,col3'

2. Binning operation: Fix NaN serialization error
   - Problem: 'Out of range float values are not JSON compliant: nan'
   - Fix: Enhanced NaN/inf handling in binning endpoint
   - Added np.inf/-np.inf replacement before JSON serialization
   - Added manual JSON serialization with NaN->null conversion

3. Enhanced all operation endpoints for consistency
   - Updated conditional, dropna endpoints with same NaN/inf handling
   - Ensures all operations return JSON-compliant data

Modified files:
- extraction_service/operations/compute.py: Add Chinese comma to regex
- extraction_service/main.py: Enhanced NaN handling in binning/conditional/dropna

Status: Hotfix complete, ready for testing
2025-12-09 08:45:27 +08:00

52 lines
1.3 KiB
Python

"""简单的代码执行测试"""
import requests
import json
# 测试数据
test_data = [
{"patient_id": "P001", "age": 25, "gender": ""},
{"patient_id": "P002", "age": 65, "gender": ""},
{"patient_id": "P003", "age": 45, "gender": ""},
]
# 测试代码
test_code = """
df['age_group'] = df['age'].apply(lambda x: '老年' if x > 60 else '非老年')
print(f"处理完成,共 {len(df)} 行")
"""
print("=" * 60)
print("测试: Pandas代码执行")
print("=" * 60)
try:
response = requests.post(
"http://localhost:8000/api/dc/execute",
json={"data": test_data, "code": test_code},
timeout=10
)
print(f"\n状态码: {response.status_code}")
result = response.json()
print(json.dumps(result, indent=2, ensure_ascii=False))
if result.get("success"):
print("\n✅ 代码执行成功!")
print(f"结果数据: {len(result.get('result_data', []))}")
print(f"执行时间: {result.get('execution_time', 0):.3f}")
print(f"\n打印输出:\n{result.get('output', '')}")
print(f"\n结果数据示例:")
for row in result.get('result_data', [])[:3]:
print(f" {row}")
else:
print(f"\n❌ 代码执行失败: {result.get('error')}")
except Exception as e:
print(f"\n❌ 测试异常: {str(e)}")