Features: - feat: Excel template generation and download (with examples) - feat: Excel file parsing in memory (cloud-native, no disk write) - feat: Field validation (title + abstract required) - feat: Smart deduplication (DOI priority + Title fallback) - feat: Literature preview table with statistics - feat: Complete submission flow (create project + import literatures) Components: - feat: Create excelUtils.ts with full Excel processing toolkit - feat: Enhance TitleScreeningSettings page with upload/preview/submit - feat: Update API interface signatures and export unified aslApi object Dependencies: - chore: Add xlsx library for Excel file processing Ref: Week 2 Frontend Development - Day 2 Scope: ASL Module MVP - Title Abstract Screening Cloud-Native: Memory parsing, no file persistence
88 lines
1.1 KiB
Markdown
88 lines
1.1 KiB
Markdown
# 医学NLP引擎
|
||
|
||
> **能力定位:** 通用能力层
|
||
> **复用率:** 14% (1个模块依赖)
|
||
> **优先级:** P2
|
||
> **状态:** ⏳ 待实现
|
||
|
||
---
|
||
|
||
## 📋 能力概述
|
||
|
||
医学NLP引擎负责:
|
||
- 医学实体识别(NER)
|
||
- 医学术语标准化
|
||
- 疾病/药物识别
|
||
|
||
---
|
||
|
||
## 📊 依赖模块
|
||
|
||
**1个模块依赖(14%复用率):**
|
||
1. **DC** - 数据清洗整理(病例数据NER提取)
|
||
|
||
---
|
||
|
||
## 💡 核心功能
|
||
|
||
### 1. 医学实体识别
|
||
- 疾病识别
|
||
- 药物识别
|
||
- 手术识别
|
||
- TNM分期提取
|
||
|
||
### 2. 术语标准化
|
||
- ICD编码
|
||
- ATC编码
|
||
|
||
### 3. 关系抽取
|
||
- 疾病-药物关系
|
||
- 症状-疾病关系
|
||
|
||
---
|
||
|
||
## 🏗️ 技术方案
|
||
|
||
### 云端版(高准确率)
|
||
```python
|
||
# 基于LLM API(Claude/GPT)
|
||
# JSON Mode结构化输出
|
||
```
|
||
|
||
### 单机版(隐私优先)
|
||
```python
|
||
# 基于spaCy + 医学模型
|
||
# 100%本地运行
|
||
```
|
||
|
||
---
|
||
|
||
## 🔗 相关文档
|
||
|
||
- [通用能力层总览](../README.md)
|
||
- [DC模块需求](../../03-业务模块/DC-数据清洗整理/README.md)
|
||
|
||
---
|
||
|
||
**最后更新:** 2025-11-06
|
||
**维护人:** 技术架构师
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|