Files
AIclinicalresearch/docs/02-通用能力层/05-医学NLP引擎/README.md
HaHafeng 2e8699c217 feat(asl): Week 2 Day 2 - Excel import with template download and intelligent dedup
Features:
- feat: Excel template generation and download (with examples)
- feat: Excel file parsing in memory (cloud-native, no disk write)
- feat: Field validation (title + abstract required)
- feat: Smart deduplication (DOI priority + Title fallback)
- feat: Literature preview table with statistics
- feat: Complete submission flow (create project + import literatures)

Components:
- feat: Create excelUtils.ts with full Excel processing toolkit
- feat: Enhance TitleScreeningSettings page with upload/preview/submit
- feat: Update API interface signatures and export unified aslApi object

Dependencies:
- chore: Add xlsx library for Excel file processing

Ref: Week 2 Frontend Development - Day 2
Scope: ASL Module MVP - Title Abstract Screening
Cloud-Native: Memory parsing, no file persistence
2025-11-19 10:24:47 +08:00

88 lines
1.1 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# 医学NLP引擎
> **能力定位:** 通用能力层
> **复用率:** 14% (1个模块依赖)
> **优先级:** P2
> **状态:** ⏳ 待实现
---
## 📋 能力概述
医学NLP引擎负责
- 医学实体识别NER
- 医学术语标准化
- 疾病/药物识别
---
## 📊 依赖模块
**1个模块依赖14%复用率):**
1. **DC** - 数据清洗整理病例数据NER提取
---
## 💡 核心功能
### 1. 医学实体识别
- 疾病识别
- 药物识别
- 手术识别
- TNM分期提取
### 2. 术语标准化
- ICD编码
- ATC编码
### 3. 关系抽取
- 疾病-药物关系
- 症状-疾病关系
---
## 🏗️ 技术方案
### 云端版(高准确率)
```python
# 基于LLM APIClaude/GPT
# JSON Mode结构化输出
```
### 单机版(隐私优先)
```python
# 基于spaCy + 医学模型
# 100%本地运行
```
---
## 🔗 相关文档
- [通用能力层总览](../README.md)
- [DC模块需求](../../03-业务模块/DC-数据清洗整理/README.md)
---
**最后更新:** 2025-11-06
**维护人:** 技术架构师