Go to file

HaHafeng beb7f7f559 feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

Core Components:
- PDFStorageService with Dify/OSS adapters
- LLM12FieldsService with Nougat-first + dual-model + 3-layer JSON parsing
- PromptBuilder for dynamic prompt assembly
- MedicalLogicValidator with 5 rules + fault tolerance
- EvidenceChainValidator for citation integrity
- ConflictDetectionService for dual-model comparison

Prompt Engineering:
- System Prompt (6601 chars, Section-Aware strategy)
- User Prompt template (PICOS context injection)
- JSON Schema (12 fields constraints)
- Cochrane standards (not loaded in MVP)

Key Innovations:
- 3-layer JSON parsing (JSON.parse + json-repair + code block extraction)
- Promise.allSettled for dual-model fault tolerance
- safeGetFieldValue for robust field extraction
- Mixed CN/EN token calculation

Integration Tests:
- integration-test.ts (full test)
- quick-test.ts (quick test)
- cached-result-test.ts (fault tolerance test)

Documentation Updates:
- Development record (Day 2-3 summary)
- Quality assurance strategy (full-text screening)
- Development plan (progress update)
- Module status (v1.1 update)
- Technical debt (10 new items)

Test Results:
- JSON parsing success rate: 100%
- Medical logic validation: 5/5 passed
- Dual-model parallel processing: OK
- Cost per PDF: CNY 0.10

Files: 238 changed, 14383 insertions(+), 32 deletions(-)
Docs: docs/03-涓氬姟妯″潡/ASL-AI鏅鸿兘鏂囩尞/05-寮€鍙戣褰?2025-11-22_Day2-Day3_LLM鏈嶅姟涓庨獙璇佺郴缁熷紑鍙?md

2025-11-22 22:21:12 +08:00

backend

feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

2025-11-22 22:21:12 +08:00

docs

feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

2025-11-22 22:21:12 +08:00

extraction_service

feat: add extraction_service (PDF/Docx/Txt) and update .gitignore to exclude venv

2025-11-16 15:32:44 +08:00

frontend

feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

2025-11-22 22:21:12 +08:00

frontend-v2

feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

2025-11-22 22:21:12 +08:00

测试记录

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

.editorconfig

feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

2025-11-22 22:21:12 +08:00

.gitattributes

feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

2025-11-22 22:21:12 +08:00

.gitignore

feat: add extraction_service (PDF/Docx/Txt) and update .gitignore to exclude venv

2025-11-16 15:32:44 +08:00

【给新AI】快速开始.md

feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

2025-11-22 22:21:12 +08:00

AI Clinical Research PRD.txt

chore: project initialization - Day 4 environment setup

2025-10-10 15:14:54 +08:00

Dify完整部署方案.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

Dify部署监控.bat

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

docker-compose.yml

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

Git提交准备清单.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

package-lock.json

docs: update Day 7 summary and milestone progress

2025-10-10 17:57:10 +08:00

package.json

docs: update Day 7 summary and milestone progress

2025-10-10 17:57:10 +08:00

Phase2-UI优化总结.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

Phase2-全文阅读模式-真实实现.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

Phase2-快速测试清单.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

Phase2-测试指南.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

Phase2-问题9-Token限制与超时修复.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

Phase2-问题9-快速验证.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

Phase2-首次测试-修复总结.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

Phase3-Day1-后端完成总结.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

Phase3-快速参考.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

Phase3-最终收尾-测试指南.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

PHASE1-测试指南.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

README-Phase2测试.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

README-里程碑1.5完成.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

README.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

START-HERE-FOR-AI.md

feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

2025-11-22 22:21:12 +08:00

START-HERE-FOR-NEW-AI.md

feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

2025-11-22 22:21:12 +08:00

START-开始使用.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

stop-all-services.bat

feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

2025-11-22 22:21:12 +08:00

yonghuduan_v6.html

chore: project initialization - Day 4 environment setup

2025-10-10 15:14:54 +08:00

一键启动.bat

refactor(asl): ASL frontend architecture refactoring with left navigation

2025-11-18 21:51:51 +08:00

优化方案总结.md

chore: project initialization - Day 4 environment setup

2025-10-10 15:14:54 +08:00

启动Dify.bat

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

启动指南.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

如何测试Phase2.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

对话系统实现方案对比.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

开发环境配置指南.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

快速修复-端口占用.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

快速参考-最终方案.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

快速测试指南-Week4.md

feat(asl): Implement full-text screening core LLM service and validation system (Day 1-3)

2025-11-22 22:21:12 +08:00

技术架构选型对比方案.md

chore: project initialization - Day 4 environment setup

2025-10-10 15:14:54 +08:00

智能引用功能-测试指南.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

查看端口占用.bat

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

检查测试环境.bat

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

测试API.bat

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

测试和启动.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

知识库需求调整说明.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

稿件审查功能-最终完成报告.md

chore: add remaining test docs, scripts and temp files

2025-11-16 15:44:55 +08:00

第一周开发指南.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

解决方案-前端获取数据失败.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

诊断问题.bat

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

调试指南.md

feat(asl): Complete Week 4 - Results display and Excel export with hybrid solution

2025-11-21 20:12:38 +08:00

配置Docker镜像加速器.md

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

重启所有服务.bat

docs: update progress for Day23-25

2025-10-12 10:01:10 +08:00

重启服务.bat

feat: Day 21-22 - knowledge base frontend completed, fix CORS and file upload issues

2025-10-11 15:40:12 +08:00

README.md

AI科研助手

专注于赋能临床及科研人员的智能化平台

📚 项目文档

📖 文档导航中心

📚 完整文档导航 ⭐ 查看所有设计文档和开发规范

🔗 快速链接

产品需求文档(PRD) - 了解产品需求
技术架构总览 - 了解技术方案
数据库设计文档 - 理解数据结构
API设计规范 - 掌握接口定义
开发里程碑 - 查看开发进度

🛠️ 子项目文档

⚙️ 后端开发指南 - Node.js + Fastify + Prisma
🎨 前端开发指南 - React + TypeScript + Ant Design

🏗️ 技术栈

前端

React 18 + TypeScript
Vite
TailwindCSS
Zustand
LobeChat组件

后端

Node.js + Fastify + TypeScript
Prisma ORM
PostgreSQL
Redis

第三方服务

Dify（RAG知识库）
DeepSeek-V3（主力LLM）
Qwen3（备用LLM）

✨ 核心功能

1. 智能问答系统

基于项目背景的上下文对话
支持@知识库引用
3种对话模式：RAG快速检索、全文阅读、批处理

2. 知识库管理

文档上传与管理（支持PDF/Word/Txt）
智能文本提取（Python微服务）
RAG检索优化（top_k=15, chunk_size=1500）

3. 批处理模式

批量提取结构化信息（3-50个文档）
预设模板+自定义Prompt
Excel导出

4. 稿件审查功能 ⭐ 新增

双维度智能评估：
- 稿约规范性评估（11个标准）
- 方法学评估（3个部分，20个检查点）
完整工作流程：
- Word文档上传（.doc/.docx）
- 实时进度展示
- 详细评估报告
- PDF导出+文本复制
多模型支持：DeepSeek-V3 / Qwen3-72B / Qwen-Long
独立导航入口：左侧菜单"稿件审查"

5. 12个智能体（规划中）

✅ 选题评价智能体（已完成）
⏳ 其他11个智能体（计划中）

🚀 快速开始

1. 启动基础服务

# 启动PostgreSQL和Redis
docker-compose up -d

2. 后端开发

cd backend
npm install
npm run dev

3. 前端开发

cd frontend
npm install
npm run dev

📦 目录结构

AIclinicalresearch/
├── frontend/           # 前端项目
├── backend/            # 后端项目
├── docs/               # 项目文档
├── docker-compose.yml  # Docker配置
└── README.md           # 本文件

🔑 环境变量

请参考 .env.example 文件配置环境变量。

📖 开发指南

请查看开发里程碑了解详细的开发计划。

📄 License

MIT

🔗 相关链接

📚 文档中心 - 完整的项目文档导航
⚙️ 后端项目 - 后端开发指南
🎨 前端项目 - 前端开发指南
🚀 快速启动指南 - 一步步启动项目
🐳 Dify部署方案 - Dify部署指南

当前开发阶段： 里程碑1 - Day 6（前端基础架构）
开发进度： 50% - 前后端基础架构已完成

Languages

TypeScript 83%

Python 6.2%

JavaScript 3.8%

CSS 3.2%

R 2.5%

Other 1.2%

README.md Unescape Escape

AI科研助手

📚 项目文档

📖 文档导航中心

🔗 快速链接

🛠️ 子项目文档

🏗️ 技术栈

前端

后端

第三方服务

✨ 核心功能

1. 智能问答系统

2. 知识库管理

3. 批处理模式

4. 稿件审查功能 ⭐ 新增

5. 12个智能体（规划中）

🚀 快速开始

1. 启动基础服务

2. 后端开发

3. 前端开发

📦 目录结构

🔑 环境变量

📖 开发指南

📄 License

🔗 相关链接

README.md