Files
AIclinicalresearch/docs/02-通用能力层/04-数据ETL引擎
HaHafeng 66255368b7 feat(admin): Add user management and upgrade to module permission system
Features - User Management (Phase 4.1):
- Database: Add user_modules table for fine-grained module permissions
- Database: Add 4 user permissions (view/create/edit/delete) to role_permissions
- Backend: UserService (780 lines) - CRUD with tenant isolation
- Backend: UserController + UserRoutes (648 lines) - 13 API endpoints
- Backend: Batch import users from Excel
- Frontend: UserListPage (412 lines) - list/filter/search/pagination
- Frontend: UserFormPage (341 lines) - create/edit with module config
- Frontend: UserDetailPage (393 lines) - details/tenant/module management
- Frontend: 3 modal components (592 lines) - import/assign/configure
- API: GET/POST/PUT/DELETE /api/admin/users/* endpoints

Architecture Upgrade - Module Permission System:
- Backend: Add getUserModules() method in auth.service
- Backend: Login API returns modules array in user object
- Frontend: AuthContext adds hasModule() method
- Frontend: Navigation filters modules based on user.modules
- Frontend: RouteGuard checks requiredModule instead of requiredVersion
- Frontend: Remove deprecated version-based permission system
- UX: Only show accessible modules in navigation (clean UI)
- UX: Smart redirect after login (avoid 403 for regular users)

Fixes:
- Fix UTF-8 encoding corruption in ~100 docs files
- Fix pageSize type conversion in userService (String to Number)
- Fix authUser undefined error in TopNavigation
- Fix login redirect logic with role-based access check
- Update Git commit guidelines v1.2 with UTF-8 safety rules

Database Changes:
- CREATE TABLE user_modules (user_id, tenant_id, module_code, is_enabled)
- ADD UNIQUE CONSTRAINT (user_id, tenant_id, module_code)
- INSERT 4 permissions + role assignments
- UPDATE PUBLIC tenant with 8 module subscriptions

Technical:
- Backend: 5 new files (~2400 lines)
- Frontend: 10 new files (~2500 lines)
- Docs: 1 development record + 2 status updates + 1 guideline update
- Total: ~4900 lines of code

Status: User management 100% complete, module permission system operational
2026-01-16 13:42:10 +08:00
..

数据ETL引擎

能力定位: 通用能力层
复用率: 29% (2个模块依赖)
优先级: P2
状态: 待实现


📋 能力概述

数据ETL引擎负责

  • Excel多表JOIN
  • 数据清洗
  • 数据转换
  • 数据验证

📊 依赖模块

2个模块依赖29%复用率):

  1. DC - 数据清洗整理(核心依赖)
  2. SSA - 智能统计分析(数据预处理)

💡 核心功能

1. Excel多表处理

  • 读取多个Excel文件
  • 自动JOIN操作
  • GROUP BY聚合

2. 数据清洗

  • 缺失值处理
  • 重复值处理
  • 异常值检测

3. 数据转换

  • 类型转换
  • 格式标准化

🏗️ 技术方案

云端版(最优)

# 基于Polars性能极高
class ETLEngine:
    def read_excel(self, files: List[File]) -> List[DataFrame]
    def join(self, dfs: List[DataFrame], keys: List[str]) -> DataFrame
    def clean(self, df: DataFrame, rules: Dict) -> DataFrame
    def export(self, df: DataFrame, format: str) -> bytes

单机版(兼容)

# 基于SQLite内存友好
# 分块读取数据库引擎处理JOIN

🔗 相关文档


最后更新: 2025-11-06
维护人: 技术架构师