feat(iit-manager): Integrate Dify knowledge base for hybrid retrieval

Completed features:
- Created Dify dataset (Dify_test0102) with 2 processed documents
- Linked test0102 project with Dify dataset ID
- Extended intent detection to recognize query_protocol intent
- Implemented queryDifyKnowledge method (semantic search Top 5)
- Integrated hybrid retrieval (REDCap data + Dify documents)
- Fixed AI hallucination bugs (intent detection + API field path)
- Developed debugging scripts
- Completed end-to-end testing (5 scenarios passed)
- Generated comprehensive documentation (600+ lines)
- Updated development plans and module status

Technical highlights:
- Single project single knowledge base architecture
- Smart routing based on user intent
- Prevent AI hallucination by injecting real data/documents
- Session memory for multi-turn conversations
- Reused LLMFactory for DeepSeek-V3 integration

Bug fixes:
- Fixed intent detection missing keywords
- Fixed Dify API response field path error

Testing: All scenarios verified in WeChat production environment

Status: Fully tested and deployed
This commit is contained in:
2026-01-04 15:44:11 +08:00
parent b47079b387
commit dfc472810b
162 changed files with 3093 additions and 62 deletions

View File

@@ -59,3 +59,4 @@ models/

View File

@@ -45,5 +45,6 @@ __version__ = '1.0.0'

View File

@@ -178,5 +178,6 @@ def get_missing_summary(df: pd.DataFrame) -> dict:

View File

@@ -138,5 +138,6 @@ def apply_filter(

View File

@@ -303,4 +303,5 @@ def get_unpivot_preview(

View File

@@ -312,5 +312,6 @@ if __name__ == "__main__":

View File

@@ -78,5 +78,6 @@ except Exception as e:

View File

@@ -58,5 +58,6 @@ except Exception as e: