Add MinerU document ingestion support

Normalize uploaded documents into structured markdown, add clearer parser
errors for missing dependencies, and cover the ingestion flow with
backend tests. This also replaces deprecated UTC timestamp helpers in
the touched backend paths so the knowledge pipeline stays warning-free.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-03-22 13:42:16 +08:00
parent a9ddf3c9b4
commit 3ee825aa90
20 changed files with 2159 additions and 156 deletions

View File

@@ -48,6 +48,10 @@ dependencies = [
# 工具
"python-dotenv>=1.0.0",
"httpx>=0.27.0",
"openpyxl>=3.1.0",
"python-docx>=1.1.0",
"mineru>=2.0.3",
"psutil>=6.1.0",
]
[project.optional-dependencies]