refactor(rag): introduce doc_service and migrate Document off legacy doc managers#1069
Open
ChenJiahaoST wants to merge 55 commits intoLazyAGI:mainfrom
Open
refactor(rag): introduce doc_service and migrate Document off legacy doc managers#1069ChenJiahaoST wants to merge 55 commits intoLazyAGI:mainfrom
ChenJiahaoST wants to merge 55 commits intoLazyAGI:mainfrom
Conversation
added 8 commits
March 18, 2026 14:14
|
Warning Gemini encountered an error creating the summary. You can try again by commenting |
ChenJiahaoST
commented
Apr 7, 2026
Collaborator
Author
ChenJiahaoST
left a comment
There was a problem hiding this comment.
截止到worker.py之前:
当前已提29条评论
Collaborator
Author
|
已重新核对并处理截至 |
ChenJiahaoST
commented
Apr 8, 2026
Collaborator
Author
|
补充说明一下这轮收口的两处主线改动:
其余本轮 unresolved 里,当前 diff 已覆盖的紧凑格式、 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
📌 PR 内容 / PR Description
doc_service主链路,新增DocServer、新的DocManager、任务/知识库/文档管理接口,以及对应本地 standalone/mock 示例Document,去除对旧DocManager/DocListManager的主链路依赖,改为对接新的doc_service/DocumentProcessordataset_path导入与扫盘能力,支持enable_path_monitoring/v1/chunks切片查看接口,支持按kb_id/doc_id/group/algo_id分页查询,并按node.number升序返回transfer的文档管理语义,支持target_doc_id、target_filename、target_file_path、target_metadataMapStore/OpenSearch/ElasticSearchDocManager/DocListManager标记为 deprecated,不再作为主推荐入口doc_service/Document相关单元测试,收口到可提 PR 的状态Document(manager=True/'ui')启动本地子服务时对已安装旧版lazyllm的源码优先级问题,避免manager='ui'因导入旧包而报错✅ 变更类型 / Type of Change
🧪 如何测试 / How Has This Been Tested?
37 passed, 6 skippedpython -m py_compile .../v1/chunks、transfer、cancel、delete_kb等链路Document(manager='ui')不再因子进程导入旧安装包而初始化报错📷 截图 / Demo (Optional)
⚡ 更新后的用法示例 / Usage After Update
🔄 重构前 / 重构后对比 (仅当 Type 为 Refactor) / Refactor Before & After (only for Refactor)
重构前 / Before:
Document主链路依赖旧DocManager/DocListManagerDocListManager重构后 / After:
Document主链路切换到DocImpl + DocumentProcessor + DocServerdataset_path导入与扫盘迁移到新架构DocServer成为新的文档管理主入口doc_server -> parsing_service -> store的完整分页查看链路DocManager/DocListManager保留兼容,但不再作为主推荐入口DocManager/DocListManager文件,但已将其降级为 legacy / deprecated 入口transfer仅支持同一算法内的知识库间转移,不支持跨算法 transfertarget_metadata语义为:继承 source metadata 后按 patch 覆盖,不是整份替换manager='ui'的初始化链路已修复为优先使用当前工作区源码;如果运行环境仍有问题,更可能是安装/打包环境问题tests/basic_tests/RAG/test_doc_processor.py中的6 skipped为原有 skip,不是本 PR 新引入的问题