fix: strip thinking tags before all JSON parse strategies by haosenwang1018 · Pull Request #209 · HKUDS/RAG-Anything

haosenwang1018 · 2026-02-20T20:28:49Z

Closes #159

When using reasoning models (qwen2.5-think, deepseek-r1, etc.), the <think>...</think> tags were only stripped in _extract_all_json_candidates(), meaning the regex fallback strategy (_extract_fields_with_regex) still operated on the raw response including thinking content. This could cause it to extract content from the thinking section rather than the actual analysis.

This fix moves the think-tag stripping to the top of _robust_json_parse() so all downstream strategies work with clean model output.

…oning model output

LarFii · 2026-02-24T10:22:42Z

Thanks for the fix.

I do see one potential side effect to consider:

Potential data loss from global tag stripping
The current implementation removes <think>...</think> / <thinking>...</thinking> across the entire response before all parsing strategies.
If those tags appear as legitimate literal content inside the actual payload (e.g., in detailed_description), that content would be removed unintentionally.

Also a minor maintainability point:

Duplicated cleanup logic
The same think-tag stripping still exists in _extract_all_json_candidates(), so cleanup now happens in two places. It may be better to centralize this in one function to avoid drift.

Suggested refinement

Instead of global removal, strip only leading reasoning blocks (prefix-only), which still fixes the fallback issue while avoiding accidental mutation of valid body content.

fix: strip thinking tags before all JSON parse strategies to fix reas…

46be41e

…oning model output

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

fix: strip thinking tags before all JSON parse strategies#209

fix: strip thinking tags before all JSON parse strategies#209
haosenwang1018 wants to merge 1 commit intoHKUDS:mainfrom
haosenwang1018:fix/strip-think-tags-before-parse

haosenwang1018 commented Feb 20, 2026

Uh oh!

LarFii commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

haosenwang1018 commented Feb 20, 2026

Uh oh!

LarFii commented Feb 24, 2026

Suggested refinement

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants