feat: enhance AI JSON response with sources, confidence, and tags by sachin9058 · Pull Request #141 · benodiwal/pg_ai_query

sachin9058 · 2026-04-05T20:55:15Z

Summary

This PR enhances the JSON response format by adding:

documentation sources (PostgreSQL/PostGIS)
a confidence score
contextual tags extracted from the explanation

Changes

Updated response_formatter.cpp to include metadata fields
Added metadata.cpp and metadata.hpp
Integrated metadata extraction based on explanation
Updated CMakeLists.txt
Added basic unit test for metadata logic

Impact

Improves trust and usability of AI responses
Allows users to verify answers using official documentation
Adds structured metadata for better interpretation

Notes

No breaking changes
Only extends existing JSON output

MohamedKamal000 · 2026-04-05T21:35:38Z

src/core/metadata.cpp

+double calculateConfidence(const std::string& text) {
+  if (text.length() > 150) return 0.9;
+  if (text.length() > 50) return 0.75;
+  return 0.6;
+}


how does this calculate ai confidence query ?

Good question ... currently the confidence score is a simple heuristic, not derived from the AI model itself.

Right now it’s based on the length of the generated explanation:

Longer explanations are assumed to be more detailed → higher confidence

Shorter ones → lower confidence

This is just an initial placeholder to provide a basic signal to users.

In future iterations, this could be improved by:

Incorporating model-provided confidence (if available)

Using response structure/quality signals

Integrating external validation or scoring mechanisms

Happy to refine this approach based on suggestions.

MohamedKamal000 · 2026-04-05T21:37:21Z

src/core/metadata.cpp

+  if (text.find("ST_") != std::string::npos) {
+    sources.push_back({
+        {"title", "PostGIS Documentation"},
+        {"url", "https://postgis.net/docs/"}});
+  }
+
+  if (text.find("SELECT") != std::string::npos) {
+    sources.push_back({
+        {"title", "PostgreSQL SELECT"},
+        {"url", "https://www.postgresql.org/docs/current/sql-select.html"}});
+  }


you only cover select and Postgis queries for documentation, i don't think this should be done this way

sahitya-chandra

Thanks for the effort here! I'm a bit skeptical though about whether this belongs in the core extension.

The confidence score being derived from text length doesn't really reflect model confidence. And honestly I'm not sure
a confidence field makes sense here, in general, the models that hallucinate a lot are exactly the ones where you'd
want some uncertainty signal, but they're also the ones least capable of producing a reliable one. And stronger models
are good enough that you don't really need it. So either way it feels like it could mislead users more than it helps.

The source linking feels a bit fragile too, if nearly every response is going to point to the same top-level
PostgreSQL or PostGIS page just because SELECT or ST_ appeared somewhere, that's not really sources, it's more
like a static footer. I think it could be genuinely useful if the links were actually tied to the specific functions
or clauses in the generated query, but as-is I feel it adds more noise than clarity.

More broadly I'm not sure metadata enrichment like this is really the extension's job. Its core responsibility is
generating valid SQL from natural language - annotating and interpreting the AI's explanation feels more like
something a client or UI layer should own. And once this is in, we're on the hook for keeping these heuristics
updated.

Might be worth opening a discussion thread first to see if there's consensus on whether this is the right place for
it?

cc @benodiwal @probablyArth - curious what you think about the scope here

sachin9058 · 2026-04-07T19:38:47Z

@MohamedKamal000 @sahitya-chandra
Thanks a lot for the thoughtful feedback, really appreciate you taking the time to review this.

I get the concerns you raised. The current version was more of an initial attempt, but I agree that:

the confidence score based on text length isn’t a reliable signal and could be misleading
the source linking is too generic right now and not actually tied to specific parts of the query
and more importantly, this might not be the right place to handle this kind of metadata

My intention was to make responses a bit more transparent and easier to trust, but I see how putting this in the core extension adds scope and maintenance overhead.

I’m open to taking a different approach here — maybe moving this to the client/UI side, or reworking it so the metadata is actually meaningful and tied to specific SQL constructs.

Happy to iterate on this, or pause and open a discussion first if that makes more sense.

Let me know what you think 👍

sachin9058 added 2 commits March 17, 2026 23:28

Add unit tests for documentation links in ResponseFormatter

eaee796

feat: add sources, confidence score, and tags to AI JSON response

487ffbc

sachin9058 requested a review from benodiwal as a code owner April 5, 2026 20:55

MohamedKamal000 reviewed Apr 5, 2026

View reviewed changes

sahitya-chandra reviewed Apr 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: enhance AI JSON response with sources, confidence, and tags#141

feat: enhance AI JSON response with sources, confidence, and tags#141
sachin9058 wants to merge 2 commits intobenodiwal:mainfrom
sachin9058:feature/add-response-metadata

sachin9058 commented Apr 5, 2026

Uh oh!

MohamedKamal000 Apr 5, 2026

Uh oh!

sachin9058 Apr 6, 2026

Uh oh!

MohamedKamal000 Apr 5, 2026

Uh oh!

sahitya-chandra left a comment •

edited

Loading

Uh oh!

sachin9058 commented Apr 7, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sachin9058 commented Apr 5, 2026

Summary

Changes

Impact

Notes

Uh oh!

MohamedKamal000 Apr 5, 2026

Choose a reason for hiding this comment

Uh oh!

sachin9058 Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

MohamedKamal000 Apr 5, 2026

Choose a reason for hiding this comment

Uh oh!

sahitya-chandra left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sachin9058 commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sahitya-chandra left a comment •

edited

Loading

sachin9058 commented Apr 7, 2026 •

edited

Loading