Nova 2 Sonic support #3212

kompfner · 2025-12-09T16:29:34Z

For another PR: cross-modal input (switching between voice and text input).

…nic-v1:0"

…ate) info

…ty" parameter.

codecov · 2025-12-09T16:31:21Z

Codecov Report

❌ Patch coverage is 0% with 16 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/pipecat/services/aws/nova_sonic/llm.py	0.00%	16 Missing ⚠️

Files with missing lines	Coverage Δ
src/pipecat/services/aws/nova_sonic/llm.py	`0.00% <0.00%> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…led tool calls in context

kompfner · 2025-12-09T18:04:19Z

examples/foundational/40-aws-nova-sonic.py

+    # Simulate a long network delay.
+    # You can continue chatting while waiting for this to complete.
+    # With Nova 2 Sonic (the default model), the assistant will respond
+    # appropriately once the function call is complete.


I think I'm witnessing a bit of buggy behavior in the model.

When there are multiple in-flight tool calls, the model seems prone to getting confused and "mixing up" the results when they come back.

For example, I see things like this:

User: "Hey, can you tell me the weather in San Diego?" --> Tool call kicks off with ID "foo" User: "Actually, hang on. I'm in Washington, D.C." --> Tool call kicks of with ID "bar" --> Tool result arrives with ID "foo" (corresponding to San Diego) and is reported to the model. Assistant: "The weather in Washington, D.C. is nice, with a temperature of 65 degrees." --> Tool result arrives with ID "bar" (corresponding to Washington, D.C.) and is reported to the model. Assistant: "Actually in Washington, D.C. the temperature is 70 degrees."

Interesting. In theory, the tool_call_id should be enough to handle this.
Have you tried invoking different functions at the same time to check whether it works correctly in that case?

Hm, if I fire all of the tool calls at the exact same time (e.g. "tell me the weather in San Diego, Washington, D.C., and New York"), tool_call_id does seem to be enough to tell the model which is which...the problem seems to arise only when they're offset...

Ah, another clue! Maybe it has something to do with whether you interact with the model while the function calls are in-flight...so, going back to the previous example ("tell me the weather in San Diego, Washington, D.C., and New York"), if I then ask the model to tell me a fun fact while I'm waiting for the weather results, when the weather results arrive they might get scrambled (the model might think San Diego's weather is New York's, etc)

Another thing I'm running into: if there are multiple in-flight tool calls at the same time, then when they come back, it triggers in the model another unnecessary duplicate tool call.

And one last thing: if I hammer the model too hard with interrupting in-flight tool calls with new tool calls, it occasionally crashes with an "System instability detected" error.

But this is all somewhat aggressive testing. In the "normal" case of one tool call at a time (probably most real-world usage falls under this umbrella), things seem to be working reasonably well.

Yeah, I think so. And in this case, since they are handling the context, I’m not sure there’s much we can do to try to prevent it.

kompfner · 2025-12-09T18:06:21Z

src/pipecat/services/aws/nova_sonic/llm.py

+        return not self._is_first_generation_sonic_model()
+
+    def _is_assistant_response_trigger_needed(self) -> bool:
+        # Assistant response trigger audio is only needed with the older model


Very glad about this 😄

kompfner · 2025-12-09T18:14:17Z

src/pipecat/services/aws/nova_sonic/llm.py

        region: str,
-        model: str = "amazon.nova-sonic-v1:0",
-        voice_id: str = "matthew",  # matthew, tiffany, amy
+        model: str = "amazon.nova-2-sonic-v1:0",


One thing I've noticed in testing this new model is that some guardrails are very strong. Some of the things I usually do during testing are occasionally treated as prohibited.

For example, a few times when I asked the model to tell me a short story it told me it didn't want to accidentally infringe on intellectual property so it couldn't fulfill my request.

Another time, when I asked for suggestions of things to do for fun while in Washington, D.C. it said it didn't want to answer to avoid suggesting criminal or violent activities.

In both cases, once the guardrail was "triggered", it became very hard to continue the conversation without completely changing topics.

In the short-story scenario, for example, a follow-up request for a 3-sentence story was also blocked. The model then suggested "safer" topics, like general story arcs or story-writing tips. Each time I said "OK, let's discuss that", it still said it couldn't discuss those topics, for the same reason (not infringing on intellectual property).

In the fun-things-to-do-around-town scenario, I followed up with things like "what about, like, going to museums?" and it still said it couldn't answer, for the same reason (it didn't want to suggest something harmful).

Yeah, it looks like there are too many guardrails in this case now.

filipi87 · 2025-12-09T18:30:11Z

changelog/3212.added.md

+  - Made the assistant-response-trigger hack a no-op. It's only needed for the
+    older Nova Sonic model.


Cool. So they have fixed this. 🙌🎉

filipi87

LGTM. 🚀

…fetching function to help the model associate a tool response with a tool call...if you interrupt the model while more than one function call is outbound, it seemingly can get confused about which tool result goes which call.

… call delay

kompfner added 4 commits December 9, 2025 09:38

Update default model in AWSNovaSonicLLMService to "amazon.nova-2-so…

b22ac82

…nic-v1:0"

Update list of supported regions in 40-aws-nova-sonic.py

53de6c0

Update AWSNovaSonicLLMService docstring with more (and more up-to-d…

ca5e668

…ate) info

Add support to AWSNovaSonicLLMService for new "endpointingSensitivi…

926514c

…ty" parameter.

kompfner added 3 commits December 9, 2025 11:55

Changes related to Nova 2 Sonic's support for the model speaking first

0c5bccd

Fix a bug in AWSNovaSonicLLMService where we would mishandle cancel…

b821dd2

…led tool calls in context

Update AWS Nova Sonic example to showcase async tool calling

3e66cb5

kompfner changed the title ~~[WIP] Nova 2 Sonic~~ Nova 2 Sonic support Dec 9, 2025

Added CHANGELOG for AWS Nova 2 Sonic-related changes

3cbfbb9

kompfner commented Dec 9, 2025

View reviewed changes

kompfner marked this pull request as ready for review December 9, 2025 18:17

kompfner requested review from aconchillo, filipi87, markbackman and mattieruth December 9, 2025 18:18

filipi87 reviewed Dec 9, 2025

View reviewed changes

filipi87 approved these changes Dec 9, 2025

View reviewed changes

kompfner added 2 commits December 9, 2025 16:27

In the AWS Nova Sonic example, shorten the simulated weather function…

c37da6a

… call delay

kompfner merged commit f41c3dc into main Dec 11, 2025
6 checks passed

kompfner deleted the pk/nova-2-sonic branch December 11, 2025 14:36

		- Made the assistant-response-trigger hack a no-op. It's only needed for the
		older Nova Sonic model.

Nova 2 Sonic support #3212

Nova 2 Sonic support #3212

Uh oh!

Conversation

kompfner commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kompfner Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kompfner Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

filipi87 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kompfner commented Dec 9, 2025 •

edited

Loading

codecov bot commented Dec 9, 2025 •

edited

Loading

kompfner Dec 9, 2025 •

edited

Loading

kompfner Dec 9, 2025 •

edited

Loading