Decouple model_context from AssistantAgent #4681

akurniawan · 2024-12-12T12:07:49Z

Why are these changes needed?

This is to decouple model_context from AssistantAgent to provide flexibility in augmenting incoming message to the agent. This could be useful when we have a specific template for an agent that is a part of a GroupChat, discussion here: #4668

Usage example

class MyModelContext()
  .....
  ....
my_model_context = MyModelContext(...)
agent = AssistantAgent(model_client =(..), model_context=(my_model_context...))
team = RoundRobinGroupChat(participantants=[agent] ...)

Related issue number

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

akurniawan · 2024-12-12T12:10:22Z

@akurniawan please read the following Contributor License Agreement(CLA). If you agree with the CLA, please reply with the following information.
@microsoft-github-policy-service agree [company="{your company}"]
Options:

(default - no company specified) I have sole ownership of intellectual property rights to my Submissions and I am not making Submissions in the course of work for my employer.
@microsoft-github-policy-service agree
(when company given) I am making Submissions in the course of work for my employer (or my employer has intellectual property rights in my Submissions by contract or applicable law). I have permission from my employer to make Submissions and enter into this Agreement on behalf of my employer. By signing below, the defined term “You” includes me and my employer.
@microsoft-github-policy-service agree company="Microsoft"
Contributor License Agreement

@microsoft-github-policy-service agree

Copilot reviewed 1 out of 1 changed files in this pull request and generated no suggestions.

Comments skipped due to low confidence (1)

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py:317

The error message 'No tools are available.' could be more descriptive. Consider changing it to 'No tools or handoff tools are available for execution.'

raise ValueError("No tools are available.")

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py

...on/packages/autogen-core/src/autogen_core/model_context/_buffered_chat_completion_context.py

ekzhu · 2024-12-13T23:01:49Z

@victordibia please take a look at this and see the implication to #4438 .

@akurniawan thanks for the PR, since this is very closely related to the aforementioned PR, please change this to DRAFT state and we will need to coordinate the work.

victordibia · 2024-12-14T00:39:36Z

thanks @akurniawan , @ekzhu, @jackgerrits
PR is looking good.

Current PR

From my understanding this PR does the following:

adds model_context as a argument to AssistantAgent .. based on the ChatCompletionContext class
uses this model_context when passed or adds and instantiates one based on UnboundedBufferedChatCompletionContext
The expected usage/benefit is that at design time, the developer can pass in their own model_context object and control behaviors on how/what context is stored by overloading the add_message method e.g., transform the structure of the message before it is added, drop some messages, replace etc.

@akurniawan .. can you confirm the above is correct? Also, if possible can you update the PR description with an example usage ...e.g., below?

class MyModelContext()
  .....
  ....
my_model_context = MyModelContext(...)
agent = AssistantAgent(model_client =(..), model_context=(my_model_context...))
team = RoundRobinGroupChat(participantants=[agent] ...)

PR #4438 - Memory in AgentChat

#4438 focuses on

adding adding a memoryargument toAssistantAgent`.
At runtime, if a memory object is passed, effort is made to query it to update the exact context message just in time (RAG) by retrieving from the memory store. E.g., I pass in a memory argument with my preferences (e.g., I am vegan) and that is retrieved and added to a query like provide a meal plan. (stuff can be added/removed from memory external to the assistant agent resulting in dynamic behaviours during runtime)

My feeling is that there are differences - while model_context is focused fixed design time updates to all messages, memory is more towards just-in-time updates. I feel both can coexist

model_context is used to transform all messages
memory is used to append stuff relevant, each time the agent receives a message. Implementation would be to add retrieved memory messages to llm_messages

Thoughts welcome.

ekzhu · 2024-12-14T05:25:05Z

My feeling is that there can be two things:

memory, which has a method that generates or transforms a model context
model context, an object that contains the LLMMessage that will be sent to the model

I think that the current model context interface is not quite complete, because a model context should also include tools or tool schema -- something a memory module should also provide augmentation for.

In this regard, I think the model context object could be potentially transient -- the agent only create one on demand when model inference is performed.

akurniawan · 2024-12-14T05:58:29Z

@victordibia yes that's correct and I have updated the PR description as suggested

…ntext behaviour on AssistantAgent

...s/autogen-core/src/autogen_core/model_context/_unbounded_buffered_chat_completion_context.py

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py

ekzhu

@akurniawan thanks.

@victordibia and I have discussed and we think this PR can be marked as Ready for Review.

ekzhu · 2024-12-18T22:12:17Z

@akurniawan I think this PR is almost at the finish line. I believe we just need to resolve the few remaining issues.

Would you like us to take over and get it done?

ekzhu · 2024-12-18T23:09:53Z

@akurniawan I am going to push some changes.

akurniawan · 2024-12-19T03:42:44Z

@akurniawan I think this PR is almost at the finish line. I believe we just need to resolve the few remaining issues.

Would you like us to take over and get it done?

Hey sorry was not available yesterday. I can complete the changes you suggested today.

akurniawan · 2024-12-19T05:07:33Z

it seems you have already changed it, thanks for the help!

ekzhu · 2024-12-19T05:52:14Z

You are welcome 😃

Join our office hours and discord to discuss more.

https://aka.ms/autogen-officehour

https://aka.ms/autogen-discord

victordibia

Looks good, thanks.

to add memory, we can do something like

...
# Inner messages.
        inner_messages: List[AgentEvent | ChatMessage] = [] 

        # Transform the model context for each "memory source"
        if memory and isinstance(memory, list):
            for m in memory: 
                self._model_context = await m.transform(self._model_context, transform=True)

@ekzhu , what do you think?

python/packages/autogen-core/samples/common/agents/_chat_completion_agent.py

husseinmozannar

We can either use a list or the ChatCompletionContext without anything breaking right?

ekzhu · 2024-12-20T02:47:29Z

Looks good, thanks.

to add memory, we can do something like

...

# Inner messages.

        inner_messages: List[AgentEvent | ChatMessage] = [] 



        # Transform the model context for each "memory source"

        if memory and isinstance(memory, list):

            for m in memory: 

                self._model_context = await m.transform(self._model_context, transform=True)

@ekzhu , what do you think?

Yes. That's what I was thinking. The model context is essentially the input or the "working set" that model consumes.

ekzhu · 2024-12-20T02:47:58Z

We can either use a list or the ChatCompletionContext without anything breaking right?

Right now no one is using this new constructor argument so it will not be breaking change.

ekzhu · 2024-12-20T02:52:49Z

Looks good, thanks.

to add memory, we can do something like

...

# Inner messages.

        inner_messages: List[AgentEvent | ChatMessage] = [] 



        # Transform the model context for each "memory source"

        if memory and isinstance(memory, list):

            for m in memory: 

                self._model_context = await m.transform(self._model_context, transform=True)

@ekzhu , what do you think?

Perhaps the memory can directly mutate the context. Right now the code snippet just shows direct update. We want be prepared for a scenario in which a remote model context hosted on a service. Eg OpenAI Assistant API's thread.

victordibia · 2024-12-20T05:21:29Z

Perhaps the memory can directly mutate the context. Right now the code snippet just shows direct update. We want be prepared for a scenario in which a remote model context hosted on a service. Eg OpenAI Assistant API's thread.

Makes sense ... we can discuss this on the memory PR once this is merged.
Looks ready to merge.

rysweet requested review from ekzhu and Copilot December 12, 2024 16:22

Copilot AI reviewed Dec 12, 2024

View reviewed changes

rysweet requested a review from jackgerrits December 12, 2024 16:25

jackgerrits reviewed Dec 12, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

ekzhu reviewed Dec 12, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

ekzhu requested review from husseinmozannar, afourney and victordibia December 12, 2024 22:36

ekzhu reviewed Dec 13, 2024

View reviewed changes

...on/packages/autogen-core/src/autogen_core/model_context/_buffered_chat_completion_context.py Outdated Show resolved Hide resolved

ekzhu reviewed Dec 13, 2024

View reviewed changes

...on/packages/autogen-core/src/autogen_core/model_context/_buffered_chat_completion_context.py Outdated Show resolved Hide resolved

akurniawan marked this pull request as draft December 14, 2024 05:54

akurniawan force-pushed the agent_context branch 2 times, most recently from 1942071 to 1966d7e Compare December 14, 2024 05:56

aditya.kurniawan added 4 commits December 16, 2024 09:38

Decouple model_context from AssistantAgent

8cb546f

add UnboundedBufferedChatCompletionContext to mimic pervious model_co…

fc6c184

…ntext behaviour on AssistantAgent

moving unbounded buffered chat to a different file

7a4b3ec

fix model_context assertions in test_group_chat

5787568

akurniawan force-pushed the agent_context branch from 353ac69 to 5787568 Compare December 16, 2024 05:38

ekzhu reviewed Dec 16, 2024

View reviewed changes

...s/autogen-core/src/autogen_core/model_context/_unbounded_buffered_chat_completion_context.py Outdated Show resolved Hide resolved

ekzhu reviewed Dec 16, 2024

View reviewed changes

...s/autogen-core/src/autogen_core/model_context/_unbounded_buffered_chat_completion_context.py Outdated Show resolved Hide resolved

ekzhu reviewed Dec 16, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

ekzhu reviewed Dec 16, 2024

View reviewed changes

Merge branch 'main' into agent_context

b2a7600

ekzhu marked this pull request as ready for review December 16, 2024 20:22

Merge branch 'main' into agent_context

4d9470f

ekzhu added 3 commits December 18, 2024 17:00

Refactor model context

586b34a

fixes

168ab66

Merge branch 'main' into agent_context

ea495a7

ekzhu approved these changes Dec 19, 2024

View reviewed changes

victordibia approved these changes Dec 19, 2024

View reviewed changes

python/packages/autogen-core/samples/common/agents/_chat_completion_agent.py Outdated Show resolved Hide resolved

husseinmozannar reviewed Dec 20, 2024

View reviewed changes

Merge branch 'main' into agent_context

4f862d5

Merge branch 'main' into agent_context

d125b15

victordibia and others added 2 commits December 19, 2024 21:32

Merge branch 'main' into agent_context

e4b53a0

update

35f37ed

ekzhu merged commit c989181 into microsoft:main Dec 20, 2024
48 checks passed

akurniawan deleted the agent_context branch December 20, 2024 06:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decouple model_context from AssistantAgent #4681

Decouple model_context from AssistantAgent #4681

akurniawan commented Dec 12, 2024 •

edited

Loading

akurniawan commented Dec 12, 2024

ekzhu commented Dec 13, 2024

victordibia commented Dec 14, 2024 •

edited

Loading

ekzhu commented Dec 14, 2024

akurniawan commented Dec 14, 2024

ekzhu left a comment

ekzhu commented Dec 18, 2024

ekzhu commented Dec 18, 2024

akurniawan commented Dec 19, 2024

akurniawan commented Dec 19, 2024

ekzhu commented Dec 19, 2024

victordibia left a comment •

edited

Loading

husseinmozannar left a comment

ekzhu commented Dec 20, 2024

ekzhu commented Dec 20, 2024

ekzhu commented Dec 20, 2024

victordibia commented Dec 20, 2024

Decouple model_context from AssistantAgent #4681

Decouple model_context from AssistantAgent #4681

Conversation

akurniawan commented Dec 12, 2024 • edited Loading

Why are these changes needed?

Usage example

Related issue number

Checks

akurniawan commented Dec 12, 2024

Choose a reason for hiding this comment

ekzhu commented Dec 13, 2024

victordibia commented Dec 14, 2024 • edited Loading

Current PR

PR #4438 - Memory in AgentChat

ekzhu commented Dec 14, 2024

akurniawan commented Dec 14, 2024

ekzhu left a comment

Choose a reason for hiding this comment

ekzhu commented Dec 18, 2024

ekzhu commented Dec 18, 2024

akurniawan commented Dec 19, 2024

akurniawan commented Dec 19, 2024

ekzhu commented Dec 19, 2024

victordibia left a comment • edited Loading

Choose a reason for hiding this comment

husseinmozannar left a comment

Choose a reason for hiding this comment

ekzhu commented Dec 20, 2024

ekzhu commented Dec 20, 2024

ekzhu commented Dec 20, 2024

victordibia commented Dec 20, 2024

akurniawan commented Dec 12, 2024 •

edited

Loading

victordibia commented Dec 14, 2024 •

edited

Loading

victordibia left a comment •

edited

Loading