I see that the system prompt is constructed as a system message immediately followed by an assistant message containing some metadata about the context.
https://github.com/alexzhang13/rlm/blob/main/rlm/utils/prompts.py#L139
Seems like this would be OOD for a lot of os models that were IFT'd with system-user-assistant turn order, so curious why this approach was taken.