Google Gen AI: Context caching breaks with generateObject / generateContent - #3333

chanmathew · 2024-10-22T18:26:19Z

Description

Hi there,

I'm trying to implement context caching with the Gemini models, but it keeps returning an error saying:

CachedContent can not be used with GenerateContent request setting system_instruction, tools or tool_config. Proposed fix: move those values to CachedContent from GenerateContent request.

But I'm not passing in system_instruction, tools, or tool_config in my request. Not sure if I'm doing something wrong here?

export const extractedPlacesSchema = z.object({
	results: z.array(
		z.object({
			name: z.string(),
			link: z.string().nullable(),
			image: z.string().nullable(),
			city: z.string().nullable(),
			country: z.string().nullable()
		})
	)
});

const newCache = await cacheManager.create({
		model,
		displayName: crypto.randomUUID(),
		systemInstruction: prompt,
		contents: [
			{
				role: 'user',
				parts: [{ text }]
			}
		],
		ttlSeconds: 60
	});
	
const { object } = await generateObject({
	model: google(model, {
		cachedContent: chunkCachedName,
		safetySettings
	}),
	temperature: 0,
	schema: extractedPlacesSchema,
	prompt: 'Extract from the provided content.'
});

It seems to work fine with generateText, however as soon as I use generateObject or generateContent it fails with that error.

Also just to clarify, if the system prompt is already included in the cacheManager, I don't need to specify it again in the generateObject right?

Code example

Here is the full error, you can see in the requestBodyValues there is a systemInstruction object being passed, potentially being injected by the SDK? perhaps that is the issue?

 Error on attempt 3/3: APICallError [AI_APICallError]: CachedContent can not be used with GenerateContent request setting system_instruction, tools or tool_config.

 Proposed fix: move those values to CachedContent from GenerateContent request.
     at file:///Users/mathew/Dev/talewind-app/node_modules/@ai-sdk/provider-utils/dist/index.mjs:431:14
     at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
     at async postToApi (file:///Users/mathew/Dev/talewind-app/node_modules/@ai-sdk/provider-utils/dist/index.mjs:336:28)
     at async GoogleGenerativeAILanguageModel.doGenerate (file:///Users/mathew/Dev/talewind-app/node_modules/@ai-sdk/google/dist/index.mjs:364:50)
     at async fn (/Users/mathew/Dev/talewind-app/node_modules/ai/dist/index.mjs:2049:33)
     at async eval (/Users/mathew/Dev/talewind-app/node_modules/ai/dist/index.mjs:299:22)
     at async _retryWithExponentialBackoff (/Users/mathew/Dev/talewind-app/node_modules/ai/dist/index.mjs:129:12)
     at async fn (/Users/mathew/Dev/talewind-app/node_modules/ai/dist/index.mjs:2017:34)
     at async eval (/Users/mathew/Dev/talewind-app/node_modules/ai/dist/index.mjs:299:22)
     at async processChunk (/Users/mathew/Dev/talewind-app/apps/creator-app/src/lib/server/crawl.server.ts:224:24) {
   cause: undefined,
   url: 'https://generativelanguage.googleapis.com/v1beta/models/gemini-1.5-flash-002:generateContent',
   requestBodyValues: {
     generationConfig: {
       topK: undefined,
       maxOutputTokens: undefined,
       temperature: 0,
       topP: undefined,
       stopSequences: undefined,
       responseMimeType: 'application/json',
       responseSchema: [Object]
     },
     contents: [ [Object] ],
     systemInstruction: { parts: [Array] },
     safetySettings: [ [Object], [Object], [Object], [Object] ],
     cachedContent: 'cachedContents/h7yj08xq8pe3'
   },
   statusCode: 400,
   responseHeaders: {
     'alt-svc': 'h3=":443"; ma=2592000,h3-29=":443"; ma=2592000',
     'cache-control': 'private',
     'content-encoding': 'gzip',
     'content-type': 'application/json; charset=UTF-8',
     date: 'Tue, 22 Oct 2024 18:20:08 GMT',
     server: 'scaffolding on HTTPServer2',
     'server-timing': 'gfet4t7; dur=84',
     'transfer-encoding': 'chunked',
     vary: 'Origin, X-Origin, Referer',
     'x-content-type-options': 'nosniff',
     'x-frame-options': 'SAMEORIGIN',
     'x-xss-protection': '0'
   },
   responseBody: '{\n' +
     '  "error": {\n' +
     '    "code": 400,\n' +
     '    "message": "CachedContent can not be used with GenerateContent request setting system_instruction, tools or tool_config.\\n\\nProposed fix: move those values to CachedContent from GenerateContent request.",\n' +
     '    "status": "INVALID_ARGUMENT"\n' +
     '  }\n' +
     '}\n',
   isRetryable: false,
   data: {
     error: {
       code: 400,
       message: 'CachedContent can not be used with GenerateContent request setting system_instruction, tools or tool_config.\n' +
         '\n' +
         'Proposed fix: move those values to CachedContent from GenerateContent request.',
       status: 'INVALID_ARGUMENT'
     }
   },
   [Symbol(vercel.ai.error)]: true,
   [Symbol(vercel.ai.error.AI_APICallError)]: true
 }

Additional context

"@ai-sdk/google": "^0.0.51",
"ai": "^3.4.18",
"@google/generative-ai": "^0.21.0",

The text was updated successfully, but these errors were encountered:

lgrammel · 2024-10-23T15:59:34Z

I can see how this happens with tool mode. Can you try mode: 'json' and see if that helps?

chanmathew · 2024-10-23T18:50:13Z

@lgrammel Added that but still same error unfortunately.

Albin0903 · 2025-01-09T01:25:39Z

still no solution?

radicalgeek · 2025-01-09T11:47:33Z

I have started to get this error too in code that was working before

Albin0903 · 2025-01-09T13:41:27Z

it seems that now only version 1.5-001 supports cached content, lol.

radicalgeek · 2025-01-09T22:26:49Z

Confirmed. caching is still working with 1.5-001. Not sure what that means for the future. I hope this is just a temporary mistake. The 1.5 models are supposed to be "stable". 1.5-001 seems to be much stricter with the minimum token requirements though. I did not need anywhere near as many tokens to create a cache with 1.5-002 when it was working.

ItsWendell · 2025-04-17T13:26:07Z

Here's a method described that I use to make context caching work, even with function calling. It's hacky and could use some improvements in the SDK to better facilitate this.

#3212 (comment)

lgrammel added bug Something isn't working ai/provider labels Oct 23, 2024

ItsWendell mentioned this issue May 9, 2025

feat(providers/google): improve cachedContent, expose rich token metadata and pass mediaResolution #6256

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Google Gen AI: Context caching breaks with generateObject / generateContent - #3333

Google Gen AI: Context caching breaks with generateObject / generateContent - #3333

chanmathew commented Oct 22, 2024 •

edited

Loading

lgrammel commented Oct 23, 2024

Uh oh!

chanmathew commented Oct 23, 2024

Uh oh!

Albin0903 commented Jan 9, 2025

Uh oh!

radicalgeek commented Jan 9, 2025

Uh oh!

Albin0903 commented Jan 9, 2025

Uh oh!

radicalgeek commented Jan 9, 2025 •

edited

Loading

Uh oh!

ItsWendell commented Apr 17, 2025

Uh oh!

Google Gen AI: Context caching breaks with generateObject / generateContent - #3333

Google Gen AI: Context caching breaks with generateObject / generateContent - #3333

Comments

chanmathew commented Oct 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Code example

Additional context

lgrammel commented Oct 23, 2024

Uh oh!

chanmathew commented Oct 23, 2024

Uh oh!

Albin0903 commented Jan 9, 2025

Uh oh!

radicalgeek commented Jan 9, 2025

Uh oh!

Albin0903 commented Jan 9, 2025

Uh oh!

radicalgeek commented Jan 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ItsWendell commented Apr 17, 2025

Uh oh!

chanmathew commented Oct 22, 2024 •

edited

Loading

radicalgeek commented Jan 9, 2025 •

edited

Loading