You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have a multi-tenant SaaS application where I need to keep track of the Unstructured quota each tenant consumes.
Is there a way to track the amount of billed pages on a per request basis, the same way LLMs return the consumed tokens in each API request?
If not, replicating the same logic Unstructured uses to count pages for each different file type may be an impediment for us to continue using Unstructured, as we won't be able to track the tenant consumption in an appropriate manner.
The problem
Page count
Unstructured supports a wide range of different file types. It is not easy to replicate the page count for each file type: pdf, pptx, docx etc. The result is that we don't know the actual page count we are being billed for on a per request basis.
Strategy
Unstructured supports the auto strategy as an input in the API. However, the actual used strategy depends on the contents of the file. It is not trivial to replicate the same logic Unstructured uses internally to choose the actual strategy that will be used to partition the file. The result is that we don't know the actual strategy we are being billed for on a per request basis.
Arguments
Cost tracking is a real problem. So real that there are products in the market that focus on providing insights about the usage of different services (e.g. Helicone, Keywords AI, Sentry etc).
Example
The Unstructured API could add headers to provide more information about the request, keeping it backwards compatible without modifying the response body used today (a JSON array containing the partition elements).
constclient=newUnstructuredClient({serverURL: process.env.UNSTRUCTURED_BASE_URL,security: {apiKeyAuth: process.env.UNSTRUCTURED_API_KEY}})constresponse=awaitclient.general.partition({partitionParameters: {files: {content: fileBuffer,fileName: fileName},strategy: 'auto'}})// Is it possible to update the Unstructured API to return this information? 🤔 constactualUsedStrategy=response.rawResponse.headers.get('x-unstructured-strategy')constactualPagesConsumed=parseInt(response.rawResponse.headers.get('x-unstructured-pages'))
Benchmarks
Many providers that support the OpenAI SDK provide transparency about the API consumption and pricing.
constopenai=newOpenAI()constresponse=awaitclient.chat.completions.create({model: 'gpt-4o',messages: [{role: 'user',content: 'Hi'}]})// We can calculate the actual cost of the request 🥳 constactualModelUsed=response.modelconstpromptTokensUsage=result.usage?.prompt_tokens??0constcompletionTokensUsage=result.usage?.completion_tokens??0
OpenAI has different prices for different gpt-4o snapshots.
When we send the model as gpt-4o, it is an alias for a specific snapshot that we are billed for.
This way, we can keep track of our usage in a very straightforward manner.
Benefits
Transparency is increased, raising Unstructured to be on par of other AI providers regarding billing
Every Unstructured customer could benefit from this improvement, increasing its adoption as an important tool in the AI age
The API maintains its backwards compatibility
The text was updated successfully, but these errors were encountered:
Context
I have a multi-tenant SaaS application where I need to keep track of the Unstructured quota each tenant consumes.
Is there a way to track the amount of billed pages on a per request basis, the same way LLMs return the consumed tokens in each API request?
If not, replicating the same logic Unstructured uses to count pages for each different file type may be an impediment for us to continue using Unstructured, as we won't be able to track the tenant consumption in an appropriate manner.
The problem
Page count
Unstructured supports a wide range of different file types. It is not easy to replicate the page count for each file type: pdf, pptx, docx etc. The result is that we don't know the actual page count we are being billed for on a per request basis.
Strategy
Unstructured supports the
auto
strategy as an input in the API. However, the actual used strategy depends on the contents of the file. It is not trivial to replicate the same logic Unstructured uses internally to choose the actual strategy that will be used to partition the file. The result is that we don't know the actual strategy we are being billed for on a per request basis.Arguments
Cost tracking is a real problem. So real that there are products in the market that focus on providing insights about the usage of different services (e.g. Helicone, Keywords AI, Sentry etc).
Example
The Unstructured API could add headers to provide more information about the request, keeping it backwards compatible without modifying the response body used today (a JSON array containing the partition elements).
Benchmarks
Many providers that support the OpenAI SDK provide transparency about the API consumption and pricing.
OpenAI has different prices for different
gpt-4o
snapshots.When we send the model as
gpt-4o
, it is an alias for a specific snapshot that we are billed for.This way, we can keep track of our usage in a very straightforward manner.
Benefits
The text was updated successfully, but these errors were encountered: