-
Notifications
You must be signed in to change notification settings - Fork 219
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for files API endpoints #184
Conversation
fe780bb
to
e5f221f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Exciting!!!
Some late night thoughts, I think prepareInputs
needs a second pass. And I do think we should consider the surface area of the files api but otherwise I am so excited to get this in.
@mattt I've incorporated the feedback, updated the code to use a Types have been updated and the tests now verify the |
@mattt I've removed the |
Co-authored-by: Aron Carroll <[email protected]>
This allows the user to determine if file uploads, falling back to base64 should be used or just sticking to one approach. The tests have been updated to validate the file upload payload and to ensure the url is correctly passed to the prediction create method.
Rebased and fixed merge conflicts. Anything blocking this from shipping? |
@mattt bump |
In #198, we added support for automatically encoding binary prediction inputs as base64-encoded data URIs.
This PR adds support for the new (write-only) files API. By default, each binary input is uploaded and replaced with a URL to the created file. Before the inputs are sent to the model to be run, Replicate's API rewrites the URL to make the file downloadable.
From the perspective of the caller, the out-of-the-box experience is the same: You pass a file handle or blog as an input, and that's it — everything just works. And with the files API, things work much better for larger files, whose data URI encoding can cause write timeouts and other problems.
What I'm less clear about is how to customize this behavior. My current solution is two-fold:
prepareInputs
property to the clienttransformFileInputsToBase64EncodedDataURIs
andtransformFileInputsToReplicateFileURLs
helper functionsBy default,
prepareInputs
tries to upload files and falls back to data URI encoding (for example if the files API is unavailable for some reason).Users can override this to customize the behavior however they like. For example, they could only base64-encode inputs, conditionalize based on file size / content, or throw an error.
I'm pretty sure we need it to be this flexible. But is it too clunky? Would love to hear your thoughts, @aron @zeke.