[Feature Request] Multi-threaded writes in pull-based ingestion #17875
Labels
enhancement
Enhancement or improvement to existing feature or request
Indexing
Indexing, Bulk Indexing and anything related to indexing
untriaged
Uh oh!
There was an error while loading. Please reload this page.
Is your feature request related to a problem? Please describe
The current pull-based ingestion design decouples poller and writer for better performance but uses a single writer thread. This can further be improved by supporting multi-threaded writes.
Describe the solution you'd like
Multi-threaded writes will be supported as follows:
This solution guarantees that the updates for the same document are sequentially processed, in the same order visible to the consumer. Versioning will be supported when the underlying streaming source does not provide ordering guarantees.
Note that a message without an ID field will result in an auto-generated ID at runtime, and can be mapped to different partitions on retries. This however should not affect the correctness of the data as subsequent updates to the same document must provide the ID field.
Shard recovery will be handled in a multi-threaded write scenario as follows:
Related component
Indexing
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: