repositories Search Results · repo:aisingapore/web_scale_pdf_processing_pipeline language:"Jupyter Notebook"
Filter by
0 files
(90 ms)0 files
inaisingapore/web_scale_pdf_processing_pipeline (press backspace or delete to remove)Data pipeline used to process web scale data for pretraining LLMs. This pipeline was used for extracting text at high accuracy from open …
- Jupyter Notebook
- 0
- Updated 8 days ago
Sponsor open source projects you depend on
Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projectsProTip!
Press the /
key to activate the search input again and adjust your query.Sponsor open source projects you depend on
Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projectsProTip!
Press the /
key to activate the search input again and adjust your query.