examples

Vespa Builder

Apr 23, 2025

456daa2 · Apr 23, 2025

Name	Name	Last commit message	Last commit date
parent directory ..
agentic-streamlit-chatbot	agentic-streamlit-chatbot	Update streamlit_vespa_app.py	Apr 3, 2025
aws/lambda	aws/lambda	correct logo serving	Sep 12, 2024
document-processing	document-processing	Bind only to 127.0.0.1 in examples (#1594 )	Dec 10, 2024
embedder-auto-training-evaluation	embedder-auto-training-evaluation	data-proofer-ignore trec.nist.gov link	Sep 16, 2024
embedding-service	embedding-service	correct logo serving	Sep 12, 2024
fasthtml-demo	fasthtml-demo	Merge pull request #1682 from vespa-engine/renovate/pypi-starlette-vu…	Mar 25, 2025
generic-request-processing	generic-request-processing	Update Vespa version to 8.511.14.	Apr 23, 2025
google-cloud/cloud-functions	google-cloud/cloud-functions	Bump golang.org/x/net in /examples/google-cloud/cloud-functions/go	Mar 13, 2025
http-api-using-request-handlers-and-processors	http-api-using-request-handlers-and-processors	Update Vespa version to 8.511.14.	Apr 23, 2025
in-context-learning	in-context-learning	Bind only to 127.0.0.1 in examples (#1594 )	Dec 10, 2024
joins	joins	correct logo serving	Sep 12, 2024
lucene-linguistics	lucene-linguistics	Update Vespa version to 8.511.14.	Apr 23, 2025
model-deployment	model-deployment	correct logo serving	Sep 12, 2024
model-exporting	model-exporting	Duplicate model files for now	Apr 9, 2025
multiple-bundles-lib	multiple-bundles-lib	Update Vespa version to 8.511.14.	Apr 23, 2025
multiple-bundles	multiple-bundles	Update Vespa version to 8.511.14.	Apr 23, 2025
operations	operations	Update Vespa version to 8.511.14.	Apr 23, 2025
part-purchases-demo	part-purchases-demo	link update	Mar 13, 2025
predicate-fields	predicate-fields	Update Vespa version to 8.511.14.	Apr 23, 2025
reranker	reranker	update logos	Sep 12, 2024
vespa-chinese-linguistics	vespa-chinese-linguistics	Update Vespa version to 8.511.14.	Apr 23, 2025
README.md	README.md	Update README.md	Mar 26, 2025

README.md

Vespa Code And Operational Examples

Vespa grouping and facets for organizing results

logo Grouping Results demonstrates Vespa grouping and faceting for query time result analytics. Read more in Vespa grouping.

Vespa Predicate Fields

logo predicate-fields uses Vespa's predicate field type to implement indexing of document side boolean expressions. Boolean document side constraints allows the document to specify which type of queries it can be retrieved for. Predicate fields allow expressing logic like "this document should only be visible in search for readers in age range 20 to 30" or "this product should only be visible in search during campaign hours".

Vespa custom linguistics Integration

The logo vespa-chinese-linguistics app demonstrates integrating custom linguistic processing, in this case a Chinese tokenizer Jieba.

Vespa custom HTTP api using request handlers and processors

logo http-api-using-request-handlers-and-processors demonstrates how to build custom HTTP apis, building REST interfaces with custom handlers and renderers. See also Custom HTTP Api tutorial.

Vespa container plugins with multiple OSGI bundles

logo multiple-bundles is a technical sample application demonstrating how to use multiple OSGI bundles for custom plugins (searchers, handlers, renderers).

Distributed joins

logo Joins shows possibilities for doing distributed query time joins. This is for use cases where parent-child is not sufficient.

Document processing

logo Document-processingbuilds on album-recommendation to show some of the possibilities for doing custom document processing in Java.

Generic request processing

logo generic-request-processing Generic request-response processing sample application.

Lucene Linguistics

logo lucene-linguistics contains two sample application packages:

A bare minimal app.
Shows advanced configuration of the Lucene based Linguistics implementation.

Lambda functions in AWS and Google Cloud

logo aws/lambda and logo google-cloud/cloud-functions have examples of (lambda) functions for accessing data and logs with the cloud providers.

Automatic data generation for training embedders using LLMs

logo embedder-auto-training-evaluation does automatic data generation using the ChatGPT API. This in order to train an embedder to perform better for information retrieval on specific datasets without labor-intensive and expensive manual training data annotation.

Machine learned embedder models enable efficient similarity computations, but training these models requires large amounts of (often manually) annotated data. The aim of this app is to investigate whether Large Language Models (LLMs), such as GPT-3.5-turbo, can be employed to generate synthetic data for training embedder, without extensive manual intervention.

The repository contains scripts and notebooks to:

Prepare datasets
Generate training data for datasets using an LLM
Train an embedder
Evaluate performance

Embedding service (WORK IN PROGRESS)

logo embedding-service demonstrates how a Java handler component can be used to process HTTP requests. In this application, a handler is used to implement an embedding service, which takes a string as an input and returns a vector embedding of that string.

FastHTML Vespa frontend

logo FastHTML Vespa frontend is a simple frontend for the Vespa search engine. It is built using FastHTML and written in pure Python. Features:

Simple search interface, with links to search results.
Accordion with full JSON-response from Vespa.
SQLite DB for storing queries.
Admin authentication for viewing and downloading queries.
Deployment options - Docker + Huggingface spaces.

ONNX Model export and deployment example

Use logo model-deployment to generate a model in ONNX format in the models directory, by running the ONNXModelExport notebook.

Model exporting

logo Model exporting demonstrates how to export a Huggingface sentence-transformer model to ONNX format.

Reranker sample application

logo reranker is a stateless application which re-ranks results obtained from another Vespa application. While this does not result in good performance and is not recommended for production, it is useful when you want to quickly do ranking experiments without rewriting application data.

Categorize using an LLM

logo In-Context Learning This is a set of scripts/installs to back up the presentation using In-Context Learning at:

Agentic Chatbot using Vespa

logo agentic-streamlit-chatbot This simple Streamlit application demonstrates how to use LangGraph agentic framework to develop an E-commerce chatbot using Vespa as a retrieval tool.

logo agentic-streamlit-chatbot This Streamlit application shows a more advanced example on how to use LangGraph agentic framework to develop an E-commerce chatbot enabling a conversational search with human in the loop feedback with yql query generation using Vespa query builder.

For any questions, please register at the Vespa Slack and discuss in the general channel.

Operations

See operations for sample applications for multinode clusters, deployed in various infrastructure like Kubernetes. Also find examples for CI/CD, security and monitoring.

Note: Applications with pom.xml are Java/Maven projects and must be built before being deployed. Refer to the Developer Guide for more information.

Contribute to the Vespa sample applications.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

examples

examples

README.md

Vespa Code And Operational Examples

Vespa grouping and facets for organizing results

Vespa Predicate Fields

Vespa custom linguistics Integration

Vespa custom HTTP api using request handlers and processors

Vespa container plugins with multiple OSGI bundles

Distributed joins

Document processing

Generic request processing

Lucene Linguistics

Lambda functions in AWS and Google Cloud

Automatic data generation for training embedders using LLMs

Embedding service (WORK IN PROGRESS)

FastHTML Vespa frontend

ONNX Model export and deployment example

Model exporting

Reranker sample application

Categorize using an LLM

Agentic Chatbot using Vespa

Operations

Files

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

Vespa Code And Operational Examples

Vespa grouping and facets for organizing results

Vespa Predicate Fields

Vespa custom linguistics Integration

Vespa custom HTTP api using request handlers and processors

Vespa container plugins with multiple OSGI bundles

Distributed joins

Document processing

Generic request processing

Lucene Linguistics

Lambda functions in AWS and Google Cloud

Automatic data generation for training embedders using LLMs

Embedding service (WORK IN PROGRESS)

FastHTML Vespa frontend

ONNX Model export and deployment example

Model exporting

Reranker sample application

Categorize using an LLM

Agentic Chatbot using Vespa

Operations