build(deps): bump com.google.protobuf:protobuf-java from 4.28.2 to 4.35.0 by dependabot[bot] · Pull Request #13 · spectrayan/spector

dependabot · 2026-05-20T21:33:37Z

Bumps com.google.protobuf:protobuf-java from 4.28.2 to 4.35.0.

Commits

See full diff in compare view

…flags

…ne, Euclidean, VectorOps)

…p) with zero-copy I/O

…rdAnalyzer

…l threads

…gestion pipeline

…mark scaffold

… SECURITY, README, CI, templates)

…double-consonant dedup

… HNSW recall tests

…to-detect

…n-testtools dependency

…25 scoring

…hunker, TextUtils and add chunked ingestion for large documents

…kenizer for large document support

… auto-embed engine integration

- QuantizationType enum (NONE, SCALAR_INT8) - ScalarQuantizer with min/max calibration and INT8 encoding - QuantizedCosineSimilarity and QuantizedDotProduct SIMD kernels - SimilarityFunction updated with quantized variants - ScalarQuantizerTest for encode/decode and batch operations

- PersistenceMode enum (IN_MEMORY, DISK, MMAP) - IndexFileFormat for binary HNSW serialization - QuantizedVectorStore with INT8 compression - InMemoryVectorStore concurrent access improvements

- DiskHnswWriter for binary HNSW graph serialization - DiskHnswIndex for mmap-based read-only index loading - QuantizedHnswIndex with INT8 scalar quantization (4x memory reduction) - BM25Index and HnswIndex performance improvements - DiskHnswIndexTest and QuantizedHnswIndexTest

- ProductQuantizer: K-Means++ codebook training, PQ encode/decode, ADC distance computation, batch encoding - IvfPqIndex: full IVF-PQ implementing VectorIndex SPI with cluster assignment, residual-based PQ encoding, and multi-probe search - PostingList: per-cluster growable storage for PQ codes - 14 tests: PQ training/encode/decode/ADC + IVF-PQ search/recall/sorting

- Reranker SPI interface for pluggable re-ranking strategies - LlmReranker: listwise relevance scoring using Ollama generate API with prompt-based 0-10 scoring and graceful fallback - HybridSearchOrchestrator: integrated optional re-ranking post-processing - LlmRerankerTest: fallback behavior, empty input, topK limiting

- spector-gpu Maven module with Panama FFM CUDA bindings - GpuCapability: runtime CUDA detection (device count, name, memory) - GpuBatchSimilarity: SIMD-optimized batch dot product and cosine similarity using FMA Vector API operations - CudaKernelLauncher: PTX module loader, function resolver, kernel launcher with grid/block configuration - batch_similarity.cu: CUDA kernels for batch_cosine, batch_dot, batch_l2 with block-level shared memory reduction - 14 tests: GPU detection, batch similarity correctness, CUDA launcher

…hitecture - spector-cluster Maven module with gRPC/protobuf integration - spector_search.proto: 6 RPC definitions (vector, keyword, hybrid search, ingest, health check, stats) - ClusterCoordinator: fan-out/merge query execution via virtual threads with consistent hash shard routing - ShardNode: gRPC server wrapping SpectorEngine - SpectorSearchServiceImpl: full gRPC service delegating to local engine - RemoteShardClient: type-safe gRPC client for all 5 RPC methods - ClusterConfig: multi-node endpoint configuration with replication - ClusterConfigTest: routing, hash consistency, topology tests

…rEngine - IndexType enum (HNSW, IVF_PQ) for configurable index strategy - SpectorConfig: added indexType, ivfNlist, ivfNprobe, pqSubspaces with builder methods (withIvfPq) and auto-defaults - SpectorEngine: IVF-PQ auto-training pipeline that buffers ingested vectors and trains PQ codebooks after nlist*40 samples - Backward-compatible 7-arg constructor preserved - 4 new tests: auto-training, keyword search during buffering, config builder, auto-defaults

- HeavyPerformanceBenchmark: keyword/vector/hybrid at 50K-100K scale - IvfPqBenchmark: IVF-PQ search, PQ encode/decode, ADC distance, batch cosine similarity at 10K-50K scale - ConcurrencyBenchmark: multi-threaded search throughput - IngestionBenchmark: document ingestion throughput - PerformanceTestRunner: standalone runner with formatted results

- pom.xml: added spector-gpu, spector-cluster modules to reactor and dependencyManagement - README.md: expanded architecture (13 modules), 5 new features, updated comparison table (quantization, IVF-PQ, GPU, LLM, distributed), updated test suite (316+ tests), added roadmap checklist

Extract ~300 lines of duplicated graph traversal code (greedyClosest, searchLayer, selectNeighbors, addConnection, getNeighbors, setNeighbors) into AbstractHnswIndex base class with three template method hooks: - computeDistance(float[], int) — distance from query to stored node - getNodeVector(int) — float32 vector retrieval for pruning - storeVector(int, float[]) — vector storage on insertion HnswIndex: 413 -> 76 lines (-81%) QuantizedHnswIndex: 476 -> 226 lines (-53%) All 316+ tests passing, zero regressions.

- VectorIndex: add default isReadOnly() method (returns false) - DiskHnswIndex: override isReadOnly() to return true - KeywordIndex: add default remove(String id) method - BM25Index: expose existing removeDoc() logic via KeywordIndex.remove() Completes the deletion API path across the engine.

- Add gpuEnabled, rerankerEnabled, rerankerOllamaUrl, rerankerModel, rerankerMaxCandidates fields to SpectorConfig record - Add with*() builder-style methods for GPU and reranker config - Add IVF-PQ computed defaults (effectiveNlist, effectiveNprobe, etc.) - Add spector-gpu dependency to engine POM

EngineComponentFactory now always creates writable components, then loads existing data from disk via addPrebuilt(). Eliminates the mutually exclusive read-only vs fresh-empty startup paths. Flow: create writable store/index → load ID mappings → copy DiskHnswIndex graph into writable HnswIndex → load DocumentStore. All data survives restarts while remaining fully writable for new ingestion.

With the unified startup path, forceWritable is no longer needed: - SpectorConfig: remove forceWritable field + withForceWritable() - SpectorConfigFactory: remove from EngineDefaults record - SpectorRuntime: remove 3-arg from() overload - IngestCommand: use standard 2-arg from() - Tests: update path expectations (.spector/index, .spector/memory)

- Change engine data-directory default from .spector-data to .spector/index - Change memory persistence-path default from .spector-memory to .spector/memory - Align with code defaults in SpectorConfigFactory - Update .gitignore to use single .spector/ entry - Update all READMEs with corrected paths

- 40-byte binary record layout (RFC-style wire format) - Chunked file architecture with rolling and snapshot-driven truncation - Corruption Recovery Strategy: torn writes vs mid-log bit rot taxonomy - Compaction & Garbage Collection policy with snapshot-aware truncation - CRDT merge semantics and cloud replication architecture - Compression (DEFLATE) and configuration reference

- Update .spector-memory → .spector/memory in all docs - Update .spector-data → .spector/index in MCP server docs - Fix modules nav: add spector-node/config/metrics/dist, remove stale server/cluster - Delete stale spector-server.md and spector-cluster.md doc pages - Register extra_css and mermaid-init.js in mkdocs.yml (was missing)

Engine close() now uses ShardedDiskHnswWriter which writes to index_shards/ directory, not a single index.spct file.

The standalone Armeria server has been consolidated into spector-node which provides REST + gRPC + SSE + cluster coordination.

Cluster coordination, shard management, and replication are planned for V3. Removing premature scaffolding to reduce build surface and test noise.

- Remove spector-server and spector-cluster from root POM modules - Update dependency versions and module cross-references

…ctorProperties - PersistenceFiles record for centralized persistence file naming - SpectorConfig enhancements for sharded index persistence - SpectorProperties improvements for typed config access

…ed persistence on close - EngineComponentFactory now accepts PersistenceFiles for disk I/O - DefaultSpectorEngine.close() persists via ShardedDiskHnswWriter - EngineIngestion updated for new component wiring

…dex improvements - ShardedDiskHnswIndex and ShardedDiskHnswWriter for multi-shard persistence - AbstractHnswIndex shared base for HNSW variants - DiskHnswIndex and DiskHnswWriter updates - QuantizedHnswIndex enhancements - SpectorIndex and SpectorShard refinements

- ShardedMappedVectorStore for multi-shard vector persistence - ShardedIndexFormat for shard file layout - MappedVectorStore and VectorStoreFactory refinements

…lectDaemon improvements - MemoryWal: ReentrantLock for virtual thread safety, corruption recovery - WalCorruptionException for explicit corruption signaling - CognitiveRecordLayout and SynapticHeaderConstants alignment - CloudSync CRDT merge semantics - DefaultSpectorMemory composition - ReflectDaemon hippocampus refinements

- Updated wiring for engine + memory + ingestion pipeline - Aligned with unified .spector/ directory structure

- MemoryPinning for virtual thread memory safety - NativeOsMemory for OS-level memory operations

- CognitiveMemoryBenchmark for memory subsystem performance - Updated concurrency, ingestion, and index operation benchmarks

- Spring AI integration auto-config - Resource configuration files

- Armeria-based HTTP REST + gRPC + SSE event streaming - Cluster coordination and node management - Replaces spector-server with full-featured unified node

- Prometheus metric collectors - JVM memory and GC instrumentation - Health check endpoints

- README updated with new module structure and directory paths - CHANGELOG reflects server→node, cluster removal, WAL improvements - Architecture, API, getting-started, and configuration docs updated - Scripts updated for .spector/ directory and MCP config - Deploy configuration added - CONTRIBUTING guide updated

- SpectorProperties test defaults - MappedVectorStore test updates - SpectorToolRegistryTest updates - MemoryWal persistence and integration tests - Commons concurrent utility tests - Spring auto-config tests - Update roadmap skill scripts

Rules: - Project-wide coding standards, architecture boundaries, git conventions Skills: - coding-standards: Java 25, Panama FFM, SIMD, naming, class structure - code-review: 8-step structured review with checklist - doc-sync: code-to-documentation sync mapping - incremental-commits: dependency-ordered commit grouping Workflows: - feature-development: requirements to commit end-to-end - pr-review: diff analysis to verdict - module-lifecycle: add/remove/rename Maven modules - documentation-update: write, build, verify docs - release-prep: test, changelog, version bump, tag - perf-investigation: profile, optimize, benchmark

Bumps [com.google.protobuf:protobuf-java](https://github.com/protocolbuffers/protobuf) from 4.28.2 to 4.35.0. - [Release notes](https://github.com/protocolbuffers/protobuf/releases) - [Commits](https://github.com/protocolbuffers/protobuf/commits) --- updated-dependencies: - dependency-name: com.google.protobuf:protobuf-java dependency-version: 4.35.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>

dependabot · 2026-06-01T13:57:41Z

OK, I won't notify you again about this release, but will get in touch when a new version is available. If you'd rather skip all updates until the next major or minor version, let me know by commenting @dependabot ignore this major version or @dependabot ignore this minor version. You can also ignore all major, minor, or patch releases for a dependency by adding an ignore condition with the desired update_types to your config file.

If you change your mind, just re-open this PR and I'll resolve any conflicts on it.

sbharatjoshi added 30 commits May 13, 2026 12:25

Initial commit

c71bc29

chore: bootstrap Maven multi-module project with JDK 25 + Vector API …

0c3ae50

…flags

feat(core): add SIMD-accelerated similarity kernels (DotProduct, Cosi…

392fb53

…ne, Euclidean, VectorOps)

feat(storage): add Panama MemorySegment vector stores (InMemory + Mma…

5cd2173

…p) with zero-copy I/O

feat(index): add HNSW vector index and BM25 keyword index with Standa…

f0c5ac2

…rdAnalyzer

feat(query): add hybrid search orchestrator with RRF fusion on virtua…

cc11948

…l threads

feat(engine): add SpectorEngine facade with config, lifecycle, and in…

87ed856

…gestion pipeline

feat(server): add Javalin REST API with virtual threads and JMH bench…

0ab8607

…mark scaffold

docs: add open-source repo files (LICENSE, NOTICE, CoC, CONTRIBUTING,…

5a2a5a1

… SECURITY, README, CI, templates)

feat(index): add StemmingAnalyzer with simplified Porter stemmer and …

3ec5999

…double-consonant dedup

feat(index): add ContentExtractor (XML/JSON/Java object) and extended…

fe9507c

… HNSW recall tests

feat(query): add QueryParser with directive syntax (mode:, k:) and au…

55d09fe

…to-detect

feat(server): add global error handler, integration tests, and javali…

be8c646

…n-testtools dependency

perf(bench): add JMH benchmarks for SIMD kernels, HNSW search, and BM…

145d696

…25 scoring

refactor: extract spector-commons module with ContentExtractor, TextC…

c862b3d

…hunker, TextUtils and add chunked ingestion for large documents

feat(commons): add streaming chunker, token-level chunker, and WordTo…

462166e

…kenizer for large document support

feat(embed): add EmbeddingProvider SPI and Ollama implementation with…

56aa477

… auto-embed engine integration

feat(storage): add disk persistence and quantized vector store

7aedb4a

- PersistenceMode enum (IN_MEMORY, DISK, MMAP) - IndexFileFormat for binary HNSW serialization - QuantizedVectorStore with INT8 compression - InMemoryVectorStore concurrent access improvements

sbharatjoshi and others added 24 commits May 28, 2026 23:53

fix(engine): update persistence test to check index_shards/ directory

9c81a13

Engine close() now uses ShardedDiskHnswWriter which writes to index_shards/ directory, not a single index.spct file.

refactor: remove spector-server module (merged into spector-node)

2b6a8b5

The standalone Armeria server has been consolidated into spector-node which provides REST + gRPC + SSE + cluster coordination.

refactor: remove spector-cluster module (deferred to V3 roadmap)

fe03537

Cluster coordination, shard management, and replication are planned for V3. Removing premature scaffolding to reduce build surface and test noise.

build: remove server/cluster from reactor, update POM dependencies

380a473

- Remove spector-server and spector-cluster from root POM modules - Update dependency versions and module cross-references

refactor(config): add PersistenceFiles, enhance SpectorConfig and Spe…

c33692e

…ctorProperties - PersistenceFiles record for centralized persistence file naming - SpectorConfig enhancements for sharded index persistence - SpectorProperties improvements for typed config access

refactor(engine): EngineComponentFactory with PersistenceFiles, shard…

8525572

…ed persistence on close - EngineComponentFactory now accepts PersistenceFiles for disk I/O - DefaultSpectorEngine.close() persists via ShardedDiskHnswWriter - EngineIngestion updated for new component wiring

feat(storage): ShardedMappedVectorStore, VectorStoreFactory improvements

0ad8300

- ShardedMappedVectorStore for multi-shard vector persistence - ShardedIndexFormat for shard file layout - MappedVectorStore and VectorStoreFactory refinements

refactor(runtime): SpectorRuntime composition root updates

c9262b3

- Updated wiring for engine + memory + ingestion pipeline - Aligned with unified .spector/ directory structure

feat(commons): MemoryPinning and NativeOsMemory concurrent utilities

ee35fae

- MemoryPinning for virtual thread memory safety - NativeOsMemory for OS-level memory operations

feat(bench): CognitiveMemoryBenchmark, updated benchmark suites

f82bcdf

- CognitiveMemoryBenchmark for memory subsystem performance - Updated concurrency, ingestion, and index operation benchmarks

feat(spring): Spring AI auto-configuration and resource support

0b2f730

- Spring AI integration auto-config - Resource configuration files

feat(node): spector-node unified server module

a9d7e2e

- Armeria-based HTTP REST + gRPC + SSE event streaming - Cluster coordination and node management - Replaces spector-server with full-featured unified node

feat(metrics): spector-metrics Prometheus + JVM instrumentation module

833abd6

- Prometheus metric collectors - JVM memory and GC instrumentation - Health check endpoints

dependabot Bot changed the title ~~chore(deps): bump com.google.protobuf:protobuf-java from 4.28.2 to 4.35.0~~ build(deps): bump com.google.protobuf:protobuf-java from 4.28.2 to 4.35.0 May 30, 2026

dependabot Bot force-pushed the dependabot/maven/com.google.protobuf-protobuf-java-4.35.0 branch from 779abcc to ca6f634 Compare May 30, 2026 00:16

sbharatjoshi closed this Jun 1, 2026

sbharatjoshi force-pushed the main branch from 834f839 to 7ba87d8 Compare June 1, 2026 13:57

dependabot Bot deleted the dependabot/maven/com.google.protobuf-protobuf-java-4.35.0 branch June 1, 2026 13:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

build(deps): bump com.google.protobuf:protobuf-java from 4.28.2 to 4.35.0#13

build(deps): bump com.google.protobuf:protobuf-java from 4.28.2 to 4.35.0#13
dependabot[bot] wants to merge 168 commits into
mainfrom
dependabot/maven/com.google.protobuf-protobuf-java-4.35.0

dependabot Bot commented on behalf of github May 20, 2026 •

edited

Loading

Uh oh!

dependabot Bot commented on behalf of github Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

dependabot Bot commented on behalf of github May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dependabot Bot commented on behalf of github Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dependabot Bot commented on behalf of github May 20, 2026 •

edited

Loading