goresearch

Generate validated, citation-rich research reports from a single Markdown brief. goresearch plans queries, searches the web (optionally via SearxNG), fetches and extracts readable text, and can call an external OpenAI-compatible LLM to synthesize a clean Markdown report with numbered citations, references, and an evidence-check appendix. It runs entirely on standard chat-completions APIs—no proprietary search features. Local LLM containers are not provided.

Features

End-to-end pipeline: brief parsing → planning → search → fetch/extract → selection/dedup → budgeting → synthesis → validation → verification → rendering.
Grounded synthesis: strictly uses supplied extracts; numbered inline citations map to a final References section.
Evidence check: second pass extracts claims, maps supporting sources, and flags weakly supported statements.
Deterministic and scriptable: low-temperature prompts, structured logs, and explicit flags/envs.
Pluggable search: defaults to self-hosted SearxNG; adapters can be swapped without changing the rest.
Polite fetching: user agent, timeouts, redirect caps, content-type checks, and optional HTTP cache with conditional requests.
Public web only: blocks localhost/private IPs and URL-embedded credentials to avoid private services.
Token budgeting: proportional truncation prevents dropping sources while fitting model context.
Reproducibility: embedded manifest and sidecar JSON record URLs and content digests used in synthesis.
Dry run: plan queries and select URLs without calling the model.

Installation

Prerequisites:

Go 1.23+ (module toolchain go1.24.6)
An external OpenAI-compatible server endpoint you control (model name and API key)
Optional: a SearxNG instance URL (and API key if required)

Install the CLI directly:

go install github.com/hyperifyio/goresearch/cmd/goresearch@latest

Or build from source:

git clone https://github.com/hyperifyio/goresearch
cd goresearch
go build -o bin/goresearch ./cmd/goresearch

Quick start

End-to-end example: Nginx HSTS decision brief (requires external LLM)

This example mirrors the documented use case and requires a real OpenAI‑compatible API you host or have access to (set via env).

Write request.md:

# Enable HSTS correctly on Nginx (with preload) — decision brief
Audience: Engineering leadership
Tone: Practical, risk-aware
Target length: 1200 words

Key questions: correct header, validation steps, preload caveats, rollback.

Export your LLM settings:

# Example (replace with your endpoint and model)
export LLM_BASE_URL="https://your-llm.example.com/v1"
export LLM_MODEL="your/model-id"

Run goresearch against live web search via SearxNG:

goresearch \
  -input request.md \
  -output report.md \
  -llm.base "$LLM_BASE_URL" \
  -llm.model "$LLM_MODEL" \
  -searx.url "http://localhost:8888"

Open report.md. You should see an executive summary, analysis with bracketed citations like [3], a References list with URLs, an Evidence appendix, and a reproducibility footer.

Configuration

You can configure via flags or environment variables.

Environment variables:

LLM_BASE_URL: base URL for the OpenAI-compatible server you use
LLM_MODEL: model name
LLM_API_KEY: API key for the server
SEARX_URL: SearxNG base URL (e.g., https://searx.example.com)
SEARX_KEY (optional): SearxNG API key
TOPIC_HASH (optional): included for traceability in cache scoping

Primary flags (with defaults):

-input (default: request.md): path to input Markdown research request
-output (default: report.md): path for the final Markdown report
-searx.url: SearxNG base URL
-searx.key: SearxNG API key (optional)
-searx.ua: Custom User-Agent for SearxNG requests (default identifies goresearch)
-search.file: Path to a JSON file providing offline search results for a file-based provider
-llm.base: OpenAI-compatible base URL (external)
-llm.model: model name
-llm.key: API key
-max.sources (default: 12): total sources cap
-max.perDomain (default: 3): per-domain cap
-max.perSourceChars (default: 12000): per-source character limit for excerpts
-min.snippetChars (default: 0): minimum snippet chars to keep a search result
-lang (default: empty): language hint, e.g. en or fi
-dry-run (default: false): plan/select without calling the LLM
- -v (default: false): verbose console output (progress). Detailed logs are controlled via -log.level.
- -log.level (default: info): structured log level for the log file: trace|debug|info|warn|error|fatal|panic
- -log.file (default: logs/goresearch.log): path to write structured JSON logs
- -debug-verbose (default: false): allow logging raw chain-of-thought (CoT) for debugging Harmony/tool-call interplay. Off by default.
-cache.dir (default: .goresearch-cache): cache directory
-cache.maxAge (default: 0): purge cache entries older than this duration (e.g. 24h, 7d); 0 disables
-cache.clear (default: false): clear entire cache before run
-cache.topicHash: optional topic hash to scope cache (accepted for traceability)
-cache.strictPerms (default: false): restrict cache at rest (0700 dirs, 0600 files)
-robots.overrideDomains (default from env ROBOTS_OVERRIDE_DOMAINS): comma-separated domain allowlist to ignore robots.txt, requires -robots.overrideConfirm
-robots.overrideConfirm (default: false): second confirmation flag required to activate robots override allowlist
-domains.allow (comma-separated): only allow these hosts/domains; subdomains included
-domains.deny (comma-separated): block these hosts/domains; takes precedence over allow
-tools.enable (default: false): enable the tool-orchestrated chat mode
-tools.dryRun (default: false): do not execute tools; append structured dry-run envelopes
-tools.maxCalls (default: 32): maximum number of tool calls per run
-tools.maxWallClock (default: 0): wall-clock cap for the tool loop (e.g., 30s); 0 disables
-tools.perToolTimeout (default: 10s): per-tool execution timeout
-tools.mode (default: harmony): chat protocol mode: harmony or legacy
-verify / -no-verify (default: -verify): enable or disable the fact-check verification pass and Evidence check appendix

Full CLI reference

For a comprehensive, auto-generated list of all flags and environment variables, see: docs/cli-reference.md.

Run locally with Docker (optional)

Important: On Apple M2 virtual machines (including this development environment), Docker is not available due to nested virtualization limits. Use the non-Docker alternatives documented below for SearxNG. This repository no longer provides local LLM containers.

Prerequisites

Docker Desktop with Compose v2 (or Docker Engine + docker compose CLI)
Recommended: ≥4 CPUs and ≥8 GB RAM for your external LLM service
Network access to pull images on first run

goresearch is a CLI that you run on the host. Docker Compose is only used to provide optional dependencies. For the nginx HSTS use case:

Configure your external LLM endpoint via LLM_BASE_URL and LLM_MODEL.
Start SearxNG locally if you want web search; otherwise use -search.file for offline/file‑based search.

Start only SearxNG (dependency) from the base compose file:

docker compose up -d searxng

Optional services live in docker-compose.optional.yml and can be brought up as needed (TLS proxy only):

# TLS reverse proxy via Caddy — optional (needs base + optional files)
docker compose -f docker-compose.yml -f docker-compose.optional.yml --profile tls up -d caddy-tls

Environment variables

Compose will read a local .env file when present and also respects exported shell variables. Useful settings:

LLM_BASE_URL: base URL for your LLM server
LLM_MODEL: model identifier known to your LLM server
LLM_API_KEY: API key if your server requires one (not baked into images)
SEARX_URL: internal URL for SearxNG (default http://searxng:8080)
SSL_VERIFY: enable SSL certificate verification; set to false for self-signed certificates (default true)
APP_UID / APP_GID: host user/group IDs to avoid permission issues on bind mounts (e.g., APP_UID=$(id -u) APP_GID=$(id -g) before make up)

You can also set CLI flags at run time when invoking goresearch directly inside the research-tool container.

Health checks and readiness

Services declare health checks: searxng probes the root (/) or /status depending on version.
Check health via:

docker compose ps
docker compose logs -f --tail=100

From the host, you can also poll readiness without Docker using the helper:

make wait

Common troubleshooting

Permissions on ./reports or ./.goresearch-cache:
- Export your IDs: export APP_UID=$(id -u) APP_GID=$(id -g) and re-run make up.
- Or run the ownership fixer once:
```
docker compose -f docker-compose.yml -f docker-compose.permissions.yml.example run --rm fix-perms
```
No ports exposed / cannot reach services from host:
- By default, the stack is on an internal network only. To expose ports temporarily:
```
docker compose -f docker-compose.yml -f docker-compose.override.yml.example up -d
```
  This maps llm-openai to localhost:8080 and searxng to localhost:8888.
LLM not ready or “model not found” errors:
- Ensure LLM_MODEL matches a model your server has installed. Consult your LLM server docs for installing or selecting a model.
Docker unavailable on this machine (e.g., Apple M2 VM):
- Use the non-Docker options in “Running SearxNG without Docker (optional)” and point goresearch at those endpoints.

Local stack helpers (optional)

These convenience targets manage the optional local stack defined in docker-compose.yml.

Important: On Apple M2 virtual machines (including this development environment), Docker is not available due to nested virtualization limits. Skip these and use the non-Docker alternatives documented below (e.g., Homebrew/venv SearxNG, local LLM). On machines with Docker, you can use:

# Start dev profile (tool + searxng)
make up

# Tail logs
make logs

# Rebuild and recreate dev services
make rebuild

# Stop services (keeps caches)
make down

# Run tests; if Docker is available, starts the test profile with stub-llm
make test

# Prune cache volumes and local cache dir
make clean

Network isolation and optional port exposure

By default, the Compose stack runs on a private internal network and does not publish any ports to the host. To temporarily expose SearxNG for local troubleshooting, use the provided override file:

docker compose -f docker-compose.yml -f docker-compose.override.yml.example up -d

This maps searxng to localhost:8888. Remove the override file from the command to restore the default private-only behavior.

Usage

Basic:

goresearch -input request.md -output report.md -llm.base "$LLM_BASE_URL" -llm.model "$LLM_MODEL" -llm.key "$LLM_API_KEY" -searx.url "$SEARX_URL"

Verbose with language hint and tighter domain cap:

goresearch -v -log.level debug -lang en -max.perDomain 2 -input request.md -output report.md \
  -llm.base "$LLM_BASE_URL" -llm.model "$LLM_MODEL" -llm.key "$LLM_API_KEY" \
  -searx.url "$SEARX_URL" -searx.key "$SEARX_KEY"

We no longer document use cases without a real LLM in the quick start. Use -dry-run only for debugging.

Caching and reproducibility

Running SearxNG without Docker (optional)

If you cannot run Docker locally, you can still use SearxNG:

Homebrew (macOS):

brew install searxng
# Configuration lives under /usr/local/etc/searxng/ or /opt/homebrew/etc/searxng/
# Start the service (paths may vary by Homebrew prefix):
searxng run &

Python venv:

python3 -m venv .venv
. .venv/bin/activate
pip install searxng
SEARXNG_SETTINGS_PATH=$(pwd)/searxng-settings.yml searxng run

Then point goresearch to it:

goresearch -searx.url http://localhost:8888

Offline and stubbed modes

Offline and stubbed modes are for tests only and are not supported in user workflows.

HTTP cache: stores bodies and headers keyed by URL; uses ETag/Last-Modified for conditional revalidation.
LLM cache: caches request/response pairs by a normalized prompt digest and model name.
Invalidation:
- -cache.maxAge 24h to purge entries older than 24 hours (HTTP and LLM caches)
- -cache.clear to clear the cache dir before a run (bypasses reads for that run)
- -cache.strictPerms to restrict cache at rest (0700 dirs, 0600 files)
Manifest: report.md includes a Manifest section and a report.md.manifest.json sidecar listing URLs and SHA-256 digests of the excerpts used.

Verification & manifest guide

See the dedicated guide for how to read the Evidence check appendix and use the embedded and sidecar manifest during audit: docs/verification-and-manifest.md.

Robots, opt-out, and politeness policy

goresearch is designed for the public web and behaves politely by default. The following rules are enforced:

Robots.txt compliance (default on): Before fetching a URL, the tool evaluates cached /robots.txt rules for the host using the configured User-Agent. It enforces Allow/Disallow with longest-path precedence and respects per-agent sections and wildcards. Redirects that would land on a disallowed path are short-circuited.
Crawl-delay: If the matched agent section declares Crawl-delay, requests to that host are spaced accordingly, in addition to global concurrency limits.
Missing robots policy: If /robots.txt returns 404, the tool proceeds as allowed. If it returns 401/403/5xx or times out, the host is treated as temporarily disallowed for this run and retried on a subsequent run or after cache expiry.
Opt-out signals for AI/TDM reuse: The fetcher denies reuse when any of these signals are present:
- X-Robots-Tag headers containing noai or notrain (scoped or unscoped)
- HTML <meta name="robots|googlebot|x-robots-tag"> with noai/notrain
- HTTP Link headers with rel="tdm-reservation"
- HTML <link rel="tdm-reservation"> in the document head Skipped URLs and the specific reason are recorded in logs and the run manifest under “skipped due to robots/opt-out”.
Public web only: Localhost and private IP address targets are blocked by default.

Override mechanism for controlled environments:

-robots.overrideDomains example.com,docs.internal: Ignore robots.txt for the listed domains. This is only activated when -robots.overrideConfirm is also provided. The override affects robots.txt evaluation only; AI/TDM opt-out signals (noai, notrain, TDM reservation) remain enforced.

These safeguards exist to keep usage respectful of site operators and content owners. If you need to test against mirrors or internal docs, prefer adding those hosts to the explicit override allowlist for the duration of your run.

Troubleshooting & FAQ

For common failures (cache, robots/opt-out denials, LLM endpoint issues) and how to raise verbosity to debug, see docs/troubleshooting-and-faq.md.

Tests

Run all tests:

go test ./...

What’s covered today:

Planner fallback when the model output is invalid or unavailable
Synthesis transient-error retry policy (single short backoff)
Normalization, extraction, selection, budgeting, and citation validation
Verification pass including deterministic fallback

Deterministic integration tests use in-process stubbed LLM clients (no containers) to avoid network variance. To preview queries and selection without calling an LLM, use dry run:

goresearch -input request.md -output report.md -dry-run -searx.url "$SEARX_URL"

Roadmap

Planned work and open items are tracked in FEATURE_CHECKLIST.md. Contributions that implement and check off items are especially welcome.

Contributing

Issues and pull requests are welcome. Please:

Keep changes focused and covered by tests (unit and/or integration as appropriate).
Explain intent in plain language and link to the relevant checklist item or issue.
Follow Go naming and formatting conventions.

If you plan a larger change or new adapter, open an issue first to discuss approach.

Support

Report bugs and ask questions on the GitHub issue tracker: https://github.com/hyperifyio/goresearch/issues

License

No license file is present yet. Until a license is added, usage is governed by standard copyright; please open an issue if you need explicit terms.

Project status

Actively developed; APIs and flags are stable enough for day-to-day use, but expect iterative improvements.

Architecture and design

Data flow overview

A concise view of the primary pipeline stages and boundaries. See the full architecture write‑up and detailed diagram in docs/architecture.md.

flowchart LR
    Search --> Fetch --> Extract --> Select --> Synthesize --> Validate --> Verify

Scope and goal. The tool reads a single Markdown file that describes a research request in natural language, for example “Detailed documentation about Cursor MDC format including examples”. It automatically plans a web research strategy, discovers and fetches relevant public web pages, extracts readable text, and asks a local OpenAI-compatible model to synthesize a validated Markdown document. The output is a self-contained report with a predictable structure, numbered inline citations, a references section with working URLs, and a short limitations and evidence-check appendix. The system must operate using only the standard chat completions style API that OpenAI-compatible servers expose. No OpenAI Search API or proprietary search features are used. The primary LLM target is a local OSS GPT endpoint that implements the chat completions interface.

High-level architecture. The application is a single binary command-line program composed of loosely coupled modules: request parsing, planning, search, fetch and extract, selection and deduplication, token budgeting, synthesis, verification, rendering, caching, and observability. Each module exposes a narrow interface so implementations can be swapped without affecting the rest. For example, the search module can point to a self-hosted meta-search engine such as SearxNG, or to a minimal HTML search scraper, without changing how results flow to the selector.

End-to-end flow. The run begins by loading the input Markdown file and reducing it to a concise research brief composed of topic, optional audience and tone hints if present, and an optional maximum word count if the file contains an explicit constraint. The planner invokes the local LLM with a system instruction that forbids narrative output and requests only structured data. The single user message contains the brief text and asks the model to propose a small set of precise web queries that together cover the topic comprehensively and to draft an outline with section headings. The tool parses the LLM’s response, tolerates minor deviations such as extra whitespace, and falls back to simple heuristic queries based on the topic if the response cannot be parsed. The search module executes the queries against a configured provider. The default provider is a self-hosted SearxNG instance over HTTP with an API key or IP allowlist, because this keeps the dependency controllable while avoiding vendor lock-in. As an optional alternative for development environments, the module can use a generic web search HTML endpoint if allowed by that site’s terms. The program aggregates results, normalizes URLs, removes duplicates, and ranks candidates by textual diversity and domain diversity. The selector enforces a maximum overall source count and a per-domain cap to avoid citation monoculture.

Content fetching and extraction. The fetcher issues HTTP GET requests with a configurable timeout, a descriptive user agent string, and polite rate limiting. It records ETag and Last-Modified values to support conditional revalidation on repeat runs. It follows redirects within a modest hop limit and rejects non-HTTP and data URLs. The extractor only processes HTML and XHTML content types for the baseline version and declines to parse binary formats. It constructs a lightweight DOM and extracts text from semantic containers such as main and article if present, else body, and keeps structural elements like headings, paragraphs, list items, and code blocks. Common navigation chrome, cookie banners, and footer boilerplate are reduced using simple density heuristics that prioritize longer, content-rich blocks. The extracted text is normalized to Unicode, whitespace is collapsed, and near-duplicate lines are removed. Each document carries its canonical URL, detected title, and extracted text. Documents that produce too little meaningful text are discarded early.

Source selection and budgeting. After extraction, the tool applies a token budget. It caps the number of documents and truncates each document’s text to a maximum number of characters that together fit within the model’s input limit once prompts and system messages are accounted for. If the combined extracts still exceed the target, the tool reduces each extract proportionally rather than dropping entire sources, unless a source is clearly redundant. The selection prefers diversity across domains and viewpoints when the topic invites it and prefers primary documentation when the topic is a software specification or format.

Synthesis prompt and grounding. The synthesizer crafts two messages. The system message defines the assistant role as a careful technical writer and explicitly requires the use of only the provided sources for factual claims. It explains the expected document structure, prescribes numbered inline citations using square brackets that map to the numbered source list, and instructs the model to state uncertainty when evidence is insufficient. The user message provides the brief, the required or suggested section headings from the outline if available, a target length, the numbered list of sources with titles and URLs, and the excerpts for each source. The tool asks for a single Markdown document that contains a title and run date, an executive summary written as short lines rather than decorative bullets, a main analysis section organized by headings, a risks and limitations section, and finally a references section where items are numbered to match the inline citations. Temperature is kept low to preserve determinism and reduce embellishment. The model is not asked to reveal hidden chain of thought and is prohibited from inventing sources.

Validation and claim checking. Once the draft is produced, the tool validates citation syntax and ensures that every bracketed citation number refers to an existing entry in the references section. Any broken or out-of-range citation markers are flagged and either removed or mapped to a best effort reference. The tool then engages the model in a second, short fact-checking pass. The system role now declares a fact-checker, and the user message contains the generated document and the numbered source list. The model is asked to extract a set of key claims, map each to the minimal set of supporting source indices, assign a confidence level, and flag any claims that are weakly supported or unsupported by the supplied sources. The result is turned into an evidence map that is appended to the output. If the verification call fails or produces invalid data, the program proceeds without the appendix and records a warning.

Output contract. The output is always valid Markdown that renders cleanly in GitHub and static site generators. The heading hierarchy follows the outline but avoids decorative flourishes. Inline citations are plain bracketed numbers that correspond to the final references section. The references list includes both page titles and full URLs so the document remains useful offline. The evidence map is a simple narrative paragraph describing which claims are well supported and which require caution, accompanied by a compact line-based listing of claims and their cited indices without complex formatting. The generated document includes a reproducibility footer that records the model name, the base URL of the LLM endpoint, the number of sources ingested, and whether caching was in effect.

Configuration and runtime controls. The program is configured through environment variables and flags so it can be scripted. The LLM base URL, model name, and API key are required and point to a local OpenAI-compatible service such as an OSS GPT runtime. The maximum number of sources, per-domain cap, per-source character limit, timeouts, language hint, and output path are adjustable. Search provider configuration is explicit and kept separate from LLM settings. A dry-run switch prints the planned queries and selected URLs without calling the model to assist in debugging and to provide transparency.

Caching and reproducibility. A local cache directory stores fetched page bodies keyed by URL plus a content hash of important request headers, as well as LLM request-response pairs keyed by a normalized prompt digest and model name. On subsequent runs, the tool reuses valid cache entries or revalidates HTTP documents using conditional requests. The cache can be invalidated by age, by explicit flags, or by topic hash to support iterative research while preserving determinism. The final report embeds a compact manifest that lists the canonical URLs and their content digests used in synthesis so downstream users can audit what was read.

Observability and logging. The tool emits concise progress to standard error by default and writes detailed structured JSON logs to a file (goresearch.log by default). It logs planned queries, chosen sources, fetch durations, extraction sizes, token budget estimates, and LLM latency. Sensitive values such as API keys are never logged. Increase detail by setting -log.level debug (or LOG_LEVEL=debug) and optionally -v to print more progress to the console. An additional -debug-verbose mode can print raw prompts for deep troubleshooting.

Error handling and resilience. Network errors, search outages, and extraction failures are isolated per source, allowing the run to proceed with remaining documents. If the planner fails to return parseable queries, the system composes a small set of deterministic fallbacks by combining the topic with intent words such as specification, documentation, tutorial, and reference in the configured language. If synthesis fails with a transient LLM error, a single retry with a short backoff is attempted. The program returns a nonzero exit code only when no usable sources are found or when the LLM cannot produce any body text.

Security, compliance, and politeness. The fetcher respects robots meta tags and avoids crawling behind search result pages. It rate limits concurrent requests and sets a clear user agent that identifies the tool and a contact URL. The program is designed for public web pages and does not authenticate to private services. It does not exfiltrate secrets because the only outbound destinations are the configured search endpoint, the fetched sites, and the local LLM endpoint. The cache directory can be encrypted at rest if the runtime environment requires it.

Language and localization. The planner and synthesizer accept a language hint. When provided, queries include that language and the synthesizer writes the report in the same language. The selector does not hard filter by language, because some authoritative sources may be in English even when the requested language is Finnish; instead, the tool prefers sources whose page language matches the hint when diversity allows.

Testing strategy. The project includes deterministic unit tests for URL normalization, HTML extraction, deduplication, token budgeting, and citation validation. It includes integration tests that run against a stub LLM server which returns canned JSON for planning and a canned Markdown for synthesis to verify the control flow without needing a real model. It includes record and replay fixtures for representative web pages so extraction logic is stable across versions. Golden outputs are compared with tolerances for timestamps and minor whitespace. The verification pass is tested with synthetic documents that contain both properly cited and deliberately uncited claims to ensure unsupported statements are flagged.

Performance profile. The fetch and extract stage runs concurrently up to a configurable limit to avoid overwhelming a site or the network. Token budgeting is computed from measured character counts with a conservative multiplier so prompts fit within the local model’s context. The synthesizer runs in a single pass for simplicity, which is sufficient for short to medium reports. Streaming output can be added later, but the baseline waits for full completion to simplify validation.

Extensibility. The search module can be extended with additional adapters without touching the rest of the program as long as each adapter yields a list of title, URL, and short snippet. The extractor can incorporate a proper readability algorithm or integrate a site-specific ruleset for popular documentation sites. The synthesis prompts can be swapped by configuration to tune style and citation strictness per project. A future extension can add optional PDF ingestion with a small text extractor when the site hosts authoritative PDFs, guarded by a per-run switch so users can control binary parsing.

Constraints and limitations. The quality of synthesis depends on the local model’s instruction following and context window. Some topics may be poorly covered by publicly accessible pages or hidden behind paywalls. HTML extraction cannot perfectly remove boilerplate on all sites, which may reduce token efficiency. The tool does not guarantee legal or medical accuracy and should be treated as an assistant for drafting rather than an oracle. The verification appendix increases confidence but is not a substitute for human review.

Operational summary. The user provides a single Markdown file containing the research request and runs the program with the LLM endpoint and search configuration. The planner asks the local model for queries and an outline. The searcher retrieves candidates and the fetcher extracts text. The selector applies diversity and budget rules. The synthesizer produces a clean Markdown document grounded in the extracts with numbered citations. The validator fixes citation issues and the verifier builds an evidence map. The renderer writes the final document to disk, along with a manifest that describes models and sources so the result is auditable and reproducible.

Name		Name	Last commit message	Last commit date
Latest commit History 237 Commits
.cursor/rules		.cursor/rules
ci		ci
cmd		cmd
devops		devops
dist		dist
docs		docs
fixtures		fixtures
internal		internal
scripts		scripts
.env.test		.env.test
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.openai-stub		Dockerfile.openai-stub
FEATURE_CHECKLIST.md		FEATURE_CHECKLIST.md
FUTURE_WORK.md		FUTURE_WORK.md
Makefile		Makefile
README.Extra.md		README.Extra.md
README.md		README.md
docker-compose.optional.yml		docker-compose.optional.yml
docker-compose.override.yml.example		docker-compose.override.yml.example
docker-compose.permissions.yml.example		docker-compose.permissions.yml.example
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
request.md		request.md
test-file-search.go		test-file-search.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

goresearch

Features

Installation

Quick start

End-to-end example: Nginx HSTS decision brief (requires external LLM)

Configuration

Full CLI reference

Run locally with Docker (optional)

Prerequisites

Environment variables

Health checks and readiness

Common troubleshooting

Local stack helpers (optional)

Network isolation and optional port exposure

Usage

Caching and reproducibility

Running SearxNG without Docker (optional)

Offline and stubbed modes

Verification & manifest guide

Robots, opt-out, and politeness policy

Troubleshooting & FAQ

Tests

Roadmap

Contributing

Support

License

Project status

Architecture and design

Data flow overview

About

Uh oh!

Releases

Packages

Languages

hyperifyio/goresearch

Folders and files

Latest commit

History

Repository files navigation

goresearch

Features

Installation

Quick start

End-to-end example: Nginx HSTS decision brief (requires external LLM)

Configuration

Full CLI reference

Run locally with Docker (optional)

Prerequisites

Environment variables

Health checks and readiness

Common troubleshooting

Local stack helpers (optional)

Network isolation and optional port exposure

Usage

Caching and reproducibility

Running SearxNG without Docker (optional)

Offline and stubbed modes

Verification & manifest guide

Robots, opt-out, and politeness policy

Troubleshooting & FAQ

Tests

Roadmap

Contributing

Support

License

Project status

Architecture and design

Data flow overview

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages