Skip to content

Possible to omit document content altogether? #16

@ronjakoi

Description

@ronjakoi

My crawler does a language detection on crawled documents and then assigns data such as "content", "title" and "description" to different fields based on the language detected. I use ScriptTagger for this.

So my Solr schema doesn't actually have a "content" field, but the Norconex Solr Committer still sends a field called "content". This results in an error:

org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at https://myserver.fi/solr/mycollection: ERROR: [doc=https://www.example.com/] unknown field 'content'

The documentation says I can use <sourceContentField> and <targetContentField> to rename the content field. But is there a way to remove it completely? After all I've in a way already renamed it with the ScriptTagger in the Importer phase.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions