My crawler does a language detection on crawled documents and then assigns data such as "content", "title" and "description" to different fields based on the language detected. I use ScriptTagger for this.
So my Solr schema doesn't actually have a "content" field, but the Norconex Solr Committer still sends a field called "content". This results in an error:
org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at https://myserver.fi/solr/mycollection: ERROR: [doc=https://www.example.com/] unknown field 'content'
The documentation says I can use <sourceContentField> and <targetContentField> to rename the content field. But is there a way to remove it completely? After all I've in a way already renamed it with the ScriptTagger in the Importer phase.
My crawler does a language detection on crawled documents and then assigns data such as "content", "title" and "description" to different fields based on the language detected. I use
ScriptTaggerfor this.So my Solr schema doesn't actually have a "content" field, but the Norconex Solr Committer still sends a field called "content". This results in an error:
The documentation says I can use
<sourceContentField>and<targetContentField>to rename the content field. But is there a way to remove it completely? After all I've in a way already renamed it with theScriptTaggerin the Importer phase.