- Prevent the "unprintable KeyError" problem from happening.
- Reverse sort core keywords.
- Exports list of objects rather than list of tuples, to help ES directly indexing the content.
- Do not use keywords as dictionary keys, rather as elements in a list.
- Supports '·' author separator
- Support utf8 author-keywords
- Enforce utf8 also for non PDF files in extractor.
- Changes dict export format for author keywords, into an improved and semantic way.
- Renames keys in dict export to be lower case and separated by _.
- Drop trailing dots in author keywords.
- Changes module to be compatible with Invenio 3.
- Fixes a crash when trying to discover a taxonomy when CLASSIFIER_WORKDIR is set to None.
- Updates minimum dependencies of Invenio packages to newer versions.
- Removes a bug in bibclassify_keyword_analyzer.py. If a combination is found via a synonym or regexp it is no longer thrown away just because the components of the combination are not found in the text.
- Adds missing invenio_base dependency.
- Initial public release.