Introduction
A French Lemmatizer in Python based on the LEFFF (Lexique des Formes Fléchies du Français / Lexicon of French inflected forms) is a large-scale morphological and syntactic lexicon for French.
Main reference:
[Sagot,2010] Sagot, B. (2010). The Lefff, a freely available and large-coverage morphological and syntactic lexicon for French. In 7th international conference on Language Resources and Evaluation (LREC 2010). Retrieved from https://hal.archives-ouvertes.fr/file/index/docid/521242/filename/lrec10lefff.pdf
Benoît Sagot Webpage about LEFFF
http://alpage.inria.fr/~sagot/lefff-en.html
More precisely, we use the morphological lexicon only: .mlex file) which has a simple format in CSV (4 fields separated by '\ t')
Tagset format FRMG - from the ALPAGE project since 2004
Tagset
License
Copyright (C) 2017 Claude Coulombe
Licensed under the Apache License, Version 2.0 (the 'License'); you may not use this file except in compliance with the License. You may obtain a copy of the License at
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an 'AS IS' BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.