FORMAS aims to study, analyze and evaluate semantic-based approaches. Our main research areas are based on five major pillars on semantic-based area: Methods, Ontology, Information Extraction, Interoperability and Big Data. We deeply analyze each one of these approaches focusing on obtaining high levels of semantic and pragmatic comprehension. Visite our website for more informations.
-
CSIS method interoperates Syntactic, Semantic and Pragmatic into University Surveillance models providing image captioning for operators and systems.
-
DIGGER method uses LLMs to provide a QA for the CDC legal documents.
- Brazilian Consumer Protection Code: a methodology for a dataset to Question-Answer (QA) Models @PADAWAN2024
- [Towards a Corpus Methodology for LLMs in the Legal Domain](STIL 2025)
-
DptOIE method extract triples from Universal Dependencies (UD) format.
- DptOIE: a Portuguese open information extraction based on dependency analysis. @AIR JOURNAL
- [DPToie-Python]
-
PortNOIE is a new version of DPTOIE-Neural.
- PortNOIE: A Neural Framework for Open Information Extraction for the Portuguese Language @PROPOR2024
- Extração de Informação Aberta com LLM para a Língua Portuguesa @LINGUAMATICA
- Exploring Open Information Extraction for Portuguese Using Large Language Models @PROPOR2024
- Scaling and Adapting Large Language Models for Portuguese Open Information Extraction: A Comparative Study of Fine-Tuning and LoRA @BRACIS2024
-
PTOIE-Flair is a pt-br OpenIE model.
-
PragmaticOIE method uses a rule-based approach to extract facts in Portuguese in a first pragmatic level.
-
ImageCaptioningPT methods to generate image captioning in the Portuguese language.
- Towards Image Captioning for the Portuguese Language: Evaluation on a Translated Dataset @ICEIS2023
- ... @JBCS2025
-
ALiBWeb is a Web system to map Brazilian dialectology areas.
- ALiBWeb: estado da arte e perspectivas futuras @WORKINGPAPER