Anabela Barreiro

@inesc-id.pt

Human Language Technology Lab
INESC-ID Lisboa

Anabela Barreiro

RESEARCH INTERESTS

Human Language Technology * Language Learning * Machine Translation * Multilingualism * Cross-Language NLP * Language Varieties * Paraphrasing * Corporate Language * Linguistic Resources * Ontologies * Linguistic Humor
33

Scopus Publications

Scopus Publications

  • Parafrasário: a variety-based paraphrasary for Portuguese
    Anabela Barreiro, Ida Rebelo-Arnold, Cristina Mota
    Language Resources and Evaluation, 2026
  • Quando as Máquinas “Pensam” Antropomorfização no Discurso sobre IA Riscos Conceptuais e Proposta de Léxico Não Antropomórfico para PLN em Português
    17th International Conference on Computational Processing of Portuguese Propor 2026, 2026
  • Paraphrase and translation: the importance of being close
    Diana Santos, Anabela Barreiro
    Open Research Europe, 2025
    This article explores the concept of paraphrasing within computational linguistics, seeking to enrich its understanding by drawing parallels with translation studies and especially machine translation. It highlights the existence of two distinct yet related tasks: paraphrase generation and paraphrase detection, as well as points out the many (sometimes implicit) contact points in evaluation in both translation and paraphrasing. We claim that the concept of near-synonymy or near- equivalence is a shared concern of both disciplines, and its formalization should be pursued.
  • Large Language Models and OpenLogos: An Educational Case Scenario
    Andrijana Pavlova, Branislav Gerazov, Anabela Barreiro
    Open Research Europe, 2024
    Large Language Models (LLMs) offer advanced text generation capabilities, sometimes surpassing human abilities. However, their use without proper expertise poses significant challenges, particularly in educational contexts. This article explores different facets of natural language generation (NLG) within the educational realm, assessing its advantages and disadvantages, particularly concerning LLMs. It addresses concerns regarding the opacity of LLMs and the potential bias in their generated content, advocating for transparent solutions. Therefore, it examines the feasibility of integrating OpenLogos expert-crafted resources into language generation tools used for paraphrasing and translation. In the context of the Multi3Generation COST Action (CA18231), we have been emphasizing the significance of incorporating OpenLogos into language generation processes, and the need for clear guidelines and ethical standards in generative models involving multilingual, multimodal, and multitasking capabilities. The Multi3Generation initiative strives to progress NLG research for societal welfare, including its educational applications. It promotes inclusive models inspired by the Logos Model, prioritizing transparency, human control, preservation of language principles and meaning, and acknowledgment of the expertise of resource creators. We envision a scenario where OpenLogos can contribute significantly to inclusive AI-supported education. Ethical considerations and limitations related to AI implementation in education are explored, highlighting the importance of maintaining a balanced approach consistent with traditional educational principles. Ultimately, the article advocates for educators to adopt innovative tools and methodologies to foster dynamic learning environments that facilitate linguistic development and growth.
  • UniDive: A COST Action on Universality, Diversity and Idiosyncrasy in Language Technology
    3rd Annual Meeting of the Elra ISCA Special Interest Group on Under Resourced Languages Sigul 2024 at Lrec Coling 2024 Workshop Proceedings, 2024
  • Multi3Generation: Multitask, Multilingual, and Multimodal Language Generation
    Elena Lloret, Anabela Barreiro, Mehul Bhatt, Alberto Bugarín-Diz, Gianfranco E. Modoni, et al.
    Open Research Europe, 2023
    The purpose of this article is to highlight the critical importance of language generation today. In particular, language generation is explored from the following three aspects: multi-modality, multilinguality, which play crucial role for NLG community. We present the activities conducted within the Multi3Generation COST Action (CA18231), as well as current trends and future perspectives for multitask, multilingual and multimodal language generation.
  • Linguistic resources for paraphrase generation in portuguese: a lexicon-grammar approach
    Anabela Barreiro, Cristina Mota, Jorge Baptista, Lucília Chacoto, Paula Carvalho
    Language Resources and Evaluation, 2022
    This paper presents a new linguistic resource for the generation of paraphrases in Portuguese, based on the lexicon-grammar framework. The resource components include (i) a lexicon-grammar based dictionary of 2.100 predicate nouns co-occurring with the support verb ser de (‘be of’), such as in ser de uma ajuda inestimável (‘be of invaluable help’); (ii) a lexicon-grammar based dictionary of 6.000 predicate nouns co-occurring with the support verb fazer (‘do’ or ‘make’), such as in fazer uma comparação (‘make a comparison’); and (iii) a lexicon-grammar based dictionary of 5.000 human intransitive adjectives, co-occurring with the copula verbs ser and/or estar (‘be’), such as in ser simpático (‘be kind’) or estar entusiasmado (‘be enthusiastic’). A set of local grammars explore the properties described in these linguistic resources, enabling a variety of text transformation tasks for paraphrasing applications. The paper highlights the complementary and synergistic components and inA. Barreiro INESC-ID Lisboa E-mail: anabela.barreiro@inesc-id.pt C. Mota INESC-ID Lisboa E-mail: cmota@ist.utl.pt J. Baptista Universidade do Algarve INESC-ID Lisboa E-mail: jbaptis@ualg.pt L. Chacoto Universidade do Algarve IELT E-mail: lchacoto@ualg.pt P. Carvalho INESC-ID Lisboa E-mail: pcc@inesc-id.pt 2 Anabela Barreiro et al. tegration efforts, and presents some preliminary evaluation results on the inclusion of such resources in the eSPERTo paraphrase generation system.
  • Multi3Generation: Multitask, Multilingual, Multimodal Language Generation
    Eamt 2022 Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022
  • Introducing an implicit crowdsourcing opportunity to teachers
    Call for Background, 2021
  • Paraphrasing Emotions in Portuguese
    Cristina Mota, Diana Santos, Anabela Barreiro
    Communications in Computer and Information Science, 2021
  • Causal Discourse Connectors in the Teaching of Spanish as a Foreign Language (SLF) for Portuguese Learners Using NooJ
    Andrea Rodrigo, Silvia Reyes, Cristina Mota, Anabela Barreiro
    Communications in Computer and Information Science, 2020
  • One book, two language varieties
    Anabela Barreiro, Ida Rebelo-Arnold, Fernando Batista, Isabel Garcez, Tanara Zingano Kuhn
    Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2020
  • Creating expert knowledge by relying on language learners: A generic approach for mass-producing language resources by combining implicit crowdsourcing and language learning
    Lrec 2020 12th International Conference on Language Resources and Evaluation Conference Proceedings, 2020
  • In Other Words (POP)
    Anabela Marques Barreiro, Jorge Baptista, Renata Vieira, Paulo Quaresma
    Linguamatica, 2019
  • The Lexicon-Grammar of Predicate Nouns with ser de in Port4NooJ
    Cristina Mota, Jorge Baptista, Anabela Barreiro
    Communications in Computer and Information Science, 2019
  • Ep–BP paraphrastic alignments of verbal constructions involving the clitic pronoun LHe
    Ida Rebelo-Arnold, Anabela Marques Barreiro, Paulo Quaresma, Cristina Mota
    Linguamatica, 2019
  • Automated paraphrasing of Portuguese informal into formal language
    Anabela Marques Barreiro, Ida Rebelo-Arnold, Jorge Baptista, Cristina Mota, Isabel Garcez
    Linguamatica, 2019
  • Paraphrastic Variance between European and Brazilian Portuguese
    Coling 2018 27th International Conference on Computational Linguistics Proceedings of the 5th Workshop on Nlp for Similar Languages Varieties and Dialects Vardial 2018, 2018
  • Integrating the Lexicon-Grammar of Predicate Nouns with Support Verb fazer into Port4NooJ
    Cristina Mota, Lucília Chacoto, Anabela Barreiro
    Communications in Computer and Information Science, 2018
  • Port4NooJ v3.0: Integrated linguistic resources for Portuguese NLP
    Proceedings of the 10th International Conference on Language Resources and Evaluation Lrec 2016, 2016
  • Generating paraphrases of human intransitive adjective constructions with Port4NooJ
    Cristina Mota, Paula Carvalho, Francisco Raposo, Anabela Barreiro
    Communications in Computer and Information Science, 2016
  • eSPERTo’s paraphrastic knowledge applied to question-answering and summarization
    Cristina Mota, Anabela Barreiro, Francisco Raposo, Ricardo Ribeiro, Sérgio Curto, et al.
    Communications in Computer and Information Science, 2016
  • OpenLogos semantico-syntactic knowledge-rich bilingual dictionaries
    Proceedings of the 9th International Conference on Language Resources and Evaluation Lrec 2014, 2014
  • Linguistic evaluation of support verb constructions by openlogos and google translate
    Proceedings of the 9th International Conference on Language Resources and Evaluation Lrec 2014, 2014
  • Cross-language semantic relations between english and portuguese
    Ceur Workshop Proceedings, 2011