Human Language Technology * Language Learning * Machine Translation * Multilingualism * Cross-Language NLP * Language Varieties * Paraphrasing * Corporate Language * Linguistic Resources * Ontologies * Linguistic Humor
Quando as Máquinas “Pensam” Antropomorfização no Discurso sobre IA Riscos Conceptuais e Proposta de Léxico Não Antropomórfico para PLN em Português 17th International Conference on Computational Processing of Portuguese Propor 2026, 2026
Paraphrase and translation: the importance of being close Diana Santos, Anabela Barreiro Open Research Europe, 2025 This article explores the concept of paraphrasing within computational linguistics, seeking to enrich its understanding by drawing parallels with translation studies and especially machine translation. It highlights the existence of two distinct yet related tasks: paraphrase generation and paraphrase detection, as well as points out the many (sometimes implicit) contact points in evaluation in both translation and paraphrasing. We claim that the concept of near-synonymy or near- equivalence is a shared concern of both disciplines, and its formalization should be pursued.
Large Language Models and OpenLogos: An Educational Case Scenario Andrijana Pavlova, Branislav Gerazov, Anabela Barreiro Open Research Europe, 2024 Large Language Models (LLMs) offer advanced text generation capabilities, sometimes surpassing human abilities. However, their use without proper expertise poses significant challenges, particularly in educational contexts. This article explores different facets of natural language generation (NLG) within the educational realm, assessing its advantages and disadvantages, particularly concerning LLMs. It addresses concerns regarding the opacity of LLMs and the potential bias in their generated content, advocating for transparent solutions. Therefore, it examines the feasibility of integrating OpenLogos expert-crafted resources into language generation tools used for paraphrasing and translation. In the context of the Multi3Generation COST Action (CA18231), we have been emphasizing the significance of incorporating OpenLogos into language generation processes, and the need for clear guidelines and ethical standards in generative models involving multilingual, multimodal, and multitasking capabilities. The Multi3Generation initiative strives to progress NLG research for societal welfare, including its educational applications. It promotes inclusive models inspired by the Logos Model, prioritizing transparency, human control, preservation of language principles and meaning, and acknowledgment of the expertise of resource creators. We envision a scenario where OpenLogos can contribute significantly to inclusive AI-supported education. Ethical considerations and limitations related to AI implementation in education are explored, highlighting the importance of maintaining a balanced approach consistent with traditional educational principles. Ultimately, the article advocates for educators to adopt innovative tools and methodologies to foster dynamic learning environments that facilitate linguistic development and growth.
UniDive: A COST Action on Universality, Diversity and Idiosyncrasy in Language Technology 3rd Annual Meeting of the Elra ISCA Special Interest Group on Under Resourced Languages Sigul 2024 at Lrec Coling 2024 Workshop Proceedings, 2024
Multi3Generation: Multitask, Multilingual, and Multimodal Language Generation Elena Lloret, Anabela Barreiro, Mehul Bhatt, Alberto Bugarín-Diz, Gianfranco E. Modoni, et al. Open Research Europe, 2023 The purpose of this article is to highlight the critical importance of language generation today. In particular, language generation is explored from the following three aspects: multi-modality, multilinguality, which play crucial role for NLG community. We present the activities conducted within the Multi3Generation COST Action (CA18231), as well as current trends and future perspectives for multitask, multilingual and multimodal language generation.
Linguistic resources for paraphrase generation in portuguese: a lexicon-grammar approach Anabela Barreiro, Cristina Mota, Jorge Baptista, Lucília Chacoto, Paula Carvalho Language Resources and Evaluation, 2022 This paper presents a new linguistic resource for the generation of paraphrases in Portuguese, based on the lexicon-grammar framework. The resource components include (i) a lexicon-grammar based dictionary of 2.100 predicate nouns co-occurring with the support verb ser de (‘be of’), such as in ser de uma ajuda inestimável (‘be of invaluable help’); (ii) a lexicon-grammar based dictionary of 6.000 predicate nouns co-occurring with the support verb fazer (‘do’ or ‘make’), such as in fazer uma comparação (‘make a comparison’); and (iii) a lexicon-grammar based dictionary of 5.000 human intransitive adjectives, co-occurring with the copula verbs ser and/or estar (‘be’), such as in ser simpático (‘be kind’) or estar entusiasmado (‘be enthusiastic’). A set of local grammars explore the properties described in these linguistic resources, enabling a variety of text transformation tasks for paraphrasing applications. The paper highlights the complementary and synergistic components and inA. Barreiro INESC-ID Lisboa E-mail: anabela.barreiro@inesc-id.pt C. Mota INESC-ID Lisboa E-mail: cmota@ist.utl.pt J. Baptista Universidade do Algarve INESC-ID Lisboa E-mail: jbaptis@ualg.pt L. Chacoto Universidade do Algarve IELT E-mail: lchacoto@ualg.pt P. Carvalho INESC-ID Lisboa E-mail: pcc@inesc-id.pt 2 Anabela Barreiro et al. tegration efforts, and presents some preliminary evaluation results on the inclusion of such resources in the eSPERTo paraphrase generation system.
Multi3Generation: Multitask, Multilingual, Multimodal Language Generation Eamt 2022 Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022
Introducing an implicit crowdsourcing opportunity to teachers Call for Background, 2021
One book, two language varieties Anabela Barreiro, Ida Rebelo-Arnold, Fernando Batista, Isabel Garcez, Tanara Zingano Kuhn Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2020
Creating expert knowledge by relying on language learners: A generic approach for mass-producing language resources by combining implicit crowdsourcing and language learning Lrec 2020 12th International Conference on Language Resources and Evaluation Conference Proceedings, 2020
In Other Words (POP) Anabela Marques Barreiro, Jorge Baptista, Renata Vieira, Paulo Quaresma Linguamatica, 2019
Paraphrastic Variance between European and Brazilian Portuguese Coling 2018 27th International Conference on Computational Linguistics Proceedings of the 5th Workshop on Nlp for Similar Languages Varieties and Dialects Vardial 2018, 2018
Port4NooJ v3.0: Integrated linguistic resources for Portuguese NLP Proceedings of the 10th International Conference on Language Resources and Evaluation Lrec 2016, 2016
OpenLogos semantico-syntactic knowledge-rich bilingual dictionaries Proceedings of the 9th International Conference on Language Resources and Evaluation Lrec 2014, 2014
Linguistic evaluation of support verb constructions by openlogos and google translate Proceedings of the 9th International Conference on Language Resources and Evaluation Lrec 2014, 2014
Cross-language semantic relations between english and portuguese Ceur Workshop Proceedings, 2011