Damien Sileo

Scopus Publications

A benchmark of expert-level academic questions to assess AI capabilities
Long Phan, Alice Gatti, Nathaniel Li, Adam Khoja, Ryan Kim, et al.
Nature, 2026
Logic Haystacks: Probing LLMs' Long-Context Logical Reasoning (Without Easily Identifiable Unrelated Padding)
Damien Sileo
Eacl 2026 19th Conference of the European Chapter of the Association for Computational Linguistics Proceedings of the Conference Vol 1 Long Papers, 2026
Generating Explanations in Medical Question-Answering by Expectation Maximization Inference over Evidence
Wei Sun, Mingxiao Li, Damien Sileo, Jesse Davis, Marie-Francine Moens
ACM Transactions on Computing for Healthcare, 2025
Medical Question Answering (medical QA) systems play an essential role in assisting healthcare workers in finding answers to their questions. However, it is not sufficient to merely provide answers by medical QA systems because users might want explanations, that is, more analytic statements in natural language that describe the elements and context that support the answer. To do so, we propose a novel approach for generating natural language explanations for answers predicted by medical QA systems. As high-quality medical explanations require additional medical knowledge, so that our system extracts knowledge from medical textbooks to enhance the quality of explanations during the explanation generation process. Concretely, we designed an Expectation-Maximization approach that makes inferences about the evidence found in these texts, offering an efficient way to focus attention on lengthy evidence passages. Experimental results, conducted on two datasets MQAE-diag and MQAE, demonstrate the effectiveness of our framework for reasoning with textual evidence. Our approach outperforms state-of-the-art models, achieving a significant improvement of 6.13 and 5.47 percentage points on the Rouge-L score; 6.49 and 5.28 percentage points on the Bleu-4 score on the MQAE-diag and MQAE datasets.
τ TAU-EVAL: A Unified Evaluation Framework for Useful and Private Text Anonymization
Gabriel Loiseau, Damien Sileo, Damien Riquet, Maxime Meyer, Marc Tommasi
Emnlp 2025 2025 Conference on Empirical Methods in Natural Language Processing Proceedings of the System Demonstrations, 2025
BRIDGING THE DATA PROVENANCE GAP ACROSS TEXT, SPEECH, AND VIDEO
13th International Conference on Learning Representations Iclr 2025, 2025
A large-scale audit of dataset licensing and attribution in AI
Shayne Longpre, Robert Mahari, Anthony Chen, Naana Obeng-Marnu, Damien Sileo, et al.
Nature Machine Intelligence, 2024
The race to train language models on vast, diverse and inconsistently documented datasets raises pressing legal and ethical concerns. To improve data transparency and understanding, we convene a multi-disciplinary effort between legal and machine learning experts to systematically audit and trace more than 1,800 text datasets. We develop tools and standards to trace the lineage of these datasets, including their source, creators, licences and subsequent use. Our landscape analysis highlights sharp divides in the composition and focus of data licenced for commercial use. Important categories including low-resource languages, creative tasks and new synthetic data all tend to be restrictively licenced. We observe frequent miscategorization of licences on popular dataset hosting sites, with licence omission rates of more than 70% and error rates of more than 50%. This highlights a crisis in misattribution and informed use of popular datasets driving many recent breakthroughs. Our analysis of data sources also explains the application of copyright law and fair use to finetuning data. As a contribution to continuing improvements in dataset transparency and responsible use, we release our audit, with an interactive user interface, the Data Provenance Explorer, to enable practitioners to trace and filter on data provenance for the most popular finetuning data collections: www.dataprovenance.org.
tasksource: A Large Collection of NLP tasks with a Structured Dataset Preprocessing Framework
2024 Joint International Conference on Computational Linguistics Language Resources and Evaluation Lrec Coling 2024 Main Conference Proceedings, 2024
Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars
Damien Sileo
Emnlp 2024 2024 Conference on Empirical Methods in Natural Language Processing Proceedings of the Conference, 2024
Generating Multiple-Choice Questions for Medical QA with Distractors and Cue-Masking
2024 Joint International Conference on Computational Linguistics Language Resources and Evaluation Lrec Coling 2024 Main Conference Proceedings, 2024
DISRPT: A Multilingual, Multi-domain, Cross-framework Benchmark for Discourse Processing
2024 Joint International Conference on Computational Linguistics Language Resources and Evaluation Lrec Coling 2024 Main Conference Proceedings, 2024
Consent in Crisis: The Rapid Decline of the AI Data Commons
Advances in Neural Information Processing Systems, 2024
Spatiotemporal self-supervised pre-Training on satellite imagery improves food insecurity prediction
Ruben Cartuyvels, Tom Fierens, Emiel Coppieters, Marie-Francine Moens, Damien Sileo
Environmental Data Science, 2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Transactions on Machine Learning Research, 2023
Probing neural language models for understanding of words of estimative probability
Damien Sileo, Marie-francine Moens
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2023
MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic
Damien Sileo, Antoine Lernould
Findings of the Association for Computational Linguistics Emnlp 2023, 2023
Analysis and Prediction of NLP models via Task Embeddings
2022 Language Resources and Evaluation Conference Lrec 2022, 2022
Zero-Shot Recommendation as Language Modeling
Damien Sileo, Wout Vossen, Robbe Raymaekers
Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2022
A Pragmatics-Centered Evaluation Framework for Natural Language Understanding
2022 Language Resources and Evaluation Conference Lrec 2022, 2022
LIIR at SemEval-2021 task 6: Detection of Persuasion Techniques In Texts and Images using CLIP features
Erfan Ghadery, Damien Sileo, Marie-Francine Moens
Semeval 2021 15th International Workshop on Semantic Evaluation Proceedings of the Workshop, 2021
DiscSense: Automated semantic analysis of discourse markers
Lrec 2020 12th International Conference on Language Resources and Evaluation Conference Proceedings, 2020
Question Answering When Knowledge Bases are Incomplete
Camille Pradel, Damien Sileo, Álvaro Rodrigo, Anselmo Peñas, Eneko Agirre
Lecture Notes in Computer Science, 2020
Mining discourse markers for unsupervised sentence representation learning
Damien Sileo, Tim Van De Cruys, Camille Pradel, Philippe Muller
Naacl Hlt 2019 2019 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Proceedings of the Conference, 2019
Composition of sentence embeddings: Lessons from statistical relational learning
Damien Sileo, Tim Van De Cruys, Camille Pradel, Philippe Muller
Sem@naacl Hlt 2019 8th Joint Conference on Lexical and Computational Semantics, 2019
Semantic role analysis for automatic summarization
Extraction Et Gestion Des Connaissances Egc 2018, 2018

RECENT SCHOLAR PUBLICATIONS

Distilling Human-Aligned Privacy Sensitivity Assessment from Large Language Models
G Loiseau, D Sileo, D Riquet, M Meyer, M Tommasi
arXiv preprint arXiv:2603.29497 , 2026
2026
Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training
V Lacombe, V Quesnel, D Sileo
arXiv preprint arXiv:2603.02208 , 2026
2026
Citations: 3
Logic Haystacks: Probing LLMs’ Long-Context Logical Reasoning (Without Easily Identifiable Unrelated Padding)
D Sileo
Proceedings of the 19th Conference of the European Chapter of the … , 2026
2026
Citations: 1
Adaptive Text Anonymization: Learning Privacy-Utility Trade-offs via Prompt Optimization
G Loiseau, D Sileo, D Riquet, M Meyer, M Tommasi
arXiv preprint arXiv:2602.20743 , 2026
2026
A benchmark of expert-level academic questions to assess AI capabilities
Center for AI Safety Phan Long agibenchmark@ safe. ai 1 Gatti Alice 1 Li ...
Nature 649 (8099), 1139-1146 , 2026
2026
Citations: 513
MortalMATH: Evaluating the Conflict Between Reasoning Objectives and Emergency Contexts
E Lanzeray, S Meilliez, M Ruelle, D Sileo
arXiv preprint arXiv:2601.18790 , 2026
2026
Attention Overflow: Language Model Input Blur during Long-Context Missing Items Identification
D Sileo
Proceedings of the 14th International Joint Conference on Natural Language … , 2025
2025
Citations: 6
Tau-Eval: A Unified Evaluation Framework for Useful and Private Text Anonymization
G Loiseau, D Sileo, D Riquet, M Meyer, M Tommasi
Proceedings of the 2025 Conference on Empirical Methods in Natural Language … , 2025
2025
Citations: 7
Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem
V Quesnel, D Sileo
arXiv preprint arXiv:2509.06809 , 2025
2025
Citations: 1
Bridging the data provenance gap across text, speech, and video
S Longpre, N Singh, M Cherep, K Tiwary, J Materzynska, W Brannon, ...
International Conference on Learning Representations 2025, 60592-60670 , 2025
2025
Citations: 27
Generating Explanations in Medical Question-Answering by Expectation Maximization Inference over Evidence
W Sun, M Li, D Sileo, J Davis, MF Moens
ACM Transactions on Computing for Healthcare 6 (2), 1-23 , 2025
2025
Citations: 6
Tarot: Task-oriented authorship obfuscation using policy optimization methods
G Loiseau, D Sileo, D Riquet, M Meyer, M Tommasi
Proceedings of the Sixth Workshop on Privacy in Natural Language Processing … , 2025
2025
Citations: 5
Recipient Profiling: Predicting Characteristics from Messages
M Borquez, M Keller, M Perrot, D Sileo
arXiv preprint arXiv:2412.12954 , 2024
2024
Citations: 1
Consent in crisis: The rapid decline of the ai data commons
S Longpre, R Mahari, A Lee, C Lund, H Oderinwale, W Brannon, ...
Advances in Neural Information Processing Systems 37, 108042-108087 , 2024
2024
Citations: 130
Scaling synthetic logical reasoning datasets with context-sensitive declarative grammars
D Sileo
Proceedings of the 2024 Conference on Empirical Methods in Natural Language … , 2024
2024
Citations: 6
A large-scale audit of dataset licensing and attribution in AI
S Longpre, R Mahari, A Chen, N Obeng-Marnu, D Sileo, W Brannon, ...
Nature Machine Intelligence 6 (8), 975-987 , 2024
2024
Citations: 218
DISRPT: A multilingual, multi-domain, cross-framework benchmark for discourse processing
C Braud, A Zeldes, L Rivière, YJ Liu, P Muller, D Sileo, T Aoyama
Proceedings of the 2024 Joint International Conference on Computational … , 2024
2024
Citations: 18
tasksource: A Large Collection of NLP tasks with a Structured Dataset Preprocessing Framework
D Sileo
Proceedings of the 2024 Joint International Conference on Computational … , 2024
2024
Citations: 81
Generating multiple-choice questions for medical question answering with distractors and cue-masking
D Sileo, K Uma, MF Moens
Proceedings of the 2024 Joint International Conference on Computational … , 2024
2024
Citations: 9
Consent in crisis: The rapid decline of the ai data commons, 2024
S Longpre, R Mahari, A Lee, C Lund, H Oderinwale, W Brannon, ...
URL https://arxiv. org/abs/2407.14933 , 2024
2024
Citations: 8

MOST CITED SCHOLAR PUBLICATIONS

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
Transactions on machine learning research , 2023
2023
Citations: 2653
A benchmark of expert-level academic questions to assess AI capabilities
Center for AI Safety Phan Long agibenchmark@ safe. ai 1 Gatti Alice 1 Li ...
Nature 649 (8099), 1139-1146 , 2026
2026
Citations: 513
A large-scale audit of dataset licensing and attribution in AI
S Longpre, R Mahari, A Chen, N Obeng-Marnu, D Sileo, W Brannon, ...
Nature Machine Intelligence 6 (8), 975-987 , 2024
2024
Citations: 218
Consent in crisis: The rapid decline of the ai data commons
S Longpre, R Mahari, A Lee, C Lund, H Oderinwale, W Brannon, ...
Advances in Neural Information Processing Systems 37, 108042-108087 , 2024
2024
Citations: 130
Nl-augmenter: A framework for task-sensitive natural language augmentation
K Dhole, V Gangal, S Gehrmann, A Gupta, Z Li, S Mahamood, ...
Northern European Journal of Language Technology 9 , 2023
2023
Citations: 106
Zero-Shot Recommendation as Language Modeling
D Sileo, W Vossen, R Raymaekers
European Conference on Information Retrieval, 223-230 , 2022
2022
Citations: 94
Mining Discourse Markers for Unsupervised Sentence Representation Learning
D Sileo, T Van-De-Cruys, C Pradel, P Muller
Proceedings of the 2019 Conference of the North American Chapter of the … , 2019
2019
Citations: 83
tasksource: A Large Collection of NLP tasks with a Structured Dataset Preprocessing Framework
D Sileo
Proceedings of the 2024 Joint International Conference on Computational … , 2024
2024
Citations: 81
MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic
D Sileo, A Lernould
Findings of the Association for Computational Linguistics: EMNLP 2023, 4570–4577 , 2023
2023
Citations: 41
Bridging the data provenance gap across text, speech, and video
S Longpre, N Singh, M Cherep, K Tiwary, J Materzynska, W Brannon, ...
International Conference on Learning Representations 2025, 60592-60670 , 2025
2025
Citations: 27
The data provenance initiative: A large scale audit of dataset licensing & attribution in ai, 2023
S Longpre, R Mahari, A Chen, N Obeng-Marnu, D Sileo, W Brannon, ...
URL https://arxiv. org/abs/2310.16787 , 2023
2023
Citations: 21
A Pragmatics-Centered Evaluation Framework for Natural Language Understanding
D Sileo, P Muller, T Van de Cruys, C Pradel
Proceedings of the Thirteenth Language Resources and Evaluation Conference … , 2022
2022
Citations: 19
DISRPT: A multilingual, multi-domain, cross-framework benchmark for discourse processing
C Braud, A Zeldes, L Rivière, YJ Liu, P Muller, D Sileo, T Aoyama
Proceedings of the 2024 Joint International Conference on Computational … , 2024
2024
Citations: 18
DiscSense: Automated Semantic Analysis of Discourse Markers
D Sileo, T Van de Cruys, C Pradel, P Muller
Proceedings of The 12th Language Resources and Evaluation Conference, 991-999 , 2020
2020
Citations: 18
Probing neural language models for understanding of words of estimative probability
D Sileo, MF Moens
Proceedings of the 12th Joint Conference on Lexical and Computational … , 2023
2023
Citations: 16
Composition of Embeddings: Lessons from Statistical Relational Learning
D Sileo, T Van de Cruys, C Pradel, P Muller
8th Joint Conference on Lexical and Computational Semantics (SEM 2019), 33-43 , 2019
2019
Citations: 10
Generating multiple-choice questions for medical question answering with distractors and cue-masking
D Sileo, K Uma, MF Moens
Proceedings of the 2024 Joint International Conference on Computational … , 2024
2024
Citations: 9
Visual Grounding Strategies for Text-Only Natural Language Processing
D Sileo
Proceedings of the Third Workshop on Beyond Vision and LANguage: inTEgrating … , 2021
2021
Citations: 9
Consent in crisis: The rapid decline of the ai data commons, 2024
S Longpre, R Mahari, A Lee, C Lund, H Oderinwale, W Brannon, ...
URL https://arxiv. org/abs/2407.14933 , 2024
2024
Citations: 8
Analysis and Prediction of NLP Models Via Task Embeddings
D Sileo, MF Moens
Proceedings of The 13th Language Resources and Evaluation Conference, LREC 2022 , 2022
2022
Citations: 8

Damien Sileo

RESEARCH, TEACHING, or OTHER INTERESTS

Scopus Publications

RECENT SCHOLAR PUBLICATIONS

MOST CITED SCHOLAR PUBLICATIONS