I'm an Assistant Professor in the Department of Artificial Intelligence at the Wroclaw University of Science and Technology, where I earned both my Ph.D. in computer science (2018) and M.Sc. Eng. degree (2012). I am the AI/ML Team Leader and Senior ML/NLP Data Scientist for the CLARIN-BIZ and PLLuM projects. My passion for natural language processing (NLP) has spanned over a decade, with a keen interest in machine learning techniques. I've published over 90 scientific papers at prominent conferences, including ACL, ICDM, EMNLP, and more. My current endeavors involve pioneering deep learning models for subjective tasks such as emotion and sentiment analysis. I'm also delving into cross-lingual knowledge transfer and language-agnostic models. My contributions have been integral to CrisisDetector, StockBrief, Sentimenti, CLARIN-PL, and PLLuM projects. I enjoy imparting knowledge on data science, AI's role in NLP, and building sophisticated deep neural networks.
RESEARCH, TEACHING, or OTHER INTERESTS
Computer Science, Artificial Intelligence, Computer Science Applications, Signal Processing
Typology of Image Crises Using Large Language Models: A Novel Approach to Crisis Classification Grzegorz Chodak, Dariusz Tworzydło, Aleksander Szczęsny, Przemysław Kazienko, Oliwier Kaszyca, et al. Journal of Contingencies and Crisis Management, 2025 Image crises pose significant challenges for organizations and public figures, often requiring rapid identification and classification to mitigate reputational damage. This study introduces a novel typology of brand crises and demonstrates its application using large language models (LLMs) to enhance crisis detection and classification. We review the current state of knowledge of brand crises and LLMs, underlining their relevance in real‐world text analytics tasks. Based on an analysis of 300 actual crisis cases, we propose an original typology that captures various types and causes of crises. Our methodology combines expert data annotation with automatic crisis type annotation using a generative LLM. This approach enables both classification and early detection of crises in media texts. The results demonstrate that the GPT‐4‐turbo achieved strong performance in distinguishing ideological from nonideological crises (accuracy: 0.903; F1: 0.874), while GPT‐5 with a 2‐shot prompt and GPT‐4o‐mini excelled in identifying affected actors (accuracy and F1: 0.984). Performance was comparatively lower for detailed cause classification, highlighting the greater complexity of fine‐grained categorizations. This study highlights the potential and limitations of LLMs in developing automated crisis management systems to enhance organizational resilience.
Improving LLM-Based Recommender Systems with User-Controllable Profiles Stanisł, aw Woźniak, Jacek Duszenko, Jan Kocoń, Przemysaw Kazienko Www Companion 2025 Companion Proceedings of the ACM Web Conference 2025, 2025 Large Language Models (LLMs) have demonstrated significant potential across various domains, including their application in recommendation systems (RS). In this paper, we propose a method that emphasizes user control, thereby increasing the role of the human within the system. Our research investigates the effectiveness of a variety of LLMs in capturing and using user preferences for recommendation tasks. The findings reveal that incorporating user controllability into RS can enhance performance by up to 50%. Furthermore, the results highlight that textual and user-controlled representations of preferences, called user-controllable profiles, outperform historical data to improve recommendation quality.
Personalized Large Language Models Stanisław Woźniak, Bartłomiej Koptyra, Arkadiusz Janz, Przemysław Kazienko, Jan Kocoń IEEE International Conference on Data Mining Workshops Icdmw, 2024
PALS: Personalized Active Learning for Subjective Tasks in NLP Kamil Kanclerz, Konrad Karanowski, Julita Bielaniewicz, Marcin Gruza, Piotr Miłkowski, et al. Emnlp 2023 2023 Conference on Empirical Methods in Natural Language Processing Proceedings, 2023
Capturing Human Perspectives in NLP: Questionnaires, Annotations, and Biases Ceur Workshop Proceedings, 2023
RWKV: Reinventing RNNs for the Transformer Era Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, et al. Findings of the Association for Computational Linguistics Emnlp 2023, 2023
StudEmo: A Non-aggregated Review Dataset for Personalized Emotion Recognition 1st Workshop on Perspectivist Approaches to Disagreement in Nlp Nlperspectives 2022 as Part of Language Resources and Evaluation Conference Lrec 2022 Workshop, 2022
MultiEmo: Language-Agnostic Sentiment Analysis Piotr Miłkowski, Marcin Gruza, Przemysław Kazienko, Joanna Szołomicka, Stanisław Woźniak, et al. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2022
Multitask Personalized Recognition of Emotions Evoked by Textual Content Piotr Milkowski, Stanislaw Saganowski, Marcin Gruza, Przemyslaw Kazienko, Maciej Piasecki, et al. 2022 IEEE International Conference on Pervasive Computing and Communications Workshops and Other Affiliated Events Percom Workshops 2022, 2022
What if Ground Truth is Subjective? Personalized Deep Neural Hate Speech Detection 1st Workshop on Perspectivist Approaches to Disagreement in Nlp Nlperspectives 2022 as Part of Language Resources and Evaluation Conference Lrec 2022 Workshop, 2022
Towards a contextualised spatial-diachronic history of literature: mapping emotional representations of the city and the country in Polish fiction from 1864 to 1939 Proceedings International Conference on Computational Linguistics Coling, 2022
Neuro-Symbolic Models for Sentiment Analysis Jan Kocoń, Joanna Baran, Marcin Gruza, Arkadiusz Janz, Michał Kajstura, et al. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2022
Evaluating Natural Language Processing tools for Polish during PolEval 2019 Łukasz Kobyliński, Maciej Ogrodniczuk, Jan Kocoń, Michał Marcińczuk, Aleksander Smywiński-Pohl, et al. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2022
Personal bias in prediction of emotions elicited by textual opinions Piotr Milkowski, Marcin Gruza, Kamil Kanclerz, Przemyslaw Kazienko, Damian Grimling, et al. Acl Ijcnlp 2021 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing Proceedings of the Student Research Workshop, 2021
Controversy and conformity: From generalized to personalized aggressiveness detection Kamil Kanclerz, Alicja Figas, Marcin Gruza, Tomasz Kajdanowicz, Jan Kocon, et al. Acl Ijcnlp 2021 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing Proceedings of the Conference, 2021
Propagation of emotions, arousal and polarity in WordNet using heterogeneous structured synset embeddings Proceedings of the 10th Global Wordnet Conference, 2020
Inforex - A collaborative system for text corpora annotation and analysis G4.19 Research Group, Department of Computational Intelligence, Faculty of Computer Science and Management, Wrocław University of Technology, Wrocław, Poland, Michał Marcińczuk, Marcin Oleksy, Jan Kocoń International Conference Recent Advances in Natural Language Processing Ranlp, 2017
Recognition of Genuine Polish suicide notes Wrocław University of Science and Technology, Wrocław, Poland, Maciej Piasecki, Ksenia Młynarczyk, Jan Kocoń International Conference Recent Advances in Natural Language Processing Ranlp, 2017
Liner2 - A generic framework for named entity recognition Bsnlp 2017 6th Workshop on Balto Slavic Natural Language Processing at the 15th Conference of the European Chapter of the Association for Computational Linguistics Eacl 2017, 2017
Heterogeneous named entity similarity function Jan Kocoń, Maciej Piasecki Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2012
Inforex - A web-based tool for text corpus management and semantic annotation Proceedings of the 8th International Conference on Language Resources and Evaluation Lrec 2012, 2012
RECENT SCHOLAR PUBLICATIONS
What properties of reasoning supervision are associated with improved downstream model quality? M Langner, D Pihulski, J Eliasz, M Rajkowski, P Kazienko, M Piasecki, ... arXiv preprint arXiv:2605.13290 , 2026 2026
Exploring the future of psychometrics from a Large Language Model perspective: A case study analysis W Mieleszczenko-Kowszewicz, J Bielaniewicz, K Kanclerz, J Kocoń, ... Computers in Human Behavior Reports 22, 101060 , 2026 2026
Breaking the Illusion of Reasoning in Polish LLMs: Quality over Quantity of Thought D Pihulski, M Langner, J Eliasz, P Kazienko, J Kocon, T Ferdinan Findings of the Association for Computational Linguistics: EACL 2026, 1796-1811 , 2026 2026 Citations: 1
Architectural Concepts for Integrating Fundamental Drives and Emotions Into Artificial Intelligence T Ferdinan, W Mieleszczenko-Kowszewicz, J Kocoń, P Kazienko IEEE Intelligent Systems 40 (6), 91-98 , 2025 2025 Citations: 1
Typology of Image Crises Using Large Language Models: A Novel Approach to Crisis Classification G Chodak, D Tworzydło, A Szczęsny, P Kazienko, O Kaszyca, K Bilski, ... Journal of Contingencies and Crisis Management 33 (4), e70092 , 2025 2025 Citations: 1
CLARIN-PL: a user centred language technology infrastructure: M. Piasecki et al. M Piasecki, A Dziob, A Janz, J Kocoń, T Naskrȩt, M Oleksy, E Rudnicka, ... Language Resources and Evaluation 59 (4), 4493-4528 , 2025 2025 Citations: 4
The PLLuM Instruction Corpus P Pęzik, F Żarnecki, K Kaczyński, A Cichosz, Z Deckert, M Garnys, ... arXiv preprint arXiv:2511.17161 , 2025 2025 Citations: 1
PLLuM: A Family of Polish Large Language Models J Kocoń, M Piasecki, A Janz, T Ferdinan, Ł Radliński, B Koptyra, M Oleksy, ... arXiv preprint arXiv:2511.03823 , 2025 2025 Citations: 5
Divide, Cache, Conquer: Dichotomic Prompting for Efficient Multi-Label LLM-Based Classification M Langner, J Eliasz, E Rudnicka, J Kocoń arXiv preprint arXiv:2511.03830 , 2025 2025 Citations: 1
Global piqa: Evaluating physical commonsense reasoning across 100+ languages and cultures TA Chang, C Arnett, A Eldesokey, A Sadallah, A Kashar, A Daud, ... arXiv preprint arXiv:2510.24081 , 2025 2025 Citations: 11
Language, Culture, and Ideology: Personalizing Offensiveness Detection in Political Tweets with Reasoning LLMs D Pihulski, J Kocoń arXiv preprint arXiv:2510.02351 , 2025 2025 Citations: 1
LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL D Pihulski, K Charchut, V Novogrodskaia, J Kocoń arXiv preprint arXiv:2510.02350 , 2025 2025 Citations: 1
Predicting stock prices with ChatGPT-annotated Reddit sentiment: Hype or reality? M Kmak, K Chmurzyński, K Matejuk, P Kotzbach, J Kocoń International Conference on Computational Science, 307-322 , 2025 2025 Citations: 1
Enhancing AI Face Realism: Cost-Efficient Quality Improvement in Distilled Diffusion Models with a Fully Synthetic Dataset J Wąsala, B Wrzalski, K Noculak, Y Tarasenko, O Krupa, J Kocoń, ... International Conference on Computational Science, 119-134 , 2025 2025 Citations: 1
SupResDiffGAN a new approach for the Super-Resolution task D Kopeć, W Kozłowski, M Wizerkaniuk, D Krutul, J Kocoń, M Zięba International Conference on Computational Science, 66-80 , 2025 2025 Citations: 6
AggTruth: Contextual Hallucination Detection using Aggregated Attention Scores in LLMs P Matys, J Eliasz, K Kiełczyński, M Langner, T Ferdinan, J Kocoń, ... International Conference on Computational Science, 227-243 , 2025 2025 Citations: 3
Backtranslation and paraphrasing in the llm era? comparing data augmentation methods for emotion classification Ł Radliński, M Guściora, J Kocoń International Conference on Computational Science, 3-17 , 2025 2025 Citations: 4
Integrating personalized and contextual information in fine-grained emotion recognition in text: A multi-source fusion approach with explainability A Ngo, J Kocoń Information Fusion 118, 102966 , 2025 2025 Citations: 10
Improving llm-based recommender systems with user-controllable profiles S Woźniak, J Duszenko, J Kocoń, P Kazienko Companion Proceedings of the ACM on Web Conference 2025, 2102-2111 , 2025 2025 Citations: 9
Fortifying nlp models against poisoning attacks: The power of personalized prediction architectures T Ferdinan, J Kocoń Information Fusion 114, 102692 , 2025 2025 Citations: 9
MOST CITED SCHOLAR PUBLICATIONS
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... Transactions on machine learning research , 2023 2023 Citations: 2646
Rwkv: Reinventing rnns for the transformer era B Peng, E Alcaide, Q Anthony, A Albalak, S Arcadinho, S Biderman, ... Findings of the association for computational linguistics: EMNLP 2023, 14048 … , 2023 2023 Citations: 1301
ChatGPT: Jack of all trades, master of none J Kocoń, I Cichecki, O Kaszyca, M Kochanek, D Szydło, J Baran, ... Information fusion 99, 101861 , 2023 2023 Citations: 1198
Offensive, aggressive, and hate speech analysis: From data-centric to human-centered approach J Kocoń, A Figas, M Gruza, D Puchalska, T Kajdanowicz, P Kazienko Information Processing & Management 58 (5), 102643 , 2021 2021 Citations: 169
Eagle and finch: Rwkv with matrix-valued states and dynamic recurrence B Peng, D Goldstein, Q Anthony, A Albalak, E Alcaide, S Biderman, ... arXiv preprint arXiv:2404.05892 , 2024 2024 Citations: 162
Personalized large language models S Woźniak, B Koptyra, A Janz, P Kazienko, J Kocoń 2024 IEEE International Conference on Data Mining Workshops (ICDMW), 511-520 , 2024 2024 Citations: 77
Multi-level sentiment analysis of PolEmo 2.0: Extended corpus of multi-domain consumer reviews J Kocoń, P Miłkowski, M Zaśko-Zielińska Proceedings of the 23rd Conference on Computational Natural Language … , 2019 2019 Citations: 73
Liner2–a customizable framework for proper names recognition for Polish M Marcińczuk, J Kocoń, M Janicki Intelligent Tools for Building a Scientific Information Platform: Advanced … , 2013 2013 Citations: 61
Learning personal human biases and representations for subjective tasks in natural language processing J Kocoń, M Gruza, J Bielaniewicz, D Grimling, K Kanclerz, P Miłkowski, ... 2021 IEEE international conference on data mining (ICDM), 1168-1173 , 2021 2021 Citations: 60
Human-centered neural reasoning for subjective content processing: Hate speech, emotions, and humor P Kazienko, J Bielaniewicz, M Gruza, K Kanclerz, K Karanowski, ... Information Fusion 94, 43-65 , 2023 2023 Citations: 55
Personal bias in prediction of emotions elicited by textual opinions P Miłkowski, M Gruza, K Kanclerz, P Kazienko, D Grimling, J Kocon Proceedings of the 59th annual meeting of the association for computational … , 2021 2021 Citations: 54
Cross-lingual deep neural transfer learning in sentiment analysis K Kanclerz, P Miłkowski, J Kocoń Procedia Computer Science 176, 128-137 , 2020 2020 Citations: 54
Controversy and conformity: from generalized to personalized aggressiveness detection K Kanclerz, A Figas, M Gruza, T Kajdanowicz, J Kocoń, D Puchalska, ... Proceedings of the 59th Annual Meeting of the Association for Computational … , 2021 2021 Citations: 49
Clarin-emo: Training emotion recognition models using human annotation and chatgpt B Koptyra, A Ngo, Ł Radliński, J Kocoń International conference on computational science, 365-379 , 2023 2023 Citations: 44
What if ground truth is subjective? personalized deep neural hate speech detection K Kanclerz, M Gruza, K Karanowski, J Bielaniewicz, P Miłkowski, J Kocoń, ... Proceedings of the 1st Workshop on Perspectivist Approaches to NLP@ LREC2022 … , 2022 2022 Citations: 40
plWordNet as a basis for large emotive lexicons of Polish A Janz, J Kocon, M Piasecki, M Zasko-Zielinska Proceedings of Human Language Technologies as a Challenge for Computer … , 2017 2017 Citations: 37
Neuro-symbolic models for sentiment analysis J Kocoń, J Baran, M Gruza, A Janz, M Kajstura, P Kazienko, W Korczyński, ... International conference on computational science, 667-681 , 2022 2022 Citations: 35
Multiemo: Multilingual, multilevel, multidomain sentiment analysis corpus of consumer reviews J Kocoń, P Miłkowski, K Kanclerz International Conference on Computational Science, 297-312 , 2021 2021 Citations: 34
Studemo: A non-aggregated review dataset for personalized emotion recognition A Ngo, A Candri, T Ferdinan, J Kocoń, W Korczynski Proceedings of the 1st Workshop on Perspectivist Approaches to NLP@ LREC2022 … , 2022 2022 Citations: 28
Multitask personalized recognition of emotions evoked by textual content P Miłkowski, S Saganowski, M Gruza, P Kazienko, M Piasecki, J Kocoń 2022 IEEE International Conference on Pervasive Computing and Communications … , 2022 2022 Citations: 27