Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs Flint Xiaofeng Fan, Cheston Tan, Roger Wattenhofer, Yew-Soon Ong Proceedings of the International Joint Conference on Neural Networks, 2025 The rise of reinforcement learning (RL) in critical real-world applications demands a fundamental rethinking of privacy in AI systems. Traditional privacy frameworks, designed to protect isolated data points, fall short for sequential decision-making systems where sensitive information emerges from temporal patterns, behavioral strategies, and collaborative dynamics. Modern RL paradigms, such as federated RL (FedRL) and RL with human feedback (RLHF) in large language models (LLMs), exacerbate these challenges by introducing complex, interactive, and context-dependent learning environments that traditional methods do not address. In this position paper, we argue for a new privacy paradigm built on four core principles: multi-scale protection, behavioral pattern protection, collaborative privacy preservation, and context-aware adaptation. These principles expose inherent tensions between privacy, utility, and interpretability that must be navigated as RL systems become more pervasive in high-stakes domains like healthcare, autonomous vehicles, and decision support systems powered by LLMs. To tackle these challenges, we call for the development of new theoretical frameworks, practical mechanisms, and rigorous evaluation methodologies that collectively enable effective privacy protection in sequential decision-making systems.
FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems Aamas, 2025
Revisiting the Byzantine Resilience of Federated Reinforcement Learning: A Distillation Perspective Wenzheng Jiang, Ji Wang, Zhengyi Zhong, Jiangzhou Liao, Xiaomin Zhu, Flint Xiaofeng Fan Proceedings of the IEEE International Conference on Trust Security and Privacy in Computing and Communications Trustcom, 2025 Federated reinforcement learning (FRL) enhances sample efficiency while preserving data privacy. However, standard FRL frameworks rely on aggregating model parameters or gradients, making them vulnerable to Byzantine attacks. Current Byzantine-resilient approaches primarily focus on server-side robust aggregations, leaving the fundamental vulnerability of transmitting parameters unaddressed. In this paper, we revisit Byzantine resilience in FRL from the knowledge distillation (KD) perspective. KD-based FRL uploads policy representations instead of policy parameters. This framework-level shift fundamentally constrains the attack surface. We theoretically prove traditional FRL suffers unbounded corruption from Byzantine agents, whereas KD-based FRL converges to an ${\mathcal{O}}(\alpha )$-stationary point under α-fraction adversaries, formalizing the accuracy-robustness trade-off. Empirical validation confirms the Byzantine resilience of KD-based FRL: it maintains near-optimal performance across diverse attacks and even withstands Byzantine fractions up to 0.9. Our theoretical guarantees and experiments demonstrate distillation endows FRL with fundamentally stronger resilience.
FedHPD: Heterogeneous Federated Reinforcement Learning via Policy Distillation Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems Aamas, 2025
Diversifying Policy Behaviors via Extrinsic Behavior Curiosity Proceedings of Machine Learning Research, 2025
SUBER: An RL Environment with Simulated Human Behavior for Recommender Systems Nathan Corecco, Giorgio Piatti, Luca A. Lanzendörfer, Flint Xiaofeng Fan, Roger Wattenhofer Frontiers in Artificial Intelligence and Applications, 2024 Reinforcement learning (RL) has gained popularity in the realm of recommender systems due to its ability to optimize long-term rewards and guide users in discovering relevant content. However, the successful implementation of RL in recommender systems is challenging because of several factors, including the limited availability of online data for training on-policy methods. This scarcity requires expensive human interaction for online model training. Furthermore, the development of effective evaluation frameworks that accurately reflect the quality of models remains a fundamental challenge in recommender systems. To address these challenges, we propose a comprehensive framework for synthetic environments that simulate human behavior by harnessing the capabilities of large language models (LLMs). We complement our framework with in-depth ablation studies and demonstrate its effectiveness with experiments on movie and book recommendations. Using LLMs as synthetic users, this work introduces a modular and novel framework to train RL-based recommender systems. The software, including the RL environment, is publicly available on https://github.com/SUBER-Team/SUBER.
Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems Aamas, 2024
FedHQL: Federated Heterogeneous Q-Learning Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems Aamas, 2023
FEDERATED NEURAL BANDITS 11th International Conference on Learning Representations Iclr 2023, 2023
Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee Advances in Neural Information Processing Systems, 2021
RECENT SCHOLAR PUBLICATIONS
Unveiling and Mitigating Untargeted Poisoning Attacks on Federated Knowledge Graph Embedding W Jiang, K Liang, W Huang, X Zhang, Z Xu, G Wan, C Tan, FX Fan, ... Proceedings of the ACM Web Conference 2026, 2569-2580 , 2026 2026
Why Do We Suffer for Fun? Ordeal Pleasure in Souls-like Games FX Fan arXiv preprint arXiv:2603.26677 , 2026 2026
Information fidelity in tool-using llm agents: A martingale analysis of the model context protocol FX Fan, C Tan, R Wattenhofer, YS Ong arXiv preprint arXiv:2602.13320 , 2026 2026 Citations: 2
Provably Reliable Tool-Using LLM Agents: Formal Guarantees on Error Accumulation in the Model Context Protocol (MCP) FX Fan, C Tan, R Wattenhofer, YS Ong 2026
Revisiting the Byzantine Resilience of Federated Reinforcement Learning: A Distillation Perspective W Jiang, J Wang, Z Zhong, J Liao, X Zhu, FX Fan 2025 IEEE 24th International Conference on Trust, Security and Privacy in … , 2025 2025
Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs FX Fan, C Tan, R Wattenhofer, YS Ong International Joint Conference on Neural Networks (IJCNN) 2025 , 2025 2025 Citations: 6
FedHPD: Heterogeneous Federated Reinforcement Learning via Policy Distillation W Jiang, J Wang, X Zhang, W Bao, C Tan, FX Fan International Conference on Autonomous Agents and Multiagent Systems (AAMAS … , 2025 2025 Citations: 7
FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF FX Fan, C Tan, YS Ong, R Wattenhofer, WT Ooi International Conference on Autonomous Agents and Multiagent Systems (AAMAS … , 2024 2024 Citations: 20
From Myth to Market: Lessons from Black Myth: Wukong's Success FX Fan Digital Games Research Association (DiGRA) 2025 , 2024 2024 Citations: 6
Diversifying Policy Behaviors with Extrinsic Behavioral Curiosity Z Wan, X Yu, DM Bossens, Y Lyu, Q Guo, FX Fan, YS Ong, I Tsang arXiv preprint arXiv:2410.06151 , 2024 2024 Citations: 4
An LLM-based Recommender System Environment N Corecco, G Piatti, LA Lanzendörfer, FX Fan, R Wattenhofer KDD 2024 workshop on Generative AI for Recommender Systems and Personalization , 2024 2024 Citations: 10
CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening HY Mak, FX Fan, LA Lanzendörfer, C Tan, WT Ooi, R Wattenhofer AAMAS 2024 workshop on Adaptive and Learning Agents (ALA 2024) , 2024 2024 Citations: 8
Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence P Jordan, F Grötschla, FX Fan, R Wattenhofer International Conference on Autonomous Agents and Multiagent Systems (AAMAS … , 2024 2024 Citations: 12
Chapter 14-Federated sequential decision making: Bayesian optimization, reinforcement learning, and beyond Z Dai, FX Fan, C Tan, TN Hoang, BKH Low, P Jaillet Federated Learning. Academic Press , 2024 2024 Citations: 3
Federated sequential decision making: Bayesian optimization, reinforcement learning, and beyond Z Dai, FX Fan, C Tan, TN Hoang, BKH Low, P Jaillet Federated Learning, 257-279 , 2024 2024 Citations: 6
Quality diversity imitation learning Z Wan, X Yu, DM Bossens, Y Lyu, Q Guo, FX Fan, I Tsang 2024 Citations: 7
SUBER: An RL Environment with Simulated Human Behavior for Recommender Systems N Corecco, G Piatti, LA Lanzendörfer, FX Fan, R Wattenhofer European Conference on Artificial Intelligence (ECAI) 2024 , 2023 2023 Citations: 17
Reinforcement Learning of TSP Heuristics with Message Passing Neural Networks L Holbein, Y Schmid ETH (MSc) student thesis , 2023 2023
Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning X Lu, FX Fan, T Wang ICML 2023 Workshop Frontiers4LCD , 2023 2023 Citations: 19
FedHQL: Federated Heterogeneous Q-Learning FX Fan, Y Ma, Z Dai, C Tan, BKH Low, R Wattenhofer International Conference on Autonomous Agents and Multiagent Systems (AAMAS … , 2023 2023 Citations: 24
MOST CITED SCHOLAR PUBLICATIONS
Fault-tolerant federated reinforcement learning with theoretical guarantee X Fan, Y Ma, Z Dai, W Jing, C Tan, BKH Low Advances in neural information processing systems (NeurIPS) 2021 , 2021 2021 Citations: 126
Federated neural bandit Z Dai, Y Shu, A Verma, FX Fan, BKH Low, P Jaillet International Conference on Learning Representations (ICLR) 2023 , 2022 2022 Citations: 40
FedHQL: Federated Heterogeneous Q-Learning FX Fan, Y Ma, Z Dai, C Tan, BKH Low, R Wattenhofer International Conference on Autonomous Agents and Multiagent Systems (AAMAS … , 2023 2023 Citations: 24
FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF FX Fan, C Tan, YS Ong, R Wattenhofer, WT Ooi International Conference on Autonomous Agents and Multiagent Systems (AAMAS … , 2024 2024 Citations: 20
Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning X Lu, FX Fan, T Wang ICML 2023 Workshop Frontiers4LCD , 2023 2023 Citations: 19
SUBER: An RL Environment with Simulated Human Behavior for Recommender Systems N Corecco, G Piatti, LA Lanzendörfer, FX Fan, R Wattenhofer European Conference on Artificial Intelligence (ECAI) 2024 , 2023 2023 Citations: 17
Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence P Jordan, F Grötschla, FX Fan, R Wattenhofer International Conference on Autonomous Agents and Multiagent Systems (AAMAS … , 2024 2024 Citations: 12
An LLM-based Recommender System Environment N Corecco, G Piatti, LA Lanzendörfer, FX Fan, R Wattenhofer KDD 2024 workshop on Generative AI for Recommender Systems and Personalization , 2024 2024 Citations: 10
CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening HY Mak, FX Fan, LA Lanzendörfer, C Tan, WT Ooi, R Wattenhofer AAMAS 2024 workshop on Adaptive and Learning Agents (ALA 2024) , 2024 2024 Citations: 8
FedHPD: Heterogeneous Federated Reinforcement Learning via Policy Distillation W Jiang, J Wang, X Zhang, W Bao, C Tan, FX Fan International Conference on Autonomous Agents and Multiagent Systems (AAMAS … , 2025 2025 Citations: 7
Quality diversity imitation learning Z Wan, X Yu, DM Bossens, Y Lyu, Q Guo, FX Fan, I Tsang 2024 Citations: 7
Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs FX Fan, C Tan, R Wattenhofer, YS Ong International Joint Conference on Neural Networks (IJCNN) 2025 , 2025 2025 Citations: 6
From Myth to Market: Lessons from Black Myth: Wukong's Success FX Fan Digital Games Research Association (DiGRA) 2025 , 2024 2024 Citations: 6
Federated sequential decision making: Bayesian optimization, reinforcement learning, and beyond Z Dai, FX Fan, C Tan, TN Hoang, BKH Low, P Jaillet Federated Learning, 257-279 , 2024 2024 Citations: 6
Diversifying Policy Behaviors with Extrinsic Behavioral Curiosity Z Wan, X Yu, DM Bossens, Y Lyu, Q Guo, FX Fan, YS Ong, I Tsang arXiv preprint arXiv:2410.06151 , 2024 2024 Citations: 4
Chapter 14-Federated sequential decision making: Bayesian optimization, reinforcement learning, and beyond Z Dai, FX Fan, C Tan, TN Hoang, BKH Low, P Jaillet Federated Learning. Academic Press , 2024 2024 Citations: 3
Information fidelity in tool-using llm agents: A martingale analysis of the model context protocol FX Fan, C Tan, R Wattenhofer, YS Ong arXiv preprint arXiv:2602.13320 , 2026 2026 Citations: 2
Unveiling and Mitigating Untargeted Poisoning Attacks on Federated Knowledge Graph Embedding W Jiang, K Liang, W Huang, X Zhang, Z Xu, G Wan, C Tan, FX Fan, ... Proceedings of the ACM Web Conference 2026, 2569-2580 , 2026 2026
Why Do We Suffer for Fun? Ordeal Pleasure in Souls-like Games FX Fan arXiv preprint arXiv:2603.26677 , 2026 2026
Provably Reliable Tool-Using LLM Agents: Formal Guarantees on Error Accumulation in the Model Context Protocol (MCP) FX Fan, C Tan, R Wattenhofer, YS Ong 2026