Lakshmanan Kailasam

I joined IIT (BHU) in 2017. Earlier I completed Ph.D from IISc., Bangalore and had post-doc experience from IIT Bombay, University of Leoben, Austria and National University of Singapore. My research interests are in machine learning where I work mostly in reinforcement learning and its applications. I'm also associated with the Coforge Data and AI Lab and the Indian Knowledge Systems Center of Excellence at IIT (BHU).

EDUCATION

PhD Computer Science and Automation, Indian Institute of Science, Bangalore - 2013

RESEARCH, TEACHING, or OTHER INTERESTS

Computer Science, Artificial Intelligence, Computer Science Applications

Scopus Publications

Influence maximization independent of seed set size
K. Lakshmanan
Operations Research Letters, 2025
A deep actor critic reinforcement learning framework for learning to rank
Vaibhav Padhye, Kailasam Lakshmanan
Neurocomputing, 2023
Proximal policy optimization based hybrid recommender systems for large scale recommendations
Vaibhav Padhye, Kailasam Lakshmanan, Amrita Chaturvedi
Multimedia Tools and Applications, 2023
Stochastic Arrow-Hurwicz Algorithm for Path Selection and Rate Allocation in Self-Backhauled mmWave Networks
Abhinav Sharma, K. Lakshmanan, Ruchir Gupta, Atul Gupta
IEEE Communications Letters, 2022
Resource Allocation in Edge Computing: A Game-Theoretic Perspective
Sumit Kumar, Ruchir Gupta, K Lakshmanan
2022 IEEE Global Conference on Computing Power and Communication Technologies Globconpt 2022, 2022
Edge computing is a novel computing framework that offers decentralized computing power in order to reduce latency and save bandwidth. This technology is critical in the fast-evolving app market, where app vendors hire edge servers to serve their users. To compete with others, each app vendor wants to employ the edge servers in such a way that it costs the least. Therefore, all app vendors need a solution that minimizes the cost of hiring edge servers. This paper proposes an Edge Server Hiring game (ESHGame), a game-theoretical approach to the problem that formulates it as a potential game. We demonstrate in ESHGame that app vendors collectively achieve at least one pure Nash equilibrium (PNE). We also set an upper bound on the price of stability in this game.
Multicasting: A Game-Theoretic Method for Constructing an Efficient Multicast Tree
Sumit Kumar, Ruchir Gupta, K Lakshmanan
4th International Conference on Inventive Research in Computing Applications Icirca 2022 Proceedings, 2022
In multicast routing, the path search process can be formulated as a network game where end-users are considered self-motivated players. Every end-user must pay a cost for the path to get the data from the source. Therefore, end-users want to construct the path from the source at the minimum possible cost. In order to get the minimum cost, end-users will keep modifying paths, which leads to instability. When no end-user can do better by modifying his path, the stable state is considered a Pure Nash equilibrium (PNE). The quality of PNE depends on the cost-sharing mechanism among the end-users of edges. Bringing PNE close to the social optimum (minimum total cost for all end-users) has always been a goal for various cost-sharing strategies. It constructs a cost-effective multicast tree for group communication. Many authors analyzed the cost-sharing mechanisms based on the Shapley value concept, far from the social optimum. Our study aims to identify a weighted cost-sharing mechanism and formulate the path construction process as a Multicast Tree Construction Game (MTCGame). This game employs an MTC algorithm to reach PNE. We prove that at least one PNE always exists in MTCGame. We analyze the quality of the PNE numerically.
Comment on 'Federated Learning with Differential Privacy: Algorithms and Performance Analysis'
Krishnan Rajkumar, Antriksh Goswami, K. Lakshmanan, Ruchir Gupta
IEEE Transactions on Information Forensics and Security, 2022
A Game-Theoretic Approach for Increasing Resource Utilization in Edge Computing Enabled Internet of Things
Sumit Kumar, Ruchir Gupta, K. Lakshmanan, Vipin Maurya
IEEE Access, 2022
Edge computing is a new paradigm that reduces latency and saves bandwidth by deploying edge servers in different geographic locations. This technology plays a crucial role in the rapidly growing apps market for IoT devices as app vendors can hire computing resources on edge servers to serve their app users. An effective allocation of edge computing resources to different apps is needed to maximize resource utilization and serve the most app users at the lowest cost. We refer to this as an Edge Resource Allocation (ERA) problem. In this paper, we propose an Edge Resource Allocation Game (ERAGame), a game-theoretic approach that formulates the ERA problem by appropriately pricing the multi-tenant edge servers. The proposed approach gives a Pure Nash Equilibrium (PNE) solution to the ERA problem. For this, we design an ERA algorithm using ERAGame under which the system converges to PNE. For fast convergence to PNE, the edge servers are partitioned into different groups, enabling the ERA algorithm to run in parallel on all edge servers within each group. We prove that ERAGame is a potential game that guarantees at least one PNE under the ERA algorithm. We evaluate that the price of stability of ERAGame is at most O(log n). The performance of the proposed algorithm is examined through simulation.
PILHNB: Popularity, interests, location used hidden Naive Bayesian-based model for link prediction in dynamic social networks
Ashwini Kumar Singh, Kailasam Lakshmanan
Neurocomputing, 2021
Transition based discount factor for model free algorithms in reinforcement learning
Abhinav Sharma, Ruchir Gupta, K. Lakshmanan, Atul Gupta
Symmetry, 2021
Reinforcement Learning (RL) enables an agent to learn control policies for achieving its long-term goals. One key parameter of RL algorithms is a discount factor that scales down future cost in the state’s current value estimate. This study introduces and analyses a transition-based discount factor in two model-free reinforcement learning algorithms: Q-learning and SARSA, and shows their convergence using the theory of stochastic approximation for finite state and action spaces. This causes an asymmetric discounting, favouring some transitions over others, which allows (1) faster convergence than constant discount factor variant of these algorithms, which is demonstrated by experiments on the Taxi domain and MountainCar environments; (2) provides better control over the RL agents to learn risk-averse or risk-taking policy, as demonstrated in a Cliff Walking experiment.
Multi-time scale smoothed functional with nesterov's acceleration
Abhinav Sharma, K. Lakshmanan, Ruchir Gupta, Atul Gupta
IEEE Access, 2021
A Novel Cloud-Assisted Secure Deep Feature Classification Framework for Cancer Histopathology Images
Abhinav Kumar, Sanjay Kumar Singh, K Lakshmanan, Sonal Saxena, Sameer Shrivastava
ACM Transactions on Internet Technology, 2021
CoMHisP: A Novel Feature Extractor for Histopathological Image Classification Based on Fuzzy SVM with Within-Class Relative Density
Abhinav Kumar, Sanjay Kumar Singh, Sonal Saxena, Amit Kumar Singh, Sameer Shrivastava, et al.
IEEE Transactions on Fuzzy Systems, 2021
A parameter-free affinity based clustering
Bhaskar Mukhoty, Ruchir Gupta, Lakshmanan K., Mayank Kumar
Applied Intelligence, 2020
Beam Alignment for mmWave Using Non-Stationary Bandits
Ruchir Gupta, K. Lakshmanan, Abhay Kumar Sah
IEEE Communications Letters, 2020
Deep feature learning for histopathological image classification of canine mammary tumors and human breast cancer
Abhinav Kumar, Sanjay Kumar Singh, Sonal Saxena, K. Lakshmanan, Arun Kumar Sangaiah, et al.
Information Sciences, 2020
Accelerated Reinforcement Learning
K. Lakshmanan
2017 14th IEEE India Council International Conference Indicon 2017, 2018
Quasi-Newton smoothed functional algorithms for unconstrained and constrained simulation optimization
K. Lakshmanan, Shalabh Bhatnagar
Computational Optimization and Applications, 2017
Multiscale Q-learning with linear function approximation
Shalabh Bhatnagar, K. Lakshmanan
Discrete Event Dynamic Systems Theory and Applications, 2016
Improved regret bounds for undiscounted continuous reinforcement learning
32nd International Conference on Machine Learning Icml 2015, 2015
A novel Q-learning algorithm with function approximation for constrained Markov decision processes
K. Lakshmanan, Shalabh Bhatnagar
2012 50th Annual Allerton Conference on Communication Control and Computing Allerton 2012, 2012
An Online Actor-Critic Algorithm with Function Approximation for Constrained Markov Decision Processes
Shalabh Bhatnagar, K. Lakshmanan
Journal of Optimization Theory and Applications, 2012
Smoothed functional and quasi-newton algorithms for routing in multi-stage queueing network with constraints
K. Lakshmanan, Shalabh Bhatnagar
Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2011

RECENT SCHOLAR PUBLICATIONS

Learning in Function Spaces: An Unified Functional Analytic View of Supervised and Unsupervised Learning
K Lakshmanan
arXiv preprint arXiv:2603.14272 , 2026
2026
A Short Survey of Averaging Techniques in Stochastic Gradient Methods
K Lakshmanan
arXiv preprint arXiv:2603.09634 , 2026
2026
Influence maximization independent of seed set size
K Lakshmanan
Operations Research Letters 62, 107309 , 2025
2025
Recursive Class Field Theory: A Computable Framework for Abelian Extensions
K Lakshmanan
2025
On the Minimality of the Conductor in Rank Bounds for Elliptic Curves
K Lakshmanan
arXiv preprint arXiv:2506.20175 , 2025
2025
Computational complexity of finding subgroups of a given order
K Lakshmanan
arXiv preprint arXiv:2503.11238 , 2025
2025
Citations: 1
Primes and Bivariate Polynomials without Constant Terms: A Recursive Algorithm
K Lakshmanan
arXiv preprint arXiv:2405.10519 , 2024
2024
Uncomputability of Global Optima for Nonconvex Functions in the Oracle Model
K Lakshmanan
arXiv preprint arXiv:2401.09436 , 2023
2023
A deep actor critic reinforcement learning framework for learning to rank
V Padhye, K Lakshmanan
Neurocomputing 547, 126314 , 2023
2023
Citations: 35
Proximal policy optimization based hybrid recommender systems for large scale recommendations
V Padhye, K Lakshmanan, A Chaturvedi
Multimedia Tools and Applications 82 (13), 20079-20100 , 2023
2023
Citations: 13
Comment on “Federated learning with differential privacy: Algorithms and performance analysis”
K Rajkumar, A Goswami, K Lakshmanan, R Gupta
IEEE Transactions on Information Forensics and Security 17, 3922-3924 , 2022
2022
Citations: 12
Resource allocation in edge computing: A game-theoretic perspective
S Kumar, R Gupta, K Lakshmanan
2022 IEEE Global Conference on Computing, Power and Communication … , 2022
2022
Citations: 3
Multicasting: A Game-Theoretic Method for Constructing an Efficient Multicast Tree
S Kumar, R Gupta, K Lakshmanan
2022 4th International Conference on Inventive Research in Computing … , 2022
2022
Citations: 1
A game-theoretic approach for increasing resource utilization in edge computing enabled internet of things
S Kumar, R Gupta, K Lakshmanan, V Maurya
IEEE Access 10, 57974-57989 , 2022
2022
Citations: 29
An unsupervised software fault prediction approach using threshold derivation
R Kumar, A Chaturvedi, L Kailasam
IEEE Transactions on Reliability 71 (2), 911-932 , 2022
2022
Citations: 24
Stochastic arrow-hurwicz algorithm for path selection and rate allocation in self-backhauled mmWave networks
A Sharma, K Lakshmanan, R Gupta, A Gupta
IEEE Communications Letters 26 (3), 716-720 , 2021
2021
Citations: 2
PILHNB: Popularity, interests, location used hidden Naive Bayesian-based model for link prediction in dynamic social networks
AK Singh, K Lakshmanan
Neurocomputing 461, 562-576 , 2021
2021
Citations: 25
Link prediction-based influence maximization in online social networks
AK Singh, L Kailasam
Neurocomputing 453, 151-163 , 2021
2021
Citations: 41
Multi-time scale smoothed functional with nesterov’s acceleration
A Sharma, K Lakshmanan, R Gupta, A Gupta
IEEE Access 9, 113489-113499 , 2021
2021
Citations: 4
Transition based discount factor for model free algorithms in reinforcement learning
A Sharma, R Gupta, K Lakshmanan, A Gupta
Symmetry 13 (7), 1197 , 2021
2021
Citations: 7

MOST CITED SCHOLAR PUBLICATIONS

An online actor–critic algorithm with function approximation for constrained markov decision processes
S Bhatnagar, K Lakshmanan
Journal of Optimization Theory and Applications 153 (3), 688-708 , 2012
2012
Citations: 347
Deep feature learning for histopathological image classification of canine mammary tumors and human breast cancer
A Kumar, SK Singh, S Saxena, K Lakshmanan, AK Sangaiah, H Chauhan, ...
Information Sciences 508, 405-421 , 2020
2020
Citations: 288
CoMHisP: A novel feature extractor for histopathological image classification based on fuzzy SVM with within-class relative density
A Kumar, SK Singh, S Saxena, AK Singh, S Shrivastava, K Lakshmanan, ...
IEEE Transactions on Fuzzy Systems 29 (1), 103-117 , 2020
2020
Citations: 52
Improved regret bounds for undiscounted continuous reinforcement learning
K Lakshmanan, R Ortner, D Ryabko
International conference on machine learning, 524-532 , 2015
2015
Citations: 50
Link prediction-based influence maximization in online social networks
AK Singh, L Kailasam
Neurocomputing 453, 151-163 , 2021
2021
Citations: 41
A deep actor critic reinforcement learning framework for learning to rank
V Padhye, K Lakshmanan
Neurocomputing 547, 126314 , 2023
2023
Citations: 35
A game-theoretic approach for increasing resource utilization in edge computing enabled internet of things
S Kumar, R Gupta, K Lakshmanan, V Maurya
IEEE Access 10, 57974-57989 , 2022
2022
Citations: 29
Beam alignment for mmWave using non-stationary bandits
R Gupta, K Lakshmanan, AK Sah
IEEE Communications Letters 24 (11), 2619-2622 , 2020
2020
Citations: 28
PILHNB: Popularity, interests, location used hidden Naive Bayesian-based model for link prediction in dynamic social networks
AK Singh, K Lakshmanan
Neurocomputing 461, 562-576 , 2021
2021
Citations: 25
An unsupervised software fault prediction approach using threshold derivation
R Kumar, A Chaturvedi, L Kailasam
IEEE Transactions on Reliability 71 (2), 911-932 , 2022
2022
Citations: 24
A novel cloud-assisted secure deep feature classification framework for cancer histopathology images
A Kumar, SK Singh, K Lakshmanan, S Saxena, S Shrivastava
ACM Transactions on Internet Technology (TOIT) 21 (2), 1-22 , 2021
2021
Citations: 22
Multiscale Q-learning with linear function approximation
S Bhatnagar, K Lakshmanan
Discrete Event Dynamic Systems 26 (3), 477-509 , 2016
2016
Citations: 15
Proximal policy optimization based hybrid recommender systems for large scale recommendations
V Padhye, K Lakshmanan, A Chaturvedi
Multimedia Tools and Applications 82 (13), 20079-20100 , 2023
2023
Citations: 13
A novel Q-learning algorithm with function approximation for constrained Markov decision processes
K Lakshmanan, S Bhatnagar
2012 50th Annual Allerton Conference on Communication, Control, and … , 2012
2012
Citations: 13
Comment on “Federated learning with differential privacy: Algorithms and performance analysis”
K Rajkumar, A Goswami, K Lakshmanan, R Gupta
IEEE Transactions on Information Forensics and Security 17, 3922-3924 , 2022
2022
Citations: 12
A parameter-free affinity based clustering
B Mukhoty, R Gupta, L K, M Kumar
Applied Intelligence 50 (12), 4543-4556 , 2020
2020
Citations: 9
Transition based discount factor for model free algorithms in reinforcement learning
A Sharma, R Gupta, K Lakshmanan, A Gupta
Symmetry 13 (7), 1197 , 2021
2021
Citations: 7
Quasi-Newton smoothed functional algorithms for unconstrained and constrained simulation optimization
K Lakshmanan, S Bhatnagar
Computational Optimization and Applications 66 (3), 533-556 , 2017
2017
Citations: 5
Multi-time scale smoothed functional with nesterov’s acceleration
A Sharma, K Lakshmanan, R Gupta, A Gupta
IEEE Access 9, 113489-113499 , 2021
2021
Citations: 4
Smoothed functional and quasi-Newton algorithms for routing in multi-stage queueing network with constraints
K Lakshmanan, S Bhatnagar
International Conference on Distributed Computing and Internet Technology … , 2011
2011
Citations: 4

GRANT DETAILS

PI, AI-assisted Vulnerability Detection, Coforge (Ongoing)

PI, Algorithm with Provable Guarantees for Dynamic Social Networks, DST SERB MATRICS (Ongoing)

Co-PI, Mathematics and Astronomy, Indian Knowledge Systems Center of Excellence @ IIT (BHU) (Ongoing)