NAND KUMAR YADAV

@iiita.ac.in

Research Scholar
Indian Institute Of Information Technology Allahabad

RESEARCH INTERESTS

Deep Learning Computer Vision Image and Vision Processing
7

Scopus Publications

Scopus Publications

  • TranscoderGAN: Transformer-Inception Based Encoder-Decoder Generative Adversarial Network for Thermal-Visible Face Transformation
    Nand Kumar Yadav, Satish Kumar Singh, Shiv Ram Dubey
    Communications in Computer and Information Science, 2026
  • SCL-GAN: Spatially-Correlative Lightweight GAN for Efficient and High-Fidelity Thermal-Visible Face Synthesis
    Nand Kumar Yadav, Rayeesa Mehmood, Rodrigue Rizk, KC Santosh
    Proceedings International Conference on Image Processing Icip, 2025
    This work introduces SCL-GAN (Spatially-Correlative Lightweight GAN), a novel architecture for facial image reconstruction using thermal face images, designed for efficient execution on edge devices such as the NVIDIA Jetson board. The proposed architecture leverages spatial feature correlations across thermal-visible modalities while maintaining a low parameter count and FLOPs. Experimental results show that SCL-GAN achieves a 68.57% reduction in computational cost (GMac) and a 71.71% reduction in trainable parameters, compared to baseline models. Moreover, we observe consistent improvements in image quality metrics, including a 5.05% increase in SSIM, 4.49% reduction in VGG-FaceLoss, and a 27.83% reduction in FID on the WHU-IIP dataset. On the CVBL-CHILD dataset, SCL-GAN demonstrates an 11.70% SSIM improvement, 18.21% VGG-FaceLoss reduction, and a 47.88% drop in FID. The code is available at: https://github.com/GANGREEK/SCL-GAN.git.
  • ISA-GAN: inception-based self-attentive encoder–decoder network for face synthesis using delineated facial images
    Nand Kumar Yadav, Satish Kumar Singh, Shiv Ram Dubey
    Visual Computer, 2024
  • DCT-SwinGAN: Leveraging DCT and Swin Transformer for Face Synthesis from Sketch and Thermal Domains
    Haresh Kumar Kotadiya, Satish Kumar Singh, Shiv Ram Dubey, Nand Kumar Yadav
    Communications in Computer and Information Science, 2024
  • TVA-GAN: attention guided generative adversarial network for thermal to visible image transformations
    Nand Kumar Yadav, Satish Kumar Singh, Shiv Ram Dubey
    Neural Computing and Applications, 2023
  • CSA-GAN: Cyclic synthesized attention guided generative adversarial network for face synthesis
    Nand Kumar Yadav, Satish Kumar Singh, Shiv Ram Dubey
    Applied Intelligence, 2022
  • MobileAR-GAN: MobileNet-Based Efficient Attentive Recurrent Generative Adversarial Network for Infrared-to-Visual Transformations
    Nand Kumar Yadav, Satish Kumar Singh, Shiv Ram Dubey
    IEEE Transactions on Instrumentation and Measurement, 2022
    Deep learning has recently shown outstanding performance for different applications, including image-to-image translation by generative adversarial networks (GANs). However, GAN models are very complex as build with multiple deep networks and require huge computational resources for the training and inference. Hence, the real-time deployment of GAN models is not feasible at present. In this article, we propose MobileNet-based efficient attentive recurrent GAN (MobileAR-GAN) for resource-constrained infrared-to-visual translation. The proposed model utilizes the lightweight MobileNet and enhances its capacity using the attention and recurrent modules, leading to an efficient yet effective model. We consider the infrared to visible image translation task to validate the efficiency and performance of the proposed model. The proposed MobileAR-GAN outperforms most of the existing GAN models in terms of both the efficiency and the quality of the generated images. We also test the MobileAR-GAN model over the resource-limited Jetson TX2 board with very compelling results. The proposed model shows promising results over state-of-the-art methods. Compared to lightweight models, such as Pix2pix and GAN-Compression methods, an average improvement gain of 19.19% and 17.05% is observed by the proposed model in terms of SSIM metric. It is observed that the proposed model can be deployed on edge devices to transform the images taken at night time using an infrared camera to the corresponding visible images with satisfactory performance.