Bruno Luiggi Macchiavello Espinoza

@unb.br

Associate Professor at the Computer Science Department
Universidad de Brasilia

Bruno Luiggi Macchiavello Espinoza

RESEARCH, TEACHING, or OTHER INTERESTS

Information Systems, Computer Science, Signal Processing
45

Scopus Publications

Scopus Publications

  • Learning-Based Image Compression With Parameter-Adaptive Rate-Constrained Loss
    Nilson D. Guerin, Renam Castro da Silva, Bruno Macchiavello
    IEEE Signal Processing Letters, 2024
    In recent years, the crucial task of image compression has been addressed by end-to-end neural network methods. However, achieving fine-grained rate control in this new paradigm has presented challenges. In our previous work, we explored mismatches in rate estimation during target-rate-oriented training and proposed heuristics involving costly parameter searches as a solution. This work proposes a lightweight approach, which dynamically adapts loss parameters to mitigate rate estimation issues, ensuring precise target rate attainment. Inspired by Reinforcement Learning, our method exhibits performance comparable to preceding approaches on the Kodak dataset in terms of PSNR. Additionally, it reduces computational training costs.
  • Rate-constrained learning-based image compression
    Nilson D. Guerin, Renam Castro da Silva, Matheus C. de Oliveira, Henrique C. Jung, Luiz Gustavo R. Martins, et al.
    Signal Processing Image Communication, 2022
  • Improved two-dimensional dynamic S-EMG Signal compression with robust automatic segmentation
    Francisco A.O. Nascimento, Marcel H. Trabuco, Bruno Macchiavello, Davi B. Gusmão, Marcus V.C. Costa
    Biomedical Signal Processing and Control, 2021
  • Trust and reputation multiagent-driven model for distributed transcoding on fog-edge
    Ceur Workshop Proceedings, 2021
  • Learning-based End-to-End Video Compression Using Predictive Coding
    Matheus C. de Oliveira, Luiz G. R. Martins, Henrique Costa Jung, Nilson Donizete Guerin, Renam Castro da Silva, et al.
    Proceedings 2021 34th Sibgrapi Conference on Graphics Patterns and Images Sibgrapi 2021, 2021
    Driven by the growing demand for video applications, deep learning techniques have become alternatives for implementing end-to-end encoders to achieve applicable compression rates. Conventional video codecs exploit both spatial and temporal correlation. However, due to some restrictions (e.g. computational complexity), they are commonly limited to linear transformations and translational motion estimation. Autoencoder models open up the way for exploiting predictive end-to-end video codecs without such limitations. This paper presents an entire learning-based video codec that exploits spatial and temporal correlations. The presented codec extends the idea of P-frame prediction presented in our previous work. The architecture adopted for I-frame coding is defined by a variational autoencoder with non-parametric entropy modeling. Besides an entropy model parameterized by a hyperprior, the inter-frame encoder architecture has two other independent networks, responsible for motion estimation and residue prediction. Experimental results indicate that some improvements still have to be incorporated into our codec to overcome the all-intra coding set up regarding the traditional algorithms High Efficiency Video Coding (HEVC) and Versatile Video Coding (VVC).
  • Multi-Mode Intra Prediction for Learning-Based Image Compression
    Henrique Costa Jung, Nilson Donizete Guerin, Raphael Soares Ramos, Bruno Macchiavello, Eduardo Peixoto, et al.
    Proceedings International Conference on Image Processing Icip, 2020
    In recent years image compression techniques based on deep learning have achieved great success and their performances are gradually reaching the methods crafted by experts, such as JPEG, WebP, and Better Portable Graphics (BPG). A technique that is fundamental for modern image and video codecs is intra prediction, which takes advantage of local redundancy to predict the pixels from previously encoded neighbors. In this paper, we use Convolutional Neural Networks (CNN) to develop a new intra-picture prediction mode. More specifically, we propose a multi-mode intra prediction approach that uses two CNN-based prediction modes and all intra modes previously implemented in the High Efficiency Video Coding (HEVC) standard. We also propose a bit allocation technique that increases the bitstream only if the reconstruction error is significantly reduced. Experimental results evince a significant and consistent performance increase compared to other approaches that use a similar backbone architecture, with 28% bitrate reduction compared to the baseline codec.
  • Joint motion and residual information latent representation for P-frame coding
    Renam Castro da Silva, Nilson Donizete Guerin, Pedro Sanches, Henrique Costa Jung, Eduardo Peixoto, et al.
    IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2020
    This paper proposes an inter-frame prediction frame encoding for the P-frame video compression challenge of the Workshop and Challenge on Learned Image Compression (CLIC). For this challenge, we use an uncompressed reference (previous) frame to compress the current frame. So, this is not a complete solution for learning-based video compression. The main goal is to represent a set of frames with an average of 0.075 bpp (bits per pixel), which is a very low bitrate. A restriction on the model size is also requested to avoid overfitting. Here we propose an autoencoder architecture that jointly represents the motion and residue information at the latent space. Three trained models were used to achieve the target bpp and a bit allocation algorithm is also proposed to optimize the quality performance of the encoded dataset.
  • A sub-aperture image selection refinement method for progressive light field transmission
    Wallace Bruno S. de Souza, Bruno Macchiavello, Eduardo Peixoto, Edson M. Hung, Gene Cheung
    2018 IEEE 20th International Workshop on Multimedia Signal Processing Mmsp 2018, 2018
    Light field cameras capture the emanated light from a scene. This type of images allows for changing point of views or focal points by processing the captured information. Recently, a Progressive Light Field Communication (PLFC) was proposed. PLFC addresses an interactive Light Field (LF) streaming framework, where a client requests a certain view or focal point and a server synthesizes and transmits each requested image as a linear combination of Sub-Aperture Images (SAI). The main idea of PLFC is that as the virtual views are transmitted, the client gradually learns information about the LF, so eventually the client may posses enough information to locally create the virtual view at the required quality, avoiding the transmission of a new image. In order to PLFC work, an optimization algorithm which selects the SAIs that are used to create a certain virtual view is requested. Here, we improve over the previous PLFC proposal by presenting a method that focuses on a refinement algorithm for SAI selection, using dynamic Quantization Parameter (QP) during encoding, using an automatic method to determine the Lagrangian multiplier during optimization and modifying how the initial required cache is created. These proposed changes in the algorithm produce significant gains. The results shows gains up to 85.8% on BD-rate compared to trivial LF transmissions, whereas they're up to 32.8% compared to previous PLFC.
  • Progressive sub-aperture image recovery for interactive light field data streaming
    Eduardo Peixoto, Bruno Macchiavello, Edson Mintsu Hung, Gene Cheung
    Proceedings International Conference on Image Processing Icip, 2018
    Due to the large size of a light field image, compressing and transmitting the entire data to a client before rendering any image for observation would incur a significant startup delay. In response, in interactive light field streaming (ILFS) a server synthesizes and transmits a new viewpoint image as a combination of sub-aperture images (SAIs) per user request. However, in so doing the client relies entirely on the server for reconstruction of every requested image. In this paper, we extend a previous proposal of progressive light field data transmission strategy, where the client can incrementally learn SAIs over time. Specifically, requested focal-point images are synthesized using carefully chosen weighted linear combinations of SAIs, so that recovery of SAIs amounts to inversion of a lower-triangular weight matrix-a matrix structure that enables SAI recovery without amplifying quantization noise due to lossy image coding. We design an objective function to encourage specific combinations of SAIs to increase rank of the lower-triangular weight matrix for fast SAI recovery. This new proposal reduces the size of the initial user's cache and the total number of transmitted images compared to our previous work. Experimental results show that our scheme can outperform ILFS by up to 70% in terms of BD-rate.
  • S-EMG Signal Compression in One-Dimensional and Two-Dimensional Approaches
    Marcel H. Trabuco, Marcus V. C. Costa, Bruno Macchiavello, Francisco Assis de O. Nascimento
    IEEE Journal of Biomedical and Health Informatics, 2018
    This paper presents algorithms designed for one-dimensional (1-D) and 2-D surface electromyographic (S-EMG) signal compression. The 1-D approach is a wavelet transform based encoder applied to isometric and dynamic S-EMG signals. An adaptive estimation of the spectral shape is used to carry out dynamic bit allocation for vector quantization of transformed coefficients. Thus, an entropy coding is applied to minimize redundancy in quantized coefficient vector and to pack the data. In the 2-D approach algorithm, the isometric or dynamic S-EMG signal is properly segmented and arranged to build a 2-D representation. The high efficient video codec is used to encode the signal, using 16-bit-depth precision, all possible coding/prediction unit sizes, and all intra-coding modes. The encoders are evaluated with objective metrics, and a real signal data bank is used. Furthermore, performance comparisons are also shown in this paper, where the proposed methods have outperformed other efficient encoders reported in the literature.
  • Progressive communication for interactive light field image data streaming
    Eduardo Peixoto, Bruno Macchiavello, Edson Mintsu Hung, Camilo Dorea, Gene Cheung
    Proceedings International Conference on Image Processing Icip, 2017
  • Touchless-to-touch fingerprint systems compatibility method
    P. Salum, D. Sandoval, A. Zaghetto, B. Macchiavello, C. Zaghetto
    Proceedings International Conference on Image Processing Icip, 2017
  • Predicting vehicle trajectories from surveillance video in a real scenario with Histogram of Oriented Gradient
    Computer Science Research Notes, 2017
  • Human action recognition in videos: A comparative evaluation of the classical and velocity adaptation space-time interest points techniques
    Computer Science Research Notes, 2017
  • Text-dependent User Verification of Handwritten Words and Signatures on Mobile Devices
    Nilson Donizete Guerin, Flavio de Barros Vidal, Bruno Macchiavello
    Computer Journal, 2016
  • Context adaptive mode sorting for fast HEVC mode decision
    S. G. Blasi, E. Peixoto, B. Macchiavello, E. M. Hung, I. Zupancic, et al.
    Proceedings International Conference on Image Processing Icip, 2015
  • Handwritten text verification on mobile devices
    Visapp 2015 10th International Conference on Computer Vision Theory and Applications Visigrapp Proceedings, 2015
  • Fast H.264/AVC to HEVC transcoding based on machine learning
    Eduardo Peixoto, Bruno Macchiavello, Ricardo. L. de Queiroz, Edson Mintsu Hung
    2014 International Telecommunications Symposium ITS 2014 Proceedings, 2014
  • A fast HEVC transcoder based on content modeling and early termination
    E. Peixoto, B. Macchiavello, E. M. Hung, R. L. de Queiroz
    2014 IEEE International Conference on Image Processing Icip 2014, 2014
  • A model and simulation framework for exploring potential impacts of land use policies: The Brazilian cerrado case
    Carolina G. Abreu, Cassio G. C. Coelho, Celia G. Ralha, Bruno Macchiavello
    Proceedings of the Annual Hawaii International Conference on System Sciences, 2014
  • Low-saliency prior for disocclusion hole filling in DIBR-synthesized images
    Bruno Macchiavello, Camilo Dorea, Edson M. Hung, Gene Cheung, Ivan Bajic
    ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 2014
  • Loss-resilient coding of texture and depth for free-viewpoint video conferencing
    Bruno Macchiavello, Camilo Dorea, Edson M. Hung, Gene Cheung, Wai-Tian Tan
    IEEE Transactions on Multimedia, 2014
  • On the impact of packet-loss impairments on visual attention mechanisms
    Judith Redi, Ingrid Heynderickx, Bruno Macchiavello, Mylene Farias
    Proceedings IEEE International Symposium on Circuits and Systems, 2013
  • A multi-agent model system for land-use change simulation
    Célia G. Ralha, Carolina G. Abreu, Cássio G.C. Coelho, Alexandre Zaghetto, Bruno Macchiavello, et al.
    Environmental Modelling and Software, 2013
  • The MASE design experience
    Proceedings 20th International Congress on Modelling and Simulation Modsim 2013, 2013
  • Saliency-cognizant robust view synthesis in free viewpoint video streaming
    Bruno Macchiavello, Camilo Dorea, Edson M. Hung, Gene Cheung, Wai-tian Tan
    2013 IEEE International Conference on Image Processing Icip 2013 Proceedings, 2013
  • An H.264/AVC to HEVC video transcoder based on mode mapping
    E. Peixoto, B. Macchiavello, E. M. Hung, A. Zaghetto, T. Shanableh, et al.
    2013 IEEE International Conference on Image Processing Icip 2013 Proceedings, 2013
  • CQR codes: Colored quick-response codes
    Max E. Vizcarra Melgar, Alexandre Zaghetto, Bruno Macchiavello, Anderson C. A. Nascimento
    IEEE International Conference on Consumer Electronics Berlin Icce Berlin, 2012
  • HEVC-based scanned document compression
    Alexandre Zaghetto, Bruno Macchiavello, Ricardo L. de Queiroz
    Proceedings International Conference on Image Processing Icip, 2012
  • Reference frame selection for loss-resilient texture & depth map coding in multiview video conferencing
    Bruno Macchiavello, Camilo Dorea, Edson M. Hung, Gene Cheung, Wai-tian Tan
    Proceedings International Conference on Image Processing Icip, 2012
  • Reference frame selection for loss-resilient depth map coding in multiview video conferencing
    Bruno Macchiavello, Camilo Dorea, Edson M. Hung, Gene Cheung, Wai-Tian Tan
    Proceedings of SPIE the International Society for Optical Engineering, 2012
  • Compression of touchless multiview fingerprints
    Nelson C. Francisco, Alexandre Zaghetto, Bruno Macchiavello, Eduardo A. B. da Silva, Mamede Lima-Marques, et al.
    Bioms 2011 2011 IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications Proceedings, 2011
  • Mixed-resolution distributed video codec without motion estimation at the encoder
    B. Macchiavello, E. M. Hung, R. L. de Queiroz, D. Mukherjee
    Proceedings International Conference on Image Processing Icip, 2010
  • Mixed resolution framework for distributed multiview coding
    Diogo C. Garcia, Camilo C. Dórea, Bruno Macchiavello, Ricardo de Queiroz, Debargha Mukherjee
    Proceedings of SPIE the International Society for Optical Engineering, 2010
  • Semi-automatic algorithm for construction of the left ventricular area variation curve over a complete cardiac cycle
    Salvador A Melo, Bruno Macchiavello, Marcelino M Andrade, João LA Carvalho, Hervaldo S Carvalho, et al.
    Biomedical Engineering Online, 2010
  • On the use of motion-based frame rejection in temporal averaging denoising for segmentation of echocardiographic image sequences
    M. do Carmo dos Reis, J.L.A. Carvalho, B.L. Macchiavello, D.F. Vasconcelos, A.F. da Rocha, et al.
    Proceedings of the 31st Annual International Conference of the IEEE Engineering in Medicine and Biology Society Engineering the Future of Biomedicine Embc 2009, 2009
  • Iterative side-information generation in a mixed resolution wyner-ziv framework
    B. Macchiavello, D. Mukherjee, R.L. de Queiroz
    IEEE Transactions on Circuits and Systems for Video Technology, 2009
  • H∞ filtering for rectangular discrete-time descriptor systems
    João Y. Ishihara, Marco H. Terra, B.M. Espinoza
    Automatica, 2009
  • Side-information generation for temporally and spatially scalable Wyner-Ziv codecs
    Bruno Macchiavello, Fernanda Brandi, Eduardo Peixoto, Ricardo L. de Queiroz, Debargha Mukherjee
    Eurasip Journal on Image and Video Processing, 2009
  • Parameter estimation for an H.264-based distributed video coder
    B. Macchiavello, R. L. de Queiroz, D. Mukherjee
    Proceedings International Conference on Image Processing Icip, 2008
  • Semi-automatic detection of the left ventricular border
    Proceedings of the 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society EMBS 08 Personalized Healthcare Through Technology, 2008
  • A statistical model for a mixed resolution Wyner-Ziv Framework
    Pcs 2007 26th Picture Coding Symposium, 2007
  • A simple reversed-complexity Wyner-Ziv video coding mode based on a spatial reduction framework
    Proceedings of SPIE the International Society for Optical Engineering, 2007
  • Motion-based side-information generation for a scalable Wyner-Ziv video coder
    B. Macchiavello, R. L. de Queiroz, D. Mukherjee
    Proceedings International Conference on Image Processing Icip, 2006
  • H∞ estimation and array algorithms for discrete-time descriptor systems
    Proceedings of the IEEE Conference on Decision and Control, 2006