George Darmiton da Cunha Cavalcanti

@cin.ufpe.br

Computer Science
Universidade Federal de Pernambuco



                 

https://researchid.co/darmiton

RESEARCH, TEACHING, or OTHER INTERESTS

Artificial Intelligence

158

Scopus Publications

5183

Scholar Citations

33

Scholar h-index

104

Scholar i10-index

Scopus Publications

  • Contrastive dissimilarity: optimizing performance on imbalanced and limited data sets
    Lucas O. Teixeira, Diego Bertolini, Luiz S. Oliveira, George D. C. Cavalcanti, and Yandre M. G. Costa

    Springer Science and Business Media LLC

  • Recent advances in applications of machine learning in reward crowdfunding success forecasting
    George D. C. Cavalcanti, Wesley Mendes-Da-Silva, Israel José dos Santos Felipe, and Leonardo A. Santos

    Springer Science and Business Media LLC

  • Ensemble of Convolutional Neural Networks for Sparse-View Cone-Beam Computed Tomography
    Carlos A. Alves Junior, Luis F. Alves Pereira, George D. C. Cavalcanti, and Tsang Ing Ren

    IEEE
    Risks related to excessive exposure of patients to ionizing radiation are a significant concern in the medical community. Several approaches based on Convolutional Neural Networks (CNNs) have been proposed to develop safer and more reliable sparse-view Computed Tomography (SVCT) systems. Most of those solutions process tomographic data within 2D slices individually. However, recent works have shown that 3D models - that exploit data correlation among adjacent slices - can outperform previous 2D models. Once the kernel size in most of those 3D models is not bigger than 5 x 5 x 5, such inter-slice exploration is restricted to a limited neighborhood, resulting in minor inter-slice analysis during the training/validation phase. To efficiently exploit data correlation among the coronal, axial, and sagittal views of the SVCT volume, we propose an ensemble of four 2D CNNs. Three of them are used to process the orthogonal SVCT volume views separately, and the fourth CNN combines the outputs from the previous three networks. Since our final architecture is highly deep, we also present a training method in stages to avoid the non-convergence of the deepest layers. We conducted experiments using head cone-beam Computed Tomography (CBCT) scans extensively used in imaged guided radiotherapy (IGTR) during brain tumor treatment. Our method presented superior results in reducing reconstruction artifacts of SVCT volumes compared to the state-of-the-art 2D and 3D models.

  • The Gated Recurrent Conditional Generative Adversarial Network (GRC-GAN): application to denoising of low-dose CT images
    Mateus Baltazar de Almeida, Luis F. Alves Pereira, Tsang Ing Ren, George D. C. Cavalcanti, and Jan Sijbers

    IEEE
    The ionizing radiation that propagates through the human body at Computed Tomography (CT) exams is known to be carcinogenic. For this reason, the development of methods for image reconstruction that operate with reduced radiation doses is essential. If we reduce the electrical current in the electrically powered X-ray tubes of CT scanners, the amount of radiation that passes through the human body during a CT exam is reduced. However, significant image noise emerges in the reconstructed CT slices if standard reconstruction methods are applied. To estimate routine-dose CT images from low-dose CT images and thus reduce noise, the Conditional Generative Adversarial Network (cGAN) was recently proposed in the literature. In this work, we introduce the Gated Recurrent Conditional Generative Adversarial Network (GRC-GAN) that is based on the usage of network gates to learn the specific regions of the input image to be updated using the cGAN denoising operation. Moreover, the GRC-GAN is executed recurrently in multiple time steps. At each time step, different parts of the input image are denoised. As a result, our GRC-GAN better focus on the denoise criterium than the regular cGAN in the LoDoPaB-CT benchmark.

  • A two-level sampling strategy for pruning methods applied to credit scoring
    Luiz Vieira e Silva Filho and George DC Cavalcanti

    IEEE
    Multiple Classifiers Systems (MCS) are based on the idea that the combination of the opinion of several experts can generate better results than when only one expert is used. Several MCS techniques have been developed; each one has its strengths and weaknesses depending on the context in which they are applied. This work presents a two-level sampling strategy for pruning methods that are applied to the credit scoring task. The first step of the proposal is to generate a pool using two well-known sampling methods, bagging and random subspace, that work complementarity in order to produce a diverse pool. After, a pruning method reduces the generated pool maintaining only the most competent classifiers. So, the proposal improves the MCS regarding the accuracy and the computational effort, since only a small percentage of the original pool is stored. The proposed architecture is evaluated in a credit scoring application, and the results showed that the proposed architecture obtained better accuracy rates than the single best approach and literature methods. These results were also obtained with ensembles whose sizes were around 20% of the original pools generated in the training phase.

  • Online local pool generation for dynamic classifier selection
    Mariana A. Souza, George D.C. Cavalcanti, Rafael M.O. Cruz, and Robert Sabourin

    Elsevier BV

  • FIRE-DES++: Enhanced online pruning of base classifiers for dynamic ensemble selection
    Rafael M.O. Cruz, Dayvid V.R. Oliveira, George D.C. Cavalcanti, and Robert Sabourin

    Elsevier BV

  • Dynamic classifier selection: Recent advances and perspectives
    Rafael M.O. Cruz, Robert Sabourin, and George D.C. Cavalcanti

    Elsevier BV

  • A study on combining dynamic selection and data preprocessing for imbalance learning
    Anandarup Roy, Rafael M.O. Cruz, Robert Sabourin, and George D.C. Cavalcanti

    Elsevier BV

  • IntensityPatches and RegionPatches for image recognition
    Tiago B.A. de Carvalho, Maria A.A. Sibaldo, Ing Ren Tsang, George D.C. Cavalcanti, Jan Sijbers, and Ing Jyh Tsang

    Elsevier BV

  • Combining sentence similarities measures to identify paraphrases
    Rafael Ferreira, George D.C. Cavalcanti, Fred Freitas, Rafael Dueire Lins, Steven J. Simske, and Marcelo Riss

    Elsevier BV

  • Prototype selection for dynamic classifier and ensemble selection
    Rafael M. O. Cruz, Robert Sabourin, and George D. C. Cavalcanti

    Springer Science and Business Media LLC

  • Online pruning of base classifiers for Dynamic Ensemble Selection
    Dayvid V.R. Oliveira, George D.C. Cavalcanti, and Robert Sabourin

    Elsevier BV

  • META-DES.Oracle: Meta-learning and feature selection for dynamic ensemble selection
    Rafael M.O. Cruz, Robert Sabourin, and George D.C. Cavalcanti

    Elsevier BV

  • Bias effect on predicting market trends with EMD
    Dennis Carnelossi Furlaneto, Luiz S. Oliveira, David Menotti, and George D.C. Cavalcanti

    Elsevier BV

  • Combining dissimilarity spaces for text categorization
    Roberto H.W. Pinheiro, George D.C. Cavalcanti, and Ing Ren Tsang

    Elsevier BV

  • Nonlinear combination method of forecasters applied to PM time series
    Paulo S.G. de Mattos Neto, George D.C. Cavalcanti, and Francisco Madeiro

    Elsevier BV

  • Dynamic ensemble selection VS K-NN: Why and when dynamic selection obtains higher classification performance?
    Rafael M. O. Cruz, Hiba H. Zakane, Robert Sabourin, and George D. C. Cavalcanti

    IEEE
    Multiple classifier systems focus on the combination of classifiers to obtain better performance than a single robust one. These systems unfold three major phases: pool generation, selection and integration. One of the most promising MCS approaches is Dynamic Selection (DS), which relies on finding the most competent classifier or ensemble of classifiers to predict each test sample. The majority of the DS techniques are based on the K-Nearest Neighbors (K-NN) definition, and the quality of the neighborhood has a huge impact on the performance of DS methods. In this paper, we perform an analysis comparing the classification results of DS techniques and the K-NN classifier under different conditions. Experiments are performed on 18 state-of-the-art DS techniques over 30 classification datasets and results show that DS methods present a significant boost in classification accuracy even though they use the same neighborhood as the K-NN. The reasons behind the outperformance of DS techniques over the K-NN classifier reside in the fact that DS techniques can deal with samples with a high degree of instance hardness (samples that are located close to the decision border) as opposed to the K-NN. In this paper, not only we explain why DS techniques achieve higher classification performance than the K-NN but also when DS should be used.

  • Analyzing different prototype selection techniques for dynamic classifier and ensemble selection
    Rafael M. O. Cruz, Robert Sabourin, and George D. C. Cavalcanti

    IEEE
    In dynamic selection (DS) techniques, only the most competent classifiers, for the classification of a specific test sample are selected to predict the sample's class labels. The more important step in DES techniques is estimating the competence of the base classifiers for the classification of each specific test sample. The classifiers' competence is usually estimated using the neighborhood of the test sample defined on the validation samples, called the region of competence. Thus, the performance of DS techniques is sensitive to the distribution of the validation set. In this paper, we evaluate six prototype selection techniques that work by editing the validation data in order to remove noise and redundant instances. Experiments conducted using several state-of-the-art DS techniques over 30 classification problems demonstrate that by using prototype selection techniques we can improve the classification accuracy of DS techniques and also significantly reduce the computational cost involved.

  • On the characterization of the Oracle for dynamic classifier selection
    Mariana A. Souza, George D. C. Cavalcanti, Rafael M. O. Cruz, and Robert Sabourin

    IEEE
    The Oracle model has been used not only for comparison between techniques but also in the design of different methods in Multiple Classifier Systems (MCS). Even though the model represents the ideal classifier selection scheme, Dynamic Classifier Selection (DCS) techniques present a large performance gap from the Oracle. This means that, for a significant number of instances, the DCS techniques are not able to select a competent classifier, despite the Oracles assurance of its presence in the pool. Given that issue, this work aims to investigate the reasons why the Oracle model may not be well suited for guiding the search for a promising pool of classifiers for DCS techniques. For this purpose, a pool generation method that guarantees an Oracle accuracy rate of 100% in the training set is proposed. This method is further used to analyse the behavior of DCS techniques when the presence of at least one competent classifier in the pool for each training sample is assured. Experiments show that integrating Oracle information in the generation phase of an MCS has little impact on the gap between the accuracy rates of DCS techniques and the Oracle. Moreover, it is also shown that, for a theoretical limit of 100%, the DCS techniques were only able to select a competent classifier for at most 85% of the instances, on average. Results suggest that the Oracle is not the best guide for generating a pool of classifiers for DCS, for the model is performed globally whilst DCS techniques work with local data only.

  • Optimizing speaker-specific filter banks for speaker verification
    Hector N. B. Pinheiro, Fernando M. P. Neto, Adriano L. I. Oliveira, Tsang Ing Ren, George D. C. Cavalcanti, and Andre G. Adami

    IEEE
    In this work, we investigate speaker-specific filter banks for text-independent speaker verification. The proposed method performs an heuristic search for the best filter-bank configuration using the Artificial Bee Colony (ABC) algorithm and a proper fitness function for the standard i-vectors/PLDA-based speaker verification system. Furthermore, filter-bank decorrelated amplitudes are used instead of the cepstral coefficients produced by Discrete Cosine Transform (DCT). In the experiments, the proposed method is compared to standard Mel and linear scales in both cases where the decorrelation is performed using DCT and high-pass filtering. The comparison is performed on the MIT Mobile Device Speaker Verification Corpus in a gender-dependent trial scheme. The proposed method outperformed the baseline systems in almost all the test sets for both genders. Performance gains of 4.6% and 26.0% are achieved for male and female speakers, respectively.

  • Speaker segmentation using i-vector in meetings domain
    Leonardo V. Neri, Hector N.B. Pinheiro, Ing Ren Tsang, George D. da C. Cavalcanti, and Andre G. Adami

    IEEE
    In this paper, we propose a speaker segmentation method for meeting audio based on i-vector. The motivation is to utilize the Total Variability (TV) framework as a feature extractor and to exploit the potential of modeling the speaker and channel variabilities for speaker segmentation in meetings. A distance-based segmentation method is designed with the cosine distance. A sliding window with variable length searches for speaker turns, through the distance between the i-vectors extracted from two segments with the same size. The experiments are conducted on the AMI Meeting Corpus, covering several conversation scenarios. For the training data of the UBM and TV matrix, 5 conversations from AMI Meeting Corpus are sampled. Other 10 conversations from AMI Meeting Corpus to compose the test data. The experiments show an improvement in the MDR and FAR curves compared with the FixSlid approach with different distance metrics, and for most of the operating points when compared with the classical BIC based WinGrow. The proposed method has on average a better computational performance, improving in 61.5% compared with the XBIC based FixSlid, and improving in 86.7% compared with the BIC based WinGrow.

  • Inline discrete tomography system: Application to agricultural product inspection
    Luis F. Alves Pereira, Eline Janssens, George D.C. Cavalcanti, Ing Ren Tsang, Mattias Van Dael, Pieter Verboven, Bart Nicolai, and Jan Sijbers

    Elsevier BV

  • A perturbative approach for enhancing the performance of time series forecasting
    Paulo S.G. de Mattos Neto, Tiago A.E. Ferreira, Aranildo R. Lima, Germano C. Vasconcelos, and George D.C. Cavalcanti

    Elsevier BV

  • A prediction classifier architecture to forecast device status on smart environments
    Bruno S. C. M. Vilar, Cezar P. Schroeder, Cristina Wada, Rayanne H. Bezerra, Leonardo L. A. Heitzmann, Rafael Simionato, and George D. C. Cavalcanti

    IEEE
    In smart environments, the extraction of relevant information in large volumes of data collected from intelligent devices is a crucial issue. The extracted information can assist in automation of user activities and on daily chores, either suggesting or even changing the state of devices based on his/her routine. In this work, we propose a prediction architecture which combines an innovative preprocessing strategy with some well known classification algorithms for the environment automation. The preprocessing enhances the datasets by including features and organizing them in structures that improve the classification results. We verify which preprocessing parameters have significant impact on prediction performance using datasets collected from a real home equipped with sensors. In simulations, the avNNet, mlp and C5.0 classifiers attained the higher accuracies using Friedman and Nemenyi statistical tests, but none of them outperformed the others in all scenarios using this architecture.

RECENT SCHOLAR PUBLICATIONS

  • Contrastive dissimilarity: optimizing performance on imbalanced and limited data sets
    LO Teixeira, D Bertolini, LS Oliveira, GDC Cavalcanti, YMG Costa
    Neural Computing and Applications, 1-18 2024

  • Music Genre Classification Using Contrastive Dissimilarity
    GH Costanzi, LO Teixeira, GZ Felipe, GDC Cavalcanti, YMG Costa
    2024 31st International Conference on Systems, Signals and Image Processing 2024

  • Recent advances in applications of machine learning in reward crowdfunding success forecasting
    GDC Cavalcanti, W Mendes-Da-Silva, IJ dos Santos Felipe, LA Santos
    Neural Computing and Applications, 1-17 2024

  • Resampling strategies for imbalanced regression: a survey and empirical analysis
    JG Avelino, GDC Cavalcanti, RMO Cruz
    Artificial Intelligence Review 57 (4), 82 2024

  • Microservices performance forecast using dynamic Multiple Predictor Systems
    WRM Santos, AR Sampaio Jr, NS Rosa, GDC Cavalcanti
    Engineering Applications of Artificial Intelligence 129, 107649 2024

  • Meta-Scaler: A Meta-Learning Framework for the Selection of Scaling Techniques
    LBV de Amorim, GDC Cavalcanti, RMO Cruz
    IEEE Transactions on Neural Networks and Learning Systems 2024

  • Subconcept perturbation-based classifier for within-class multimodal data
    GDC Cavalcanti, RJO Soares, EL Arajo
    Neural Computing and Applications 36, 2479–2491 2024

  • Fake news detection: Taxonomy and comparative study
    F Farhangian, RMO Cruz, GDC Cavalcanti
    Information Fusion, 102140 2024

  • Fault distance estimation for transmission lines with dynamic regressor selection
    LA Ensina, LES de Oliveira, RMO Cruz, GDC Cavalcanti
    Neural Computing and Applications 36, 1741–1759 2024

  • A dynamic multiple classifier system using graph neural network for high dimensional overlapped data
    MA Souza, R Sabourin, GDC Cavalcanti, RMO Cruz
    Information Fusion, 102145 2024

  • A hybrid system based on ensemble learning to model residuals for time series forecasting
    DSOS Jnior, PSG de Mattos Neto, JFL de Oliveira, GDC Cavalcanti
    Information Sciences 649, 119614 2023

  • A post-selection algorithm for improving dynamic ensemble selection methods
    PRG Cordeiro, GDC Cavalcanti, RMO Cruz
    2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC 2023

  • GNN-DES: A New End-to-End Dynamic Ensemble Selection Method Based on Multi-label Graph Neural Network
    M de Araujo Souza, R Sabourin, GDC Cavalcanti, RMO e Cruz
    International Workshop on Graph-Based Representations in Pattern Recognition 2023

  • Dynamic ensemble algorithm post-selection using Hardness-aware Oracle
    PRG Cordeiro, GDC Cavalcanti, RMO Cruz
    IEEE Access 2023

  • Security Relevant Methods of Android’s API Classification: A Machine Learning Empirical Evaluation
    WM Rodrigues, FN Walmsley, GDC Cavalcanti, RMO Cruz
    IEEE Transactions on Computers 2023

  • OLP++: An online local classifier for high dimensional data
    MA Souza, R Sabourin, GDC Cavalcanti, RMO Cruz
    Information Fusion 90, 120-137 2023

  • Gender Bias Propagation on Hate Speech: An Analysis at Feature-Level
    FRS Nascimento, G Cavalcanti, D Costa-Abreu
    Available at SSRN 4517546 2023

  • Exploring Automatic Hate Speech Detection on Social Media: A Focus on Content-Based Analysis
    FRS Nascimento, GDC Cavalcanti, MD Costa-Abreu
    SAGE Open 13 (2) 2023

  • Cancer Identification in Enteric Nervous System Preclinical Images Using Handcrafted and Automatic Learned Features
    GZ Felipe, LO Teixeira, RM Pereira, JN Zanoni, SRG Souza, L Nanni, ...
    Neural Processing Letters, 1-22 2023

  • The choice of scaling technique matters for classification performance
    LBV de Amorim, GDC Cavalcanti, RMO Cruz
    Applied Soft Computing 133, 109924 2023

MOST CITED SCHOLAR PUBLICATIONS

  • Dynamic classifier selection: Recent advances and perspectives
    RMO Cruz, R Sabourin, GDC Cavalcanti
    Information Fusion 41, 195-216 2018
    Citations: 447

  • Assessing sentence scoring techniques for extractive text summarization
    R Ferreira, L de Souza Cabral, RD Lins, GP e Silva, F Freitas, ...
    Expert Systems with Applications 40 (14), 5755-5764 2013
    Citations: 370

  • META-DES: A Dynamic Ensemble Selection Framework using Meta-Learning
    RMO Cruz, R Sabourin, GDC Cavalcanti, TI Ren
    Pattern Recognition 2015
    Citations: 274

  • A graph-based friend recommendation system using genetic algorithm
    NB Silva, R Tsang, GDC Cavalcanti, J Tsang
    IEEE Congress on Evolutionary Computation (CEC), 233-239 2010
    Citations: 157

  • Impact of lung segmentation on the diagnosis and explanation of COVID-19 in chest X-ray images
    LO Teixeira, RM Pereira, D Bertolini, LS Oliveira, L Nanni, ...
    Sensors 21 (21), 7116 2021
    Citations: 141

  • Text line segmentation based on morphology and histogram projection
    RP dos Santos, GS Clemente, TI Ren, GDC Cavalcanti
    International Conference on Document Analysis and Recognition (ICDAR), 651-655 2009
    Citations: 124

  • Combining Diversity Measures for Ensemble Pruning
    GDC Cavalcanti, LS Oliveira, TJM Moura, GV Carvalho
    Pattern Recognition Letters, http://dx.doi.org/10.1016/j.patrec.2016 2016
    Citations: 121

  • A study on combining dynamic selection and data preprocessing for imbalance learning
    A Roy, RMO Cruz, R Sabourin, GDC Cavalcanti
    Neurocomputing 286, 179-192 2018
    Citations: 120

  • DESlib: A Dynamic ensemble selection library in Python
    RMO Cruz, LG Hafemann, R Sabourin, GDC Cavalcanti
    Journal of Machine Learning Research 21 (8), 1-5 2020
    Citations: 118

  • Unsupervised Retinal Vessel Segmentation Using Combined Filters
    WS Oliveira, JV Teixeira, TI Ren, GDC Cavalcanti, J Sijbers
    PLOS ONE 2016
    Citations: 118

  • Semi-supervised clustering for MR brain image segmentation
    NM Portela, GDC Cavalcanti, TI Ren
    Expert Systems with Applications 41 (4), 1492-1497 2014
    Citations: 107

  • META-DES.Oracle: Meta-learning and feature selection for dynamic ensemble selection
    RMO Cruz, R Sabourin, GDC Cavalcanti
    Information Fusion 38, 84-103 2017
    Citations: 103

  • A global-ranking local feature selection method for text categorization
    RHW Pinheiro, GDC Cavalcanti, RF Correa, TI Ren
    Expert Systems with Applications 39 (17), 12851-12857 2012
    Citations: 91

  • A Proposal for Path Loss Prediction in Urban Environments using Support Vector Regression
    R Timoteo, DC Cunha, GDC Cavalcanti
    AICT 2014, The Tenth Advanced International Conference on Telecommunications 2014
    Citations: 85

  • An approach to feature selection for keystroke dynamics systems based on PSO and feature weighting
    GLF Azevedo, GDC Cavalcanti, ECB Carvalho Filho
    IEEE Congress on Evolutionary Computation (CEC), 3577-3584 2007
    Citations: 83

  • Online Pruning of Base Classifiers for Dynamic Ensemble Selection
    DVR Oliveira, GDC Cavalcanti, R Sabourin
    Pattern Recognition 2017
    Citations: 80

  • The choice of scaling technique matters for classification performance
    LBV de Amorim, GDC Cavalcanti, RMO Cruz
    Applied Soft Computing 133, 109924 2023
    Citations: 78

  • Data-driven global-ranking local feature selection methods for text categorization
    RHW Pinheiro, GDC Cavalcanti, TI Ren
    Expert Systems with Applications 42 (4), 1941–1949 2015
    Citations: 69

  • FIRE-DES++: Enhanced Online Pruning of Base Classifiers for Dynamic Ensemble Selection
    RMO Cruz, DVR Oliveira, GDC Cavalcanti, R Sabourin
    Pattern Recognition 85, 149-160 2019
    Citations: 62

  • Handwritten digit recognition using multiple feature extraction techniques and classifier ensemble
    RMO Cruz, GDC Cavalcanti, TI Ren
    17th International conference on systems, signals and image processing, 215-218 2010
    Citations: 58