George Darmiton da Cunha Cavalcanti

@cin.ufpe.br

Computer Science
Universidade Federal de Pernambuco

https://researchid.co/darmiton

RESEARCH, TEACHING, or OTHER INTERESTS

Artificial Intelligence

158

Scopus Publications

5183

Scholar Citations

Scholar h-index

104

Scholar i10-index

Scopus Publications

Contrastive dissimilarity: optimizing performance on imbalanced and limited data sets
Lucas O. Teixeira, Diego Bertolini, Luiz S. Oliveira, George D. C. Cavalcanti, and Yandre M. G. Costa
Springer Science and Business Media LLC

Recent advances in applications of machine learning in reward crowdfunding success forecasting
George D. C. Cavalcanti, Wesley Mendes-Da-Silva, Israel José dos Santos Felipe, and Leonardo A. Santos
Springer Science and Business Media LLC

Ensemble of Convolutional Neural Networks for Sparse-View Cone-Beam Computed Tomography
Carlos A. Alves Junior, Luis F. Alves Pereira, George D. C. Cavalcanti, and Tsang Ing Ren
IEEE
Risks related to excessive exposure of patients to ionizing radiation are a significant concern in the medical community. Several approaches based on Convolutional Neural Networks (CNNs) have been proposed to develop safer and more reliable sparse-view Computed Tomography (SVCT) systems. Most of those solutions process tomographic data within 2D slices individually. However, recent works have shown that 3D models - that exploit data correlation among adjacent slices - can outperform previous 2D models. Once the kernel size in most of those 3D models is not bigger than 5 x 5 x 5, such inter-slice exploration is restricted to a limited neighborhood, resulting in minor inter-slice analysis during the training/validation phase. To efficiently exploit data correlation among the coronal, axial, and sagittal views of the SVCT volume, we propose an ensemble of four 2D CNNs. Three of them are used to process the orthogonal SVCT volume views separately, and the fourth CNN combines the outputs from the previous three networks. Since our final architecture is highly deep, we also present a training method in stages to avoid the non-convergence of the deepest layers. We conducted experiments using head cone-beam Computed Tomography (CBCT) scans extensively used in imaged guided radiotherapy (IGTR) during brain tumor treatment. Our method presented superior results in reducing reconstruction artifacts of SVCT volumes compared to the state-of-the-art 2D and 3D models.

The Gated Recurrent Conditional Generative Adversarial Network (GRC-GAN): application to denoising of low-dose CT images
Mateus Baltazar de Almeida, Luis F. Alves Pereira, Tsang Ing Ren, George D. C. Cavalcanti, and Jan Sijbers
IEEE
The ionizing radiation that propagates through the human body at Computed Tomography (CT) exams is known to be carcinogenic. For this reason, the development of methods for image reconstruction that operate with reduced radiation doses is essential. If we reduce the electrical current in the electrically powered X-ray tubes of CT scanners, the amount of radiation that passes through the human body during a CT exam is reduced. However, significant image noise emerges in the reconstructed CT slices if standard reconstruction methods are applied. To estimate routine-dose CT images from low-dose CT images and thus reduce noise, the Conditional Generative Adversarial Network (cGAN) was recently proposed in the literature. In this work, we introduce the Gated Recurrent Conditional Generative Adversarial Network (GRC-GAN) that is based on the usage of network gates to learn the specific regions of the input image to be updated using the cGAN denoising operation. Moreover, the GRC-GAN is executed recurrently in multiple time steps. At each time step, different parts of the input image are denoised. As a result, our GRC-GAN better focus on the denoise criterium than the regular cGAN in the LoDoPaB-CT benchmark.

A two-level sampling strategy for pruning methods applied to credit scoring
Luiz Vieira e Silva Filho and George DC Cavalcanti
IEEE
Multiple Classifiers Systems (MCS) are based on the idea that the combination of the opinion of several experts can generate better results than when only one expert is used. Several MCS techniques have been developed; each one has its strengths and weaknesses depending on the context in which they are applied. This work presents a two-level sampling strategy for pruning methods that are applied to the credit scoring task. The first step of the proposal is to generate a pool using two well-known sampling methods, bagging and random subspace, that work complementarity in order to produce a diverse pool. After, a pruning method reduces the generated pool maintaining only the most competent classifiers. So, the proposal improves the MCS regarding the accuracy and the computational effort, since only a small percentage of the original pool is stored. The proposed architecture is evaluated in a credit scoring application, and the results showed that the proposed architecture obtained better accuracy rates than the single best approach and literature methods. These results were also obtained with ensembles whose sizes were around 20% of the original pools generated in the training phase.

Online local pool generation for dynamic classifier selection
Mariana A. Souza, George D.C. Cavalcanti, Rafael M.O. Cruz, and Robert Sabourin
Elsevier BV

FIRE-DES++: Enhanced online pruning of base classifiers for dynamic ensemble selection
Rafael M.O. Cruz, Dayvid V.R. Oliveira, George D.C. Cavalcanti, and Robert Sabourin
Elsevier BV

Dynamic classifier selection: Recent advances and perspectives
Rafael M.O. Cruz, Robert Sabourin, and George D.C. Cavalcanti
Elsevier BV

A study on combining dynamic selection and data preprocessing for imbalance learning
Anandarup Roy, Rafael M.O. Cruz, Robert Sabourin, and George D.C. Cavalcanti
Elsevier BV

IntensityPatches and RegionPatches for image recognition
Tiago B.A. de Carvalho, Maria A.A. Sibaldo, Ing Ren Tsang, George D.C. Cavalcanti, Jan Sijbers, and Ing Jyh Tsang
Elsevier BV

Combining sentence similarities measures to identify paraphrases
Rafael Ferreira, George D.C. Cavalcanti, Fred Freitas, Rafael Dueire Lins, Steven J. Simske, and Marcelo Riss
Elsevier BV

Prototype selection for dynamic classifier and ensemble selection
Rafael M. O. Cruz, Robert Sabourin, and George D. C. Cavalcanti
Springer Science and Business Media LLC

Online pruning of base classifiers for Dynamic Ensemble Selection
Dayvid V.R. Oliveira, George D.C. Cavalcanti, and Robert Sabourin
Elsevier BV

META-DES.Oracle: Meta-learning and feature selection for dynamic ensemble selection
Rafael M.O. Cruz, Robert Sabourin, and George D.C. Cavalcanti
Elsevier BV

Bias effect on predicting market trends with EMD
Dennis Carnelossi Furlaneto, Luiz S. Oliveira, David Menotti, and George D.C. Cavalcanti
Elsevier BV

Combining dissimilarity spaces for text categorization
Roberto H.W. Pinheiro, George D.C. Cavalcanti, and Ing Ren Tsang
Elsevier BV

Nonlinear combination method of forecasters applied to PM time series
Paulo S.G. de Mattos Neto, George D.C. Cavalcanti, and Francisco Madeiro
Elsevier BV

Dynamic ensemble selection VS K-NN: Why and when dynamic selection obtains higher classification performance?
Rafael M. O. Cruz, Hiba H. Zakane, Robert Sabourin, and George D. C. Cavalcanti
IEEE
Multiple classifier systems focus on the combination of classifiers to obtain better performance than a single robust one. These systems unfold three major phases: pool generation, selection and integration. One of the most promising MCS approaches is Dynamic Selection (DS), which relies on finding the most competent classifier or ensemble of classifiers to predict each test sample. The majority of the DS techniques are based on the K-Nearest Neighbors (K-NN) definition, and the quality of the neighborhood has a huge impact on the performance of DS methods. In this paper, we perform an analysis comparing the classification results of DS techniques and the K-NN classifier under different conditions. Experiments are performed on 18 state-of-the-art DS techniques over 30 classification datasets and results show that DS methods present a significant boost in classification accuracy even though they use the same neighborhood as the K-NN. The reasons behind the outperformance of DS techniques over the K-NN classifier reside in the fact that DS techniques can deal with samples with a high degree of instance hardness (samples that are located close to the decision border) as opposed to the K-NN. In this paper, not only we explain why DS techniques achieve higher classification performance than the K-NN but also when DS should be used.

Analyzing different prototype selection techniques for dynamic classifier and ensemble selection
Rafael M. O. Cruz, Robert Sabourin, and George D. C. Cavalcanti
IEEE
In dynamic selection (DS) techniques, only the most competent classifiers, for the classification of a specific test sample are selected to predict the sample's class labels. The more important step in DES techniques is estimating the competence of the base classifiers for the classification of each specific test sample. The classifiers' competence is usually estimated using the neighborhood of the test sample defined on the validation samples, called the region of competence. Thus, the performance of DS techniques is sensitive to the distribution of the validation set. In this paper, we evaluate six prototype selection techniques that work by editing the validation data in order to remove noise and redundant instances. Experiments conducted using several state-of-the-art DS techniques over 30 classification problems demonstrate that by using prototype selection techniques we can improve the classification accuracy of DS techniques and also significantly reduce the computational cost involved.

On the characterization of the Oracle for dynamic classifier selection
Mariana A. Souza, George D. C. Cavalcanti, Rafael M. O. Cruz, and Robert Sabourin
IEEE
The Oracle model has been used not only for comparison between techniques but also in the design of different methods in Multiple Classifier Systems (MCS). Even though the model represents the ideal classifier selection scheme, Dynamic Classifier Selection (DCS) techniques present a large performance gap from the Oracle. This means that, for a significant number of instances, the DCS techniques are not able to select a competent classifier, despite the Oracles assurance of its presence in the pool. Given that issue, this work aims to investigate the reasons why the Oracle model may not be well suited for guiding the search for a promising pool of classifiers for DCS techniques. For this purpose, a pool generation method that guarantees an Oracle accuracy rate of 100% in the training set is proposed. This method is further used to analyse the behavior of DCS techniques when the presence of at least one competent classifier in the pool for each training sample is assured. Experiments show that integrating Oracle information in the generation phase of an MCS has little impact on the gap between the accuracy rates of DCS techniques and the Oracle. Moreover, it is also shown that, for a theoretical limit of 100%, the DCS techniques were only able to select a competent classifier for at most 85% of the instances, on average. Results suggest that the Oracle is not the best guide for generating a pool of classifiers for DCS, for the model is performed globally whilst DCS techniques work with local data only.

Optimizing speaker-specific filter banks for speaker verification
Hector N. B. Pinheiro, Fernando M. P. Neto, Adriano L. I. Oliveira, Tsang Ing Ren, George D. C. Cavalcanti, and Andre G. Adami
IEEE
In this work, we investigate speaker-specific filter banks for text-independent speaker verification. The proposed method performs an heuristic search for the best filter-bank configuration using the Artificial Bee Colony (ABC) algorithm and a proper fitness function for the standard i-vectors/PLDA-based speaker verification system. Furthermore, filter-bank decorrelated amplitudes are used instead of the cepstral coefficients produced by Discrete Cosine Transform (DCT). In the experiments, the proposed method is compared to standard Mel and linear scales in both cases where the decorrelation is performed using DCT and high-pass filtering. The comparison is performed on the MIT Mobile Device Speaker Verification Corpus in a gender-dependent trial scheme. The proposed method outperformed the baseline systems in almost all the test sets for both genders. Performance gains of 4.6% and 26.0% are achieved for male and female speakers, respectively.

Speaker segmentation using i-vector in meetings domain
Leonardo V. Neri, Hector N.B. Pinheiro, Ing Ren Tsang, George D. da C. Cavalcanti, and Andre G. Adami
IEEE
In this paper, we propose a speaker segmentation method for meeting audio based on i-vector. The motivation is to utilize the Total Variability (TV) framework as a feature extractor and to exploit the potential of modeling the speaker and channel variabilities for speaker segmentation in meetings. A distance-based segmentation method is designed with the cosine distance. A sliding window with variable length searches for speaker turns, through the distance between the i-vectors extracted from two segments with the same size. The experiments are conducted on the AMI Meeting Corpus, covering several conversation scenarios. For the training data of the UBM and TV matrix, 5 conversations from AMI Meeting Corpus are sampled. Other 10 conversations from AMI Meeting Corpus to compose the test data. The experiments show an improvement in the MDR and FAR curves compared with the FixSlid approach with different distance metrics, and for most of the operating points when compared with the classical BIC based WinGrow. The proposed method has on average a better computational performance, improving in 61.5% compared with the XBIC based FixSlid, and improving in 86.7% compared with the BIC based WinGrow.

Inline discrete tomography system: Application to agricultural product inspection
Luis F. Alves Pereira, Eline Janssens, George D.C. Cavalcanti, Ing Ren Tsang, Mattias Van Dael, Pieter Verboven, Bart Nicolai, and Jan Sijbers
Elsevier BV

A perturbative approach for enhancing the performance of time series forecasting
Paulo S.G. de Mattos Neto, Tiago A.E. Ferreira, Aranildo R. Lima, Germano C. Vasconcelos, and George D.C. Cavalcanti
Elsevier BV

A prediction classifier architecture to forecast device status on smart environments
Bruno S. C. M. Vilar, Cezar P. Schroeder, Cristina Wada, Rayanne H. Bezerra, Leonardo L. A. Heitzmann, Rafael Simionato, and George D. C. Cavalcanti
IEEE
In smart environments, the extraction of relevant information in large volumes of data collected from intelligent devices is a crucial issue. The extracted information can assist in automation of user activities and on daily chores, either suggesting or even changing the state of devices based on his/her routine. In this work, we propose a prediction architecture which combines an innovative preprocessing strategy with some well known classification algorithms for the environment automation. The preprocessing enhances the datasets by including features and organizing them in structures that improve the classification results. We verify which preprocessing parameters have significant impact on prediction performance using datasets collected from a real home equipped with sensors. In simulations, the avNNet, mlp and C5.0 classifiers attained the higher accuracies using Friedman and Nemenyi statistical tests, but none of them outperformed the others in all scenarios using this architecture.

RECENT SCHOLAR PUBLICATIONS

Contrastive dissimilarity: optimizing performance on imbalanced and limited data sets
LO Teixeira, D Bertolini, LS Oliveira, GDC Cavalcanti, YMG Costa
Neural Computing and Applications, 1-18 2024

Music Genre Classification Using Contrastive Dissimilarity
GH Costanzi, LO Teixeira, GZ Felipe, GDC Cavalcanti, YMG Costa
2024 31st International Conference on Systems, Signals and Image Processing 2024

Recent advances in applications of machine learning in reward crowdfunding success forecasting
GDC Cavalcanti, W Mendes-Da-Silva, IJ dos Santos Felipe, LA Santos
Neural Computing and Applications, 1-17 2024

Resampling strategies for imbalanced regression: a survey and empirical analysis
JG Avelino, GDC Cavalcanti, RMO Cruz
Artificial Intelligence Review 57 (4), 82 2024

Microservices performance forecast using dynamic Multiple Predictor Systems
WRM Santos, AR Sampaio Jr, NS Rosa, GDC Cavalcanti
Engineering Applications of Artificial Intelligence 129, 107649 2024

Meta-Scaler: A Meta-Learning Framework for the Selection of Scaling Techniques
LBV de Amorim, GDC Cavalcanti, RMO Cruz
IEEE Transactions on Neural Networks and Learning Systems 2024

Subconcept perturbation-based classifier for within-class multimodal data
GDC Cavalcanti, RJO Soares, EL Arajo
Neural Computing and Applications 36, 2479–2491 2024

Fake news detection: Taxonomy and comparative study
F Farhangian, RMO Cruz, GDC Cavalcanti
Information Fusion, 102140 2024

Fault distance estimation for transmission lines with dynamic regressor selection
LA Ensina, LES de Oliveira, RMO Cruz, GDC Cavalcanti
Neural Computing and Applications 36, 1741–1759 2024

A dynamic multiple classifier system using graph neural network for high dimensional overlapped data
MA Souza, R Sabourin, GDC Cavalcanti, RMO Cruz
Information Fusion, 102145 2024

A hybrid system based on ensemble learning to model residuals for time series forecasting
DSOS Jnior, PSG de Mattos Neto, JFL de Oliveira, GDC Cavalcanti
Information Sciences 649, 119614 2023

A post-selection algorithm for improving dynamic ensemble selection methods
PRG Cordeiro, GDC Cavalcanti, RMO Cruz
2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC 2023

GNN-DES: A New End-to-End Dynamic Ensemble Selection Method Based on Multi-label Graph Neural Network
M de Araujo Souza, R Sabourin, GDC Cavalcanti, RMO e Cruz
International Workshop on Graph-Based Representations in Pattern Recognition 2023

Dynamic ensemble algorithm post-selection using Hardness-aware Oracle
PRG Cordeiro, GDC Cavalcanti, RMO Cruz
IEEE Access 2023

Security Relevant Methods of Android’s API Classification: A Machine Learning Empirical Evaluation
WM Rodrigues, FN Walmsley, GDC Cavalcanti, RMO Cruz
IEEE Transactions on Computers 2023

OLP++: An online local classifier for high dimensional data
MA Souza, R Sabourin, GDC Cavalcanti, RMO Cruz
Information Fusion 90, 120-137 2023

Gender Bias Propagation on Hate Speech: An Analysis at Feature-Level
FRS Nascimento, G Cavalcanti, D Costa-Abreu
Available at SSRN 4517546 2023

Exploring Automatic Hate Speech Detection on Social Media: A Focus on Content-Based Analysis
FRS Nascimento, GDC Cavalcanti, MD Costa-Abreu
SAGE Open 13 (2) 2023

Cancer Identification in Enteric Nervous System Preclinical Images Using Handcrafted and Automatic Learned Features
GZ Felipe, LO Teixeira, RM Pereira, JN Zanoni, SRG Souza, L Nanni, ...
Neural Processing Letters, 1-22 2023

The choice of scaling technique matters for classification performance
LBV de Amorim, GDC Cavalcanti, RMO Cruz
Applied Soft Computing 133, 109924 2023

MOST CITED SCHOLAR PUBLICATIONS

Dynamic classifier selection: Recent advances and perspectives
RMO Cruz, R Sabourin, GDC Cavalcanti
Information Fusion 41, 195-216 2018
Citations: 447

Assessing sentence scoring techniques for extractive text summarization
R Ferreira, L de Souza Cabral, RD Lins, GP e Silva, F Freitas, ...
Expert Systems with Applications 40 (14), 5755-5764 2013
Citations: 370

META-DES: A Dynamic Ensemble Selection Framework using Meta-Learning
RMO Cruz, R Sabourin, GDC Cavalcanti, TI Ren
Pattern Recognition 2015
Citations: 274

A graph-based friend recommendation system using genetic algorithm
NB Silva, R Tsang, GDC Cavalcanti, J Tsang
IEEE Congress on Evolutionary Computation (CEC), 233-239 2010
Citations: 157

Impact of lung segmentation on the diagnosis and explanation of COVID-19 in chest X-ray images
LO Teixeira, RM Pereira, D Bertolini, LS Oliveira, L Nanni, ...
Sensors 21 (21), 7116 2021
Citations: 141

Text line segmentation based on morphology and histogram projection
RP dos Santos, GS Clemente, TI Ren, GDC Cavalcanti
International Conference on Document Analysis and Recognition (ICDAR), 651-655 2009
Citations: 124

Combining Diversity Measures for Ensemble Pruning
GDC Cavalcanti, LS Oliveira, TJM Moura, GV Carvalho
Pattern Recognition Letters, http://dx.doi.org/10.1016/j.patrec.2016 2016
Citations: 121

A study on combining dynamic selection and data preprocessing for imbalance learning
A Roy, RMO Cruz, R Sabourin, GDC Cavalcanti
Neurocomputing 286, 179-192 2018
Citations: 120

DESlib: A Dynamic ensemble selection library in Python
RMO Cruz, LG Hafemann, R Sabourin, GDC Cavalcanti
Journal of Machine Learning Research 21 (8), 1-5 2020
Citations: 118

Unsupervised Retinal Vessel Segmentation Using Combined Filters
WS Oliveira, JV Teixeira, TI Ren, GDC Cavalcanti, J Sijbers
PLOS ONE 2016
Citations: 118

Semi-supervised clustering for MR brain image segmentation
NM Portela, GDC Cavalcanti, TI Ren
Expert Systems with Applications 41 (4), 1492-1497 2014
Citations: 107

META-DES.Oracle: Meta-learning and feature selection for dynamic ensemble selection
RMO Cruz, R Sabourin, GDC Cavalcanti
Information Fusion 38, 84-103 2017
Citations: 103

A global-ranking local feature selection method for text categorization
RHW Pinheiro, GDC Cavalcanti, RF Correa, TI Ren
Expert Systems with Applications 39 (17), 12851-12857 2012
Citations: 91

A Proposal for Path Loss Prediction in Urban Environments using Support Vector Regression
R Timoteo, DC Cunha, GDC Cavalcanti
AICT 2014, The Tenth Advanced International Conference on Telecommunications 2014
Citations: 85

An approach to feature selection for keystroke dynamics systems based on PSO and feature weighting
GLF Azevedo, GDC Cavalcanti, ECB Carvalho Filho
IEEE Congress on Evolutionary Computation (CEC), 3577-3584 2007
Citations: 83

Online Pruning of Base Classifiers for Dynamic Ensemble Selection
DVR Oliveira, GDC Cavalcanti, R Sabourin
Pattern Recognition 2017
Citations: 80

The choice of scaling technique matters for classification performance
LBV de Amorim, GDC Cavalcanti, RMO Cruz
Applied Soft Computing 133, 109924 2023
Citations: 78

Data-driven global-ranking local feature selection methods for text categorization
RHW Pinheiro, GDC Cavalcanti, TI Ren
Expert Systems with Applications 42 (4), 1941–1949 2015
Citations: 69

FIRE-DES++: Enhanced Online Pruning of Base Classifiers for Dynamic Ensemble Selection
RMO Cruz, DVR Oliveira, GDC Cavalcanti, R Sabourin
Pattern Recognition 85, 149-160 2019
Citations: 62

Handwritten digit recognition using multiple feature extraction techniques and classifier ensemble
RMO Cruz, GDC Cavalcanti, TI Ren
17th International conference on systems, signals and image processing, 215-218 2010
Citations: 58