Knowledge Integrity in Large Language Models: A State-of-The-Art Review Vadivel Abishethvarman, Fariza Sabrina, Paul Kwan Information Switzerland, 2025 Large Language Models (LLMs) are emerging technologies and a growing research trend in Artificial General Intelligence (AGI), which envisions a future where machines can think and learn like humans across a wide range of tasks. Information generated by LLMs is essentially the prediction of next tokens in Natural Language Processing (NLP) tasks. However, the generated content is always subject to issues of truthfulness and hallucinations. The information and knowledge integrity of LLM-generated content therefore remains subjective. Exploring recent literature on the integrity of LLMs in a systematic manner is both timely and essential. Moreover, ensuring the reliability of LLMs in real-world applications is critical. Various approaches have been explored to promote information and knowledge integrity in LLMs, including adversarial training, data augmentation, and calibration methods. However, beyond these techniques, other strategies also contribute to maintaining knowledge integrity. This paper specifically focuses on three such approaches: knowledge distillation, semantic integrity, and provenance tracking, which play essential roles in ensuring that LLMs generate accurate, consistent, and trustworthy information. Knowledge distillation enhances model efficiency by transferring knowledge from larger models to smaller ones while preserving essential learning without compromising knowledge integrity. This reduces hallucinations. Semantic integrity safeguards consistency and strengthens the robustness of generated outputs. It is concurrently checking the meaningfulness of the outputs with the context. Provenance tracking improves transparency and trustworthiness through mechanisms such as data lineage and explainability, thereby ensuring the credibility of the LLM-generated responses. This review suggests that knowledge distillation, semantic integrity, and provenance tracking can enhance the reliability of LLM outputs, with prior studies reporting reductions in hallucination rates, improvements in robustness, and gains in factual consistency.
A Collaborative Model for Integrating Teacher and GenAI into Future Education Tayab D. Memon, Paul Kwan Techtrends, 2025 The integration of Generative AI (GenAI) technology in education is set to transform traditional teaching and learning models. The age of teacher-led instructional pedagogy, where teachers are the sole providers of knowledge and students being mere receivers, is long outdated. Teachers are confronted with the choice of embracing new Artificial Intelligence (AI) technologies or risking obsolescence, potentially falling behind a new generation of AI-skilled educators. To maximise the benefits, teachers and the new generation of AI tools must coexist in a collaborative relationship, underscoring a new model of pedagogy for the new age. This paper proposes a collaborative model where GenAI and teachers are envisioned to work together to optimise educational outcomes of current and future learners. Beginning with a review of recent trends of AI integration in education, we describe the proposed model that we envision will underscore the roles of teachers and GenAI technology in future teaching and learning scenarios. Innovatively, we categorise the degree of involvement—minor, medium, major—by teachers and GenAI in developing learners' cognitive, psychomotor, and affective skills, as informed by Bloom's taxonomy. This categorisation underpins how GenAI and human collaboration can effectively enhance various learner skills. Additionally, key issues including privacy, bias, emotional impacts, and job displacement are discussed, alongside opportunities for improving education quality and accessibility. We argue that aspects of our model have started to appear in educational scenarios in countries including Australia, USA, and China. We conclude by highlighting the implications and impacts of rapid and sustained advancements of AI will have on current and future educational policies and the frontiers for educational research.
Reimagining Flipped Learning via Bloom’s Taxonomy and Student–Teacher–GenAI Interactions Paul Kwan, Rajan Kadel, Tayab D. Memon, Saad S. Hashmi Education Sciences, 2025 This paper explores how generative artificial intelligence (GenAI) technologies, such as ChatGPT 4o and other AI-based conversational models, can be applied to flipped learning pedagogy to achieve enhanced learning outcomes for students. By applying Bloom’s taxonomy to intentionally align educational objectives to the key phases of flipped learning, our study proposes a model for assigning learning activities to pre-class, in-class, and post-class contexts that can be enhanced by the integration of GenAI. In the pre-class phase, GenAI tools can facilitate personalised content delivery, enabling students to grasp fundamental concepts at their own pace. During class, the interactions between students, teacher, and GenAI encourage collaborative learning and real-time feedback. Post-class activities utilise GenAI to reinforce knowledge, provide instant feedback, and support continuous learning through summarisation and content generation. Furthermore, our model articulates the synergies between the three key actors: interactions between students and teachers, learning support provided by GenAI to students, and use of GenAI by teachers to enhance their teaching strategies. These human–AI interactions fundamentally reshape the flipped learning experience, making it more adaptive, engaging, and supportive of the development of 21st-century skills such as critical thinking, collaboration, communication, and creativity.
An Adaptive Approach in Channel Quantization for Small Cells Based on Per-Receiver Antenna Quantization Sanjeeb Shrestha, Xiaoying Kong, Paul Kwan, Xiaojing Huang IEEE Access, 2025 The widespread deployment of small cells (SCs) plays a crucial role in enhancing system capacity, coverage, and quality of service (QoS) for smart applications. However, due to the dynamic nature of user demands and the limited resources available, SCs cannot support large quantization codebooks, which are typically more suitable for macro cells (MCs) in finite rate feedback (FRF)-based multiple input single output (MISO) systems. In this paper, we propose an adaptive quantization approach for SCs that adjusts the codebook size based on the number of receiver antennas. Additionally, we address the issue of code quantization error (CQE), which arises when two distinct channels are quantized using the same code, as well as the average system error (AvgSysErr), which can increase due to elevated CQE. Our analysis shows that for SCs to achieve convergence of AvgSysErr with FRF-based MISO systems, the probability of non-unique codes in the quantization codebook must be less than <inline-formula> <tex-math notation="LaTeX">$\\frac {1}{N}$ </tex-math></inline-formula>, where N is the number of antennas at the transmitter. Similarly, the lower bound for the non-unique code probability must be less than or equal to <inline-formula> <tex-math notation="LaTeX">$\\varepsilon $ </tex-math></inline-formula>, where <inline-formula> <tex-math notation="LaTeX">$\\varepsilon $ </tex-math></inline-formula> represents the difference between the non-unique code probabilities of <inline-formula> <tex-math notation="LaTeX">$\\frac {1}{N_{1}}$ </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">$\\frac {1}{N_{2}}$ </tex-math></inline-formula>, given <inline-formula> <tex-math notation="LaTeX">$\\frac {1}{N_{1}}\\gt \\frac {1}{N_{2}}$ </tex-math></inline-formula> (where <inline-formula> <tex-math notation="LaTeX">$N_{1}$ </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">$N_{2}$ </tex-math></inline-formula> denote the number of antennas at the transmitter).
Fusion of UAV-Acquired Visible Images and Multispectral Data by Applying Machine-Learning Methods in Crop Classification Zuojun Zheng, Jianghao Yuan, Wei Yao, Paul Kwan, Hongxun Yao, et al. Agronomy, 2024 The sustainable development of agriculture is closely related to the adoption of precision agriculture techniques, and accurate crop classification is a fundamental aspect of this approach. This study explores the application of machine learning techniques to crop classification by integrating RGB images and multispectral data acquired by UAVs. The study focused on five crops: rice, soybean, red bean, wheat, and corn. To improve classification accuracy, the researchers extracted three key feature sets: band values and vegetation indices, texture features extracted from a grey-scale co-occurrence matrix, and shape features. These features were combined with five machine learning models: random forest (RF), support vector machine (SVM), k-nearest neighbour (KNN) based, classification and regression tree (CART) and artificial neural network (ANN). The results show that the Random Forest model consistently outperforms the other models, with an overall accuracy (OA) of over 97% and a significantly higher Kappa coefficient. Fusion of RGB images and multispectral data improved the accuracy by 1–4% compared to using a single data source. Our feature importance analysis showed that band values and vegetation indices had the greatest impact on classification results. This study provides a comprehensive analysis from feature extraction to model evaluation, identifying the optimal combination of features to improve crop classification and providing valuable insights for advancing precision agriculture through data fusion and machine learning techniques.
Development of a Device and Algorithm Research for Akhal-Teke Activity Level Analysis Xuan Chen, Fuzhong Li, Jinxing Li, Qijie Fan, Paul Kwan, et al. Applied Sciences Switzerland, 2024 This study demonstrated that wearable devices can distinguish between different levels of horse activity, categorized into three types based on the horse’s gaits: low activity (standing), medium activity (walking), and high activity (trotting, cantering, and galloping). Current research in activity level classification predominantly relies on deep learning techniques, known for their effectiveness but also their demand for substantial data and computational resources. This study introduces a combined acceleration threshold behavior recognition method tailored for wearable hardware devices, enabling these devices to classify the activity levels of horses directly. The approach comprises three sequential phases: first, a combined acceleration interval counting method utilizing a non-linear segmentation strategy for preliminary classification; second, a statistical analysis of the variance among these segments, coupled with multi-level threshold processing; third, a method using variance-based proximity classification for recognition. The experimental results show that the initial stage achieved an accuracy of 87.55% using interval counting, the second stage reached 90.87% with variance analysis, and the third stage achieved 91.27% through variance-based proximity classification. When all three stages are combined, the classification accuracy improves to 92.74%. Extensive testing with the Xinjiang Wild Horse Group validated the feasibility of the proposed solution and demonstrated its practical applicability in real-world scenarios.
Intensifying learner engagement and focus by a block mode flipped learning pedagogy Flipped Classrooms and Learning Perspectives Opportunities and Challenges, 2024
Analysis and Comparison of New-Born Calf Standing and Lying Time Based on Deep Learning Wenju Zhang, Yaowu Wang, Leifeng Guo, Greg Falzon, Paul Kwan, et al. Animals, 2024 Standing and lying are the fundamental behaviours of quadrupedal animals, and the ratio of their durations is a significant indicator of calf health. In this study, we proposed a computer vision method for non-invasively monitoring of calves’ behaviours. Cameras were deployed at four viewpoints to monitor six calves on six consecutive days. YOLOv8n was trained to detect standing and lying calves. Daily behavioural budget was then summarised and analysed based on automatic inference on untrained data. The results show a mean average precision of 0.995 and an average inference speed of 333 frames per second. The maximum error in the estimated daily standing and lying time for a total of 8 calf-days is less than 14 min. Calves with diarrhoea had about 2 h more daily lying time (p < 0.002), 2.65 more daily lying bouts (p < 0.049), and 4.3 min less daily lying bout duration (p = 0.5) compared to healthy calves. The proposed method can help in understanding calves’ health status based on automatically measured standing and lying time, thereby improving their welfare and management on the farm.
Using a video-based critique process to support studio pedagogies in distance education – A tool and pilot study Ascilite 2016 Conference Proceedings 33rd International Conference of Innovation Practice and Research in the Use of Educational Technologies in Tertiary Education Show Me the Learning, 2016
A high performance, agent-based simulation of old world screwworm fly lifecycle and dispersal using a graphics processing unit (GPU) platform Proceedings 20th International Congress on Modelling and Simulation Modsim 2013, 2013
Modelling the spread of livestock disease on a national scale: The case for a hybrid approach Proceedings 20th International Congress on Modelling and Simulation Modsim 2013, 2013
Parallel evolutionary computation in R Cedric Gondro, Paul Kwan Multidisciplinary Computational Intelligence Techniques Applications in Business Engineering and Medicine, 2012
Learning gradients with Gaussian processes Xinwei Jiang, Junbin Gao, Tianjiang Wang, Paul W. Kwan Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2010
L-1 LASSO modeling and its Bayesian inference Junbin Gao, Michael Antolovich, Paul W. Kwan Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2008
Twin kernel embedding Y. Guo, J. Gao, P.W. Kwan IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008
Content-based retrieval of kamon images by image smoothing and relaxation Proceedings of the IASTED International Conference on Modelling Identification and Control Mic, 2005
A New Method on Assigning Function Types to Line Segments for Function Approximation-based Image Coding IEEE Pacific RIM Conference on Communications Computers and Signal Processing Proceedings, 2003
A Method on Tracking Common Boundaries of Color Regions in Function Approximation-based Image Coding IEEE Pacific RIM Conference on Communications Computers and Signal Processing Proceedings, 2003
TAST - Trademark Application Assistant IEEE International Conference on Image Processing, 2002
Connecting image similarity retrieval with consistent labeling problem by introducing a Match-all label IEEE International Conference on Fuzzy Systems, 2001
Trademark retrieval by relaxation matching on fluency function approximated image contours IEEE Pacific RIM Conference on Communications Computers and Signal Processing Proceedings, 2001
On an image contour compression method using Compactly Supported Sampling Functions IEEE Pacific RIM Conference on Communications Computers and Signal Processing Proceedings, 2001
RECENT SCHOLAR PUBLICATIONS
Knowledge Integrity in Large Language Models: A State-of-The-Art Review V Abishethvarman, F Sabrina, P Kwan Information 16 (12), 1076 , 2025 2025 Citations: 4
On the quantitative analysis of assessment scores with implicit and explicit constraints S Shrestha, X Kong, P Kwan Studies in Educational Evaluation 87, 101509 , 2025 2025
A collaborative model for integrating teacher and genai into future education TD Memon, P Kwan TechTrends, 1-15 , 2025 2025 Citations: 30
Reimagining flipped learning via bloom’s taxonomy and student–teacher–GenAI interactions P Kwan, R Kadel, TD Memon, SS Hashmi Education Sciences 15 (4), 465 , 2025 2025 Citations: 22
An adaptive approach in channel quantization for small cells based on per-receiver antenna quantization S Shrestha, X Kong, P Kwan, X Huang IEEE Access , 2025 2025 Citations: 2
Fusion of UAV-acquired visible images and multispectral data by applying machine-learning methods in crop classification Z Zheng, J Yuan, W Yao, P Kwan, H Yao, Q Liu, L Guo Agronomy 14 (11), 2670 , 2024 2024 Citations: 34
Development of a Device and Algorithm Research for Akhal-Teke Activity Level Analysis X Chen, F Li, J Li, Q Fan, P Kwan, W Zheng, L Guo Applied Sciences 14 (13), 5424 , 2024 2024 Citations: 1
Intensifying learner engagement and focus by a block mode flipped learning pedagogy P Kwan, R Kadel, TD Memon, SS Hashmi Flipped classrooms and learning: Perspectives, opportunities and challenges … , 2024 2024 Citations: 4
Analysis and comparison of new-born calf standing and lying time based on deep learning W Zhang, Y Wang, L Guo, G Falzon, P Kwan, Z Jin, Y Li, W Wang Animals 14 (9), 1324 , 2024 2024 Citations: 9
Knowledge Integrity in Large Language Models: A State-of-The-Art Review. Information 2025, 16, 1076 V Abishethvarman, F Sabrina, P Kwan Adv. Neural Inf. Process. Syst 37, 1502-1530 , 2024 2024
Automated Livestock Vocalisation Detection in Farm Acoustic Environments J Bishop, M Welch, D Paul, P Kwan, G Falzon University of New England , 2023 2023
Confluence: A robust non-IoU alternative to non-maxima suppression in object detection AJ Shepley, G Falzon, P Kwan, L Brankovic IEEE transactions on pattern analysis and machine intelligence 45 (10 … , 2023 2023 Citations: 69
An empirical study of students’ perception of and key factors affecting overall satisfaction in an intensive block mode and flipped classroom P Kwan, TD Memon, SS Hashmi, F Rhode, R Kadel Education Sciences 12 (8), 535 , 2022 2022 Citations: 19
A deep learning model to predict student learning outcomes in LMS using CNN and LSTM AS Aljaloud, DM Uliyan, A Alkhalil, M Abd Elrhman, AFM Alogali, ... IEEE Access 10, 85255-85265 , 2022 2022 Citations: 109
Improving wheat yield prediction accuracy using LSTM-RF framework based on UAV thermal infrared and multispectral imagery Y Shen, B Mercatoris, Z Cao, P Kwan, L Guo, H Yao, Q Cheng Agriculture 12 (6), 892 , 2022 2022 Citations: 87
Biometric Identification of Cattle via Muzzle Print Patterns and Deep Learning in a Few-Shot Learning Context A Shojaeipour, F Cowley, G Falzon, D Paul University of New England , 2021 2021
Automated muzzle detection and biometric identification via few-shot deep transfer learning of mixed breed cattle A Shojaeipour, G Falzon, P Kwan, N Hadavi, FC Cowley, D Paul Agronomy 11 (11), 2365 , 2021 2021 Citations: 95
Studying learner’s perception of attaining graduate attributes in capstone project units using online flipped classroom TD Memon, M Jurin, P Kwan, T Jan, N Sidnal, N Nafi Education Sciences 11 (11), 698 , 2021 2021 Citations: 24
Promoting Usage of Deep Learning Object Detection in Ecology by Improving Performance and Accessibility-Dataset AJ Shepley, G Falzon, P Kwan University of New England , 2021 2021
Automated location invariant animal detection in camera trap images using publicly available data sources A Shepley, G Falzon, P Meek, P Kwan Ecology and Evolution 11 (9), 4494-4506 , 2021 2021 Citations: 38
MOST CITED SCHOLAR PUBLICATIONS
Classification of crops and weeds from digital images: A support vector machine approach F Ahmed, HA Al-Mamun, ASMH Bari, E Hossain, P Kwan Crop Protection 40, 98-104 , 2012 2012 Citations: 296
Automated cattle counting using Mask R-CNN in quadcopter vision system B Xu, W Wang, G Falzon, P Kwan, L Guo, G Chen, A Tait, D Schneider Computers and Electronics in Agriculture 171, 105300 , 2020 2020 Citations: 271
Genome-wide association study of body weight in Australian Merino sheep reveals an orthologous region on OAR6 to human and bovine genomic regions affecting height and weight HA Al-Mamun, P Kwan, SA Clark, MH Ferdosi, R Tellam, C Gondro Genetics Selection Evolution 47 (1), 66 , 2015 2015 Citations: 230
Designing a multi-stage multivariate empirical mode decomposition coupled with ant colony optimization and random forest model to forecast monthly solar radiation R Prasad, M Ali, P Kwan, H Khan Applied energy 236, 778-792 , 2019 2019 Citations: 219
Genome-wide linkage disequilibrium and genetic diversity in five populations of Australian domestic sheep HA Al-Mamun, S a Clark, P Kwan, C Gondro Genetics Selection Evolution 47 (1), 90 , 2015 2015 Citations: 188
Sparse kernel learning with LASSO and Bayesian inference algorithm J Gao, PW Kwan, D Shi Neural networks 23 (2), 257-264 , 2010 2010 Citations: 166
Livestock classification and counting in quadcopter aerial images using Mask R-CNN B Xu, W Wang, G Falzon, P Kwan, L Guo, Z Sun, C Li International Journal of Remote Sensing 41 (21), 8121-8142 , 2020 2020 Citations: 136
A hybrid modeling approach to simulating foot-and-mouth disease outbreaks in Australian livestock RA Bradhurst, SE Roche, IJ East, P Kwan, MG Garner Frontiers in Environmental Science 3, 17 , 2015 2015 Citations: 118
A deep learning model to predict student learning outcomes in LMS using CNN and LSTM AS Aljaloud, DM Uliyan, A Alkhalil, M Abd Elrhman, AFM Alogali, ... IEEE Access 10, 85255-85265 , 2022 2022 Citations: 109
Research trends in student response systems: A literature review A Aljaloud, N Gromik, W Billingsley, P Kwan International Journal of Learning Technology 10 (4), 313-325 , 2015 2015 Citations: 108
Automated muzzle detection and biometric identification via few-shot deep transfer learning of mixed breed cattle A Shojaeipour, G Falzon, P Kwan, N Hadavi, FC Cowley, D Paul Agronomy 11 (11), 2365 , 2021 2021 Citations: 95
Improving wheat yield prediction accuracy using LSTM-RF framework based on UAV thermal infrared and multispectral imagery Y Shen, B Mercatoris, Z Cao, P Kwan, L Guo, H Yao, Q Cheng Agriculture 12 (6), 892 , 2022 2022 Citations: 87
Fingerprint matching using a hybrid shape and orientation descriptor J Abraham, P Kwan, J Gao State of the art in Biometrics, 25-56 , 2011 2011 Citations: 79
Livestock vocalisation classification in farm soundscapes JC Bishop, G Falzon, M Trotter, P Kwan, PD Meek Computers and electronics in agriculture 162, 531-542 , 2019 2019 Citations: 72
Confluence: A robust non-IoU alternative to non-maxima suppression in object detection AJ Shepley, G Falzon, P Kwan, L Brankovic IEEE transactions on pattern analysis and machine intelligence 45 (10 … , 2023 2023 Citations: 69
Saudi undergraduate students’ perceptions of the use of smartphone clicker apps on learning performance AS Aljaloud, N Gromik, P Kwan, W Billingsley Australasian Journal of Educational Technology 35 (1) , 2019 2019 Citations: 56
Comparison of grazing behaviour of sheep on pasture with different sward surface heights using an inertial measurement unit sensor L Guo, M Welch, R Dobos, P Kwan, W Wang Computers and electronics in agriculture 150, 394-401 , 2018 2018 Citations: 54
Improving the computational efficiency of an agent-based spatiotemporal model of livestock disease spread and control RA Bradhurst, SE Roche, IJ East, P Kwan, MG Garner Environmental Modelling & Software 77, 1-12 , 2016 2016 Citations: 50
Automated location invariant animal detection in camera trap images using publicly available data sources A Shepley, G Falzon, P Meek, P Kwan Ecology and Evolution 11 (9), 4494-4506 , 2021 2021 Citations: 38
Performance analysis of support vector machine and bayesian classifier for crop and weed classification from digital images F Ahmed, AH Bari, E Hossain, HA Al-Mamun, P Kwan World Applied Sciences Journal 12 (4), 432-440 , 2011 2011 Citations: 38