Date of Award
Spring 4-14-2026
Document Type
Dissertation
Publication Status
Version of Record
Submission Date
April 2026
Department
Computer and Electrical Engineering and Computer Science
College Granting Degree
College of Engineering and Computer Science
Department Granting Degree
Electrical Engineering and Computer Science
Degree Name
Doctor of Philosophy (PhD)
Thesis/Dissertation Advisor [Chair]
Hanqi Zhuang
Abstract
The convergence of artificial intelligence and healthcare represents one of the most transformative developments in modern medicine, with deep learning technologies emerging as powerful tools for addressing complex diagnostic challenges. This dissertation develops and validates machine learning frameworks that address critical challenges in medical diagnosis through innovative approaches to data augmentation, feature learning, and classification, focusing on two fundamental problems: Diabetic Retinopathy (DR) severity classification using multi-model convolutional neural networks (CNNs), and breast cancer stage identification using microRNA (miRNA) gene expression biomarkers. For diabetic retinopathy classification, this work proposes an ensemble deep learning framework that integrates Diffusion-based data augmentation for synthetic data generation with contrastive learning and multi-model CNN architectures. The DDPM-based augmentation addresses the scarcity of annotated fundus images by learning probabilistic latent representations that enable synthesis of realistic samples retaining critical pathological features. Contrastive learning optimizes representation learning by maximizing similarity between augmented views of the same image while minimizing similarity with different images, enhancing discriminative capacity for subtle inter-class variations. The multi-model ensemble leverages complementary strengths of different network architectures through decision-level fusion, providing improved generalization and reduced variance compared to single-model systems. For breast cancer stage classification, this dissertation implements and compares feature selection algorithms including Neighborhood Component Analysis (NCA) and Minimum Redundancy Maximum Relevance (MRMR) to identify optimal miRNA biomarker sets from The Cancer Genome Atlas (TCGA) dataset. NCA preserves local neighborhood structure while MRMR explicitly minimizes redundancy between selected features while maximizing relevance to classification targets. The identified biomarkers are integrated with machine learning classifiers to achieve superior cancer staging accuracy, demonstrating how advanced feature selection methods directly translate to improved clinical prediction performance. Experimental validation demonstrates that the proposed frameworks achieve clinically relevant accuracy levels even with limited training data. The DR classification system achieves competitive performance on benchmark datasets such as APTOS 2019, EyePACS, and Messidor-2, while the miRNA-based breast cancer classifier effectively distinguishes between cancer stages using optimized biomarker panels. Comparative analyses against existing methods and clinical diagnostic techniques quantify the performance gains of both proposed models. These results suggest potential for enabling automated diabetic retinopathy screening in resource-constrained environments and improving molecular-based breast cancer staging through systematic biomarker identification.
Recommended Citation
Abidalkareem, Ali, "PREDICTIVE ANALYTICS IN ONCOLOGY AND OPHTHALMOLOGY: MACHINE LEARNING APPLICATIONS FOR DIABETIC RETINOPATHY AND BREAST CANCER" (2026). Electronic Theses and Dissertations. 240.
https://digitalcommons.fau.edu/etd_general/240