Skip to main content

Showing 1–13 of 13 results for author: Qazi, M A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2601.17880  [pdf, ps, other

    cs.CV

    Quran-MD: A Fine-Grained Multilingual Multimodal Dataset of the Quran

    Authors: Muhammad Umar Salman, Mohammad Areeb Qazi, Mohammed Talha Alam

    Abstract: We present Quran MD, a comprehensive multimodal dataset of the Quran that integrates textual, linguistic, and audio dimensions at the verse and word levels. For each verse (ayah), the dataset provides its original Arabic text, English translation, and phonetic transliteration. To capture the rich oral tradition of Quranic recitation, we include verse-level audio from 32 distinct reciters, reflecti… ▽ More

    Submitted 25 January, 2026; originally announced January 2026.

    Comments: 6 pages, 2 tables and 2 figures

  2. arXiv:2511.16333  [pdf, ps, other

    cs.LG

    Beyond Generative AI: World Models for Clinical Prediction, Counterfactuals, and Planning

    Authors: Mohammad Areeb Qazi, Maryam Nadeem, Mohammad Yaqub

    Abstract: Healthcare requires AI that is predictive, reliable, and data-efficient. However, recent generative models lack physical foundation and temporal reasoning required for clinical decision support. As scaling language models show diminishing returns for grounded clinical reasoning, world models are gaining traction because they learn multimodal, temporally coherent, and action-conditioned representat… ▽ More

    Submitted 20 November, 2025; originally announced November 2025.

    Comments: 2 Figures, 1 Table

  3. arXiv:2511.11212  [pdf, ps, other

    cs.CV

    MAFM^3: Modular Adaptation of Foundation Models for Multi-Modal Medical AI

    Authors: Mohammad Areeb Qazi, Munachiso S Nwadike, Ibrahim Almakky, Mohammad Yaqub, Numan Saeed

    Abstract: Foundational models are trained on extensive datasets to capture the general trends of a domain. However, in medical imaging, the scarcity of data makes pre-training for every domain, modality, or task challenging. Instead of building separate models, we propose MAFM^3 (Modular Adaptation of Foundation Models for Multi-Modal Medical AI), a framework that enables a single foundation model to expand… ▽ More

    Submitted 14 November, 2025; originally announced November 2025.

    Comments: 2 figures, 3 tables

  4. arXiv:2508.14024  [pdf, ps, other

    eess.IV cs.CV

    UNICON: UNIfied CONtinual Learning for Medical Foundational Models

    Authors: Mohammad Areeb Qazi, Munachiso S Nwadike, Ibrahim Almakky, Mohammad Yaqub, Numan Saeed

    Abstract: Foundational models are trained on extensive datasets to capture the general trends of a domain. However, in medical imaging, the scarcity of data makes pre-training for every domain, modality, or task challenging. Continual learning offers a solution by fine-tuning a model sequentially on different domains or tasks, enabling it to integrate new knowledge without requiring large datasets for each… ▽ More

    Submitted 19 August, 2025; originally announced August 2025.

    Comments: 10 pages, 1 figure

  5. arXiv:2507.12145  [pdf, ps, other

    cs.LG cs.AI cs.CV

    PRISM: Distributed Inference for Foundation Models at Edge

    Authors: Muhammad Azlan Qazi, Alexandros Iosifidis, Qi Zhang

    Abstract: Foundation models (FMs) have achieved remarkable success across a wide range of applications, from image classification to natural langurage processing, but pose significant challenges for deployment at edge. This has sparked growing interest in developing practical and efficient strategies for bringing foundation models to edge environments. In this work, we propose PRISM, a communication-efficie… ▽ More

    Submitted 16 July, 2025; originally announced July 2025.

  6. arXiv:2409.19806  [pdf, other

    cs.SD cs.AI eess.AS

    PALM: Few-Shot Prompt Learning for Audio Language Models

    Authors: Asif Hanif, Maha Tufail Agro, Mohammad Areeb Qazi, Hanan Aldarmaki

    Abstract: Audio-Language Models (ALMs) have recently achieved remarkable success in zero-shot audio recognition tasks, which match features of audio waveforms with class-specific text prompt features, inspired by advancements in Vision-Language Models (VLMs). Given the sensitivity of zero-shot performance to the choice of hand-crafted text prompts, many prompt learning techniques have been developed for VLM… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: EMNLP 2024 (Main)

  7. arXiv:2409.19436  [pdf, other

    cs.CV

    Introducing SDICE: An Index for Assessing Diversity of Synthetic Medical Datasets

    Authors: Mohammed Talha Alam, Raza Imam, Mohammad Areeb Qazi, Asim Ukaye, Karthik Nandakumar

    Abstract: Advancements in generative modeling are pushing the state-of-the-art in synthetic medical image generation. These synthetic images can serve as an effective data augmentation method to aid the development of more accurate machine learning models for medical image analysis. While the fidelity of these synthetic images has progressively increased, the diversity of these images is an understudied phe… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

    Comments: Accepted at BMVC 2024 - PFATCV

  8. arXiv:2405.13482  [pdf, other

    cs.CV

    Continual Learning in Medical Imaging: A Survey and Practical Analysis

    Authors: Mohammad Areeb Qazi, Anees Ur Rehman Hashmi, Santosh Sanjeev, Ibrahim Almakky, Numan Saeed, Camila Gonzalez, Mohammad Yaqub

    Abstract: Deep Learning has shown great success in reshaping medical imaging, yet it faces numerous challenges hindering widespread application. Issues like catastrophic forgetting and distribution shifts in the continuously evolving data stream increase the gap between research and applications. Continual Learning offers promise in addressing these hurdles by enabling the sequential acquisition of new know… ▽ More

    Submitted 1 October, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: 16 pages, 9 figures

  9. arXiv:2404.14099  [pdf, other

    cs.CV

    DynaMMo: Dynamic Model Merging for Efficient Class Incremental Learning for Medical Images

    Authors: Mohammad Areeb Qazi, Ibrahim Almakky, Anees Ur Rehman Hashmi, Santosh Sanjeev, Mohammad Yaqub

    Abstract: Continual learning, the ability to acquire knowledge from new data while retaining previously learned information, is a fundamental challenge in machine learning. Various approaches, including memory replay, knowledge distillation, model regularization, and dynamic network expansion, have been proposed to address this issue. Thus far, dynamic network expansion methods have achieved state-of-the-ar… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  10. arXiv:2403.13341  [pdf, other

    cs.CV cs.AI

    FissionFusion: Fast Geometric Generation and Hierarchical Souping for Medical Image Analysis

    Authors: Santosh Sanjeev, Nuren Zhaksylyk, Ibrahim Almakky, Anees Ur Rehman Hashmi, Mohammad Areeb Qazi, Mohammad Yaqub

    Abstract: The scarcity of well-annotated medical datasets requires leveraging transfer learning from broader datasets like ImageNet or pre-trained models like CLIP. Model soups averages multiple fine-tuned models aiming to improve performance on In-Domain (ID) tasks and enhance robustness against Out-of-Distribution (OOD) datasets. However, applying these methods to the medical imaging domain faces challeng… ▽ More

    Submitted 3 June, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  11. arXiv:2403.11646  [pdf, other

    cs.CV

    MedMerge: Merging Models for Effective Transfer Learning to Medical Imaging Tasks

    Authors: Ibrahim Almakky, Santosh Sanjeev, Anees Ur Rehman Hashmi, Mohammad Areeb Qazi, Hu Wang, Mohammad Yaqub

    Abstract: Transfer learning has become a powerful tool to initialize deep learning models to achieve faster convergence and higher performance. This is especially useful in the medical imaging analysis domain, where data scarcity limits possible performance gains for deep learning models. Some advancements have been made in boosting the transfer learning performance gain by merging models starting from the… ▽ More

    Submitted 15 April, 2025; v1 submitted 18 March, 2024; originally announced March 2024.

  12. arXiv:2403.09240  [pdf, ps, other

    eess.IV cs.CV

    XReal: Realistic Anatomy and Pathology-Aware X-ray Generation via Controllable Diffusion Model

    Authors: Anees Ur Rehman Hashmi, Ibrahim Almakky, Mohammad Areeb Qazi, Santosh Sanjeev, Vijay Ram Papineni, Jagalpathy Jagdish, Mohammad Yaqub

    Abstract: Large-scale generative models have demonstrated impressive capabilities in producing visually compelling images, with increasing applications in medical imaging. However, they continue to grapple with hallucination challenges and the generation of anatomically inaccurate outputs. These limitations are mainly due to the reliance on textual inputs and lack of spatial control over the generated image… ▽ More

    Submitted 22 October, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  13. arXiv:2311.09607  [pdf, other

    eess.IV cs.CV

    Multi-Task Learning Approach for Unified Biometric Estimation from Fetal Ultrasound Anomaly Scans

    Authors: Mohammad Areeb Qazi, Mohammed Talha Alam, Ibrahim Almakky, Werner Gerhard Diehl, Leanne Bricker, Mohammad Yaqub

    Abstract: Precise estimation of fetal biometry parameters from ultrasound images is vital for evaluating fetal growth, monitoring health, and identifying potential complications reliably. However, the automated computerized segmentation of the fetal head, abdomen, and femur from ultrasound images, along with the subsequent measurement of fetal biometrics, remains challenging. In this work, we propose a mult… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 10 Pages, 4 Figures, The 4th International Conference on Medical Imaging and Computer-Aided Diagnosis