Skip to main content

Showing 1–50 of 649 results for author: Khan, F

.
  1. arXiv:2604.14700  [pdf, ps, other

    cs.AR

    Accelerating CRONet on AMD Versal AIE-ML Engines

    Authors: Kaustubh Mhatre, Vedant Tewari, Aditya Ray, Farhan Khan, Ridwan Olabiyi, Ashif Iquebal, Aman Arora

    Abstract: Topology optimization is a computational method used to determine the optimal material distribution within a prescribed design domain, aiming to minimize structural weight while satisfying load and boundary conditions. For critical infrastructure applications, such as structural health monitoring of bridges and buildings, particularly in digital twin contexts, low-latency energy-efficient topology… ▽ More

    Submitted 16 April, 2026; originally announced April 2026.

  2. arXiv:2604.12306  [pdf, ps, other

    cs.LG cs.AI

    GCA Framework: A Gulf-Grounded Dataset and Agentic Pipeline for Climate Decision Support

    Authors: Muhammad Umer Sheikh, Khawar Shehzad, Salman Khan, Fahad Shahbaz Khan, Muhammad Haris Khan

    Abstract: Climate decision-making in the Gulf increasingly demands systems that can translate heterogeneous scientific and policy evidence into actionable guidance, yet general-purpose large language models (LLMs) remain weak both in region-specific climate knowledge and grounded interaction with geospatial and forecasting tools. We present the GCA framework, which unifies (i) GCA-DS, a curated Gulf-focused… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

  3. arXiv:2604.06170  [pdf, ps, other

    cs.CL

    Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework

    Authors: Komal Kumar, Aman Chadha, Salman Khan, Fahad Shahbaz Khan, Hisham Cholakkal

    Abstract: The rapid growth of scientific literature has made it increasingly difficult for researchers to efficiently discover, evaluate, and synthesize relevant work. Recent advances in multi-agent large language models (LLMs) have demonstrated strong potential for understanding user intent and are being trained to utilize various tools. In this paper, we introduce Paper Circle, a multi-agent research disc… ▽ More

    Submitted 7 April, 2026; originally announced April 2026.

    Comments: 19 pages, 7 figures, 8 tables, ACL main (Oral)

  4. arXiv:2604.03231  [pdf, ps, other

    cs.CV

    CoME-VL: Scaling Complementary Multi-Encoder Vision-Language Learning

    Authors: Ankan Deria, Komal Kumar, Xilin He, Imran Razzak, Hisham Cholakkal, Fahad Shahbaz Khan, Salman Khan

    Abstract: Recent vision-language models (VLMs) typically rely on a single vision encoder trained with contrastive image-text objectives, such as CLIP-style pretraining. While contrastive encoders are effective for cross-modal alignment and retrieval, self-supervised visual encoders often capture richer dense semantics and exhibit stronger robustness on recognition and understanding tasks. In this work, we i… ▽ More

    Submitted 3 April, 2026; originally announced April 2026.

    Comments: 16 pages, 10 figures, 5 tables

  5. arXiv:2604.03198  [pdf, ps, other

    cs.CV

    The Eleventh NTIRE 2026 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Hang Guo, Yan Shu, Jiaqi Ma, Ziteng Cui, Shuhong Liu, Guofeng Mei, Lei Sun, Zongwei Wu, Fahad Shahbaz Khan, Salman Khan, Radu Timofte, Yawei Li, Hongyuan Yu, Pufan Xu, Chen Wu, Long Peng, Jiaojiao Yi, Siyang Yi, Yuning Cui, Jingyuan Xia, Xing Mou, Keji He, Jinlin Wu, Zongang Gao , et al. (38 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2026 challenge on efficient single-image super-resolution with a focus on the proposed solutions and results. The aim of this challenge is to devise a network that reduces one or several aspects, such as runtime, parameters, and FLOPs, while maintaining PSNR of around 26.90 dB on the DIV2K_LSDIR_valid dataset, and 26.99 dB on the DIV2K_LSDIR_test dataset. The challenge… ▽ More

    Submitted 3 April, 2026; originally announced April 2026.

    Comments: CVPR 2026 NTIRE Workshop Paper, Efficient Super Resolution Technical Report

  6. arXiv:2603.25938  [pdf, ps, other

    gr-qc astro-ph.HE

    Narrowband searches for continuous gravitational waves from known pulsars in the first two parts of the fourth LIGO--Virgo--KAGRA observing run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, A. Adam, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith , et al. (1831 additional authors not shown)

    Abstract: Rotating non-axisymmetric neutron stars (NSs) are promising sources for continuous gravitational waves (CWs). Such CWs can, if detected, inform us about the internal structure and equation of state of NSs. Here, we present a narrowband search for CWs from known pulsars, for which an efficient and sensitive matched-filter search can be applied. Narrowband searches are designed to be robust to misma… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

    Comments: 30 pages, 6 figures, submitted to ApJ

    Report number: LIGO-P2500612

  7. arXiv:2603.25808  [pdf, ps, other

    gr-qc astro-ph.HE

    Searches for Continuous Gravitational Waves from Supernova Remnants in the first part of the LIGO-Virgo-KAGRA Fourth Observing run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1742 additional authors not shown)

    Abstract: We present results from directed searches for continuous gravitational waves from a sample of 15 nearby supernova remnants, likely hosting young neutron star candidates, using data from the first eight months of the fourth observing run (O4) of the LIGO-Virgo-KAGRA Collaboration. The analysis employs five pipelines: four semi-coherent methods -- the Band-Sampled-Data directed pipeline, Weave and t… ▽ More

    Submitted 2 April, 2026; v1 submitted 26 March, 2026; originally announced March 2026.

  8. arXiv:2603.24371  [pdf

    physics.optics

    Shape-Dependent, Deep-Learning-Assisted Metamaterial Solid Immersion Lens (mSIL) Super-Resolution Imaging

    Authors: Baidong Wu, Fiza Khan, Lingya Yu, Zengbo Wang

    Abstract: We present the first systematic comparison of three TiO2 metamaterial solid immersion lens geometries - sub-hemispherical, super-hemispherical, and full-spherical - for label-free super-resolution imaging. Using SEM, we characterised both the cap profiles and the nanoparticle-fluid immersion at the lens-sample interface, revealing that super-hemispherical lenses achieve the deepest immersion and c… ▽ More

    Submitted 25 March, 2026; originally announced March 2026.

  9. arXiv:2603.22286  [pdf, ps, other

    cs.CV cs.AI cs.CL cs.LG

    WorldCache: Content-Aware Caching for Accelerated Video World Models

    Authors: Umair Nawaz, Ahmed Heakl, Ufaq Khan, Abdelrahman Shaker, Salman Khan, Fahad Shahbaz Khan

    Abstract: Diffusion Transformers (DiTs) power high-fidelity video world models but remain computationally expensive due to sequential denoising and costly spatio-temporal attention. Training-free feature caching accelerates inference by reusing intermediate activations across denoising steps; however, existing methods largely rely on a Zero-Order Hold assumption i.e., reusing cached features as static snaps… ▽ More

    Submitted 23 March, 2026; originally announced March 2026.

    Comments: 33 Pages

  10. arXiv:2603.20190  [pdf, ps, other

    cs.CV

    CoVR-R:Reason-Aware Composed Video Retrieval

    Authors: Omkar Thawakar, Dmitry Demidov, Vaishnav Potlapalli, Sai Prasanna Teja Reddy Bogireddy, Viswanatha Reddy Gajjala, Alaa Mostafa Lasheen, Rao Muhammad Anwer, Fahad Khan

    Abstract: Composed Video Retrieval (CoVR) aims to find a target video given a reference video and a textual modification. Prior work assumes the modification text fully specifies the visual changes, overlooking after-effects and implicit consequences (e.g., motion, state transitions, viewpoint or duration cues) that emerge from the edit. We argue that successful CoVR requires reasoning about these after-eff… ▽ More

    Submitted 20 March, 2026; originally announced March 2026.

    Comments: CVPR 2026 (findings)

  11. arXiv:2603.19021  [pdf, ps, other

    gr-qc astro-ph.HE

    GWTC-4.0: Tests of General Relativity. III. Tests of the Remnants

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1757 additional authors not shown)

    Abstract: This is the third paper of the set recording the results of the suite of tests of general relativity (GR) performed on the signals from the fourth Gravitational-Wave Transient Catalog (GWTC-4.0), where we focus on the remnants of the binary mergers. We examine for the first time 42 events from the first part of the fourth observing run of the LIGO-Virgo-KAGRA detectors, alongside events from the p… ▽ More

    Submitted 19 March, 2026; originally announced March 2026.

    Comments: As part of the Astrophysical Journal Letters Focus Issue on the Gravitational Wave Transient Catalog

    Report number: LIGO-P2500067

  12. arXiv:2603.19020  [pdf, ps, other

    gr-qc astro-ph.HE

    GWTC-4.0: Tests of General Relativity. II. Parameterized Tests

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1761 additional authors not shown)

    Abstract: In this second of three papers on tests of general relativity (GR) applied to the compact binary coalescence signals in the fourth Gravitational-Wave Transient Catalog (GWTC-4.0), we present the results of the parameterized tests of GR and constraints on line-of-sight acceleration. We include events up to and including the first part of the fourth observing run (O4a) of the LIGO Virgo KAGRA detect… ▽ More

    Submitted 19 March, 2026; originally announced March 2026.

    Comments: As part of the Astrophysical Journal Letters Focus Issue on the Gravitational Wave Transient Catalog

    Report number: LIGO-P2500066

  13. arXiv:2603.19019  [pdf, ps, other

    gr-qc astro-ph.HE

    GWTC-4.0: Tests of General Relativity. I. Overview and General Tests

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1759 additional authors not shown)

    Abstract: The worldwide LIGO-Virgo-KAGRA network of gravitational-wave (GW) detectors continues to increase in sensitivity, thus increasing the quantity and quality of the detected GW signals from compact binary coalescences. These signals allow us to perform ever-more sensitive tests of general relativity (GR) in the dynamical and strong-field regime of gravity. This paper is the first of three, where we p… ▽ More

    Submitted 19 March, 2026; originally announced March 2026.

    Comments: As part of the Astrophysical Journal Letters Focus Issue on the Gravitational Wave Transient Catalog

    Report number: LIGO-P2500065

  14. arXiv:2603.15122  [pdf, ps, other

    math.NA

    Structure-preserving preconditioning of discrete space-fractional diffusion equations with variable coefficient and θ-Method

    Authors: Muhammad Faisal Khan, Asim Ilyas, Rolf Krause, Stefano Serra-Capizzano, Cristina Tablino-Possio

    Abstract: This paper studies the spectral properties of large matrices and the preconditioning of linear systems, arising from the finite difference discretization of a time-dependent space-fractional diffusion equation with a variable coefficient $a(x)$ defined on $Ω\subset \mathbb{R}^d$, $d=1,2$. The model involves a one-sided Riemann-Liouville fractional derivative multiplied by the function $a(x)$, disc… ▽ More

    Submitted 16 March, 2026; originally announced March 2026.

  15. arXiv:2603.14168  [pdf, ps, other

    gr-qc

    All-sky Searches for Continuous Gravitational Waves from Isolated Neutron Stars in the Data from the First Part of the Fourth LIGO-Virgo-KAGRA Observing Run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, A. Adam, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith , et al. (1804 additional authors not shown)

    Abstract: We present results from an all-sky search for continuous gravitational waves, using three different methods applied to the first eight months of LIGO data from the fourth LIGO-Virgo-KAGRA Collaboration s observing run. We aim at signals potentially emitted by rotating, non-axisymmetric isolated neutron star in the Milky Way. The analysis spans a frequency range from 20 Hz to 2000 Hz and accommodat… ▽ More

    Submitted 14 March, 2026; originally announced March 2026.

    Comments: 45 pages, 17 figures

    Report number: LIGO-P2500416

  16. arXiv:2603.07294  [pdf, ps, other

    cs.CV cs.AI

    MAviS: A Multimodal Conversational Assistant For Avian Species

    Authors: Yevheniia Kryklyvets, Mohammed Irfan Kurpath, Sahal Shaji Mullappilly, Jinxing Zhou, Fahad Shabzan Khan, Rao Anwer, Salman Khan, Hisham Cholakkal

    Abstract: Fine-grained understanding and species-specific multimodal question answering are vital for advancing biodiversity conservation and ecological monitoring. However, existing multimodal large language models face challenges when it comes to specialized topics like avian species, making it harder to provide accurate and contextually relevant information in these areas. To address this limitation, we… ▽ More

    Submitted 7 March, 2026; originally announced March 2026.

    Comments: EMNLP 2025

  17. arXiv:2602.23363  [pdf, ps, other

    cs.CV

    MediX-R1: Open Ended Medical Reinforcement Learning

    Authors: Sahal Shaji Mullappilly, Mohammed Irfan Kurpath, Omair Mohamed, Mohamed Zidan, Fahad Khan, Salman Khan, Rao Anwer, Hisham Cholakkal

    Abstract: We introduce MediX-R1, an open-ended Reinforcement Learning (RL) framework for medical multimodal large language models (MLLMs) that enables clinically grounded, free-form answers beyond multiple-choice formats. MediX-R1 fine-tunes a baseline vision-language backbone with Group Based RL and a composite reward tailored for medical reasoning: an LLM-based accuracy reward that judges semantic correct… ▽ More

    Submitted 26 February, 2026; originally announced February 2026.

  18. arXiv:2602.20161  [pdf, ps, other

    cs.CV

    Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device

    Authors: Abdelrahman Shaker, Ahmed Heakl, Jaseel Muhammad, Ritesh Thawkar, Omkar Thawakar, Senmao Li, Hisham Cholakkal, Ian Reid, Eric P. Xing, Salman Khan, Fahad Shahbaz Khan

    Abstract: Unified multimodal models can both understand and generate visual content within a single architecture. Existing models, however, remain data-hungry and too heavy for deployment on edge devices. We present Mobile-O, a compact vision-language-diffusion model that brings unified multimodal intelligence to a mobile device. Its core module, the Mobile Conditioning Projector (MCP), fuses vision-languag… ▽ More

    Submitted 24 February, 2026; v1 submitted 23 February, 2026; originally announced February 2026.

    Comments: Project page: https://amshaker.github.io/Mobile-O/

  19. arXiv:2602.19268  [pdf, ps, other

    cs.AR cs.AI cs.CV cs.NE eess.IV

    CORVET: A CORDIC-Powered, Resource-Frugal Mixed-Precision Vector Processing Engine for High-Throughput AIoT applications

    Authors: Sonu Kumar, Mohd Faisal Khan, Mukul Lokhande, Santosh Kumar Vishvakarma

    Abstract: This brief presents a runtime-adaptive, performance-enhanced vector engine featuring a low-resource, iterative CORDIC-based MAC unit for edge AI acceleration. The proposed design enables dynamic reconfiguration between approximate and accurate modes, exploiting the latency-accuracy trade-off for a wide range of workloads. Its resource-efficient approach further enables up to 4x throughput improvem… ▽ More

    Submitted 22 February, 2026; originally announced February 2026.

  20. arXiv:2602.17665  [pdf, ps, other

    cs.CV

    OpenEarthAgent: A Unified Framework for Tool-Augmented Geospatial Agents

    Authors: Akashah Shabbir, Muhammad Umer Sheikh, Muhammad Akhtar Munir, Hiyam Debary, Mustansar Fiaz, Muhammad Zaigham Zaheer, Paolo Fraccaro, Fahad Shahbaz Khan, Muhammad Haris Khan, Xiao Xiang Zhu, Salman Khan

    Abstract: Recent progress in multimodal reasoning has enabled agents that interpret imagery, connect it with language, and execute structured analytical tasks. Extending these capabilities to remote sensing remains challenging, as models must reason over spatial scale, geographic structures, and multispectral indices while maintaining coherent multi-step logic. To address this gap, we introduce \textit{Open… ▽ More

    Submitted 25 March, 2026; v1 submitted 19 February, 2026; originally announced February 2026.

  21. arXiv:2602.06138  [pdf, ps, other

    cs.LG

    Flow Matching for Offline Reinforcement Learning with Discrete Actions

    Authors: Fairoz Nower Khan, Nabuat Zaman Nahim, Ruiquan Huang, Haibo Yang, Peizhong Ju

    Abstract: Generative policies based on diffusion models and flow matching have shown strong promise for offline reinforcement learning (RL), but their applicability remains largely confined to continuous action spaces. To address a broader range of offline RL settings, we extend flow matching to a general framework that supports discrete action spaces with multiple objectives. Specifically, we replace conti… ▽ More

    Submitted 5 February, 2026; originally announced February 2026.

  22. arXiv:2602.05882  [pdf, ps, other

    cs.CV

    EoCD: Encoder only Remote Sensing Change Detection

    Authors: Mubashir Noman, Mustansar Fiaz, Hiyam Debary, Abdul Hannan, Shah Nawaz, Fahad Shahbaz Khan, Salman Khan

    Abstract: Being a cornerstone of temporal analysis, change detection has been playing a pivotal role in modern earth observation. Existing change detection methods rely on the Siamese encoder to individually extract temporal features followed by temporal fusion. Subsequently, these methods design sophisticated decoders to improve the change detection performance without taking into consideration the complex… ▽ More

    Submitted 5 February, 2026; originally announced February 2026.

  23. arXiv:2601.15514  [pdf, ps, other

    stat.AP stat.ML

    Assessing the informative value of macroeconomic indicators for public health forecasting

    Authors: Shome Chakraborty, Fardil Khan, Soutik Ghosal

    Abstract: Macroeconomic conditions influence the environments in which health systems operate, yet their value as leading signals of health system capacity has not been systematically evaluated. In this study, we examine whether selected macroeconomic indicators contain predictive information for several capacity-related public health targets, including employment in the health and social assistance workfor… ▽ More

    Submitted 21 January, 2026; originally announced January 2026.

    Comments: 16 pages, 6 figures

    MSC Class: 62M10 (Primary); 62H30 (Secondary)

  24. Bio-RV: Low-Power Resource-Efficient RISC-V Processor for Biomedical Applications

    Authors: Vijay Pratap Sharma, Annu Kumar, Mohd Faisal Khan, Mukul Lokhande, Santosh Kumar Vishvakarma

    Abstract: This work presents Bio-RV, a compact and resource-efficient RISC-V processor intended for biomedical control applications, such as accelerator-based biomedical SoCs and implantable pacemaker systems. The proposed Bio-RV is a multi-cycle RV32I core that provides explicit execution control and external instruction loading with capabilities that enable controlled firmware deployment, ASIC bring-up, a… ▽ More

    Submitted 13 January, 2026; originally announced January 2026.

    Comments: IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI-2026)

  25. arXiv:2601.07595  [pdf, ps, other

    astro-ph.HE

    Deep Search for Joint Sources of Gravitational Waves and High-Energy Neutrinos with IceCube During the Third Observing Run of LIGO and Virgo

    Authors: The IceCube Collaboration, R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, S. Ali, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, S. N. Axani, R. Babu, X. Bai, J. Baines-Holmes, A. Balagopal V., S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus , et al. (2193 additional authors not shown)

    Abstract: The discovery of joint sources of high-energy neutrinos and gravitational waves has been a primary target for the LIGO, Virgo, KAGRA, and IceCube observatories. The joint detection of high-energy neutrinos and gravitational waves would provide insight into cosmic processes, from the dynamics of compact object mergers and stellar collapses to the mechanisms driving relativistic outflows. The joint… ▽ More

    Submitted 28 January, 2026; v1 submitted 12 January, 2026; originally announced January 2026.

    Comments: Data release at: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/34B5AP

  26. arXiv:2601.03009  [pdf

    cs.SE

    A Dataset of Low-Rated Applications from the Amazon Appstore for User Feedback Analysis

    Authors: Nek Dil Khan, Javed Ali Khan, Darvesh Khan, Jianqiang Li, Mumrez Khan, Shah Fahad Khan

    Abstract: In todays digital landscape, end-user feedback plays a crucial role in the evolution of software applications, particularly in addressing issues that hinder user experience. While much research has focused on high-rated applications, low-rated applications often remain unexplored, despite their potential to reveal valuable insights. This study introduces a novel dataset curated from 64 low-rated a… ▽ More

    Submitted 6 January, 2026; originally announced January 2026.

  27. arXiv:2512.22878  [pdf, ps, other

    cs.CV cs.AI

    SwinTF3D: A Lightweight Multimodal Fusion Approach for Text-Guided 3D Medical Image Segmentation

    Authors: Hasan Faraz Khan, Noor Fatima, Muzammil Behzad

    Abstract: The recent integration of artificial intelligence into medical imaging has driven remarkable advances in automated organ segmentation. However, most existing 3D segmentation frameworks rely exclusively on visual learning from large annotated datasets restricting their adaptability to new domains and clinical tasks. The lack of semantic understanding in these models makes them ineffective in addres… ▽ More

    Submitted 28 December, 2025; originally announced December 2025.

  28. arXiv:2512.22303  [pdf, ps, other

    cs.CV cs.AI

    Attack-Aware Deepfake Detection under Counter-Forensic Manipulations

    Authors: Noor Fatima, Hasan Faraz Khan, Muzammil Behzad

    Abstract: This work presents an attack-aware deepfake and image-forensics detector designed for robustness, well-calibrated probabilities, and transparent evidence under realistic deployment conditions. The method combines red-team training with randomized test-time defense in a two-stream architecture, where one stream encodes semantic content using a pretrained backbone and the other extracts forensic res… ▽ More

    Submitted 25 December, 2025; originally announced December 2025.

  29. arXiv:2512.17990  [pdf, ps, other

    gr-qc astro-ph.HE

    Constraints on gravitational waves from the 2024 Vela pulsar glitch

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1752 additional authors not shown)

    Abstract: Among known neutron stars, the Vela pulsar is one of the best targets for gravitational-wave searches. It is also one of the most prolific in terms of glitches, sudden frequency changes in a pulsar's rotation. Such glitches could cause a variety of transient gravitational-wave signals. Here we search for signals associated with a Vela glitch on 29 April 2024 in data of the two LIGO detectors from… ▽ More

    Submitted 21 January, 2026; v1 submitted 19 December, 2025; originally announced December 2025.

    Comments: main paper: 16 pages and 7 figures; total with appendices: 40 pages and 14 figures. Submitted to ApJ. Data release at https://doi.org/10.5281/zenodo.17735648

    Report number: LIGO-P2500086

  30. arXiv:2512.17769  [pdf, ps, other

    cs.CL cs.AI

    Bangla MedER: Multi-BERT Ensemble Approach for the Recognition of Bangla Medical Entity

    Authors: Tanjim Taharat Aurpa, Farzana Akter, Md. Mehedi Hasan, Shakil Ahmed, Shifat Ara Rafiq, Fatema Khan

    Abstract: Medical Entity Recognition (MedER) is an essential NLP task for extracting meaningful entities from the medical corpus. Nowadays, MedER-based research outcomes can remarkably contribute to the development of automated systems in the medical sector, ultimately enhancing patient care and outcomes. While extensive research has been conducted on MedER in English, low-resource languages like Bangla rem… ▽ More

    Submitted 19 December, 2025; originally announced December 2025.

  31. arXiv:2512.16978  [pdf, ps, other

    cs.CV

    A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos

    Authors: Mohammed Irfan Kurpath, Jaseel Muhammad Kaithakkodan, Jinxing Zhou, Sahal Shaji Mullappilly, Mohammad Almansoori, Noor Ahsan, Beknur Kalmakhanbet, Sambal Shikhar, Rishabh Lalla, Jean Lahoud, Mariette Awad, Fahad Shahbaz Khan, Salman Khan, Rao Muhammad Anwer, Hisham Cholakkal

    Abstract: Long-form multimodal video understanding requires integrating vision, speech, and ambient audio with coherent long-range reasoning. Existing benchmarks emphasize either temporal length or multimodal richness, but rarely both and while some incorporate open-ended questions and advanced metrics, they mostly rely on single-score accuracy, obscuring failure modes. We introduce LongShOTBench, a diagnos… ▽ More

    Submitted 18 December, 2025; originally announced December 2025.

  32. arXiv:2512.16483  [pdf, ps, other

    cs.CV

    StageVAR: Stage-Aware Acceleration for Visual Autoregressive Models

    Authors: Senmao Li, Kai Wang, Salman Khan, Fahad Shahbaz Khan, Jian Yang, Yaxing Wang

    Abstract: Visual Autoregressive (VAR) modeling departs from the next-token prediction paradigm of traditional Autoregressive (AR) models through next-scale prediction, enabling high-quality image generation. However, the VAR paradigm suffers from sharply increased computational complexity and running time at large-scale steps. Although existing acceleration methods reduce runtime for large-scale steps, but… ▽ More

    Submitted 18 December, 2025; originally announced December 2025.

  33. arXiv:2512.16347  [pdf, ps, other

    gr-qc astro-ph.CO

    GWTC-4.0: Searches for Gravitational-Wave Lensing Signatures

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1744 additional authors not shown)

    Abstract: Gravitational waves can be gravitationally lensed by massive objects along their path. Depending on the lens mass and the lens--source geometry, this can lead to the observation of a single distorted signal or multiple repeated events with the same frequency evolution. We present the results for gravitational-wave lensing searches on the data from the first part of the fourth LIGO--Virgo--KAGRA ob… ▽ More

    Submitted 4 February, 2026; v1 submitted 18 December, 2025; originally announced December 2025.

    Comments: 36 pages (including refs), 15 figures

    Report number: LIGO-P2500419

  34. arXiv:2512.14713  [pdf, ps, other

    cs.LG stat.ML

    A Bayesian latent class reinforcement learning framework to capture adaptive, feedback-driven travel behaviour

    Authors: Georges Sfeir, Stephane Hess, Thomas O. Hancock, Filipe Rodrigues, Jamal Amani Rad, Michiel Bliemer, Matthew Beck, Fayyaz Khan

    Abstract: Many travel decisions involve a degree of experience formation, where individuals learn their preferences over time. At the same time, there is extensive scope for heterogeneity across individual travellers, both in their underlying preferences and in how these evolve. The present paper puts forward a Latent Class Reinforcement Learning (LCRL) model that allows analysts to capture both of these ph… ▽ More

    Submitted 8 December, 2025; originally announced December 2025.

    Comments: 32 pages, 8 figures, 6 tables

  35. arXiv:2512.14695  [pdf, ps, other

    astro-ph.GA

    JWST Observations of the Double Nucleus in NGC 4486B: Possible Evidence for a Recent Binary SMBH Merger and Recoil

    Authors: Behzad Tahmasebzadeh, Monica Valluri, Shashank Dattathri, Tatsuya Akiba, Fazeel Mahmood Khan, Matthew A. Taylor, Haruka Yoshino, Solveig Thompson, Ann-Marie Madigan, Frank C. van den Bosch, Kelly holley-bockelmann, Patrick Côté, Laura Ferrarese, Michael J. Drinkwater, Holger Baumgardt, Misty C. Bentz, Kristen Dage, Eric W. Peng, Somya Jha, Andrea V. Macciò, Chengze Liu, Tyrone E. Woods

    Abstract: A recent study of the compact elliptical galaxy NGC 4486B using JWST-NIRSpec IFU kinematics confirmed a supermassive black hole (SMBH) of mass $M_{BH}=3.6\pm0.7\times10^8$ (~8% of the stellar mass). In addition to its double nucleus, the nuclear kinematics show pronounced asymmetries: a velocity-dispersion peak displaced by 6 pc from the galaxy center and a ~16 km/s offset in the mean stellar line… ▽ More

    Submitted 13 March, 2026; v1 submitted 16 December, 2025; originally announced December 2025.

    Comments: Accepted for publication in ApJL

  36. arXiv:2512.12840  [pdf, ps, other

    cs.LG cs.AI

    PRIVEE: Privacy-Preserving Vertical Federated Learning Against Feature Inference Attacks

    Authors: Sindhuja Madabushi, Ahmad Faraz Khan, Haider Ali, Ananthram Swami, Rui Ning, Hongyi Wu, Jin-Hee Cho

    Abstract: Vertical Federated Learning (VFL) enables collaborative model training across organizations that share common user samples but hold disjoint feature spaces. Despite its potential, VFL is susceptible to feature inference attacks, in which adversarial parties exploit shared confidence scores (i.e., prediction probabilities) during inference to reconstruct private input features of other participants… ▽ More

    Submitted 14 December, 2025; originally announced December 2025.

    Comments: 12 pages, 3 figures

  37. arXiv:2512.12583  [pdf, ps, other

    cs.CR cs.AI

    Detecting Prompt Injection Attacks Against Application Using Classifiers

    Authors: Safwan Shaheer, G. M. Refatul Islam, Mohammad Rafid Hamid, Md. Abrar Faiaz Khan, Md. Omar Faruk, Yaseen Nur

    Abstract: Prompt injection attacks can compromise the security and stability of critical systems, from infrastructure to large web applications. This work curates and augments a prompt injection dataset based on the HackAPrompt Playground Submissions corpus and trains several classifiers, including LSTM, feed forward neural networks, Random Forest, and Naive Bayes, to detect malicious prompts in LLM integra… ▽ More

    Submitted 14 December, 2025; originally announced December 2025.

    Comments: 9 pages, X figures; undergraduate research project on detecting prompt injection attacks against LLM integrated web applications using classical machine learning and neural classifiers

    ACM Class: D.4.6; I.2.7

  38. arXiv:2512.11490  [pdf, ps, other

    cs.CV cs.IR

    VLM2GeoVec: Toward Universal Multimodal Embeddings for Remote Sensing

    Authors: Emanuel Sánchez Aimar, Gulnaz Zhambulova, Fahad Shahbaz Khan, Yonghao Xu, Michael Felsberg

    Abstract: Satellite imagery differs fundamentally from natural images: its aerial viewpoint, very high resolution, diverse scale variations, and abundance of small objects demand both region-level spatial reasoning and holistic scene understanding. Current remote-sensing approaches remain fragmented between dual-encoder retrieval models, which excel at large-scale cross-modal search but cannot interleave mo… ▽ More

    Submitted 12 December, 2025; originally announced December 2025.

    Comments: 21 pages, 7 figures, under review

  39. Intermediate Mass Black Hole Binary Evolution in Nuclear Star Clusters: the effect of the stellar mass black hole population

    Authors: Fazeel Mahmood Khan, Peter Berczik, Margarita Sobolenko, Andreas Just, Rainer Spurzem, Kelly Holley-Bockelmann, Andrea Valerio Macciò

    Abstract: In this study, we investigate the dynamics of Intermediate-Mass Black Hole (IMBH) binaries within Nuclear Star Clusters (NSCs) that contain a population of stellar-mass black holes (BHs). We examine how these stellar and BH populations influence the dynamics of the IMBH binary and, in turn, how the evolving IMBH binary affects the surrounding stellar and BH populations. We conduct high-resolution… ▽ More

    Submitted 11 December, 2025; originally announced December 2025.

    Comments: Accepted for publication in Astronomy & Astrophysics

    Journal ref: A&A 706, A354 (2026)

  40. arXiv:2512.05802  [pdf, ps, other

    cs.CV

    Bring Your Dreams to Life: Continual Text-to-Video Customization

    Authors: Jiahua Dong, Xudong Wang, Wenqi Liang, Zongyan Han, Meng Cao, Duzhen Zhang, Hanbin Zhao, Zhi Han, Salman Khan, Fahad Shahbaz Khan

    Abstract: Customized text-to-video generation (CTVG) has recently witnessed great progress in generating tailored videos from user-specific text. However, most CTVG methods assume that personalized concepts remain static and do not expand incrementally over time. Additionally, they struggle with forgetting and concept neglect when continuously learning new concepts, including subjects and motions. To resolv… ▽ More

    Submitted 10 December, 2025; v1 submitted 5 December, 2025; originally announced December 2025.

    Comments: Accepted to AAAI2026

  41. arXiv:2512.03335  [pdf, ps, other

    cs.CV cs.LG

    Step-by-step Layered Design Generation

    Authors: Faizan Farooq Khan, K J Joseph, Koustava Goswami, Mohamed Elhoseiny, Balaji Vasan Srinivasan

    Abstract: Design generation, in its essence, is a step-by-step process where designers progressively refine and enhance their work through careful modifications. Despite this fundamental characteristic, existing approaches mainly treat design synthesis as a single-step generation problem, significantly underestimating the inherent complexity of the creative process. To bridge this gap, we propose a novel pr… ▽ More

    Submitted 2 December, 2025; originally announced December 2025.

    Journal ref: AAAI 2026

  42. arXiv:2511.23478  [pdf, ps, other

    cs.CV

    Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models

    Authors: Muhammad Maaz, Hanoona Rasheed, Fahad Shahbaz Khan, Salman Khan

    Abstract: Reasoning over dynamic visual content remains a central challenge for multimodal large language models. Recent thinking models generate explicit reasoning traces for interpretability; however, their reasoning often appears convincing while being logically inconsistent or weakly grounded in visual evidence. We identify and formalize these issues through two diagnostic metrics: Think Answer Consiste… ▽ More

    Submitted 8 December, 2025; v1 submitted 28 November, 2025; originally announced November 2025.

    Comments: Video-R2 Technical Report

  43. arXiv:2511.23477  [pdf, ps, other

    cs.CV

    Video-CoM: Interactive Video Reasoning via Chain of Manipulations

    Authors: Hanoona Rasheed, Mohammed Zumri, Muhammad Maaz, Ming-Hsuan Yang, Fahad Shahbaz Khan, Salman Khan

    Abstract: Recent multimodal large language models (MLLMs) have advanced video understanding, yet most still "think about videos" ie once a video is encoded, reasoning unfolds entirely in text, treating visual input as a static context. This passive paradigm creates a semantic bottleneck: models cannot rewatch, refocus, or verify evidence, leading to shallow visual reasoning on tasks requiring fine grained s… ▽ More

    Submitted 28 November, 2025; originally announced November 2025.

    Comments: Technical Report

  44. arXiv:2511.20650  [pdf, ps, other

    cs.CV cs.AI

    MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities

    Authors: Tooba Tehreem Sheikh, Jean Lahoud, Rao Muhammad Anwer, Fahad Shahbaz Khan, Salman Khan, Hisham Cholakkal

    Abstract: Traditional object detection models in medical imaging operate within a closed-set paradigm, limiting their ability to detect objects of novel labels. Open-vocabulary object detection (OVOD) addresses this limitation but remains underexplored in medical imaging due to dataset scarcity and weak text-image alignment. To bridge this gap, we introduce MedROV, the first Real-time Open Vocabulary detect… ▽ More

    Submitted 25 November, 2025; originally announced November 2025.

  45. arXiv:2511.19911  [pdf, ps, other

    gr-qc astro-ph.CO

    Search for planetary-mass ultra-compact binaries using data from the first part of the LIGO--Virgo--KAGRA fourth observing run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1743 additional authors not shown)

    Abstract: We present a search for gravitational waves from inspiraling, planetary-mass ultra-compact binaries using data from the first part of the fourth observing run of LIGO, Virgo and KAGRA. Finding no evidence of such systems, we determine the maximum distance reach for such objects and their merger rate densities, independently of how they could have formed. Then, we identify classes of primordial bla… ▽ More

    Submitted 5 December, 2025; v1 submitted 24 November, 2025; originally announced November 2025.

    Comments: 6 pages (main) + 7 pages (appendix) + refs; 8 figures

    Report number: P2500248

  46. arXiv:2511.17074  [pdf, ps, other

    cs.CV

    Diversity Has Always Been There in Your Visual Autoregressive Models

    Authors: Tong Wang, Guanyu Yang, Nian Liu, Kai Wang, Yaxing Wang, Abdelrahman M Shaker, Salman Khan, Fahad Shahbaz Khan, Senmao Li

    Abstract: Visual Autoregressive (VAR) models have recently garnered significant attention for their innovative next-scale prediction paradigm, offering notable advantages in both inference efficiency and image quality compared to traditional multi-step autoregressive (AR) and diffusion models. However, despite their efficiency, VAR models often suffer from the diversity collapse i.e., a reduction in output… ▽ More

    Submitted 21 November, 2025; originally announced November 2025.

  47. arXiv:2511.16863  [pdf, ps, other

    gr-qc

    All-sky search for continuous gravitational-wave signals from unknown neutron stars in binary systems in the first part of the fourth LIGO-Virgo-KAGRA observing run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1743 additional authors not shown)

    Abstract: We present the results of a blind all-sky search for continuous gravitational-wave signals from neutron stars in binary systems using data from the first part of the fourth observing run (O4a) using LIGO detectors data. Rapidly rotating, non-axisymmetric neutron stars are expected to emit continuous gravitational waves, whose detection would significantly improve our understanding of the galactic… ▽ More

    Submitted 4 December, 2025; v1 submitted 20 November, 2025; originally announced November 2025.

    Comments: 24 pages, 18 figures, 6 tables

    Report number: LIGO-P2500437

  48. arXiv:2511.16672  [pdf, ps, other

    cs.CV

    EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards

    Authors: Omkar Thawakar, Shravan Venkatraman, Ritesh Thawkar, Abdelrahman Shaker, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Fahad Khan

    Abstract: Recent advances in large multimodal models (LMMs) have enabled impressive reasoning and perception abilities, yet most existing training pipelines still depend on human-curated data or externally verified reward models, limiting their autonomy and scalability. In this work, we strive to improve LMM reasoning capabilities in a purely unsupervised fashion (without any annotated data or reward distil… ▽ More

    Submitted 13 March, 2026; v1 submitted 20 November, 2025; originally announced November 2025.

    Comments: CVPR 2026 (findings)

  49. arXiv:2511.06312  [pdf, ps, other

    math.NA math-ph

    GLT matrix-sequences and few emblematic applications

    Authors: Muhammad Faisal Khan

    Abstract: This thesis advances the spectral theory of structured matrix-sequences within the framework of Generalized Locally Toeplitz (GLT) $*$-algebras, focusing on the geometric mean of Hermitian positive definite (HPD) GLT sequences and its applications in mathematical physics. For two HPD sequences $\{A_n\}_n \sim_{\mathrm{GLT}} κ$ and $\{B_n\}_n \sim_{\mathrm{GLT}} ξ$ in the same $d$-level, $r$-block… ▽ More

    Submitted 9 November, 2025; originally announced November 2025.

  50. arXiv:2511.04386  [pdf, ps, other

    physics.ins-det astro-ph.IM

    Mitigating effects of nonlinearities in homodyne quadrature interferometers

    Authors: Johannes Lehmann, Artem Basalaev, Jonathan J. Carter, Matteo Carlassara, Harald Lück, Gabriella Chiarini, Pritam Sarkar, Firoz Khan, Satoru Takano, Sara Al-Kershi, Sina M. Koehlenbeck, Pascal Birckigt, Sarah L. Kranzhoff, Juliane von Wrangel, David S. Wu

    Abstract: Homodyne Quadrature interferometers (HoQI) are an interferometric displacement sensing scheme proven to have excellent noise performance, making them a strong candidate for sensing and control schemes in gravitational wave detector seismic isolation. Like many interferometric schemes, HoQIs are prone to nonlinear effects when measuring displacements. These nonlinearities, if left unsuppressed, wou… ▽ More

    Submitted 6 November, 2025; originally announced November 2025.

    Comments: 13 pages, 13 figures