Skip to main content

Showing 1–17 of 17 results for author: Bilal, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2511.12869  [pdf, ps, other

    cs.LG cs.AI cs.DC cs.IT cs.MA

    On the Fundamental Limits of LLMs at Scale

    Authors: Muhammad Ahmed Mohsin, Muhammad Umer, Ahsan Bilal, Zeeshan Memon, Muhammad Ibtsaam Qadir, Sagnik Bhattacharya, Hassan Rizwan, Abhiram R. Gorle, Maahe Zehra Kazmi, Ayesha Mohsin, Muhammad Usman Rafique, Zihao He, Pulkit Mehta, Muhammad Ali Jamshed, John M. Cioffi

    Abstract: Large Language Models (LLMs) have benefited enormously from scaling, yet these gains are bounded by five fundamental limitations: (1) hallucination, (2) context compression, (3) reasoning degradation, (4) retrieval fragility, and (5) multimodal misalignment. While existing surveys describe these phenomena empirically, they lack a rigorous theoretical synthesis connecting them to the foundational l… ▽ More

    Submitted 16 November, 2025; originally announced November 2025.

    Comments: Submitted to TMLR 2025

  2. arXiv:2511.01333  [pdf, ps, other

    cs.DC

    Transformer-Based Sparse CSI Estimation for Non-Stationary Channels

    Authors: Muhammad Ahmed Mohsin, Muhammad Umer, Ahsan Bilal, Hassan Rizwan, Sagnik Bhattacharya, Muhammad Ali Jamshed, John M. Cioffi

    Abstract: Accurate and efficient estimation of Channel State Information (CSI) is critical for next-generation wireless systems operating under non-stationary conditions, where user mobility, Doppler spread, and multipath dynamics rapidly alter channel statistics. Conventional pilot aided estimators incur substantial overhead, while deep learning approaches degrade under dynamic pilot patterns and time vary… ▽ More

    Submitted 3 November, 2025; originally announced November 2025.

    Comments: ICC 2026

  3. arXiv:2509.18735  [pdf, ps, other

    cs.DC

    6G Twin: Hybrid Gaussian Radio Fields for Channel Estimation and Non-Linear Precoder Design for Radio Access Networks

    Authors: Muhammad Ahmed Mohsin, Muhammad Umer, Ahsan Bilal, Muhammad Ali Jamshed, Dean F. Hougen, John M. Cioffi

    Abstract: This work introduces 6G Twin, the first end-to-end artificial intelligence (AI)-native radio access network (RAN) design that unifies (i) neural Gaussian Radio Fields (GRF) for compressed channel state information (CSI) acquisition, (ii) continual channel prediction with handover persistence, and (iii) an energy-optimal nonlinear precoder (minPMAC). GRF replaces dense pilots with a sparse Gaussian… ▽ More

    Submitted 23 September, 2025; originally announced September 2025.

    Comments: Submitted to IEEE Transactions on Wireless Communications

  4. arXiv:2509.15192  [pdf, ps, other

    cs.DC

    Channel Prediction under Network Distribution Shift Using Continual Learning-based Loss Regularization

    Authors: Muhammad Ahmed Mohsin, Muhammad Umer, Ahsan Bilal, Muhammad Ibtsaam Qadir, Muhammad Ali Jamshed, Dean F. Hougen, John M. Cioffi

    Abstract: Modern wireless networks face critical challenges when mobile users traverse heterogeneous network configurations with varying antenna layouts, carrier frequencies, and scattering statistics. Traditional predictors degrade under distribution shift, with NMSE rising by 37.5\% during cross-configuration handovers. This work addresses catastrophic forgetting in channel prediction by proposing a conti… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

    Comments: ICASSP 2026

  5. arXiv:2509.15182  [pdf, ps, other

    cs.DC

    Conditional Prior-based Non-stationary Channel Estimation Using Accelerated Diffusion Models

    Authors: Muhammad Ahmed Mohsin, Ahsan Bilal, Muhammad Umer, Asad Aali, Muhammad Ali Jamshed, Dean F. Hougen, John M. Cioffi

    Abstract: Wireless channels in motion-rich urban microcell (UMi) settings are non-stationary; mobility and scatterer dynamics shift the distribution over time, degrading classical and deep estimators. This work proposes conditional prior diffusion for channel estimation, which learns a history-conditioned score to denoise noisy channel snapshots. A temporal encoder with cross-time attention compresses a sho… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

    Comments: ICASSP 2026

  6. arXiv:2508.11668  [pdf, ps, other

    eess.SP cs.NI

    Neural Gaussian Radio Fields for Channel Estimation

    Authors: Muhammad Umer, Muhammad Ahmed Mohsin, Ahsan Bilal, John M. Cioffi

    Abstract: Accurate channel state information (CSI) remains the most critical bottleneck in modern wireless networks, with pilot overhead consuming up to 11-21% of transmission bandwidth, increasing latency by 20-40% in massive MIMO systems, and reducing potential spectral efficiency by over 53%. Traditional estimation techniques fundamentally fail under mobility, with feedback delays as small as 4 ms causin… ▽ More

    Submitted 6 August, 2025; originally announced August 2025.

    Comments: This paper has been submitted to NeurIPS 2025

  7. arXiv:2507.10619  [pdf, ps, other

    cs.LG cs.AI cs.NI

    Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks

    Authors: Oluwaseyi Giwa, Tobi Awodunmila, Muhammad Ahmed Mohsin, Ahsan Bilal, Muhammad Ali Jamshed

    Abstract: The dynamic allocation of spectrum in 5G / 6G networks is critical to efficient resource utilization. However, applying traditional deep reinforcement learning (DRL) is often infeasible due to its immense sample complexity and the safety risks associated with unguided exploration, which can cause severe network interference. To address these challenges, we propose a meta-learning framework that en… ▽ More

    Submitted 13 July, 2025; originally announced July 2025.

    Comments: 5 pages, 6 figures, under review at IEEE Wireless Communications Letters

  8. arXiv:2506.22471  [pdf, ps, other

    eess.SP cs.NI

    Continual Learning for Wireless Channel Prediction

    Authors: Muhammad Ahmed Mohsin, Muhammad Umer, Ahsan Bilal, Muhammad Ali Jamshed, John M. Cioffi

    Abstract: Modern 5G/6G deployments routinely face cross-configuration handovers--users traversing cells with different antenna layouts, carrier frequencies, and scattering statistics--which inflate channel-prediction NMSE by $37.5\%$ on average when models are naively fine-tuned. The proposed improvement frames this mismatch as a continual-learning problem and benchmarks three adaptation families: replay wi… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: Accepted at ICML Workshop on ML4Wireless

  9. arXiv:2505.05082  [pdf, other

    cs.LG cs.IT math.PR

    ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model

    Authors: Sagnik Bhattacharya, Abhiram Gorle, Ahsan Bilal, Connor Ding, Amit Kumar Singh Yadav, Tsachy Weissman

    Abstract: Generative modeling of non-negative, discrete data, such as symbolic music, remains challenging due to two persistent limitations in existing methods. Firstly, many approaches rely on modeling continuous embeddings, which is suboptimal for inherently discrete data distributions. Secondly, most models optimize variational bounds rather than exact data likelihood, resulting in inaccurate likelihood… ▽ More

    Submitted 27 May, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: Pre-print

  10. arXiv:2504.14520  [pdf, other

    cs.AI cs.CL

    Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey

    Authors: Ahsan Bilal, Muhammad Ahmed Mohsin, Muhammad Umer, Muhammad Awais Khan Bangash, Muhammad Ali Jamshed

    Abstract: This survey explores the development of meta-thinking capabilities in Large Language Models (LLMs) from a Multi-Agent Reinforcement Learning (MARL) perspective. Meta-thinking self-reflection, assessment, and control of thinking processes is an important next step in enhancing LLM reliability, flexibility, and performance, particularly for complex or high-stakes tasks. The survey begins by analyzin… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    Comments: Submitted to IEEE Transactions on Artificial Intelligence

  11. arXiv:2504.02894   

    cs.CL cs.AI

    OnRL-RAG: Real-Time Personalized Mental Health Dialogue System

    Authors: Ahsan Bilal, Beiyu Lin

    Abstract: Large language models (LLMs) have been widely used for various tasks and applications. However, LLMs and fine-tuning are limited to the pre-trained data. For example, ChatGPT's world knowledge until 2021 can be outdated or inaccurate. To enhance the capabilities of LLMs, Retrieval-Augmented Generation (RAG), is proposed to augment LLMs with additional, new, latest details and information to LLMs.… ▽ More

    Submitted 22 April, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

    Comments: It needs more revisions. I am currently working on it with my co-author

  12. arXiv:2504.00125  [pdf, other

    cs.AI cs.CL

    LLMs for Explainable AI: A Comprehensive Survey

    Authors: Ahsan Bilal, David Ebert, Beiyu Lin

    Abstract: Large Language Models (LLMs) offer a promising approach to enhancing Explainable AI (XAI) by transforming complex machine learning outputs into easy-to-understand narratives, making model predictions more accessible to users, and helping bridge the gap between sophisticated model behavior and human interpretability. AI models, such as state-of-the-art neural networks and deep learning models, are… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

    Comments: This manuscript is intended for submission to ACM Transactions on Intelligent Systems and Technology

  13. arXiv:2503.07670  [pdf, other

    cs.NI eess.IV

    Retrieval Augmented Generation with Multi-Modal LLM Framework for Wireless Environments

    Authors: Muhammad Ahmed Mohsin, Ahsan Bilal, Sagnik Bhattacharya, John M. Cioffi

    Abstract: Future wireless networks aim to deliver high data rates and lower power consumption while ensuring seamless connectivity, necessitating robust optimization. Large language models (LLMs) have been deployed for generalized optimization scenarios. To take advantage of generative AI (GAI) models, we propose retrieval augmented generation (RAG) for multi-sensor wireless environment perception. Utilizin… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

    Comments: Accepted @ ICC 2025

  14. arXiv:2501.17879  [pdf, other

    cs.IT cs.AI cs.SD eess.AS eess.SP

    Task and Perception-aware Distributed Source Coding for Correlated Speech under Bandwidth-constrained Channels

    Authors: Sagnik Bhattacharya, Muhammad Ahmed Mohsin, Ahsan Bilal, John M. Cioffi

    Abstract: Emerging wireless AR/VR applications require real-time transmission of correlated high-fidelity speech from multiple resource-constrained devices over unreliable, bandwidth-limited channels. Existing autoencoder-based speech source coding methods fail to address the combination of the following - (1) dynamic bitrate adaptation without retraining the model, (2) leveraging correlations among multipl… ▽ More

    Submitted 19 January, 2025; originally announced January 2025.

    Comments: Published at AAAI 2025 Workshop

    Journal ref: Association for the Advancement of Artificial Intelligence (AAAI) 2025 Workshop

  15. arXiv:2302.03156  [pdf, other

    cs.CV cs.AI eess.IV

    Novel Building Detection and Location Intelligence Collection in Aerial Satellite Imagery

    Authors: Sandeep Singh, Christian Wiles, Ahmed Bilal

    Abstract: Building structures detection and information about these buildings in aerial images is an important solution for city planning and management, land use analysis. It can be the center piece to answer important questions such as planning evacuation routes in case of an earthquake, flood management, etc. These applications rely on being able to accurately retrieve up-to-date information. Being able… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 9 pages(5 main pages, 4 auxiliary pages)

  16. arXiv:1901.01462  [pdf, other

    cs.NE

    Rethinking the Artificial Neural Networks: A Mesh of Subnets with a Central Mechanism for Storing and Predicting the Data

    Authors: Usman Ahmad, Hong Song, Awais Bilal, Shahid Mahmood, Asad Ullah, Uzair Saeed

    Abstract: The Artificial Neural Networks (ANNs) have been originally designed to function like a biological neural network, but does an ANN really work in the same way as a biological neural network? As we know, the human brain holds information in its memory cells, so if the ANNs use the same model as our brains, they should store datasets in a similar manner. The most popular type of ANN architecture is b… ▽ More

    Submitted 5 January, 2019; originally announced January 2019.

    Comments: SUBMITTED TO IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

  17. arXiv:1811.11983  [pdf, other

    cs.SI

    Analysing Emergent Users' Text Messages Data and Exploring its Benefits

    Authors: Anas Bilal, Aimal Rextin, Ahmad Kakakhail, Mehwish Nasim

    Abstract: While users in the developed world can choose to adopt the technology that suits their needs, the emergent users cannot afford this luxury, hence, they adapt themselves to the technology that is readily available. When technology is designed, such as the mobile-phone technology, it is an implicit assumption that it would be adopted by the emergent users in due course. However, such user groups hav… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.