Skip to main content

Showing 1–50 of 236 results for author: Ahmed, N

Searching in archive cs. Search in all archives.
.
  1. Sentiment Classification of Gaza War Headlines: A Comparative Analysis of Large Language Models and Arabic Fine-Tuned BERT Models

    Authors: Amr Eleraqi, Hager H. Mustafa, Abdul Hadi N. Ahmed

    Abstract: This study examines how different artificial intelligence architectures interpret sentiment in conflict-related media discourse, using the 2023 Gaza War as a case study. Drawing on a corpus of 10,990 Arabic news headlines (Eleraqi 2026), the research conducts a comparative analysis between three large language models and six fine-tuned Arabic BERT models. Rather than evaluating accuracy against a… ▽ More

    Submitted 18 March, 2026; originally announced April 2026.

    Comments: 45 pages, 6 figures (including diagrams), 8 tables. Dataset available at this https URL . Previously posted at https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/FFENX3

    Journal ref: SSRN (2026)

  2. arXiv:2603.03646  [pdf, ps, other

    cs.CV

    InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions

    Authors: Mohamed Elmoghany, Liangbing Zhao, Xiaoqian Shen, Subhojyoti Mukherjee, Yang Zhou, Gang Wu, Viet Dac Lai, Seunghyun Yoon, Ryan Rossi, Abdullah Rashwan, Puneet Mathur, Varun Manjunatha, Daksh Dangi, Chien Nguyen, Nedim Lipka, Trung Bui, Krishna Kumar Singh, Ruiyi Zhang, Xiaolei Huang, Jaemin Cho, Yu Wang, Namyong Park, Zhengzhong Tu, Hongjie Chen, Hoda Eldardiry , et al. (5 additional authors not shown)

    Abstract: Generating long-form storytelling videos with consistent visual narratives remains a significant challenge in video synthesis. We present a novel framework, dataset, and a model that address three critical limitations: background consistency across shots, seamless multi-subject shot-to-shot transitions, and scalability to hour-long narratives. Our approach introduces a background-consistent genera… ▽ More

    Submitted 3 March, 2026; originally announced March 2026.

  3. arXiv:2602.13028  [pdf, ps, other

    cs.CV cs.CL

    Human-Aligned MLLM Judges for Fine-Grained Image Editing Evaluation: A Benchmark, Framework, and Analysis

    Authors: Runzhou Liu, Hailey Weingord, Sejal Mittal, Prakhar Dungarwal, Anusha Nandula, Bo Ni, Samyadeep Basu, Hongjie Chen, Nesreen K. Ahmed, Li Li, Jiayi Zhang, Koustava Goswami, Subhojyoti Mukherjee, Branislav Kveton, Puneet Mathur, Franck Dernoncourt, Yue Zhao, Yu Wang, Ryan A. Rossi, Zhengzhong Tu, Hongru Du

    Abstract: Evaluating image editing models remains challenging due to the coarse granularity and limited interpretability of traditional metrics, which often fail to capture aspects important to human perception and intent. Such metrics frequently reward visually plausible outputs while overlooking controllability, edit localization, and faithfulness to user instructions. In this work, we introduce a fine-gr… ▽ More

    Submitted 13 February, 2026; originally announced February 2026.

  4. arXiv:2602.12305  [pdf, ps, other

    cs.LG cs.AI cs.DC cs.MA cs.SE

    OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization

    Authors: Arijit Bhattacharjee, Heng Ping, Son Vu Le, Paul Bogdan, Nesreen K. Ahmed, Ali Jannesari

    Abstract: Generating high-performance CUDA kernels remains challenging due to the need to navigate a combinatorial space of low-level transformations under noisy and expensive hardware feedback. Although large language models can synthesize functionally correct CUDA code, achieving competitive performance requires systematic exploration and verification of optimization choices. We present OptiML, an end-to-… ▽ More

    Submitted 11 February, 2026; originally announced February 2026.

  5. arXiv:2602.09319  [pdf, ps, other

    cs.CR

    Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation

    Authors: Zhisheng Qi, Utkarsh Sahu, Li Ma, Haoyu Han, Ryan Rossi, Franck Dernoncourt, Mahantesh Halappanavar, Nesreen Ahmed, Yushun Dong, Yue Zhao, Yu Zhang, Yu Wang

    Abstract: Retrieval-Augmented Generation (RAG) has become a cornerstone of knowledge-intensive applications, including enterprise chatbots, healthcare assistants, and agentic memory management. However, recent studies show that knowledge-extraction attacks can recover sensitive knowledge-base content through maliciously crafted queries, raising serious concerns about intellectual property theft and privacy… ▽ More

    Submitted 11 February, 2026; v1 submitted 9 February, 2026; originally announced February 2026.

  6. arXiv:2602.07673  [pdf, ps, other

    cs.CL

    Blind to the Human Touch: Overlap Bias in LLM-Based Summary Evaluation

    Authors: Jiangnan Fang, Cheng-Tse Liu, Hanieh Deilamsalehy, Nesreen K. Ahmed, Puneet Mathur, Nedim Lipka, Franck Dernoncourt, Ryan A. Rossi

    Abstract: Large language model (LLM) judges have often been used alongside traditional, algorithm-based metrics for tasks like summarization because they better capture semantic information, are better at reasoning, and are more robust to paraphrasing. However, LLM judges show biases for length and order among others, and are vulnerable to various adversarial input prompts. While recent studies have looked… ▽ More

    Submitted 7 February, 2026; originally announced February 2026.

  7. arXiv:2602.03068  [pdf, ps, other

    cs.SI

    From semantic memory to collective creativity: A generative cognitive foundation for social creativity models

    Authors: Mirza Nayeem Ahmed, Raiyan Abdul Baten

    Abstract: Simulation-based theory development has yielded powerful insights into collective performance by linking social structure to emergent outcomes, yet it has struggled to extend to collective creativity. Creativity is hard to capture purely at the social level, as novel ideas are generated through cognitive mechanisms. To address this gap, we introduce a multi-level socio-cognitive agent-based framew… ▽ More

    Submitted 2 February, 2026; originally announced February 2026.

  8. arXiv:2601.17690  [pdf, ps, other

    cs.SD cs.AI cs.IR cs.LG eess.AS

    Segment Length Matters: A Study of Segment Lengths on Audio Fingerprinting Performance

    Authors: Ziling Gong, Yunyan Ouyang, Iram Kamdar, Melody Ma, Hongjie Chen, Franck Dernoncourt, Ryan A. Rossi, Nesreen K. Ahmed

    Abstract: Audio fingerprinting provides an identifiable representation of acoustic signals, which can be later used for identification and retrieval systems. To obtain a discriminative representation, the input audio is usually segmented into shorter time intervals, allowing local acoustic features to be extracted and analyzed. Modern neural approaches typically operate on short, fixed-duration audio segmen… ▽ More

    Submitted 24 January, 2026; originally announced January 2026.

  9. arXiv:2601.15399  [pdf, ps, other

    cs.LG

    Attention-Informed Surrogates for Navigating Power-Performance Trade-offs in HPC

    Authors: Ashna Nawar Ahmed, Banooqa Banday, Terry Jones, Tanzima Z. Islam

    Abstract: High-Performance Computing (HPC) schedulers must balance user performance with facility-wide resource constraints. The task boils down to selecting the optimal number of nodes for a given job. We present a surrogate-assisted multi-objective Bayesian optimization (MOBO) framework to automate this complex decision. Our core hypothesis is that surrogate models informed by attention-based embeddings o… ▽ More

    Submitted 21 January, 2026; originally announced January 2026.

    Comments: 13 pages, 6 figures Published in MLForSys workshop in NeurIPS 2025 Link: https://openreview.net/forum?id=R0Vc9lnDd5

  10. arXiv:2601.03124  [pdf

    cs.CV cs.AI cs.LG

    LeafLife: An Explainable Deep Learning Framework with Robustness for Grape Leaf Disease Recognition

    Authors: B. M. Shahria Alam, Md. Nasim Ahmed

    Abstract: Plant disease diagnosis is essential to farmers' management choices because plant diseases frequently lower crop yield and product quality. For harvests to flourish and agricultural productivity to boost, grape leaf disease detection is important. The plant disease dataset contains grape leaf diseases total of 9,032 images of four classes, among them three classes are leaf diseases, and the other… ▽ More

    Submitted 6 January, 2026; originally announced January 2026.

    Comments: 4 pages, 8 figures, 2025 IEEE International Conference on Signal Processing, Information, Communication and Systems (SPICSCON)

  11. arXiv:2511.03866  [pdf, ps, other

    cs.DC cs.AI cs.LG cs.PF cs.PL

    OMPILOT: Harnessing Transformer Models for Auto Parallelization to Shared Memory Computing Paradigms

    Authors: Arijit Bhattacharjee, Ali TehraniJamsaz, Le Chen, Niranjan Hasabnis, Mihai Capota, Nesreen Ahmed, Ali Jannesari

    Abstract: Recent advances in large language models (LLMs) have significantly accelerated progress in code translation, enabling more accurate and efficient transformation across programming languages. While originally developed for natural language processing, LLMs have shown strong capabilities in modeling programming language syntax and semantics, outperforming traditional rule-based systems in both accur… ▽ More

    Submitted 11 November, 2025; v1 submitted 5 November, 2025; originally announced November 2025.

  12. arXiv:2510.27617  [pdf, ps, other

    cs.AI

    VeriMoA: A Mixture-of-Agents Framework for Spec-to-HDL Generation

    Authors: Heng Ping, Arijit Bhattacharjee, Peiyu Zhang, Shixuan Li, Wei Yang, Anzhe Cheng, Xiaole Zhang, Jesse Thomason, Ali Jannesari, Nesreen Ahmed, Paul Bogdan

    Abstract: Automation of Register Transfer Level (RTL) design can help developers meet increasing computational demands. Large Language Models (LLMs) show promise for Hardware Description Language (HDL) generation, but face challenges due to limited parametric knowledge and domain-specific constraints. While prompt engineering and fine-tuning have limitations in knowledge coverage and training costs, multi-a… ▽ More

    Submitted 31 October, 2025; originally announced October 2025.

  13. arXiv:2510.24469  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Iterative Critique-Refine Framework for Enhancing LLM Personalization

    Authors: Durga Prasad Maram, Dhruvin Gandhi, Zonghai Yao, Gayathri Akkinapalli, Franck Dernoncourt, Yu Wang, Ryan A. Rossi, Nesreen K. Ahmed

    Abstract: Personalized text generation requires models not only to produce coherent text but also to align with a target user's style, tone, and topical focus. Existing retrieval-augmented approaches such as LaMP and PGraphRAG enrich profiles with user and neighbor histories, but they stop at generation and often yield outputs that drift in tone, topic, or style. We present PerFine, a unified, training-free… ▽ More

    Submitted 28 October, 2025; originally announced October 2025.

  14. arXiv:2510.08624  [pdf, ps, other

    cs.CL

    Do LLMs Know They Are Being Tested? Evaluation Awareness and Incentive-Sensitive Failures in GPT-OSS-20B

    Authors: Nisar Ahmed, Muhammad Imran Zaman, Gulshan Saleem, Ali Hassan

    Abstract: Benchmarks for large language models (LLMs) often rely on rubric-scented prompts that request visible reasoning and strict formatting, whereas real deployments demand terse, contract-bound answers. We investigate whether such "evaluation scent" inflates measured performance without commensurate capability gains. Using a single open-weights model (GPT-OSS-20B), we run six paired A/B scenarios that… ▽ More

    Submitted 8 October, 2025; originally announced October 2025.

  15. arXiv:2510.06244  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Evaluating Embedding Frameworks for Scientific Domain

    Authors: Nouman Ahmed, Ronin Wu, Victor Botev

    Abstract: Finding an optimal word representation algorithm is particularly important in terms of domain specific data, as the same word can have different meanings and hence, different representations depending on the domain and context. While Generative AI and transformer architecture does a great job at generating contextualized embeddings for any given work, they are quite time and compute extensive, esp… ▽ More

    Submitted 3 October, 2025; originally announced October 2025.

  16. arXiv:2509.14851  [pdf, ps, other

    cs.CL cs.AI

    Empathy-R1: A Chain-of-Empathy and Reinforcement Learning Framework for Long-Form Mental Health Support

    Authors: Xianrong Yao, Dong She, Chenxu Zhang, Yimeng Zhang, Yueru Sun, Noman Ahmed, Yang Gao, Zhanpeng Jin

    Abstract: Empathy is critical for effective mental health support, especially when addressing Long Counseling Texts (LCTs). However, existing Large Language Models (LLMs) often generate replies that are semantically fluent but lack the structured reasoning necessary for genuine psychological support, particularly in a Chinese context. To bridge this gap, we introduce Empathy-R1, a novel framework that integ… ▽ More

    Submitted 19 September, 2025; v1 submitted 18 September, 2025; originally announced September 2025.

  17. arXiv:2508.18301  [pdf

    cs.LG cs.CY cs.HC

    A Fast and Minimal System to Identify Depression Using Smartphones: Explainable Machine Learning-Based Approach

    Authors: Md Sabbir Ahmed, Nova Ahmed

    Abstract: Background: Existing robust, pervasive device-based systems developed in recent years to detect depression require data collected over a long period and may not be effective in cases where early detection is crucial. Objective: Our main objective was to develop a minimalistic system to identify depression using data retrieved in the fastest possible time. Methods: We developed a fast tool that… ▽ More

    Submitted 22 August, 2025; originally announced August 2025.

  18. A Minimalistic Approach to Predict and Understand the Relation of App Usage with Students' Academic Performances

    Authors: Md Sabbir Ahmed, Rahat Jahangir Rony, Mohammad Abdul Hadi, Ekram Hossain, Nova Ahmed

    Abstract: Due to usage of self-reported data which may contain biasness, the existing studies may not unveil the exact relation between academic grades and app categories such as Video. Additionally, the existing systems' requirement for data of prolonged period to predict grades may not facilitate early intervention to improve it. Thus, we presented an app that retrieves past 7 days' actual app usage data… ▽ More

    Submitted 22 August, 2025; originally announced August 2025.

  19. arXiv:2508.16078  [pdf, ps, other

    cs.CR cs.NI

    A Survey of Post-Quantum Cryptography Support in Cryptographic Libraries

    Authors: Nadeem Ahmed, Lei Zhang, Aryya Gangopadhyay

    Abstract: The rapid advancement of quantum computing poses a significant threat to modern cryptographic systems, necessitating the transition to Post-Quantum Cryptography (PQC). This study evaluates the support for PQC algorithms within nine widely used open-source cryptographic libraries -- OpenSSL, wolfSSL, BoringSSL, LibreSSL, Bouncy Castle, libsodium, Crypto++, Botan, and MbedTLS -- focusing on their im… ▽ More

    Submitted 22 August, 2025; originally announced August 2025.

    Comments: To be published in IEEE International Conference on Quantum Computing and Engineering (QCE) 2025

  20. arXiv:2508.08821  [pdf, ps, other

    cs.CV

    3DFroMLLM: 3D Prototype Generation only from Pretrained Multimodal LLMs

    Authors: Noor Ahmed, Cameron Braunstein, Steffen Eger, Eddy Ilg

    Abstract: Recent Multi-Modal Large Language Models (MLLMs) have demonstrated strong capabilities in learning joint representations from text and images. However, their spatial reasoning remains limited. We introduce 3DFroMLLM, a novel framework that enables the generation of 3D object prototypes directly from MLLMs, including geometry and part labels. Our pipeline is agentic, comprising a designer, coder, a… ▽ More

    Submitted 12 August, 2025; originally announced August 2025.

  21. arXiv:2508.05427  [pdf, ps, other

    cs.AI

    Large Language Models Transform Organic Synthesis From Reaction Prediction to Automation

    Authors: Kartar Kumar Lohana Tharwani, Rajesh Kumar, Sumita, Numan Ahmed, Yong Tang

    Abstract: Large language models (LLMs) are beginning to reshape how chemists plan and run reactions in organic synthesis. Trained on millions of reported transformations, these text-based models can propose synthetic routes, forecast reaction outcomes and even instruct robots that execute experiments without human supervision. Here we survey the milestones that turned LLMs from speculative tools into practi… ▽ More

    Submitted 7 August, 2025; originally announced August 2025.

  22. arXiv:2508.01128  [pdf, ps, other

    cs.IR cs.AI cs.LG

    Towards Bridging Review Sparsity in Recommendation with Textual Edge Graph Representation

    Authors: Leyao Wang, Xutao Mao, Xuhui Zhan, Yuying Zhao, Bo Ni, Ryan A. Rossi, Nesreen K. Ahmed, Tyler Derr

    Abstract: Textual reviews enrich recommender systems with fine-grained preference signals and enhanced explainability. However, in real-world scenarios, users rarely leave reviews, resulting in severe sparsity that undermines the effectiveness of existing models. A natural solution is to impute or generate missing reviews to enrich the data. However, conventional imputation techniques -- such as matrix comp… ▽ More

    Submitted 1 August, 2025; originally announced August 2025.

    Comments: 13 pages

  23. arXiv:2507.18565  [pdf

    cs.CV

    Deep Learning-Based Age Estimation and Gender Deep Learning-Based Age Estimation and Gender Classification for Targeted Advertisement

    Authors: Muhammad Imran Zaman, Nisar Ahmed

    Abstract: This paper presents a novel deep learning-based approach for simultaneous age and gender classification from facial images, designed to enhance the effectiveness of targeted advertising campaigns. We propose a custom Convolutional Neural Network (CNN) architecture, optimized for both tasks, which leverages the inherent correlation between age and gender information present in facial features. Unli… ▽ More

    Submitted 24 July, 2025; originally announced July 2025.

    Comments: 6

  24. arXiv:2507.07202  [pdf, ps, other

    cs.CV

    A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality

    Authors: Mohamed Elmoghany, Ryan Rossi, Seunghyun Yoon, Subhojyoti Mukherjee, Eslam Bakr, Puneet Mathur, Gang Wu, Viet Dac Lai, Nedim Lipka, Ruiyi Zhang, Varun Manjunatha, Chien Nguyen, Daksh Dangi, Abel Salinas, Mohammad Taesiri, Hongjie Chen, Xiaolei Huang, Joe Barrow, Nesreen Ahmed, Hoda Eldardiry, Namyong Park, Yu Wang, Jaemin Cho, Anh Totti Nguyen, Zhengzhong Tu , et al. (4 additional authors not shown)

    Abstract: Despite the significant progress that has been made in video generative models, existing state-of-the-art methods can only produce videos lasting 5-16 seconds, often labeled "long-form videos". Furthermore, videos exceeding 16 seconds struggle to maintain consistent character appearances and scene layouts throughout the narrative. In particular, multi-subject long videos still fail to preserve cha… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

  25. arXiv:2506.18323  [pdf, ps, other

    eess.IV cs.AI cs.CV

    A Multi-Scale Spatial Attention-Based Zero-Shot Learning Framework for Low-Light Image Enhancement

    Authors: Muhammad Azeem Aslam, Hassan Khalid, Nisar Ahmed

    Abstract: Low-light image enhancement remains a challenging task, particularly in the absence of paired training data. In this study, we present LucentVisionNet, a novel zero-shot learning framework that addresses the limitations of traditional and deep learning-based enhancement methods. The proposed approach integrates multi-scale spatial attention with a deep curve estimation network, enabling fine-grain… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  26. arXiv:2506.18321  [pdf

    cs.CV

    Attention-Based Ensemble Learning for Crop Classification Using Landsat 8-9 Fusion

    Authors: Zeeshan Ramzan, Nisar Ahmed, Qurat-ul-Ain Akram, Shahzad Asif, Muhammad Shahbaz, Rabin Chakrabortty, Ahmed F. Elaksher

    Abstract: Remote sensing offers a highly effective method for obtaining accurate information on total cropped area and crop types. The study focuses on crop cover identification for irrigated regions of Central Punjab. Data collection was executed in two stages: the first involved identifying and geocoding six target crops through field surveys conducted in January and February 2023. The second stage involv… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: Under review in Earth Systems and Environment

  27. arXiv:2506.17439  [pdf

    cs.CV

    Enhancing Wireless Device Identification through RF Fingerprinting: Leveraging Transient Energy Spectrum Analysis

    Authors: Nisar Ahmed, Gulshan Saleem, Hafiz Muhammad Shahzad Asif, Muhammad Usman Younus, Kalsoom Safdar

    Abstract: In recent years, the rapid growth of the Internet of Things technologies and the widespread adoption of 5G wireless networks have led to an exponential increase in the number of radiation devices operating in complex electromagnetic environments. A key challenge in managing and securing these devices is accurate identification and classification. To address this challenge, specific emitter identif… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: Submitted in Wireless Personal Communications

  28. arXiv:2506.16601  [pdf, ps, other

    cs.CV eess.IV

    MetaQAP - A Meta-Learning Approach for Quality-Aware Pretraining in Image Quality Assessment

    Authors: Nisar Ahmed, Gulshan Saleem, Nazik Alturki, Nada Alasbali

    Abstract: Image Quality Assessment (IQA) is a critical task in a wide range of applications but remains challenging due to the subjective nature of human perception and the complexity of real-world image distortions. This study proposes MetaQAP, a novel no-reference IQA model designed to address these challenges by leveraging quality-aware pre-training and meta-learning. The model performs three key contrib… ▽ More

    Submitted 15 October, 2025; v1 submitted 19 June, 2025; originally announced June 2025.

  29. arXiv:2506.16592  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Hybrid Attention Network for Accurate Breast Tumor Segmentation in Ultrasound Images

    Authors: Muhammad Azeem Aslam, Asim Naveed, Nisar Ahmed

    Abstract: Breast ultrasound imaging is a valuable tool for early breast cancer detection, but automated tumor segmentation is challenging due to inherent noise, variations in scale of lesions, and fuzzy boundaries. To address these challenges, we propose a novel hybrid attention-based network for lesion segmentation. Our proposed architecture integrates a pre-trained DenseNet121 in the encoder part for robu… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  30. arXiv:2506.13116  [pdf

    cs.LG cs.CL

    Crime Hotspot Prediction Using Deep Graph Convolutional Networks

    Authors: Tehreem Zubair, Syeda Kisaa Fatima, Noman Ahmed, Asifullah Khan

    Abstract: Crime hotspot prediction is critical for ensuring urban safety and effective law enforcement, yet it remains challenging due to the complex spatial dependencies inherent in criminal activity. The previous approaches tended to use classical algorithms such as the KDE and SVM to model data distributions and decision boundaries. The methods often fail to capture these spatial relationships, treating… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  31. arXiv:2506.12953  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Forecasting Time Series with LLMs via Patch-Based Prompting and Decomposition

    Authors: Mayank Bumb, Anshul Vemulapalli, Sri Harsha Vardhan Prasad Jella, Anish Gupta, An La, Ryan A. Rossi, Hongjie Chen, Franck Dernoncourt, Nesreen K. Ahmed, Yu Wang

    Abstract: Recent advances in Large Language Models (LLMs) have demonstrated new possibilities for accurate and efficient time series analysis, but prior work often required heavy fine-tuning and/or ignored inter-series correlations. In this work, we explore simple and flexible prompt-based strategies that enable LLMs to perform time series forecasting without extensive retraining or the use of a complex ext… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  32. arXiv:2506.11475  [pdf

    cs.MA cs.CL cs.CV

    AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction

    Authors: Syeda Kisaa Fatima, Tehreem Zubair, Noman Ahmed, Asifullah Khan

    Abstract: This paper introduces LUCID-MA (Learning and Understanding Crime through Dialogue of Multiple Agents), an innovative AI powered framework where multiple AI agents collaboratively analyze and understand crime data. Our system that consists of three core components: an analysis assistant that highlights spatiotemporal crime patterns; a feedback component that reviews and refines analytical results;… ▽ More

    Submitted 20 July, 2025; v1 submitted 13 June, 2025; originally announced June 2025.

  33. arXiv:2506.08140  [pdf, ps, other

    cs.LG cs.CL

    AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists

    Authors: Yifei Li, Hanane Nour Moussa, Ziru Chen, Shijie Chen, Botao Yu, Mingyi Xue, Benjamin Burns, Tzu-Yao Chiu, Vishal Dey, Zitong Lu, Chen Wei, Qianheng Zhang, Tianyu Zhang, Song Gao, Xuhui Huang, Xia Ning, Nesreen K. Ahmed, Ali Payani, Huan Sun

    Abstract: Despite long-standing efforts in accelerating scientific discovery with AI, building AI co-scientists remains challenging due to limited high-quality data for training and evaluation. To tackle this data scarcity issue, we present AutoSDT, an automatic pipeline that collects high-quality coding tasks in real-world data-driven discovery workflows. AutoSDT leverages the coding capabilities and param… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  34. arXiv:2505.19286  [pdf, ps, other

    cs.CL cs.LG cs.SI

    A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models

    Authors: Utkarsh Sahu, Zhisheng Qi, Yongjia Lei, Ryan A. Rossi, Franck Dernoncourt, Nesreen K. Ahmed, Mahantesh M Halappanavar, Yao Ma, Yu Wang

    Abstract: Large language models have been extensively studied as neural knowledge bases for their knowledge access, editability, reasoning, and explainability. However, few works focus on the structural patterns of their knowledge. Motivated by this gap, we investigate these structural patterns from a graph perspective. We quantify the knowledge of LLMs at both the triplet and entity levels, and analyze how… ▽ More

    Submitted 27 May, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

  35. arXiv:2505.14106  [pdf, ps, other

    cs.CL cs.AI

    A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations

    Authors: Li Li, Peilin Cai, Ryan A. Rossi, Franck Dernoncourt, Branislav Kveton, Junda Wu, Tong Yu, Linxin Song, Tiankai Yang, Yuehan Qin, Nesreen K. Ahmed, Samyadeep Basu, Subhojyoti Mukherjee, Ruiyi Zhang, Zhengmian Hu, Bo Ni, Yuxiao Zhou, Zichao Wang, Yue Huang, Yu Wang, Xiangliang Zhang, Philip S. Yu, Xiyang Hu, Yue Zhao

    Abstract: We present PersonaConvBench, a large-scale benchmark for evaluating personalized reasoning and generation in multi-turn conversations with large language models (LLMs). Unlike existing work that focuses on either personalization or conversational structure in isolation, PersonaConvBench integrates both, offering three core tasks: sentence classification, impact regression, and user-centric text ge… ▽ More

    Submitted 25 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  36. arXiv:2504.06429  [pdf, other

    cs.RO cs.MA

    Extended Version: Multi-Robot Motion Planning with Cooperative Localization

    Authors: Anne Theurkauf, Nisar Ahmed, Morteza Lahijanian

    Abstract: We consider the uncertain multi-robot motion planning (MRMP) problem with cooperative localization (CL-MRMP), under both motion and measurement noise, where each robot can act as a sensor for its nearby teammates. We formalize CL-MRMP as a chance-constrained motion planning problem, and propose a safety-guaranteed algorithm that explicitly accounts for robot-robot correlations. Our approach extend… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: Submitted to IROS 2025

  37. arXiv:2504.02719  [pdf, other

    cs.SE

    The Myth of Immutability: A Multivocal Review on Smart Contract Upgradeability

    Authors: Ilham Qasse, Isra M. Ali, Nafisa Ahmed, Mohammad Hamdaqa, Björn Þór Jónsson

    Abstract: The immutability of smart contracts on blockchain platforms like Ethereum promotes security and trustworthiness but presents challenges for updates, bug fixes, or adding new features post-deployment. These limitations can lead to vulnerabilities and outdated functionality, impeding the evolution and maintenance of decentralized applications. Despite various upgrade mechanisms proposed in academic… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  38. arXiv:2503.12317  [pdf

    cs.AI

    A Transformer-based survival model for prediction of all-cause mortality in heart failure patients: a multi-cohort study

    Authors: Shishir Rao, Nouman Ahmed, Gholamreza Salimi-Khorshidi, Christopher Yau, Huimin Su, Nathalie Conrad, Folkert W Asselbergs, Mark Woodward, Rod Jackson, John GF Cleland, Kazem Rahimi

    Abstract: We developed and validated TRisk, a Transformer-based AI model predicting 36-month mortality in heart failure patients by analysing temporal patient journeys from UK electronic health records (EHR). Our study included 403,534 heart failure patients (ages 40-90) from 1,418 English general practices, with 1,063 practices for model derivation and 355 for external validation. TRisk was compared agains… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

  39. arXiv:2502.17843  [pdf, other

    cs.CV

    Automatic Vehicle Detection using DETR: A Transformer-Based Approach for Navigating Treacherous Roads

    Authors: Istiaq Ahmed Fahad, Abdullah Ibne Hanif Arean, Nazmus Sakib Ahmed, Mahmudul Hasan

    Abstract: Automatic Vehicle Detection (AVD) in diverse driving environments presents unique challenges due to varying lighting conditions, road types, and vehicle types. Traditional methods, such as YOLO and Faster R-CNN, often struggle to cope with these complexities. As computer vision evolves, combining Convolutional Neural Networks (CNNs) with Transformer-based approaches offers promising opportunities… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  40. arXiv:2502.11767  [pdf, ps, other

    cs.LG cs.CL

    From Selection to Generation: A Survey of LLM-based Active Learning

    Authors: Yu Xia, Subhojyoti Mukherjee, Zhouhang Xie, Junda Wu, Xintong Li, Ryan Aponte, Hanjia Lyu, Joe Barrow, Hongjie Chen, Franck Dernoncourt, Branislav Kveton, Tong Yu, Ruiyi Zhang, Jiuxiang Gu, Nesreen K. Ahmed, Yu Wang, Xiang Chen, Hanieh Deilamsalehy, Sungchul Kim, Zhengmian Hu, Yue Zhao, Nedim Lipka, Seunghyun Yoon, Ting-Hao Kenneth Huang, Zichao Wang , et al. (9 additional authors not shown)

    Abstract: Active Learning (AL) has been a powerful paradigm for improving model efficiency and performance by selecting the most informative data points for labeling and training. In recent active learning frameworks, Large Language Models (LLMs) have been employed not only for selection but also for generating entirely new data instances and providing more cost-effective annotations. Motivated by the incre… ▽ More

    Submitted 31 May, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: ACL 2025

  41. arXiv:2502.06872  [pdf, other

    cs.CL cs.AI

    Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A Survey

    Authors: Bo Ni, Zheyuan Liu, Leyao Wang, Yongjia Lei, Yuying Zhao, Xueqi Cheng, Qingkai Zeng, Luna Dong, Yinglong Xia, Krishnaram Kenthapadi, Ryan Rossi, Franck Dernoncourt, Md Mehrab Tanjim, Nesreen Ahmed, Xiaorui Liu, Wenqi Fan, Erik Blasch, Yu Wang, Meng Jiang, Tyler Derr

    Abstract: Retrieval-Augmented Generation (RAG) is an advanced technique designed to address the challenges of Artificial Intelligence-Generated Content (AIGC). By integrating context retrieval into content generation, RAG provides reliable and up-to-date external knowledge, reduces hallucinations, and ensures relevant context across a wide range of tasks. However, despite RAG's success and potential, recent… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

  42. arXiv:2501.11849  [pdf, other

    cs.CL cs.AI cs.SI

    Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance

    Authors: Nikos Kanakaris, Heng Ping, Xiongye Xiao, Nesreen K. Ahmed, Luca Luceri, Emilio Ferrara, Paul Bogdan

    Abstract: Detecting organized political campaigns is of paramount importance in fighting against disinformation on social media. Existing approaches for the identification of such organized actions employ techniques mostly from network science, graph machine learning and natural language processing. Their ultimate goal is to analyze the relationships and interactions (e.g. re-posting) among users and the te… ▽ More

    Submitted 17 February, 2025; v1 submitted 20 January, 2025; originally announced January 2025.

  43. arXiv:2501.02157  [pdf, ps, other

    cs.CL

    Personalized Graph-Based Retrieval for Large Language Models

    Authors: Steven Au, Cameron J. Dimacali, Ojasmitha Pedirappagari, Namyong Park, Franck Dernoncourt, Yu Wang, Nikos Kanakaris, Hanieh Deilamsalehy, Ryan A. Rossi, Nesreen K. Ahmed

    Abstract: As large language models (LLMs) evolve, their ability to deliver personalized and context-aware responses offers transformative potential for improving user experiences. Existing personalization approaches, however, often rely solely on user history to augment the prompt, limiting their effectiveness in generating tailored outputs, especially in cold-start scenarios with sparse data. To address th… ▽ More

    Submitted 31 May, 2025; v1 submitted 3 January, 2025; originally announced January 2025.

  44. arXiv:2501.00994  [pdf, other

    cs.OS

    Exploiting Application-to-Architecture Dependencies for Designing Scalable OS

    Authors: Yao Xiao, Nikos Kanakaris, Anzhe Cheng, Chenzhong Yin, Nesreen K. Ahmed, Shahin Nazarian, Andrei Irimia, Paul Bogdan

    Abstract: With the advent of hundreds of cores on a chip to accelerate applications, the operating system (OS) needs to exploit the existing parallelism provided by the underlying hardware resources to determine the right amount of processes to be mapped on the multi-core systems. However, the existing OS is not scalable and is oblivious to applications. We address these issues by adopting a multi-layer net… ▽ More

    Submitted 6 January, 2025; v1 submitted 1 January, 2025; originally announced January 2025.

  45. Spatial Clustering of Citizen Science Data Improves Downstream Species Distribution Models

    Authors: Nahian Ahmed, Mark Roth, Tyler A. Hallman, W. Douglas Robinson, Rebecca A. Hutchinson

    Abstract: Citizen science biodiversity data present great opportunities for ecology and conservation across vast spatial and temporal scales. However, the opportunistic nature of these data lacks the sampling structure required by modeling methodologies that address a pervasive challenge in ecological data collection: imperfect detection, i.e., the likelihood of under-observing species on field surveys. Occ… ▽ More

    Submitted 16 January, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

    Comments: AAAI 2025

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 39(27), 27775-27783, 2025

  46. arXiv:2412.15487  [pdf, other

    cs.CL

    Multi-LLM Text Summarization

    Authors: Jiangnan Fang, Cheng-Tse Liu, Jieun Kim, Yash Bhedaru, Ethan Liu, Nikhil Singh, Nedim Lipka, Puneet Mathur, Nesreen K. Ahmed, Franck Dernoncourt, Ryan A. Rossi, Hanieh Deilamsalehy

    Abstract: In this work, we propose a Multi-LLM summarization framework, and investigate two different multi-LLM strategies including centralized and decentralized. Our multi-LLM summarization framework has two fundamentally important steps at each round of conversation: generation and evaluation. These steps are different depending on whether our multi-LLM decentralized summarization is used or centralized.… ▽ More

    Submitted 1 April, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

  47. arXiv:2412.13501  [pdf, ps, other

    cs.AI cs.HC

    GUI Agents: A Survey

    Authors: Dang Nguyen, Jian Chen, Yu Wang, Gang Wu, Namyong Park, Zhengmian Hu, Hanjia Lyu, Junda Wu, Ryan Aponte, Yu Xia, Xintong Li, Jing Shi, Hongjie Chen, Viet Dac Lai, Zhouhang Xie, Sungchul Kim, Ruiyi Zhang, Tong Yu, Mehrab Tanjim, Nesreen K. Ahmed, Puneet Mathur, Seunghyun Yoon, Lina Yao, Branislav Kveton, Jihyung Kil , et al. (5 additional authors not shown)

    Abstract: Graphical User Interface (GUI) agents, powered by Large Foundation Models, have emerged as a transformative approach to automating human-computer interaction. These agents autonomously interact with digital systems or software applications via GUIs, emulating human actions such as clicking, typing, and navigating visual elements across diverse platforms. Motivated by the growing interest and funda… ▽ More

    Submitted 26 September, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: Accepted to Findings of ACL 2025

  48. arXiv:2412.04183  [pdf

    cs.LG

    Linear Discriminant Analysis in Credit Scoring: A Transparent Hybrid Model Approach

    Authors: Md Shihab Reza, Monirul Islam Mahmud, Ifti Azad Abeer, Nova Ahmed

    Abstract: The development of computing has made credit scoring approaches possible, with various machine learning (ML) and deep learning (DL) techniques becoming more and more valuable. While complex models yield more accurate predictions, their interpretability is often weakened, which is a concern for credit scoring that places importance on decision fairness. As features of the dataset are a crucial fact… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: Accepted on International Conference on Computer and Information Technology (ICCIT) 2024

  49. arXiv:2412.02142  [pdf, other

    cs.CV cs.AI cs.CL cs.IR

    Personalized Multimodal Large Language Models: A Survey

    Authors: Junda Wu, Hanjia Lyu, Yu Xia, Zhehao Zhang, Joe Barrow, Ishita Kumar, Mehrnoosh Mirtaheri, Hongjie Chen, Ryan A. Rossi, Franck Dernoncourt, Tong Yu, Ruiyi Zhang, Jiuxiang Gu, Nesreen K. Ahmed, Yu Wang, Xiang Chen, Hanieh Deilamsalehy, Namyong Park, Sungchul Kim, Huanrui Yang, Subrata Mitra, Zhengmian Hu, Nedim Lipka, Dang Nguyen, Yue Zhao , et al. (2 additional authors not shown)

    Abstract: Multimodal Large Language Models (MLLMs) have become increasingly important due to their state-of-the-art performance and ability to integrate multiple data modalities, such as text, images, and audio, to perform complex tasks with high accuracy. This paper presents a comprehensive survey on personalized multimodal large language models, focusing on their architecture, training methods, and applic… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  50. arXiv:2411.00369  [pdf, other

    cs.CL

    GRS-QA -- Graph Reasoning-Structured Question Answering Dataset

    Authors: Anish Pahilajani, Devasha Trivedi, Jincen Shuai, Khin S. Yone, Samyak Rajesh Jain, Namyong Park, Ryan A. Rossi, Nesreen K. Ahmed, Franck Dernoncourt, Yu Wang

    Abstract: Large Language Models (LLMs) have excelled in multi-hop question-answering (M-QA) due to their advanced reasoning abilities. However, the impact of the inherent reasoning structures on LLM M-QA performance remains unclear, largely due to the absence of QA datasets that provide fine-grained reasoning structures. To address this gap, we introduce the Graph Reasoning-Structured Question Answering Dat… ▽ More

    Submitted 7 November, 2024; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: 15 pages, 24 figures, 10 tables