Skip to main content

Showing 1–50 of 135 results for author: Patel, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2604.07911  [pdf, ps, other

    cs.MA cs.AI cs.LG

    Dynamic Attentional Context Scoping: Agent-Triggered Focus Sessions for Isolated Per-Agent Steering in Multi-Agent LLM Orchestration

    Authors: Nickson Patel

    Abstract: Multi-agent LLM orchestration systems suffer from context pollution: when N concurrent agents compete for the orchestrator's context window, each agent's task state, partial outputs, and pending questions contaminate the steering interactions of every other agent, degrading decision quality. We introduce Dynamic Attentional Context Scoping (DACS), a mechanism in which the orchestrator operates in… ▽ More

    Submitted 9 April, 2026; originally announced April 2026.

    Comments: 15 pages, 4 figures, preprint

    ACM Class: I.2.11; I.2.6

  2. arXiv:2604.00292  [pdf, ps, other

    cs.SD cs.LG

    MambaVoiceCloning: Efficient and Expressive Text-to-Speech via State-Space Modeling and Diffusion Control

    Authors: Sahil Kumar, Namrataben Patel, Honggang Wang, Youshan Zhang

    Abstract: MambaVoiceCloning (MVC) asks whether the conditioning path of diffusion-based TTS can be made fully SSM-only at inference, removing all attention and explicit RNN-style recurrence layers across text, rhythm, and prosody, while preserving or improving quality under controlled conditions. MVC combines a gated bidirectional Mamba text encoder, a Temporal Bi-Mamba supervised by a lightweight alignment… ▽ More

    Submitted 31 March, 2026; originally announced April 2026.

    Comments: Accepted at ICLR 2026

  3. arXiv:2603.25248  [pdf, ps, other

    cs.IR

    ColBERT-Att: Late-Interaction Meets Attention for Enhanced Retrieval

    Authors: Raj Nath Patel, Sourav Dutta

    Abstract: Vector embeddings from pre-trained language models form a core component in Neural Information Retrieval systems across a multitude of knowledge extraction tasks. The paradigm of late interaction, introduced in ColBERT, demonstrates high accuracy along with runtime efficiency. However, the current formulation fails to take into account the attention weights of query and document terms, which intui… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

    Comments: 5 pages

  4. arXiv:2602.20659  [pdf, ps, other

    cs.AI

    Recursive Belief Vision Language Action Models

    Authors: Vaidehi Bagaria, Bijo Sebastian, Nirav Kumar Patel

    Abstract: Vision-language-action models must enable agents to execute long-horizon tasks under partial observability. However, most existing approaches remain observation-driven, relying on short context windows or repeated queries to vision-language models (VLMs). This leads to loss of task progress, action repetition under perceptual aliasing, and high inference latency. While semantic grounding is import… ▽ More

    Submitted 25 February, 2026; v1 submitted 24 February, 2026; originally announced February 2026.

  5. arXiv:2602.19510  [pdf, ps, other

    cs.LG math.OC stat.ML

    Less is More: Convergence Benefits of Fewer Data Weight Updates over Longer Horizon

    Authors: Rudrajit Das, Neel Patel, Meisam Razaviyayn, Vahab Mirrokni

    Abstract: Data mixing--the strategic reweighting of training domains--is a critical component in training robust machine learning models. This problem is naturally formulated as a bilevel optimization task, where the outer loop optimizes domain weights to minimize validation loss, and the inner loop optimizes model parameters to minimize the weighted training loss. Classical bilevel optimization relies on h… ▽ More

    Submitted 22 February, 2026; originally announced February 2026.

  6. Joint Time Series Chain: Detecting Unusual Evolving Trend across Time Series

    Authors: Li Zhang, Nital Patel, Xiuqi Li, Jessica Lin

    Abstract: Time series chain (TSC) is a recently introduced concept that captures the evolving patterns in large scale time series. Informally, a time series chain is a temporally ordered set of subsequences, in which consecutive subsequences in the chain are similar to one another, but the last and the first subsequences maybe be dissimilar. Time series chain has the great potential to reveal latent unusual… ▽ More

    Submitted 14 February, 2026; originally announced February 2026.

    Journal ref: In Proceedings of the 2022 SIAM International Conference on Data Mining (SDM)

  7. arXiv:2602.10410  [pdf, ps, other

    cs.LG

    LUCID: Attention with Preconditioned Representations

    Authors: Sai Surya Duvvuri, Nirmal Patel, Nilesh Gupta, Inderjit S. Dhillon

    Abstract: Softmax-based dot-product attention is a cornerstone of Transformer architectures, enabling remarkable capabilities such as in-context learning. However, as context lengths increase, a fundamental limitation of the softmax function emerges: it tends to diffuse probability mass to irrelevant tokens degrading performance in long-sequence scenarios. Furthermore, attempts to sharpen focus by lowering… ▽ More

    Submitted 10 February, 2026; originally announced February 2026.

  8. arXiv:2512.19361  [pdf, ps, other

    cs.LG

    Interpretable Hybrid Deep Q-Learning Framework for IoT-Based Food Spoilage Prediction with Synthetic Data Generation and Hardware Validation

    Authors: Isshaan Singh, Divyansh Chawla, Anshu Garg, Shivin Mangal, Pallavi Gupta, Khushi Agarwal, Nimrat Singh Khalsa, Nandan Patel

    Abstract: The need for an intelligent, real-time spoilage prediction system has become critical in modern IoT-driven food supply chains, where perishable goods are highly susceptible to environmental conditions. Existing methods often lack adaptability to dynamic conditions and fail to optimize decision making in real time. To address these challenges, we propose a hybrid reinforcement learning framework in… ▽ More

    Submitted 22 December, 2025; originally announced December 2025.

  9. arXiv:2511.02371  [pdf, ps, other

    cs.LG

    LUMA-RAG: Lifelong Multimodal Agents with Provably Stable Streaming Alignment

    Authors: Rohan Wandre, Yash Gajewar, Namrata Patel, Vivek Dhalkari

    Abstract: Retrieval-Augmented Generation (RAG) has emerged as the dominant paradigm for grounding large language model outputs in verifiable evidence. However, as modern AI agents transition from static knowledge bases to continuous multimodal streams encompassing text, images, video, and audio, two critical challenges arise: maintaining index freshness without prohibitive re-indexing costs, and preserving… ▽ More

    Submitted 4 November, 2025; originally announced November 2025.

  10. arXiv:2510.18819  [pdf, ps, other

    cs.CV cs.AI

    An Explainable Hybrid AI Framework for Enhanced Tuberculosis and Symptom Detection

    Authors: Neel Patel, Alexander Wong, Ashkan Ebadi

    Abstract: Tuberculosis remains a critical global health issue, particularly in resource-limited and remote areas. Early detection is vital for treatment, yet the lack of skilled radiologists underscores the need for artificial intelligence (AI)-driven screening tools. Developing reliable AI models is challenging due to the necessity for large, high-quality datasets, which are costly to obtain. To tackle thi… ▽ More

    Submitted 21 October, 2025; originally announced October 2025.

    Comments: 16 pages, 3 figures

  11. arXiv:2510.08469  [pdf, ps, other

    quant-ph cs.AI cs.SE

    Platform-Agnostic Modular Architecture for Quantum Benchmarking

    Authors: Neer Patel, Anish Giri, Hrushikesh Pramod Patil, Noah Siekierski, Avimita Chatterjee, Sonika Johri, Timothy Proctor, Thomas Lubinski, Siyuan Niu

    Abstract: We present a platform-agnostic modular architecture that addresses the increasingly fragmented landscape of quantum computing benchmarking by decoupling problem generation, circuit execution, and results analysis into independent, interoperable components. Supporting over 20 benchmark variants ranging from simple algorithmic tests like Bernstein-Vazirani to complex Hamiltonian simulation with obse… ▽ More

    Submitted 9 October, 2025; originally announced October 2025.

  12. arXiv:2510.06096  [pdf, ps, other

    cs.LG cs.CL

    The Alignment Auditor: A Bayesian Framework for Verifying and Refining LLM Objectives

    Authors: Matthieu Bou, Nyal Patel, Arjun Jagota, Satyapriya Krishna, Sonali Parbhoo

    Abstract: The objectives that Large Language Models (LLMs) implicitly optimize remain dangerously opaque, making trustworthy alignment and auditing a grand challenge. While Inverse Reinforcement Learning (IRL) can infer reward functions from behaviour, existing approaches either produce a single, overconfident reward estimate or fail to address the fundamental ambiguity of the task (non-identifiability). Th… ▽ More

    Submitted 8 October, 2025; v1 submitted 7 October, 2025; originally announced October 2025.

    Comments: Preprint

  13. arXiv:2510.06092  [pdf, ps, other

    cs.LG cs.CL

    Learning from Failures: Understanding LLM Alignment through Failure-Aware Inverse RL

    Authors: Nyal Patel, Matthieu Bou, Arjun Jagota, Satyapriya Krishna, Sonali Parbhoo

    Abstract: Reinforcement Learning from Human Feedback (RLHF) aligns Large Language Models (LLMs) with human preferences, yet the underlying reward signals they internalize remain hidden, posing a critical challenge for interpretability and safety. Existing approaches attempt to extract these latent incentives using Inverse Reinforcement Learning (IRL), but treat all preference pairs equally, often overlookin… ▽ More

    Submitted 17 January, 2026; v1 submitted 7 October, 2025; originally announced October 2025.

    Comments: Preprint

  14. arXiv:2508.04711  [pdf, ps, other

    cs.IR cs.LG

    Scaling Generative Recommendations with Context Parallelism on Hierarchical Sequential Transducers

    Authors: Yue Dong, Han Li, Shen Li, Nikhil Patel, Xing Liu, Xiaodong Wang, Chuanhao Zhuge

    Abstract: Large-scale recommendation systems are pivotal to process an immense volume of daily user interactions, requiring the effective modeling of high cardinality and heterogeneous features to ensure accurate predictions. In prior work, we introduced Hierarchical Sequential Transducers (HSTU), an attention-based architecture for modeling high cardinality, non-stationary streaming recommendation data, pr… ▽ More

    Submitted 15 August, 2025; v1 submitted 23 July, 2025; originally announced August 2025.

  15. arXiv:2507.09871  [pdf, ps, other

    cs.LG cs.AI

    Task Priors: Enhancing Model Evaluation by Considering the Entire Space of Downstream Tasks

    Authors: Niket Patel, Randall Balestriero

    Abstract: The grand goal of AI research, and particularly Self Supervised Learning (SSL), is to produce systems that can successfully solve any possible task. In contrast, current evaluation methods available to AI researchers typically rely on a fixed collection of hand-picked downstream benchmarks. Hence, a large amount of effort is put into designing and searching for large collection of evaluation tasks… ▽ More

    Submitted 20 October, 2025; v1 submitted 13 July, 2025; originally announced July 2025.

    Comments: NeurIPS UniReps Workshop 2025

  16. arXiv:2506.06535  [pdf, ps, other

    cs.RO

    MapleGrasp: Mask-guided Feature Pooling for Language-driven Efficient Robotic Grasping

    Authors: Vineet Bhat, Naman Patel, Prashanth Krishnamurthy, Ramesh Karri, Farshad Khorrami

    Abstract: Robotic manipulation of unseen objects via natural language commands remains challenging. Language driven robotic grasping (LDRG) predicts stable grasp poses from natural language queries and RGB-D images. We propose MapleGrasp, a novel framework that leverages mask-guided feature pooling for efficient vision-language driven grasping. Our two-stage training first predicts segmentation masks from C… ▽ More

    Submitted 25 August, 2025; v1 submitted 6 June, 2025; originally announced June 2025.

  17. arXiv:2505.21644  [pdf, ps, other

    cs.CV

    Geometric Feature Prompting of Image Segmentation Models

    Authors: Kenneth Ball, Erin Taylor, Nirav Patel, Andrew Bartels, Gary Koplik, James Polly, Jay Hineman

    Abstract: Advances in machine learning, especially the introduction of transformer architectures and vision transformers, have led to the development of highly capable computer vision foundation models. The segment anything model (known colloquially as SAM and more recently SAM 2), is a highly capable foundation model for segmentation of natural images and has been further applied to medical and scientific… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  18. arXiv:2505.15373  [pdf, ps, other

    cs.CV cs.RO

    RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation

    Authors: Naman Patel, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: Mapping and understanding complex 3D environments is fundamental to how autonomous systems perceive and interact with the physical world, requiring both precise geometric reconstruction and rich semantic comprehension. While existing 3D semantic mapping systems excel at reconstructing and identifying predefined object instances, they lack the flexibility to efficiently build semantic maps with ope… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  19. arXiv:2505.09010  [pdf, ps, other

    cs.DS

    Fully Dynamic Euclidean Bi-Chromatic Matching in Sublinear Update Time

    Authors: Gramoz Goranci, Peter Kiss, Neel Patel, Martin P. Seybold, Eva Szilagyi, Da Wei Zheng

    Abstract: We consider the Euclidean bi-chromatic matching problem in the dynamic setting, where the goal is to efficiently process point insertions and deletions while maintaining a high-quality solution. Computing the minimum cost bi-chromatic matching is one of the core problems in geometric optimization that has found many applications, most notably in estimating Wasserstein distance between two distribu… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: accepted at ICML 2025

  20. arXiv:2504.17897  [pdf, ps, other

    cs.CY

    A Walk across Europe: Development of a high-resolution walkability index

    Authors: Nishit Patel, Hoang-Ha Nguyen, Jet van de Geest, Alfred Wagtendonk, Mohan JS Raju, Payam Dadvand, Kees de Hoogh, Marta Cirach, Mark Nieuwenhuijsen, Thao Minh Lam, Jeroen Lakerveld

    Abstract: Physical inactivity significantly contributes to obesity and other non-communicable diseases, yet efforts to increase population-wide physical activity levels have met with limited success. The built environment plays a pivotal role in encouraging active behaviors like walking. Walkability indices, which aggregate various environmental features, provide a valuable tool for promoting healthy, walka… ▽ More

    Submitted 24 July, 2025; v1 submitted 24 April, 2025; originally announced April 2025.

  21. arXiv:2504.13882  [pdf, other

    cs.HC cs.CL

    Toward Automated Qualitative Analysis: Leveraging Large Language Models for Tutoring Dialogue Evaluation

    Authors: Megan Gu, Chloe Qianhui Zhao, Claire Liu, Nikhil Patel, Jahnvi Shah, Jionghao Lin, Kenneth R. Koedinger

    Abstract: Our study introduces an automated system leveraging large language models (LLMs) to assess the effectiveness of five key tutoring strategies: 1. giving effective praise, 2. reacting to errors, 3. determining what students know, 4. helping students manage inequity, and 5. responding to negative self-talk. Using a public dataset from the Teacher-Student Chatroom Corpus, our system classifies each tu… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: Manuscript accepted to the Workshop on "From Data to Discovery: LLMs for Qualitative Analysis in Education" at LAK25

  22. arXiv:2504.02191  [pdf, ps, other

    cs.CE cs.LG

    A User-Tunable Machine Learning Framework for Step-Wise Synthesis Planning

    Authors: Shivesh Prakash, Nandan Patel, Hans-Arno Jacobsen, Viki Kumar Prasad

    Abstract: We introduce MHNpath, a machine learning-driven retrosynthetic tool designed for computer-aided synthesis planning. Leveraging modern Hopfield networks and novel comparative metrics, MHNpath efficiently prioritizes reaction templates, improving the scalability and accuracy of retrosynthetic predictions. The tool incorporates a tunable scoring system that allows users to prioritize pathways based o… ▽ More

    Submitted 16 December, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

  23. arXiv:2503.09613  [pdf

    cs.CY cs.AI

    Empowering the Future Workforce: Prioritizing Education for the AI-Accelerated Job Market

    Authors: Lisa Amini, Henry F. Korth, Nita Patel, Evan Peck, Ben Zorn

    Abstract: AI's rapid integration into the workplace demands new approaches to workforce education and training and broader AI literacy across disciplines. Coordinated action from government, industry, and educational institutions is necessary to ensure workers can adapt to accelerating technological change.

    Submitted 3 March, 2025; originally announced March 2025.

  24. arXiv:2502.20346  [pdf, ps, other

    cs.GT

    Equilibria and Learning in Modular Marketplaces

    Authors: Kshipra Bhawalkar, Jeff Dean, Christopher Liaw, Aranyak Mehta, Neel Patel

    Abstract: We envision a marketplace where diverse entities offer specialized "modules" through APIs, allowing users to compose the outputs of these modules for complex tasks within a given budget. This paper studies the market design problem in such an ecosystem, where module owners strategically set prices for their APIs (to maximize their profit) and a central platform orchestrates the aggregation of modu… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  25. arXiv:2502.02013  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Layer by Layer: Uncovering Hidden Representations in Language Models

    Authors: Oscar Skean, Md Rifat Arefin, Dan Zhao, Niket Patel, Jalal Naghiyev, Yann LeCun, Ravid Shwartz-Ziv

    Abstract: From extracting features to generating text, the outputs of large language models (LLMs) typically rely on the final layers, following the conventional wisdom that earlier layers capture only low-level cues. However, our analysis shows that intermediate layers can encode even richer representations, often improving performance on a range of downstream tasks. To explain and quantify these hidden-la… ▽ More

    Submitted 15 June, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: update for ICML2025 camera-ready

  26. arXiv:2412.18283  [pdf, ps, other

    cs.LG

    On the Local Complexity of Linear Regions in Deep ReLU Networks

    Authors: Niket Patel, Guido Montufar

    Abstract: We define the local complexity of a neural network with continuous piecewise linear activations as a measure of the density of linear regions over an input data distribution. We show theoretically that ReLU networks that learn low-dimensional feature representations have a lower local complexity. This allows us to connect recent empirical observations on feature learning at the level of the weight… ▽ More

    Submitted 13 July, 2025; v1 submitted 24 December, 2024; originally announced December 2024.

    Comments: International Conference on Machine Learning (ICML), 2025

  27. arXiv:2412.15246  [pdf, other

    cs.CL cs.AI cs.AR cs.DC cs.IR

    Accelerating Retrieval-Augmented Generation

    Authors: Derrick Quinn, Mohammad Nouri, Neel Patel, John Salihu, Alireza Salemi, Sukhan Lee, Hamed Zamani, Mohammad Alian

    Abstract: An evolving solution to address hallucination and enhance accuracy in large language models (LLMs) is Retrieval-Augmented Generation (RAG), which involves augmenting LLMs with information retrieved from an external knowledge source, such as the web. This paper profiles several RAG execution pipelines and demystifies the complex interplay between their retrieval and generation phases. We demonstrat… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

  28. arXiv:2412.12131  [pdf

    cs.CR cs.CE cs.ET

    Technical Insights on Blockchain's Role in Financial Systems

    Authors: Ishan Patwardhan, Sunil Mane, Nidhi Patel

    Abstract: This research provides a critical analysis regarding the way blockchain is being implemented in the financial industry, highlighting its vital role in promoting green finance, guaranteeing compliance with regulations, improving supply chain finance, boosting decentralized finance (DeFi), and strengthening the Internet of Things (IoT). It discusses how blockchain's inherent attributes could signifi… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

    Comments: Published in International Journal of Scientific and Engineering Research

  29. arXiv:2412.04782  [pdf, other

    cs.AI cs.CE

    A Survey of Sustainability in Large Language Models: Applications, Economics, and Challenges

    Authors: Aditi Singh, Nirmal Prakashbhai Patel, Abul Ehtesham, Saket Kumar, Tala Talaei Khoei

    Abstract: Large Language Models (LLMs) have transformed numerous domains by providing advanced capabilities in natural language understanding, generation, and reasoning. Despite their groundbreaking applications across industries such as research, healthcare, and creative media, their rapid adoption raises critical concerns regarding sustainability. This survey paper comprehensively examines the environment… ▽ More

    Submitted 18 January, 2025; v1 submitted 6 December, 2024; originally announced December 2024.

  30. Your Interest, Your Summaries: Query-Focused Long Video Summarization

    Authors: Nirav Patel, Payal Prajapati, Maitrik Shah

    Abstract: Generating a concise and informative video summary from a long video is important, yet subjective due to varying scene importance. Users' ability to specify scene importance through text queries enhances the relevance of such summaries. This paper introduces an approach for query-focused video summarization, aiming to align video summaries closely with user queries. To this end, we propose the Ful… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: To appear at the 18th International Conference on Control, Automation, Robotics and Vision (ICARCV), December 2024, Dubai, UAE

  31. arXiv:2410.12491  [pdf, ps, other

    cs.CL

    Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse Reinforcement Learning

    Authors: Jared Joselowitz, Ritam Majumdar, Arjun Jagota, Matthieu Bou, Nyal Patel, Satyapriya Krishna, Sonali Parbhoo

    Abstract: Large language models (LLMs) trained with Reinforcement Learning from Human Feedback (RLHF) have demonstrated remarkable capabilities, but their underlying reward functions and decision-making processes remain opaque. This paper introduces a novel approach to interpreting LLMs by applying inverse reinforcement learning (IRL) to recover their implicit reward functions. We conduct experiments on tox… ▽ More

    Submitted 6 October, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Published as a conference paper at COLM 2025

  32. arXiv:2410.10299  [pdf, other

    cs.RO

    Preliminary Evaluation of an Ultrasound-Guided Robotic System for Autonomous Percutaneous Intervention

    Authors: Pratima Mohan, Aayush Agrawal, Niravkumar A. Patel

    Abstract: Cancer cases have been rising globally, resulting in nearly 10 million deaths in 2023. Biopsy, crucial for diagnosis, is often performed under ultrasound (US) guidance, demanding precise hand coordination and cognitive decision-making. Robot-assisted interventions have shown improved accuracy in lesion targeting by addressing challenges such as noisy 2D images and maintaining consistent probe-to-s… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 7 pages and 6 figures

  33. arXiv:2410.07687  [pdf, other

    cs.LG cs.IT

    Learning to Compress: Local Rank and Information Compression in Deep Neural Networks

    Authors: Niket Patel, Ravid Shwartz-Ziv

    Abstract: Deep neural networks tend to exhibit a bias toward low-rank solutions during training, implicitly learning low-dimensional feature representations. This paper investigates how deep multilayer perceptrons (MLPs) encode these feature manifolds and connects this behavior to the Information Bottleneck (IB) theory. We introduce the concept of local rank as a measure of feature manifold dimensionality a… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: Accepted to Compression Workshop @ NeurIPS 2024

  34. arXiv:2410.06239  [pdf, other

    cs.RO

    OrionNav: Online Planning for Robot Autonomy with Context-Aware LLM and Open-Vocabulary Semantic Scene Graphs

    Authors: Venkata Naren Devarakonda, Raktim Gautam Goswami, Ali Umut Kaypak, Naman Patel, Rooholla Khorrambakht, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: Enabling robots to autonomously navigate unknown, complex, dynamic environments and perform diverse tasks remains a fundamental challenge in developing robust autonomous physical agents. These agents must effectively perceive their surroundings while leveraging world knowledge for decision-making. Although recent approaches utilize vision-language and large language models for scene understanding… ▽ More

    Submitted 22 October, 2024; v1 submitted 8 October, 2024; originally announced October 2024.

  35. arXiv:2410.00702  [pdf, other

    cs.CV

    FlashMix: Fast Map-Free LiDAR Localization via Feature Mixing and Contrastive-Constrained Accelerated Training

    Authors: Raktim Gautam Goswami, Naman Patel, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: Map-free LiDAR localization systems accurately localize within known environments by predicting sensor position and orientation directly from raw point clouds, eliminating the need for large maps and descriptors. However, their long training times hinder rapid adaptation to new environments. To address this, we propose FlashMix, which uses a frozen, scene-agnostic backbone to extract local point d… ▽ More

    Submitted 27 September, 2024; originally announced October 2024.

  36. arXiv:2409.13571  [pdf, other

    cs.MA cs.AI

    Scalable Multi-agent Reinforcement Learning for Factory-wide Dynamic Scheduling

    Authors: Jaeyeon Jang, Diego Klabjan, Han Liu, Nital S. Patel, Xiuqi Li, Balakrishnan Ananthanarayanan, Husam Dauod, Tzung-Han Juang

    Abstract: Real-time dynamic scheduling is a crucial but notoriously challenging task in modern manufacturing processes due to its high decision complexity. Recently, reinforcement learning (RL) has been gaining attention as an impactful technique to handle this challenge. However, classical RL methods typically rely on human-made dispatching rules, which are not suitable for large-scale factory-wide schedul… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  37. arXiv:2408.08624  [pdf

    cs.CL cs.AI

    RealMedQA: A pilot biomedical question answering dataset containing realistic clinical questions

    Authors: Gregory Kell, Angus Roberts, Serge Umansky, Yuti Khare, Najma Ahmed, Nikhil Patel, Chloe Simela, Jack Coumbe, Julian Rozario, Ryan-Rhys Griffiths, Iain J. Marshall

    Abstract: Clinical question answering systems have the potential to provide clinicians with relevant and timely answers to their questions. Nonetheless, despite the advances that have been made, adoption of these systems in clinical settings has been slow. One issue is a lack of question-answering datasets which reflect the real-world needs of health professionals. In this work, we present RealMedQA, a data… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: Accepted at AMIA Annual Symposium 2024

  38. Dimensionality Reduction and Nearest Neighbors for Improving Out-of-Distribution Detection in Medical Image Segmentation

    Authors: McKell Woodland, Nihil Patel, Austin Castelo, Mais Al Taie, Mohamed Eltaher, Joshua P. Yung, Tucker J. Netherton, Tiffany L. Calderone, Jessica I. Sanchez, Darrel W. Cleere, Ahmed Elsaiey, Nakul Gupta, David Victor, Laura Beretta, Ankit B. Patel, Kristy K. Brock

    Abstract: Clinically deployed deep learning-based segmentation models are known to fail on data outside of their training distributions. While clinicians review the segmentations, these models tend to perform well in most instances, which could exacerbate automation bias. Therefore, detecting out-of-distribution images at inference is critical to warn the clinicians that the model likely failed. This work a… ▽ More

    Submitted 2 October, 2024; v1 submitted 5 August, 2024; originally announced August 2024.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org/2024:020. Expansion of "Dimensionality Reduction for Improving Out-of-Distribution Detection in Medical Image Segmentation" arXiv:2308.03723. Code available at https://github.com/mckellwoodland/dimen_reduce_mahal (https://zenodo.org/records/13881989)

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 2 (2024) 2006

  39. arXiv:2407.14790  [pdf, other

    cs.CL cs.AI

    Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?

    Authors: Nemika Tyagi, Mihir Parmar, Mohith Kulkarni, Aswin RRV, Nisarg Patel, Mutsumi Nakamura, Arindam Mitra, Chitta Baral

    Abstract: Solving grid puzzles involves a significant amount of logical reasoning. Hence, it is a good domain to evaluate the reasoning capability of a model which can then guide us to improve the reasoning ability of models. However, most existing works evaluate only the final predicted answer of a puzzle, without delving into an in-depth analysis of the LLMs' reasoning chains (such as where they falter) o… ▽ More

    Submitted 4 October, 2024; v1 submitted 20 July, 2024; originally announced July 2024.

    Comments: Accepted at EMNLP 2024 Main

  40. arXiv:2407.10785  [pdf, other

    eess.IV cs.CV

    Learning biologically relevant features in a pathology foundation model using sparse autoencoders

    Authors: Nhat Minh Le, Ciyue Shen, Neel Patel, Chintan Shah, Darpan Sanghavi, Blake Martin, Alfred Eng, Daniel Shenker, Harshith Padigela, Raymond Biju, Syed Ashar Javed, Jennifer Hipp, John Abel, Harsha Pokkalla, Sean Grullon, Dinkar Juyal

    Abstract: Pathology plays an important role in disease diagnosis, treatment decision-making and drug development. Previous works on interpretability for machine learning models on pathology images have revolved around methods such as attention value visualization and deriving human-interpretable features from model heatmaps. Mechanistic interpretability is an emerging area of model interpretability that foc… ▽ More

    Submitted 16 December, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  41. arXiv:2407.10316  [pdf, ps, other

    cs.DS cs.GT

    Online Matroid Embeddings

    Authors: Andrés Cristi, Paul Dütting, Robert Kleinberg, Renato Paes Leme, Neel Patel

    Abstract: We introduce the notion of an online matroid embedding, which is an algorithm for mapping an unknown matroid that is revealed in an online fashion to a larger-but-known matroid. We establish the existence of such an embedding for binary matroids, and use it to relate variants of the binary matroid secretary problem to each other, showing that seemingly simpler problems are in fact equivalent to se… ▽ More

    Submitted 16 October, 2025; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: 45 pages, 5 figures

  42. arXiv:2407.08260  [pdf, other

    cs.CV cs.RO

    SALSA: Swift Adaptive Lightweight Self-Attention for Enhanced LiDAR Place Recognition

    Authors: Raktim Gautam Goswami, Naman Patel, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: Large-scale LiDAR mappings and localization leverage place recognition techniques to mitigate odometry drifts, ensuring accurate mapping. These techniques utilize scene representations from LiDAR point clouds to identify previously visited sites within a database. Local descriptors, assigned to each point within a point cloud, are aggregated to form a scene representation for the point cloud. Thes… ▽ More

    Submitted 30 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

  43. arXiv:2407.07277  [pdf, other

    cs.LG cs.AI

    Lifestyle-Informed Personalized Blood Biomarker Prediction via Novel Representation Learning

    Authors: A. Ali Heydari, Naghmeh Rezaei, Javier L. Prieto, Shwetak N. Patel, Ahmed A. Metwally

    Abstract: Blood biomarkers are an essential tool for healthcare providers to diagnose, monitor, and treat a wide range of medical conditions. Current reference values and recommended ranges often rely on population-level statistics, which may not adequately account for the influence of inter-individual variability driven by factors such as lifestyle and genetics. In this work, we introduce a novel framework… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  44. arXiv:2406.17169  [pdf, other

    cs.CL cs.AI

    Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models

    Authors: Nisarg Patel, Mohith Kulkarni, Mihir Parmar, Aashna Budhiraja, Mutsumi Nakamura, Neeraj Varshney, Chitta Baral

    Abstract: As Large Language Models (LLMs) continue to exhibit remarkable performance in natural language understanding tasks, there is a crucial need to measure their ability for human-like multi-step logical reasoning. Existing logical reasoning evaluation benchmarks often focus primarily on simplistic single-step or multi-step reasoning with a limited set of inference rules. Furthermore, the lack of datas… ▽ More

    Submitted 6 October, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted at EMNLP 2024 Main

  45. Empowering Tuberculosis Screening with Explainable Self-Supervised Deep Neural Networks

    Authors: Neel Patel, Alexander Wong, Ashkan Ebadi

    Abstract: Tuberculosis persists as a global health crisis, especially in resource-limited populations and remote regions, with more than 10 million individuals newly infected annually. It stands as a stark symbol of inequity in public health. Tuberculosis impacts roughly a quarter of the global populace, with the majority of cases concentrated in eight countries, accounting for two-thirds of all tuberculosi… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 9 pages, 3 figures

    Journal ref: 2024 International Conference on Machine Learning and Applications (ICMLA), Miami, FL, USA, 2024, pp. 794-797

  46. arXiv:2406.00010  [pdf, other

    cs.IR cs.AI cs.CL

    EnterpriseEM: Fine-tuned Embeddings for Enterprise Semantic Search

    Authors: Kamalkumar Rathinasamy, Jayarama Nettar, Amit Kumar, Vishal Manchanda, Arun Vijayakumar, Ayush Kataria, Venkateshprasanna Manjunath, Chidambaram GS, Jaskirat Singh Sodhi, Shoeb Shaikh, Wasim Akhtar Khan, Prashant Singh, Tanishq Dattatray Ige, Vipin Tiwari, Rajab Ali Mondal, Harshini K, S Reka, Chetana Amancharla, Faiz ur Rahman, Harikrishnan P A, Indraneel Saha, Bhavya Tiwary, Navin Shankar Patel, Pradeep T S, Balaji A J , et al. (2 additional authors not shown)

    Abstract: Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components.… ▽ More

    Submitted 27 September, 2024; v1 submitted 18 May, 2024; originally announced June 2024.

    ACM Class: I.2.7

  47. arXiv:2405.18510  [pdf, other

    cs.AI

    Improved Emotional Alignment of AI and Humans: Human Ratings of Emotions Expressed by Stable Diffusion v1, DALL-E 2, and DALL-E 3

    Authors: James Derek Lomas, Willem van der Maden, Sohhom Bandyopadhyay, Giovanni Lion, Nirmal Patel, Gyanesh Jain, Yanna Litowsky, Haian Xue, Pieter Desmet

    Abstract: Generative AI systems are increasingly capable of expressing emotions via text and imagery. Effective emotional expression will likely play a major role in the efficacy of AI systems -- particularly those designed to support human mental health and wellbeing. This motivates our present research to better understand the alignment of AI expressed emotions with the human perception of emotions. When… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  48. arXiv:2405.14737  [pdf, other

    cs.CV

    CLIPScope: Enhancing Zero-Shot OOD Detection with Bayesian Scoring

    Authors: Hao Fu, Naman Patel, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: Detection of out-of-distribution (OOD) samples is crucial for safe real-world deployment of machine learning models. Recent advances in vision language foundation models have made them capable of detecting OOD samples without requiring in-distribution (ID) images. However, these zero-shot methods often underperform as they do not adequately consider ID class likelihoods in their detection confiden… ▽ More

    Submitted 10 November, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  49. arXiv:2405.13271  [pdf, other

    cs.PL cs.LO

    Verifying Lock-free Search Structure Templates

    Authors: Nisarg Patel, Dennis Shasha, Thomas Wies

    Abstract: We present and verify template algorithms for lock-free concurrent search structures that cover a broad range of existing implementations based on lists and skiplists. Our linearizability proofs are fully mechanized in the concurrent separation logic Iris. The proofs are modular and cover the broader design space of the underlying algorithms by parameterizing the verification over aspects such as… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Extended version of an article to appear in ECOOP'24

  50. arXiv:2404.15522  [pdf, other

    cs.CL cs.AI

    LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models

    Authors: Mihir Parmar, Nisarg Patel, Neeraj Varshney, Mutsumi Nakamura, Man Luo, Santosh Mashetty, Arindam Mitra, Chitta Baral

    Abstract: Recently developed large language models (LLMs) have been shown to perform remarkably well on a wide range of language understanding tasks. But, can they really "reason" over the natural language? This question has been receiving significant research attention and many reasoning skills such as commonsense, numerical, and qualitative have been studied. However, the crucial skill pertaining to 'logi… ▽ More

    Submitted 6 June, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Accepted at ACL(Main) 2024 | First version available @ https://openreview.net/forum?id=7NR2ZVzZxx