Skip to main content

Showing 1–50 of 51 results for author: Chaudhary, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2603.03824  [pdf, ps, other

    cs.AI cs.CL cs.LG cs.MA

    In-Context Environments Induce Evaluation-Awareness in Language Models

    Authors: Maheep Chaudhary

    Abstract: Humans often become more self-aware under threat, yet can lose self-awareness when absorbed in a task; we hypothesize that language models exhibit environment-dependent \textit{evaluation awareness}. This raises concerns that models could strategically underperform, or \textit{sandbag}, to avoid triggering capability-limiting interventions such as unlearning or shutdown. Prior work demonstrates sa… ▽ More

    Submitted 4 March, 2026; originally announced March 2026.

  2. arXiv:2602.18782  [pdf, ps, other

    cs.CR cs.AI cs.CL cs.LG

    MANATEE: Inference-Time Lightweight Diffusion Based Safety Defense for LLMs

    Authors: Chun Yan Ryan Kan, Tommy Tran, Vedant Yadav, Ava Cai, Kevin Zhu, Ruizhe Li, Maheep Chaudhary

    Abstract: Defending LLMs against adversarial jailbreak attacks remains an open challenge. Existing defenses rely on binary classifiers that fail when adversarial input falls outside the learned decision boundary, and repeated fine-tuning is computationally expensive while potentially degrading model capabilities. We propose MANATEE, an inference-time defense that uses density estimation over a benign repres… ▽ More

    Submitted 21 February, 2026; originally announced February 2026.

  3. arXiv:2602.15195  [pdf, ps, other

    cs.CR cs.AI cs.CL cs.LG

    Weight space Detection of Backdoors in LoRA Adapters

    Authors: David Puertolas Merenciano, Ekaterina Vasyagina, Kevin Zhu, Javier Ferrando, Maheep Chaudhary

    Abstract: LoRA adapters let users fine-tune large language models (LLMs) efficiently. However, LoRA adapters are shared through open repositories like Hugging Face Hub \citep{huggingface_hub_docs}, making them vulnerable to backdoor attacks. Current detection methods require running the model with test input data -- making them impractical for screening thousands of adapters where the trigger for backdoor b… ▽ More

    Submitted 7 April, 2026; v1 submitted 16 February, 2026; originally announced February 2026.

  4. arXiv:2602.14444  [pdf, ps, other

    cs.LG cs.AI

    Broken Chains: The Cost of Incomplete Reasoning in LLMs

    Authors: Ian Su, Gaurav Purushothaman, Jey Narayan, Ruhika Goel, Kevin Zhu, Sunishchal Dev, Yash More, Maheep Chaudhary

    Abstract: Reasoning-specialized models like OpenAI's 5.1 and DeepSeek-V3.2 allocate substantial inference compute to extended chain-of-thought (CoT) traces, yet reasoning tokens incur significant costs. How do different reasoning modalities of code, natural language, hybrid, or none do perform under token constraints? We introduce a framework that constrains models to reason exclusively through code, commen… ▽ More

    Submitted 15 February, 2026; originally announced February 2026.

  5. arXiv:2511.23131  [pdf, ps, other

    cs.GR

    Towards Generalized Position-Based Dynamics

    Authors: Manas Chaudhary, Chandradeep Pokhariya, Rahul Narain

    Abstract: The position-based dynamics (PBD) algorithm is a popular and versatile technique for real-time simulation of deformable bodies, but is only applicable to forces that can be expressed as linearly compliant constraints. In this work, we explore a generalization of PBD that is applicable to arbitrary nonlinear force models. We do this by reformulating the implicit time integration equations in terms… ▽ More

    Submitted 28 November, 2025; originally announced November 2025.

  6. arXiv:2511.07772  [pdf, ps, other

    cs.CR cs.AI cs.CL cs.LG

    SALT: Steering Activations towards Leakage-free Thinking in Chain of Thought

    Authors: Shourya Batra, Pierce Tillman, Samarth Gaggar, Shashank Kesineni, Kevin Zhu, Sunishchal Dev, Ashwinee Panda, Vasu Sharma, Maheep Chaudhary

    Abstract: As Large Language Models (LLMs) evolve into personal assistants with access to sensitive user data, they face a critical privacy challenge: while prior work has addressed output-level privacy, recent findings reveal that LLMs often leak private information through their internal reasoning processes, violating contextual privacy expectations. These leaky thoughts occur when models inadvertently exp… ▽ More

    Submitted 20 November, 2025; v1 submitted 10 November, 2025; originally announced November 2025.

  7. arXiv:2511.07482  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Alignment-Constrained Dynamic Pruning for LLMs: Identifying and Preserving Alignment-Critical Circuits

    Authors: Dev Patel, Gabrielle Gervacio, Diekola Raimi, Kevin Zhu, Ryan Lagasse, Gabriel Grand, Ashwinee Panda, Maheep Chaudhary

    Abstract: Large Language Models require substantial computational resources for inference, posing deployment challenges. While dynamic pruning offers superior efficiency over static methods through adaptive circuit selection, it exacerbates alignment degradation by retaining only input-dependent safety-critical circuit preservation across diverse inputs. As a result, addressing these heightened alignment vu… ▽ More

    Submitted 9 November, 2025; originally announced November 2025.

  8. arXiv:2511.06437  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Optimizing Chain-of-Thought Confidence via Topological and Dirichlet Risk Analysis

    Authors: Abhishek More, Anthony Zhang, Nicole Bonilla, Ashvik Vivekan, Kevin Zhu, Parham Sharafoleslami, Maheep Chaudhary

    Abstract: Chain-of-thought (CoT) prompting enables Large Language Models to solve complex problems, but deploying these models safely requires reliable confidence estimates, a capability where existing methods suffer from poor calibration and severe overconfidence on incorrect predictions. We propose Enhanced Dirichlet and Topology Risk (EDTR), a novel decoding strategy that combines topological analysis wi… ▽ More

    Submitted 9 November, 2025; originally announced November 2025.

  9. arXiv:2509.25238  [pdf, ps, other

    cs.LG cs.AI

    PALADIN: Self-Correcting Language Model Agents to Cure Tool-Failure Cases

    Authors: Sri Vatsa Vuddanti, Aarav Shah, Satwik Kumar Chittiprolu, Tony Song, Sunishchal Dev, Kevin Zhu, Maheep Chaudhary

    Abstract: Tool-augmented language agents frequently fail in real-world deployment due to tool malfunctions--timeouts, API exceptions, or inconsistent outputs--triggering cascading reasoning errors and task abandonment. Existing agent training pipelines optimize only for success trajectories, failing to expose models to the tool failures that dominate real-world usage. We propose \textbf{PALADIN}, a generali… ▽ More

    Submitted 25 September, 2025; originally announced September 2025.

  10. arXiv:2509.18116  [pdf, ps, other

    cs.LG cs.AI

    Amortized Latent Steering: Low-Cost Alternative to Test-Time Optimization

    Authors: Nathan Egbuna, Saatvik Gaur, Sunishchal Dev, Ashwinee Panda, Maheep Chaudhary

    Abstract: Test-time optimization remains impractical at scale due to prohibitive inference costs--techniques like iterative refinement and multi-step verification can require $10-100\times$ more compute per query than standard decoding. Latent space test-time optimization methods like LatentSeek offer a more direct approach by steering hidden representations, but still demand expensive per-query optimizatio… ▽ More

    Submitted 7 November, 2025; v1 submitted 10 September, 2025; originally announced September 2025.

  11. arXiv:2509.16254  [pdf, ps, other

    q-bio.TO cs.AI

    Imaging Modalities-Based Classification for Lung Cancer Detection

    Authors: Sajim Ahmed, Muhammad Zain Chaudhary, Muhammad Zohaib Chaudhary, Mahmoud Abbass, Ahmed Sherif, Mohammad Mahbubur Rahman Khan Mamun

    Abstract: Lung cancer continues to be the predominant cause of cancer-related mortality globally. This review analyzes various approaches, including advanced image processing methods, focusing on their efficacy in interpreting CT scans, chest radiographs, and biological markers. Notably, we identify critical gaps in the previous surveys, including the need for robust models that can generalize across divers… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

    Comments: Accepted at ICMI 2025

    MSC Class: 68T07; 92C55

  12. arXiv:2509.13334  [pdf, ps, other

    cs.AI cs.LG

    FRIT: Using Causal Importance to Improve Chain-of-Thought Faithfulness

    Authors: Anand Swaroop, Akshat Nallani, Saksham Uboweja, Adiliia Uzdenova, Michael Nguyen, Kevin Zhu, Sunishchal Dev, Ashwinee Panda, Vasu Sharma, Maheep Chaudhary

    Abstract: Chain-of-thought (CoT) reasoning has emerged as a powerful tool for improving large language model performance on complex tasks, but recent work shows that reasoning steps often fail to causally influence the final answer, creating brittle and untrustworthy outputs. Prior approaches focus primarily on measuring faithfulness, while methods for systematically improving it remain limited. We introduc… ▽ More

    Submitted 10 September, 2025; originally announced September 2025.

  13. arXiv:2509.13333  [pdf, ps, other

    cs.AI

    Evaluation Awareness Scales Predictably in Open-Weights Large Language Models

    Authors: Maheep Chaudhary, Ian Su, Nikhil Hooda, Nishith Shankar, Julia Tan, Kevin Zhu, Ryan Lagasse, Vasu Sharma, Ashwinee Panda

    Abstract: Large language models (LLMs) can internally distinguish between evaluation and deployment contexts, a behaviour known as \emph{evaluation awareness}. This undermines AI safety evaluations, as models may conceal dangerous capabilities during testing. Prior work demonstrated this in a single $70$B model, but the scaling relationship across model sizes remains unknown. We investigate evaluation aware… ▽ More

    Submitted 9 November, 2025; v1 submitted 10 September, 2025; originally announced September 2025.

  14. arXiv:2508.15099  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Hydra: A Modular Architecture for Efficient Long-Context Reasoning

    Authors: Siddharth Chaudhary, Dev Patel, Maheep Chaudhary, Bennett Browning

    Abstract: The quadratic complexity of transformers fundamentally limits reasoning system deployment in resource-constrained and long-context settings. We introduce Hydra, a modular architecture based upon a state-space backbone which adaptively routes between complementary efficiency mechanisms: sparse global attention, mixture-of-experts, and dual memories comprising a reasoning workspace and product key m… ▽ More

    Submitted 16 October, 2025; v1 submitted 20 August, 2025; originally announced August 2025.

    Comments: Updated with the new paper accepted to NeurIPS workshop

  15. arXiv:2508.14067  [pdf, ps, other

    cs.CL cs.LG

    Punctuation and Predicates in Language Models

    Authors: Sonakshi Chauhan, Maheep Chaudhary, Koby Choy, Samuel Nellessen, Nandi Schoots

    Abstract: In this paper we explore where information is collected and how it is propagated throughout layers in large language models (LLMs). We begin by examining the surprising computational importance of punctuation tokens which previous work has identified as attention sinks and memory aids. Using intervention-based techniques, we evaluate the necessity and sufficiency (for preserving model performance)… ▽ More

    Submitted 11 August, 2025; originally announced August 2025.

  16. arXiv:2507.21532  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Automatic Classification of User Requirements from Online Feedback -- A Replication Study

    Authors: Meet Bhatt, Nic Boilard, Muhammad Rehan Chaudhary, Cole Thompson, Jacob Idoko, Aakash Sorathiya, Gouri Ginde

    Abstract: Natural language processing (NLP) techniques have been widely applied in the requirements engineering (RE) field to support tasks such as classification and ambiguity detection. Although RE research is rooted in empirical investigation, it has paid limited attention to replicating NLP for RE (NLP4RE) studies. The rapidly advancing realm of NLP is creating new opportunities for efficient, machine-a… ▽ More

    Submitted 29 July, 2025; originally announced July 2025.

    Comments: 10 pages, 3 figures, Replication package available at https://zenodo.org/records/15626782, Accepted at AIRE 2025 (12th International Workshop on Artificial Intelligence and Requirements Engineering)

  17. arXiv:2507.12367  [pdf, ps, other

    cs.SE cs.AI cs.PL

    GitChameleon 2.0: Evaluating AI Code Generation Against Python Library Version Incompatibilities

    Authors: Diganta Misra, Nizar Islah, Victor May, Brice Rauby, Zihan Wang, Justine Gehring, Antonio Orvieto, Muawiz Chaudhary, Eilif B. Muller, Irina Rish, Samira Ebrahimi Kahou, Massimo Caccia

    Abstract: The rapid evolution of software libraries poses a considerable hurdle for code generation, necessitating continuous adaptation to frequent version updates while preserving backward compatibility. While existing code evolution benchmarks provide valuable insights, they typically lack execution-based evaluation for generating code compliant with specific library versions. To address this, we introdu… ▽ More

    Submitted 21 July, 2025; v1 submitted 16 July, 2025; originally announced July 2025.

    Comments: Version 2 of the dataset from: arXiv:2411.05830

  18. arXiv:2505.16014  [pdf, ps, other

    cs.CL

    Ranking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains

    Authors: Yash Saxena, Ankur Padia, Mandar S Chaudhary, Kalpa Gunaratna, Srinivasan Parthasarathy, Manas Gaur

    Abstract: In sensitive domains, Retrieval-Augmented Generation (RAG) must be interpretable and robust because errors do not just mislead, they invite lawsuits, undermine scholarly credibility, and breach compliance. Stakeholders require traceable evidence, clear rationales for why specific evidence is selected, and safeguards against poisoned or misleading content. Yet current RAG pipelines rely on similari… ▽ More

    Submitted 18 January, 2026; v1 submitted 21 May, 2025; originally announced May 2025.

  19. arXiv:2505.14376  [pdf, ps, other

    cs.CL

    Graph-Guided Passage Retrieval for Author-Centric Structured Feedback

    Authors: Maitreya Prafulla Chitale, Ketaki Mangesh Shetye, Harshit Gupta, Manav Chaudhary, Manish Shrivastava, Vasudeva Varma

    Abstract: Obtaining high-quality, pre-submission feedback is a critical bottleneck in the academic publication lifecycle for researchers. We introduce AutoRev, an automated author-centric feedback system that generates structured, actionable guidance prior to formal peer review. AutoRev employs a graph-based retrieval-augmented generation framework that models each paper as a hierarchical document graph, in… ▽ More

    Submitted 9 January, 2026; v1 submitted 20 May, 2025; originally announced May 2025.

  20. arXiv:2505.14300  [pdf, other

    cs.AI cs.CL cs.LG

    SafetyNet: Detecting Harmful Outputs in LLMs by Modeling and Monitoring Deceptive Behaviors

    Authors: Maheep Chaudhary, Fazl Barez

    Abstract: High-risk industries like nuclear and aviation use real-time monitoring to detect dangerous system conditions. Similarly, Large Language Models (LLMs) need monitoring safeguards. We propose a real-time framework to predict harmful AI outputs before they occur by using an unsupervised approach that treats normal behavior as the baseline and harmful outputs as outliers. Our study focuses specificall… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  21. arXiv:2505.04759  [pdf, ps, other

    cs.SE cs.AI

    Exploring Zero-Shot App Review Classification with ChatGPT: Challenges and Potential

    Authors: Mohit Chaudhary, Chirag Jain, Preethu Rose Anish

    Abstract: App reviews are a critical source of user feedback, offering valuable insights into an app's performance, features, usability, and overall user experience. Effectively analyzing these reviews is essential for guiding app development, prioritizing feature updates, and enhancing user satisfaction. Classifying reviews into functional and non-functional requirements play a pivotal role in distinguishi… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  22. arXiv:2504.16331  [pdf, ps, other

    cs.SE cs.LG

    Can Code Language Models Learn Clarification-Seeking Behaviors?

    Authors: Jie JW Wu, Manav Chaudhary, Davit Abrahamyan, Arhaan Khaku, Anjiang Wei, Fatemeh H. Fard

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities in code generation tasks. However, a gap remains between their output and the problem-solving strategies of human developers. Unlike humans, who spend substantial time disambiguating requirements through iterative dialogue, LLMs often generate code despite ambiguities in natural language requirements, leading to unreliable solu… ▽ More

    Submitted 26 September, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

  23. arXiv:2502.02470  [pdf, ps, other

    cs.LG cs.AI

    Studying Cross-cluster Modularity in Neural Networks

    Authors: Satvik Golechha, Maheep Chaudhary, Joan Velja, Alessandro Abate, Nandi Schoots

    Abstract: An approach to improve neural network interpretability is via clusterability, i.e., splitting a model into disjoint clusters that can be studied independently. We define a measure for clusterability and show that pre-trained models form highly enmeshed clusters via spectral graph clustering. We thus train models to be more modular using a "clusterability loss" function that encourages the formatio… ▽ More

    Submitted 25 July, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: 8 pages, under review. arXiv admin note: text overlap with arXiv:2409.15747 (author note: this is an extension of that paper but has different authors)

  24. arXiv:2412.09269  [pdf, other

    cs.CL cs.AI

    Towards Understanding the Robustness of LLM-based Evaluations under Perturbations

    Authors: Manav Chaudhary, Harshit Gupta, Savita Bhat, Vasudeva Varma

    Abstract: Traditional evaluation metrics like BLEU and ROUGE fall short when capturing the nuanced qualities of generated text, particularly when there is no single ground truth. In this paper, we explore the potential of Large Language Models (LLMs), specifically Google Gemini 1, to serve as automatic evaluators for non-standardized metrics in summarization and dialog-based tasks. We conduct experiments ac… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Comments: Accepted at ICON 2024

  25. arXiv:2411.07567  [pdf, other

    eess.IV cs.CV cs.LG

    Uncertainty-Aware Test-Time Adaptation for Inverse Consistent Diffeomorphic Lung Image Registration

    Authors: Muhammad F. A. Chaudhary, Stephanie M. Aguilera, Arie Nakhmani, Joseph M. Reinhardt, Surya P. Bhatt, Sandeep Bodduluri

    Abstract: Diffeomorphic deformable image registration ensures smooth invertible transformations across inspiratory and expiratory chest CT scans. Yet, in practice, deep learning-based diffeomorphic methods struggle to capture large deformations between inspiratory and expiratory volumes, and therefore lack inverse consistency. Existing methods also fail to account for model uncertainty, which can be useful… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

    Comments: 5 pages, 4 figures

  26. arXiv:2409.14703  [pdf, other

    cs.LG cs.CL cs.MM

    MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification

    Authors: Siddhant Bikram Shah, Shuvam Shiwakoti, Maheep Chaudhary, Haohan Wang

    Abstract: The complexity of text-embedded images presents a formidable challenge in machine learning given the need for multimodal understanding of multiple aspects of expression conveyed by them. While previous research in multimodal analysis has primarily focused on singular aspects such as hate speech and its subclasses, this study expands this focus to encompass multiple aspects of linguistics: hate, ta… ▽ More

    Submitted 27 October, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: Accepted to EMNLP 2024 (Main)

  27. arXiv:2409.04478  [pdf, other

    cs.LG cs.AI cs.NE

    Evaluating Open-Source Sparse Autoencoders on Disentangling Factual Knowledge in GPT-2 Small

    Authors: Maheep Chaudhary, Atticus Geiger

    Abstract: A popular new method in mechanistic interpretability is to train high-dimensional sparse autoencoders (SAEs) on neuron activations and use SAE features as the atomic units of analysis. However, the body of evidence on whether SAE feature spaces are useful for causal analysis is underdeveloped. In this work, we use the RAVEL benchmark to evaluate whether SAEs trained on hidden representations of GP… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  28. arXiv:2405.16129  [pdf, other

    cs.CL

    iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers

    Authors: Harshit Gupta, Manav Chaudhary, Tathagata Raha, Shivansh Subramanian, Vasudeva Varma

    Abstract: This paper describes our approach for SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense. The BRAINTEASER task comprises multiple-choice Question Answering designed to evaluate the models' lateral thinking capabilities. It consists of Sentence Puzzle and Word Puzzle subtasks that require models to defy default common-sense associations and exhibit unconventional thinking. We propo… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  29. arXiv:2405.11192  [pdf, other

    cs.CL cs.SI

    BrainStorm @ iREL at #SMM4H 2024: Leveraging Translation and Topical Embeddings for Annotation Detection in Tweets

    Authors: Manav Chaudhary, Harshit Gupta, Vasudeva Varma

    Abstract: The proliferation of LLMs in various NLP tasks has sparked debates regarding their reliability, particularly in annotation tasks where biases and hallucinations may arise. In this shared task, we address the challenge of distinguishing annotations made by LLMs from those made by human domain experts in the context of COVID-19 symptom detection from tweets in Latin American Spanish. This paper pres… ▽ More

    Submitted 20 July, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

    Comments: Accepted at SMM4H, colocated at ACL 2024

  30. arXiv:2402.04958  [pdf, other

    cs.CV

    Channel-Selective Normalization for Label-Shift Robust Test-Time Adaptation

    Authors: Pedro Vianna, Muawiz Chaudhary, Paria Mehrbod, An Tang, Guy Cloutier, Guy Wolf, Michael Eickenberg, Eugene Belilovsky

    Abstract: Deep neural networks have useful applications in many different tasks, however their performance can be severely affected by changes in the data distribution. For example, in the biomedical field, their performance can be affected by changes in the data (different machines, populations) between training and test datasets. To ensure robustness and generalization to real-world scenarios, test-time a… ▽ More

    Submitted 29 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Accepted at the Conference on Lifelong Learning Agents (CoLLAs) 2024

  31. arXiv:2308.14969  [pdf, other

    cs.LG cs.CV

    Uncovering the Hidden Cost of Model Compression

    Authors: Diganta Misra, Muawiz Chaudhary, Agam Goyal, Bharat Runwal, Pin Yu Chen

    Abstract: In an age dominated by resource-intensive foundation models, the ability to efficiently adapt to downstream tasks is crucial. Visual Prompting (VP), drawing inspiration from the prompting techniques employed in Large Language Models (LLMs), has emerged as a pivotal method for transfer learning in the realm of computer vision. As the importance of efficiency continues to rise, research into model c… ▽ More

    Submitted 15 March, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Preprint

  32. arXiv:2307.16851  [pdf, other

    cs.LG cs.AI

    Towards Trustworthy and Aligned Machine Learning: A Data-centric Survey with Causality Perspectives

    Authors: Haoyang Liu, Maheep Chaudhary, Haohan Wang

    Abstract: The trustworthiness of machine learning has emerged as a critical topic in the field, encompassing various applications and research areas such as robustness, security, interpretability, and fairness. The last decade saw the development of numerous methods addressing these challenges. In this survey, we systematically review these advancements from a data-centric perspective, highlighting the shor… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 47 pages, 13 figures

  33. arXiv:2307.00132  [pdf, other

    cs.CL

    iMETRE: Incorporating Markers of Entity Types for Relation Extraction

    Authors: N Harsha Vardhan, Manav Chaudhary

    Abstract: Sentence-level relation extraction (RE) aims to identify the relationship between 2 entities given a contextual sentence. While there have been many attempts to solve this problem, the current solutions have a lot of room to improve. In this paper, we approach the task of relationship extraction in the financial dataset REFinD. Our approach incorporates typed entity markers representations and var… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

  34. Machine Reading Comprehension using Case-based Reasoning

    Authors: Dung Thai, Dhruv Agarwal, Mudit Chaudhary, Wenlong Zhao, Rajarshi Das, Manzil Zaheer, Jay-Yoon Lee, Hannaneh Hajishirzi, Andrew McCallum

    Abstract: We present an accurate and interpretable method for answer extraction in machine reading comprehension that is reminiscent of case-based reasoning (CBR) from classical AI. Our method (CBR-MRC) builds upon the hypothesis that contextualized answers to similar questions share semantic similarities with each other. Given a test question, CBR-MRC first retrieves a set of similar cases from a nonparame… ▽ More

    Submitted 5 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 9 pages, 2 figures

  35. arXiv:2304.04858  [pdf, other

    cs.LG cs.CV

    Simulated Annealing in Early Layers Leads to Better Generalization

    Authors: Amirmohammad Sarfi, Zahra Karimpour, Muawiz Chaudhary, Nasir M. Khalid, Mirco Ravanelli, Sudhir Mudur, Eugene Belilovsky

    Abstract: Recently, a number of iterative learning methods have been introduced to improve generalization. These typically rely on training for longer periods of time in exchange for improved generalization. LLF (later-layer-forgetting) is a state-of-the-art method in this category. It strengthens learning in early layers by periodically re-initializing the last few layers of the network. Our principal inno… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  36. arXiv:2301.04709  [pdf, other

    cs.AI

    Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability

    Authors: Atticus Geiger, Duligur Ibeling, Amir Zur, Maheep Chaudhary, Sonakshi Chauhan, Jing Huang, Aryaman Arora, Zhengxuan Wu, Noah Goodman, Christopher Potts, Thomas Icard

    Abstract: Causal abstraction provides a theoretical foundation for mechanistic interpretability, the field concerned with providing intelligible algorithms that are faithful simplifications of the known, but opaque low-level details of black box AI models. Our contributions are (1) generalizing the theory of causal abstraction from mechanism replacement (i.e., hard and soft interventions) to arbitrary mecha… ▽ More

    Submitted 8 May, 2025; v1 submitted 11 January, 2023; originally announced January 2023.

  37. arXiv:2209.06067  [pdf, other

    cs.CV cs.AI

    SeRP: Self-Supervised Representation Learning Using Perturbed Point Clouds

    Authors: Siddhant Garg, Mudit Chaudhary

    Abstract: We present SeRP, a framework for Self-Supervised Learning of 3D point clouds. SeRP consists of encoder-decoder architecture that takes perturbed or corrupted point clouds as inputs and aims to reconstruct the original point cloud without corruption. The encoder learns the high-level latent representations of the points clouds in a low-dimensional subspace and recovers the original structure. In th… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 6 pages

  38. arXiv:2205.01907  [pdf, other

    cs.CL

    Cross-lingual Word Embeddings in Hyperbolic Space

    Authors: Chandni Saxena, Mudit Chaudhary, Helen Meng

    Abstract: Cross-lingual word embeddings can be applied to several natural language processing applications across multiple languages. Unlike prior works that use word embeddings based on the Euclidean space, this short paper presents a simple and effective cross-lingual Word2Vec model that adapts to the Poincaré ball model of hyperbolic space to learn unsupervised cross-lingual word representations from a G… ▽ More

    Submitted 25 June, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: 6 pages

  39. arXiv:2204.08554  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    CBR-iKB: A Case-Based Reasoning Approach for Question Answering over Incomplete Knowledge Bases

    Authors: Dung Thai, Srinivas Ravishankar, Ibrahim Abdelaziz, Mudit Chaudhary, Nandana Mihindukulasooriya, Tahira Naseem, Rajarshi Das, Pavan Kapanipathi, Achille Fokoue, Andrew McCallum

    Abstract: Knowledge bases (KBs) are often incomplete and constantly changing in practice. Yet, in many question answering applications coupled with knowledge bases, the sparse nature of KBs is often overlooked. To this end, we propose a case-based reasoning approach, CBR-iKB, for knowledge base question answering (KBQA) with incomplete-KB as our main focus. Our method ensembles decisions from multiple reaso… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 8 pages, 3 figurs, 4 tables

  40. arXiv:2110.07878  [pdf, other

    eess.IV cs.CV

    Single volume lung biomechanics from chest computed tomography using a mode preserving generative adversarial network

    Authors: Muhammad F. A. Chaudhary, Sarah E. Gerard, Di Wang, Gary E. Christensen, Christopher B. Cooper, Joyce D. Schroeder, Eric A. Hoffman, Joseph M. Reinhardt

    Abstract: Local tissue expansion of the lungs is typically derived by registering computed tomography (CT) scans acquired at multiple lung volumes. However, acquiring multiple scans incurs increased radiation dose, time, and cost, and may not be possible in many cases, thus restricting the applicability of registration-based biomechanics. We propose a generative adversarial learning approach for estimating… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: 5 pages, 5 figures

  41. arXiv:2109.02941  [pdf, other

    cs.CL cs.LG

    Countering Online Hate Speech: An NLP Perspective

    Authors: Mudit Chaudhary, Chandni Saxena, Helen Meng

    Abstract: Online hate speech has caught everyone's attention from the news related to the COVID-19 pandemic, US elections, and worldwide protests. Online toxicity - an umbrella term for online hateful behavior, manifests itself in forms such as online hate speech. Hate speech is a deliberate attack directed towards an individual or a group motivated by the targeted entity's identity or opinions. The rising… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Comments: 12 pages

  42. arXiv:2108.06206  [pdf, other

    cs.IR cs.LG

    An Intelligent Recommendation-cum-Reminder System

    Authors: Rohan Saxena, Maheep Chaudhary, Chandresh Kumar Maurya, Shitala Prasad

    Abstract: Intelligent recommendation and reminder systems are the need of the fast-pacing life. Current intelligent systems such as Siri, Google Assistant, Microsoft Cortona, etc., have limited capability. For example, if you want to wake up at 6 am because you have an upcoming trip, you have to set the alarm manually. Besides, these systems do not recommend or remind what else to carry, such as carrying an… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 9

  43. arXiv:2107.09539  [pdf, other

    cs.LG eess.SP

    Parametric Scattering Networks

    Authors: Shanel Gauthier, Benjamin Thérien, Laurent Alsène-Racicot, Muawiz Chaudhary, Irina Rish, Eugene Belilovsky, Michael Eickenberg, Guy Wolf

    Abstract: The wavelet scattering transform creates geometric invariants and deformation stability. In multiple signal domains, it has been shown to yield more discriminative representations compared to other non-learned representations and to outperform learned representations in certain tasks, particularly on limited labeled data and highly structured signals. The wavelet filters used in the scattering tra… ▽ More

    Submitted 15 August, 2022; v1 submitted 20 July, 2021; originally announced July 2021.

    ACM Class: F.2.2; I.2.7

  44. arXiv:2102.09024  [pdf, other

    cs.LG cs.AI

    Deep Learning Approaches for Forecasting Strawberry Yields and Prices Using Satellite Images and Station-Based Soil Parameters

    Authors: Mohita Chaudhary, Mohamed Sadok Gastli, Lobna Nassar, Fakhri Karray

    Abstract: Computational tools for forecasting yields and prices for fresh produce have been based on traditional machine learning approaches or time series modelling. We propose here an alternate approach based on deep learning algorithms for forecasting strawberry yields and prices in Santa Barbara county, California. Building the proposed forecasting model comprises three stages: first, the station-based… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: Paper Accepted in Association for the Advancement of Artificial Intelligence (AAAI) Spring Symposium on 21st Jan, 2021

    Journal ref: AAAI 2021 Spring Symposium on Combining Machine Learning and Knowledge Engineering (AAAI-MAKE 2021)

  45. arXiv:2101.06066  [pdf, other

    cs.CL cs.AI

    Unstructured Knowledge Access in Task-oriented Dialog Modeling using Language Inference, Knowledge Retrieval and Knowledge-Integrative Response Generation

    Authors: Mudit Chaudhary, Borislav Dzodzo, Sida Huang, Chun Hei Lo, Mingzhi Lyu, Lun Yiu Nie, Jinbo Xing, Tianhua Zhang, Xiaoying Zhang, Jingyan Zhou, Hong Cheng, Wai Lam, Helen Meng

    Abstract: Dialog systems enriched with external knowledge can handle user queries that are outside the scope of the supporting databases/APIs. In this paper, we follow the baseline provided in DSTC9 Track 1 and propose three subsystems, KDEAK, KnowleDgEFactor, and Ens-GPT, which form the pipeline for a task-oriented dialog system capable of accessing unstructured knowledge. Specifically, KDEAK performs know… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

  46. arXiv:2008.11643  [pdf, other

    cs.LG stat.ML

    HydaLearn: Highly Dynamic Task Weighting for Multi-task Learning with Auxiliary Tasks

    Authors: Sam Verboven, Muhammad Hafeez Chaudhary, Jeroen Berrevoets, Wouter Verbeke

    Abstract: Multi-task learning (MTL) can improve performance on a task by sharing representations with one or more related auxiliary-tasks. Usually, MTL-networks are trained on a composite loss function formed by a constant weighted combination of the separate task losses. In practice, constant loss weights lead to poor results for two reasons: (i) the relevance of the auxiliary tasks can gradually drift thr… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

  47. arXiv:2008.07092  [pdf, other

    cs.LG q-bio.NC stat.ML

    Understanding Brain Dynamics for Color Perception using Wearable EEG headband

    Authors: Mahima Chaudhary, Sumona Mukhopadhyay, Marin Litoiu, Lauren E Sergio, Meaghan S Adams

    Abstract: The perception of color is an important cognitive feature of the human brain. The variety of colors that impinge upon the human eye can trigger changes in brain activity which can be captured using electroencephalography (EEG). In this work, we have designed a multiclass classification model to detect the primary colors from the features of raw EEG signals. In contrast to previous research, our me… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: 10 pages,10 figures, Conference- EVOKE CASCON 2020

    Journal ref: Proceedings of 30th Annual International Conference on Computer Science and Software Engineering 2020

  48. arXiv:2007.09181  [pdf, other

    cs.CY cs.LG cs.SI stat.AP

    Network Learning Approaches to study World Happiness

    Authors: Siddharth Dixit, Meghna Chaudhary, Niteesh Sahni

    Abstract: The United Nations in its 2011 resolution declared the pursuit of happiness a fundamental human goal and proposed public and economic policies centered around happiness. In this paper we used 2 types of computational strategies viz. Predictive Modelling and Bayesian Networks (BNs) to model the processed historical happiness index data of 156 nations published by UN since 2012. We attacked the prob… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: 13 Pages, 8 figures

  49. arXiv:1812.11214  [pdf, ps, other

    cs.LG cs.CV cs.SD eess.AS stat.ML

    Kymatio: Scattering Transforms in Python

    Authors: Mathieu Andreux, Tomás Angles, Georgios Exarchakis, Roberto Leonarduzzi, Gaspar Rochette, Louis Thiry, John Zarka, Stéphane Mallat, Joakim andén, Eugene Belilovsky, Joan Bruna, Vincent Lostanlen, Muawiz Chaudhary, Matthew J. Hirn, Edouard Oyallon, Sixin Zhang, Carmine Cella, Michael Eickenberg

    Abstract: The wavelet scattering transform is an invariant signal representation suitable for many signal processing and machine learning applications. We present the Kymatio software package, an easy-to-use, high-performance Python implementation of the scattering transform in 1D, 2D, and 3D that is compatible with modern deep learning frameworks. All transforms may be executed on a GPU (in addition to CPU… ▽ More

    Submitted 31 May, 2022; v1 submitted 28 December, 2018; originally announced December 2018.

  50. arXiv:1805.02679  [pdf

    cs.CV

    Multichannel Distributed Local Pattern for Content Based Indexing and Retrieval

    Authors: Sonakshi Mathur, Mallika Chaudhary, Hemant Verma, Murari Mandal, S. K. Vipparthi, Subrahmanyam Murala

    Abstract: A novel color feature descriptor, Multichannel Distributed Local Pattern (MDLP) is proposed in this manuscript. The MDLP combines the salient features of both local binary and local mesh patterns in the neighborhood. The multi-distance information computed by the MDLP aids in robust extraction of the texture arrangement. Further, MDLP features are extracted for each color channel of an image. The… ▽ More

    Submitted 7 May, 2018; originally announced May 2018.

    Comments: Accepted in INDICON-2017