Skip to main content

Showing 1–50 of 263 results for author: Kulkarni, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2603.18308  [pdf, ps, other

    cs.RO

    Proprioceptive-only State Estimation for Legged Robots with Set-Coverage Measurements of Learned Dynamics

    Authors: Abhijeet M. Kulkarni, Ioannis Poulakakis, Guoquan Huang

    Abstract: Proprioceptive-only state estimation is attractive for legged robots since it is computationally cheaper and is unaffected by perceptually degraded conditions. The history of joint-level measurements contains rich information that can be used to infer the dynamics of the system and subsequently produce navigational measurements. Recent approaches produce these estimates with learned measurement mo… ▽ More

    Submitted 18 March, 2026; originally announced March 2026.

  2. arXiv:2603.10465  [pdf, ps, other

    cs.SD cs.CV cs.HC

    MoXaRt: Audio-Visual Object-Guided Sound Interaction for XR

    Authors: Tianyu Xu, Sieun Kim, Qianhui Zheng, Ruoyu Xu, Tejasvi Ravi, Anuva Kulkarni, Katrina Passarella-Ward, Junyi Zhu, Adarsh Kowdle

    Abstract: In Extended Reality (XR), complex acoustic environments often overwhelm users, compromising both scene awareness and social engagement due to entangled sound sources. We introduce MoXaRt, a real-time XR system that uses audio-visual cues to separate these sources and enable fine-grained sound interaction. MoXaRt's core is a cascaded architecture that performs coarse, audio-only separation in paral… ▽ More

    Submitted 11 March, 2026; originally announced March 2026.

    ACM Class: H.5.1; H.5.2; H.5.5; I.2.7; I.2.10

  3. arXiv:2603.06164  [pdf, ps, other

    cs.SD cs.AI cs.CL

    Do Compact SSL Backbones Matter for Audio Deepfake Detection? A Controlled Study with RAPTOR

    Authors: Ajinkya Kulkarni, Sandipana Dowerah, Atharva Kulkarni, Tanel Alumäe, Mathew Magimai Doss

    Abstract: Self-supervised learning (SSL) underpins modern audio deepfake detection, yet most prior work centers on a single large wav2vec2-XLSR backbone, leaving compact under studied. We present RAPTOR, Representation Aware Pairwise-gated Transformer for Out-of-domain Recognition a controlled study of compact SSL backbones from the HuBERT and WavLM within a unified pairwise-gated fusion detector, evaluated… ▽ More

    Submitted 6 March, 2026; originally announced March 2026.

    Comments: Submitted to Interspeech 2026, 4 pages, 2 figures

  4. arXiv:2603.02268  [pdf, ps, other

    cs.LG cs.AI

    PRISM: Exploring Heterogeneous Pretrained EEG Foundation Model Transfer to Clinical Differential Diagnosis

    Authors: Jeet Bandhu Lahiri, Parshva Runwal, Arvasu Kulkarni, Mahir Jain, Aditya Ray Mishra, Siddharth Panwar, Sandeep Singh

    Abstract: EEG foundation models are typically pretrained on narrow-source clinical archives and evaluated on benchmarks from the same ecosystem, leaving unclear whether representations encode neural physiology or recording-distribution artifacts. We introduce PRISM (Population Representative Invariant Signal Model), a masked autoencoder ablated along two axes -- pretraining population and downstream adaptat… ▽ More

    Submitted 28 February, 2026; originally announced March 2026.

    Comments: 14 pages, 1 figure, 5 tables

  5. arXiv:2602.24036  [pdf, ps, other

    cs.HC

    Designing AI Tutors for Interest-Based Learning: Insights from Human Instructors

    Authors: Abhishek Kulkarni, Sharon Lynn Chu

    Abstract: Interest-based learning (IBL) is a paradigm of instruction in which educational content is contextualized using learners' interests to enhance content relevance. IBL has been shown to result in improved learning outcomes. Unfortunately, high effort is needed for instructors to design and deliver IBL content for individual students. LLMs in the form of AI tutors may allow for IBL to scale across ma… ▽ More

    Submitted 27 February, 2026; originally announced February 2026.

  6. arXiv:2602.22362  [pdf

    cs.HC

    E3VA: Enhancing Emotional Expressiveness in Virtual Conversational Agents

    Authors: Abhishek Kulkarni, Alexander Barquero, Pavitra Lahari, Aryaan Shaikh, Sarah Brown

    Abstract: With the advent of generative AI and large language models, embodied conversational agents are becoming synonymous with online interactions. These agents possess vast amounts of knowledge but suffer from exhibiting limited emotional expressiveness. Without adequate expressions, agents might fail to adapt to users' emotions, which may result in a sub-optimal user experience and engagement. Most cur… ▽ More

    Submitted 25 February, 2026; originally announced February 2026.

    Comments: 5 pages

  7. arXiv:2602.20433  [pdf, ps, other

    cs.CL

    Disentangling Geometry, Performance, and Training in Language Models

    Authors: Atharva Kulkarni, Jacob Mitchell Springer, Arjun Subramonian, Swabha Swayamdipta

    Abstract: Geometric properties of Transformer weights, particularly the unembedding matrix, have been widely useful in language model interpretability research. Yet, their utility for estimating downstream performance remains unclear. In this work, we systematically investigate the relationship between model performance and the unembedding matrix geometry, particularly its effective rank. Our experiments, i… ▽ More

    Submitted 23 February, 2026; originally announced February 2026.

  8. arXiv:2602.16430  [pdf, ps, other

    cs.CV cs.AI

    Designing Production-Scale OCR for India: Multilingual and Domain-Specific Systems

    Authors: Ali Faraz, Raja Kolla, Ashish Kulkarni, Shubham Agarwal

    Abstract: Designing Optical Character Recognition (OCR) systems for India requires balancing linguistic diversity, document heterogeneity, and deployment constraints. In this paper, we study two training strategies for building multilingual OCR systems with Vision-Language Models through the Chitrapathak series. We first follow a popular multimodal approach, pairing a generic vision encoder with a strong mu… ▽ More

    Submitted 18 February, 2026; originally announced February 2026.

  9. arXiv:2602.07190  [pdf, ps, other

    cs.CL cs.AI

    Long-Context Long-Form Question Answering for Legal Domain

    Authors: Anagha Kulkarni, Parin Rajesh Jhaveri, Prasha Shrestha, Yu Tong Han, Reza Amini, Behrouz Madahian

    Abstract: Legal documents have complex document layouts involving multiple nested sections, lengthy footnotes and further use specialized linguistic devices like intricate syntax and domain-specific vocabulary to ensure precision and authority. These inherent characteristics of legal documents make question answering challenging, and particularly so when the answer to the question spans several pages (i.e.… ▽ More

    Submitted 6 February, 2026; originally announced February 2026.

    Comments: EACL 2026

  10. arXiv:2602.05059  [pdf, ps, other

    cs.AI

    Evaluating Large Language Models on Solved and Unsolved Problems in Graph Theory: Implications for Computing Education

    Authors: Adithya Kulkarni, Mohna Chakraborty, Jay Bagga

    Abstract: Large Language Models are increasingly used by students to explore advanced material in computer science, including graph theory. As these tools become integrated into undergraduate and graduate coursework, it is important to understand how reliably they support mathematically rigorous thinking. This study examines the performance of a LLM on two related graph theoretic problems: a solved problem… ▽ More

    Submitted 4 February, 2026; originally announced February 2026.

  11. arXiv:2601.01101  [pdf, ps, other

    cs.CY

    An Agentic Software Framework for Data Governance under DPDP

    Authors: Apurva Kulkarni, Chandrashekar Ramanathan

    Abstract: Despite the rise of data-driven software systems in the modern digital landscape, data governance under a legal framework remains a critical challenge. In India, the Digital Personal Data Protection (DPDP) Act mandates rigorous data privacy and compliance requirements, necessitating software frameworks that are both ethical and regulation-aware. From a software development perspective, traditional… ▽ More

    Submitted 3 January, 2026; originally announced January 2026.

  12. arXiv:2512.23684  [pdf, ps, other

    cs.CL cs.AI

    Multilingual Hidden Prompt Injection Attacks on LLM-Based Academic Reviewing

    Authors: Panagiotis Theocharopoulos, Ajinkya Kulkarni, Mathew Magimai. -Doss

    Abstract: Large language models (LLMs) are increasingly considered for use in high-impact workflows, including academic peer review. However, LLMs are vulnerable to document-level hidden prompt injection attacks. In this work, we construct a dataset of approximately 500 real academic papers accepted to ICML and evaluate the effect of embedding hidden adversarial prompts within these documents. Each paper is… ▽ More

    Submitted 29 December, 2025; originally announced December 2025.

  13. arXiv:2512.10805  [pdf, ps, other

    cs.LG cs.CV

    Interpretable and Steerable Concept Bottleneck Sparse Autoencoders

    Authors: Akshay Kulkarni, Tsui-Wei Weng, Vivek Narayanaswamy, Shusen Liu, Wesam A. Sakla, Kowshik Thopalli

    Abstract: Sparse autoencoders (SAEs) promise a unified approach for mechanistic interpretability, concept discovery, and model steering in LLMs and LVLMs. However, realizing this potential requires learned features to be both interpretable and steerable. To that end, we introduce two new computationally inexpensive interpretability and steerability metrics for a systematic analysis of LVLM SAEs. This uncove… ▽ More

    Submitted 30 March, 2026; v1 submitted 11 December, 2025; originally announced December 2025.

    Comments: CVPR 2026

  14. arXiv:2512.07819  [pdf, ps, other

    cs.RO

    Efficient and Compliant Control Framework for Versatile Human-Humanoid Collaborative Transportation

    Authors: Shubham S. Kumbhar, Abhijeet M. Kulkarni, Panagiotis Artemiadis

    Abstract: We present a control framework that enables humanoid robots to perform collaborative transportation tasks with a human partner. The framework supports both translational and rotational motions, which are fundamental to co-transport scenarios. It comprises three components: a high-level planner, a low-level controller, and a stiffness modulation mechanism. At the planning level, we introduce the In… ▽ More

    Submitted 8 December, 2025; originally announced December 2025.

  15. arXiv:2512.03073  [pdf, ps, other

    cs.CY cs.AI

    Economies of Open Intelligence: Tracing Power & Participation in the Model Ecosystem

    Authors: Shayne Longpre, Christopher Akiki, Campbell Lund, Atharva Kulkarni, Emily Chen, Irene Solaiman, Avijit Ghosh, Yacine Jernite, Lucie-Aimée Kaffee

    Abstract: Since 2019, the Hugging Face Model Hub has been the primary global platform for sharing open weight AI models. By releasing a dataset of the complete history of weekly model downloads (June 2020-August 2025) alongside model metadata, we provide the most rigorous examination to-date of concentration dynamics and evolving characteristics in the open model economy. Our analysis spans 851,000 models,… ▽ More

    Submitted 27 November, 2025; originally announced December 2025.

  16. arXiv:2511.23088  [pdf, ps, other

    cs.CL

    Accent Placement Models for Rigvedic Sanskrit Text

    Authors: Akhil Rajeev P, Annarao Kulkarni

    Abstract: The Rigveda, among the oldest Indian texts in Vedic Sanskrit, employs a distinctive pitch-accent system : udātta, anudātta, svarita whose marks encode melodic and interpretive cues but are often absent from modern e-texts. This work develops a parallel corpus of accented-unaccented ślokas and conducts a controlled comparison of three strategies for automatic accent placement in Rigvedic verse: (i)… ▽ More

    Submitted 28 November, 2025; originally announced November 2025.

    Comments: Submitted to AACL-IJCNLP 2025

  17. arXiv:2511.10338  [pdf, ps, other

    cs.CL cs.AI

    BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages

    Authors: Guduru Manoj, Neel Prabhanjan Rachamalla, Ashish Kulkarni, Gautam Rajeev, Jay Piplodiya, Arul Menezes, Shaharukh Khan, Souvik Rana, Manya Sah, Chandra Khatri, Shubham Agarwal

    Abstract: In the context of pretraining of Large Language Models (LLMs), synthetic data has emerged as an alternative for generating high-quality pretraining data at scale. This is particularly beneficial in low-resource language settings where the benefits of recent LLMs have been unevenly distributed across languages. In this work, we present a systematic study on the generation and evaluation of syntheti… ▽ More

    Submitted 16 November, 2025; v1 submitted 13 November, 2025; originally announced November 2025.

  18. arXiv:2511.09955  [pdf, ps, other

    cs.CV

    Robust Object Detection with Pseudo Labels from VLMs using Per-Object Co-teaching

    Authors: Uday Bhaskar, Rishabh Bhattacharya, Avinash Patel, Sarthak Khoche, Praveen Anil Kulkarni, Naresh Manwani

    Abstract: Foundation models, especially vision-language models (VLMs), offer compelling zero-shot object detection for applications like autonomous driving, a domain where manual labelling is prohibitively expensive. However, their detection latency and tendency to hallucinate predictions render them unsuitable for direct deployment. This work introduces a novel pipeline that addresses this challenge by lev… ▽ More

    Submitted 12 November, 2025; originally announced November 2025.

  19. arXiv:2511.05810  [pdf, ps, other

    cs.AI cs.CL cs.LG

    DiagnoLLM: A Hybrid Bayesian Neural Language Framework for Interpretable Disease Diagnosis

    Authors: Bowen Xu, Xinyue Zeng, Jiazhen Hu, Tuo Wang, Adithya Kulkarni

    Abstract: Building trustworthy clinical AI systems requires not only accurate predictions but also transparent, biologically grounded explanations. We present \texttt{DiagnoLLM}, a hybrid framework that integrates Bayesian deconvolution, eQTL-guided deep learning, and LLM-based narrative generation for interpretable disease diagnosis. DiagnoLLM begins with GP-unmix, a Gaussian Process-based hierarchical mod… ▽ More

    Submitted 16 November, 2025; v1 submitted 7 November, 2025; originally announced November 2025.

  20. arXiv:2511.03237  [pdf, ps, other

    cs.CL

    MUTANT: A Recipe for Multilingual Tokenizer Design

    Authors: Souvik Rana, Arul Menezes, Ashish Kulkarni, Chandra Khatri, Shubham Agarwal

    Abstract: Tokenizers play a crucial role in determining the performance, training efficiency, and the inference cost of Large Language Models (LLMs). Designing effective tokenizers for multilingual LLMs is particularly challenging due to diverse scripts and rich morphological variation. While subword methods like Byte Pair Encoding (BPE) are widely adopted, their effectiveness in multilingual settings remai… ▽ More

    Submitted 22 March, 2026; v1 submitted 5 November, 2025; originally announced November 2025.

  21. arXiv:2510.22789  [pdf, ps, other

    cs.RO

    Learning Neural Observer-Predictor Models for Limb-level Sampling-based Locomotion Planning

    Authors: Abhijeet M. Kulkarni, Ioannis Poulakakis, Guoquan Huang

    Abstract: Accurate full-body motion prediction is essential for the safe, autonomous navigation of legged robots, enabling critical capabilities like limb-level collision checking in cluttered environments. Simplified kinematic models often fail to capture the complex, closed-loop dynamics of the robot and its low-level controller, limiting their predictions to simple planar motion. To address this, we pres… ▽ More

    Submitted 26 October, 2025; originally announced October 2025.

  22. arXiv:2510.13670  [pdf, ps, other

    cs.CV

    NTIRE 2025 Challenge on Low Light Image Enhancement: Methods and Results

    Authors: Xiaoning Liu, Zongwei Wu, Florin-Alexandru Vasluianu, Hailong Yan, Bin Ren, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Kangbiao Shi, Yixu Feng, Tao Hu, Yu Cao, Peng Wu, Yijin Liang, Yanning Zhang, Qingsen Yan, Han Zhou, Wei Dong, Yan Min, Mohab Kishawy, Jun Chen, Pengpeng Yu, Anjin Park , et al. (80 additional authors not shown)

    Abstract: This paper presents a comprehensive review of the NTIRE 2025 Low-Light Image Enhancement (LLIE) Challenge, highlighting the proposed solutions and final outcomes. The objective of the challenge is to identify effective networks capable of producing brighter, clearer, and visually compelling images under diverse and challenging conditions. A remarkable total of 762 participants registered for the c… ▽ More

    Submitted 15 October, 2025; originally announced October 2025.

    Comments: CVPR NTIRE 2025 Workshop, please refer to https://openaccess.thecvf.com/CVPR2025_workshops/NTIRE

  23. arXiv:2510.13485  [pdf, ps, other

    cs.IT

    Non-Linear Precoding via Dirty Paper Coding for Near-Field Downlink MISO Communications

    Authors: Akash Kulkarni, Rajshekhar V Bhat

    Abstract: In 6G systems, extremely large-scale antenna arrays operating at terahertz frequencies extend the near-field region to typical user distances from the base station, enabling near-field communication (NFC) with fine spatial resolution through beamfocusing. Existing multiuser NFC systems predominantly employ linear precoding techniques such as zero-forcing (ZF), which suffer from performance degrada… ▽ More

    Submitted 20 November, 2025; v1 submitted 15 October, 2025; originally announced October 2025.

  24. arXiv:2510.09062  [pdf, ps, other

    cs.CL

    ReFIne: A Framework for Trustworthy Large Reasoning Models with Reliability, Faithfulness, and Interpretability

    Authors: Chung-En Sun, Ge Yan, Akshay Kulkarni, Tsui-Wei Weng

    Abstract: Recent advances in long chain-of-thought (CoT) reasoning have largely prioritized answer accuracy and token efficiency, while overlooking aspects critical to trustworthiness. We argue that usable reasoning systems must be trustworthy, characterized by three properties: interpretability, faithfulness, and reliability. To this end, we propose ReFIne, a new training framework that integrates supervis… ▽ More

    Submitted 10 October, 2025; originally announced October 2025.

  25. arXiv:2510.08571  [pdf, ps, other

    cs.RO cs.CV

    Scalable Offline Metrics for Autonomous Driving

    Authors: Animikh Aich, Adwait Kulkarni, Eshed Ohn-Bar

    Abstract: Real-world evaluation of perception-based planning models for robotic systems, such as autonomous vehicles, can be safely and inexpensively conducted offline, i.e. by computing model prediction error over a pre-collected validation dataset with ground-truth annotations. However, extrapolating from offline model performance to online settings remains a challenge. In these settings, seemingly minor… ▽ More

    Submitted 9 November, 2025; v1 submitted 9 October, 2025; originally announced October 2025.

    Comments: Accepted at IROS 2025 (IEEE/RSJ International Conference on Intelligent Robots and Systems); typos corrected

  26. arXiv:2510.07978  [pdf, ps, other

    cs.AI cs.CL cs.LG

    VoiceAgentBench: Are Voice Assistants ready for agentic tasks?

    Authors: Dhruv Jain, Harshit Shukla, Gautam Rajeev, Ashish Kulkarni, Chandra Khatri, Shubham Agarwal

    Abstract: Large scale Speech Language Models have enabled voice assistants capable of understanding natural spoken queries and performing complex tasks. However, existing speech benchmarks largely focus on isolated capabilities such as transcription or question answering and do not systematically evaluate agentic behavior or adversarial robustness. To address this, we introduce VoiceAgentBench, a comprehens… ▽ More

    Submitted 13 February, 2026; v1 submitted 9 October, 2025; originally announced October 2025.

  27. arXiv:2510.07000  [pdf, ps, other

    cs.CL cs.AI

    Pragyaan: Designing and Curating High-Quality Cultural Post-Training Datasets for Indian Languages

    Authors: Neel Prabhanjan Rachamalla, Aravind Konakalla, Gautam Rajeev, Ashish Kulkarni, Chandra Khatri, Shubham Agarwal

    Abstract: The effectiveness of Large Language Models (LLMs) depends heavily on the availability of high-quality post-training data, particularly instruction-tuning and preference-based examples. Existing open-source datasets, however, often lack multilingual coverage, cultural grounding, and suffer from task diversity gaps that are especially pronounced for Indian languages. We introduce a human-in-the-loop… ▽ More

    Submitted 8 October, 2025; originally announced October 2025.

    Comments: EMNLP 2025

  28. arXiv:2510.04983   

    cs.CL cs.AI cs.CY cs.LG

    AWARE, Beyond Sentence Boundaries: A Contextual Transformer Framework for Identifying Cultural Capital in STEM Narratives

    Authors: Khalid Mehtab Khan, Anagha Kulkarni

    Abstract: Identifying cultural capital (CC) themes in student reflections can offer valuable insights that help foster equitable learning environments in classrooms. However, themes such as aspirational goals or family support are often woven into narratives, rather than appearing as direct keywords. This makes them difficult to detect for standard NLP models that process sentences in isolation. The core ch… ▽ More

    Submitted 3 November, 2025; v1 submitted 6 October, 2025; originally announced October 2025.

    Comments: The authors are withdrawing this version to correct issues identified in the experimental design and analysis. A revised and validated version will be submitted after further review

  29. arXiv:2509.22937  [pdf, ps, other

    cs.RO

    DBF-MA: A Differential Bayesian Filtering Planner for Multi-Agent Autonomous Racing Overtakes

    Authors: Trent Weiss, Amar Kulkarni, Madhur Behl

    Abstract: A significant challenge in autonomous racing is to generate overtaking maneuvers. Racing agents must execute these maneuvers on complex racetracks with little room for error. Optimization techniques and graph-based methods have been proposed, but these methods often rely on oversimplified assumptions for collision-avoidance and dynamic constraints. In this work, we present an approach to trajector… ▽ More

    Submitted 1 October, 2025; v1 submitted 26 September, 2025; originally announced September 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  30. arXiv:2509.19941  [pdf, ps, other

    cs.CL cs.AI

    CorIL: Towards Enriching Indian Language to Indian Language Parallel Corpora and Machine Translation Systems

    Authors: Soham Bhattacharjee, Mukund K Roy, Yathish Poojary, Bhargav Dave, Mihir Raj, Vandan Mujadia, Baban Gain, Pruthwik Mishra, Arafat Ahsan, Parameswari Krishnamurthy, Ashwath Rao, Gurpreet Singh Josan, Preeti Dubey, Aadil Amin Kak, Anna Rao Kulkarni, Narendra VG, Sunita Arora, Rakesh Balbantray, Prasenjit Majumdar, Karunesh K Arora, Asif Ekbal, Dipti Mishra Sharma

    Abstract: India's linguistic landscape is one of the most diverse in the world, comprising over 120 major languages and approximately 1,600 additional languages, with 22 officially recognized as scheduled languages in the Indian Constitution. Despite recent progress in multilingual neural machine translation (NMT), high-quality parallel corpora for Indian languages remain scarce, especially across varied do… ▽ More

    Submitted 24 September, 2025; originally announced September 2025.

  31. arXiv:2509.16648  [pdf, ps, other

    cs.AI cs.CL cs.LG

    FESTA: Functionally Equivalent Sampling for Trust Assessment of Multimodal LLMs

    Authors: Debarpan Bhattacharya, Apoorva Kulkarni, Sriram Ganapathy

    Abstract: The accurate trust assessment of multimodal large language models (MLLMs) generated predictions, which can enable selective prediction and improve user confidence, is challenging due to the diverse multi-modal input paradigms. We propose Functionally Equivalent Sampling for Trust Assessment (FESTA), a multimodal input sampling technique for MLLMs, that generates an uncertainty measure based on the… ▽ More

    Submitted 30 January, 2026; v1 submitted 20 September, 2025; originally announced September 2025.

    Comments: Accepted in the Findings of EMNLP, 2025

    Journal ref: EMNLP 2025

  32. arXiv:2509.13721  [pdf

    cs.NE

    Snail Homing and Mating Search Algorithm for Weight Optimization of Stepped-Transmission Shaft

    Authors: Kaustav Saha, Ishaan R Kale, Vivek Patel, Anand J Kulkarni, Puskaraj D Sonawwanay

    Abstract: In this paper, the steeped-transmission shaft design problem is proposed for weight optimization. The bio-inspired search-based Snail Homing and Mating Search (SHMS) algorithm is utilized to solve the problem. It is inspired by the social behaviour of snails and their inherent nature of finding better homes, and mate. The proposed steeped-transmission shaft design problem is modelled considering t… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

  33. arXiv:2509.11123  [pdf, ps, other

    cs.CR

    ODoQ: Oblivious DNS-over-QUIC

    Authors: Aditya Kulkarni, Tamal Das, Vivek Balachandran

    Abstract: The Domain Name System (DNS), which converts domain names to their respective IP addresses, has advanced enhancements aimed at safeguarding DNS data and users' identity from attackers. The recent privacy-focused advancements have enabled the IETF to standardize several protocols. Nevertheless, these protocols tend to focus on either strengthening user privacy (like Oblivious DNS and Oblivious DNS-… ▽ More

    Submitted 8 December, 2025; v1 submitted 14 September, 2025; originally announced September 2025.

  34. arXiv:2509.09592  [pdf, ps, other

    cs.CR

    Bridging the Gap in Phishing Detection: A Comprehensive Phishing Dataset Collector

    Authors: Aditya Kulkarni, Shahil Manishbhai Patel, Shivam Pradip Tirmare, Vivek Balachandran, Tamal Das

    Abstract: To combat phishing attacks -- aimed at luring web users to divulge their sensitive information -- various phishing detection approaches have been proposed. As attackers focus on devising new tactics to bypass existing detection solutions, researchers have adapted by integrating machine learning and deep learning into phishing detection. Phishing dataset collection is vital to developing effective… ▽ More

    Submitted 11 September, 2025; originally announced September 2025.

  35. arXiv:2509.08424  [pdf, ps, other

    cs.CR

    Phishing Webpage Detection: Unveiling the Threat Landscape and Investigating Detection Techniques

    Authors: Aditya Kulkarni, Vivek Balachandran, Tamal Das

    Abstract: In the realm of cybersecurity, phishing stands as a prevalent cyber attack, where attackers employ various tactics to deceive users into gathering their sensitive information, potentially leading to identity theft or financial gain. Researchers have been actively working on advancing phishing webpage detection approaches to detect new phishing URLs, bolstering user protection. Nonetheless, the eve… ▽ More

    Submitted 10 September, 2025; originally announced September 2025.

  36. arXiv:2509.08375  [pdf, ps, other

    cs.CR

    Phish-Blitz: Advancing Phishing Detection with Comprehensive Webpage Resource Collection and Visual Integrity Preservation

    Authors: Duddu Hriday, Aditya Kulkarni, Vivek Balachandran, Tamal Das

    Abstract: Phishing attacks are increasingly prevalent, with adversaries creating deceptive webpages to steal sensitive information. Despite advancements in machine learning and deep learning for phishing detection, attackers constantly develop new tactics to bypass detection models. As a result, phishing webpages continue to reach users, particularly those unable to recognize phishing indicators. To improve… ▽ More

    Submitted 10 September, 2025; originally announced September 2025.

  37. arXiv:2509.08364  [pdf, ps, other

    cs.CR

    Overcoming DNSSEC Islands of Security: A TLS and IP-Based Certificate Solution

    Authors: Aduma Rishith, Aditya Kulkarni, Tamal Das, Vivek Balachandran

    Abstract: The Domain Name System (DNS) serves as the backbone of the Internet, primarily translating domain names to IP addresses. Over time, various enhancements have been introduced to strengthen the integrity of DNS. Among these, DNSSEC stands out as a leading cryptographic solution. It protects against attacks (such as DNS spoofing) by establishing a chain of trust throughout the DNS nameserver hierarch… ▽ More

    Submitted 8 December, 2025; v1 submitted 10 September, 2025; originally announced September 2025.

  38. arXiv:2509.07925  [pdf, ps, other

    cs.CL cs.AI cs.LG

    GENUINE: Graph Enhanced Multi-level Uncertainty Estimation for Large Language Models

    Authors: Tuo Wang, Adithya Kulkarni, Tyler Cody, Peter A. Beling, Yujun Yan, Dawei Zhou

    Abstract: Uncertainty estimation is essential for enhancing the reliability of Large Language Models (LLMs), particularly in high-stakes applications. Existing methods often overlook semantic dependencies, relying on token-level probability measures that fail to capture structural relationships within the generated text. We propose GENUINE: Graph ENhanced mUlti-level uncertaINty Estimation for Large Languag… ▽ More

    Submitted 9 September, 2025; originally announced September 2025.

    Comments: Accepted by EMNLP 2025

  39. arXiv:2509.02859  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Speech DF Arena: A Leaderboard for Speech DeepFake Detection Models

    Authors: Sandipana Dowerah, Atharva Kulkarni, Ajinkya Kulkarni, Hoan My Tran, Joonas Kalda, Artem Fedorchenko, Benoit Fauve, Damien Lolive, Tanel Alumäe, Matthew Magimai Doss

    Abstract: Parallel to the development of advanced deepfake audio generation, audio deepfake detection has also seen significant progress. However, a standardized and comprehensive benchmark is still missing. To address this, we introduce Speech DeepFake (DF) Arena, the first comprehensive benchmark for audio deepfake detection. Speech DF Arena provides a toolkit to uniformly evaluate detection systems, curr… ▽ More

    Submitted 2 September, 2025; originally announced September 2025.

  40. arXiv:2508.20543  [pdf, ps, other

    cs.IR cs.CY

    Enhancing Semantic Document Retrieval- Employing Group Steiner Tree Algorithm with Domain Knowledge Enrichment

    Authors: Apurva Kulkarni, Chandrashekar Ramanathan, Vinu E Venugopal

    Abstract: Retrieving pertinent documents from various data sources with diverse characteristics poses a significant challenge for Document Retrieval Systems. The complexity of this challenge is further compounded when accounting for the semantic relationship between data and domain knowledge. While existing retrieval systems using semantics (usually represented as Knowledge Graphs created from open-access r… ▽ More

    Submitted 28 August, 2025; originally announced August 2025.

  41. arXiv:2507.14758  [pdf, ps, other

    cs.CL cs.AI cs.IR

    GRACE: Generative Recommendation via Journey-Aware Sparse Attention on Chain-of-Thought Tokenization

    Authors: Luyi Ma, Wanjia Zhang, Kai Zhao, Abhishek Kulkarni, Lalitesh Morishetti, Anjana Ganesh, Ashish Ranjan, Aashika Padmanabhan, Jianpeng Xu, Jason Cho, Praveen Kanumala, Kaushiki Nag, Sumit Dutta, Kamiya Motwani, Malay Patel, Evren Korpeoglu, Sushant Kumar, Kannan Achan

    Abstract: Generative models have recently demonstrated strong potential in multi-behavior recommendation systems, leveraging the expressive power of transformers and tokenization to generate personalized item sequences. However, their adoption is hindered by (1) the lack of explicit information for token reasoning, (2) high computational costs due to quadratic attention complexity and dense sequence represe… ▽ More

    Submitted 19 July, 2025; originally announced July 2025.

    Comments: 10 pages, 5 figures, The ACM Conference on Recommender Systems (RecSys) 2025

  42. arXiv:2507.07741  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Code-Switching in End-to-End Automatic Speech Recognition: A Systematic Literature Review

    Authors: Maha Tufail Agro, Atharva Kulkarni, Karima Kadaoui, Zeerak Talat, Hanan Aldarmaki

    Abstract: Motivated by a growing research interest into automatic speech recognition (ASR), and the growing body of work for languages in which code-switching (CS) often occurs, we present a systematic literature review of code-switching in end-to-end ASR models. We collect and manually annotate papers published in peer reviewed venues. We document the languages considered, datasets, metrics, model choices,… ▽ More

    Submitted 10 July, 2025; originally announced July 2025.

  43. arXiv:2507.02883  [pdf, ps, other

    q-bio.BM cs.LG

    DISPROTBENCH: Uncovering the Functional Limits of Protein Structure Prediction Models in Intrinsically Disordered Regions

    Authors: Xinyue Zeng, Tuo Wang, Adithya Kulkarni, Alexander Lu, Alexandra Ni, Phoebe Xing, Junhan Zhao, Siwei Chen, Dawei Zhou

    Abstract: Intrinsically disordered regions (IDRs) play central roles in cellular function, yet remain poorly evaluated by existing protein structure prediction benchmarks. Current evaluations largely focus on well-folded domains, overlooking three fundamental challenges in realistic biological settings: the structural complexity of proteins, the resulting low availability of reliable ground truth, and predi… ▽ More

    Submitted 10 February, 2026; v1 submitted 18 June, 2025; originally announced July 2025.

  44. Non-exchangeable Conformal Prediction for Temporal Graph Neural Networks

    Authors: Tuo Wang, Jian Kang, Yujun Yan, Adithya Kulkarni, Dawei Zhou

    Abstract: Conformal prediction for graph neural networks (GNNs) offers a promising framework for quantifying uncertainty, enhancing GNN reliability in high-stakes applications. However, existing methods predominantly focus on static graphs, neglecting the evolving nature of real-world graphs. Temporal dependencies in graph structure, node attributes, and ground truth labels violate the fundamental exchangea… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: accepted by KDD 2025

    ACM Class: H.1.0; I.2.0

  45. arXiv:2507.00330  [pdf, ps, other

    cs.CL cs.IR

    Modeling Data Diversity for Joint Instance and Verbalizer Selection in Cold-Start Scenarios

    Authors: Mohna Chakraborty, Adithya Kulkarni, Qi Li

    Abstract: Prompt-based methods leverage the knowledge of pre-trained language models (PLMs) trained with a masked language modeling (MLM) objective; however, these methods are sensitive to template, verbalizer, and few-shot instance selection, particularly in cold-start settings with no labeled data. Existing studies overlook the dependency between instances and verbalizers, where instance-label probabiliti… ▽ More

    Submitted 30 June, 2025; originally announced July 2025.

  46. arXiv:2506.07985  [pdf, ps, other

    cs.CV cs.LG

    Beyond Top Activations: Efficient and Reliable Crowdsourced Evaluation of Automated Interpretability

    Authors: Tuomas Oikarinen, Ge Yan, Akshay Kulkarni, Tsui-Wei Weng

    Abstract: Interpreting individual neurons or directions in activation space is an important topic in mechanistic interpretability. Numerous automated interpretability methods have been proposed to generate such explanations, but it remains unclear how reliable these explanations are, and which methods produce the most accurate descriptions. While crowd-sourced evaluations are commonly used, existing pipelin… ▽ More

    Submitted 2 December, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  47. arXiv:2506.06093  [pdf, ps, other

    cs.CL

    Reinforcing Code Generation: Improving Text-to-SQL with Execution-Based Learning

    Authors: Atharv Kulkarni, Vivek Srikumar

    Abstract: In this work, we study the problem of code generation with a large language model (LLM), with a focus on generating SQL queries from natural language questions. We ask: Instead of using supervised fine tuning with text-code pairs, can we tune a model by having it interact with a database engine? We frame this problem as a reinforcement learning problem where the model receives execution-based feed… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: Under review at EMNLP 2025

  48. arXiv:2506.05746  [pdf, ps, other

    cs.CL

    LLM-Symbolic Integration for Robust Temporal Tabular Reasoning

    Authors: Atharv Kulkarni, Kushagra Dixit, Vivek Srikumar, Dan Roth, Vivek Gupta

    Abstract: Temporal tabular question answering presents a significant challenge for Large Language Models (LLMs), requiring robust reasoning over structured data, which is a task where traditional prompting methods often fall short. These methods face challenges such as memorization, sensitivity to table size, and reduced performance on complex queries. To overcome these limitations, we introduce TempTabQA-C… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: Accepted to ACL Findings 2025

  49. arXiv:2506.02085  [pdf, ps, other

    cs.SD cs.AI cs.CL eess.AS

    Unveiling Audio Deepfake Origins: A Deep Metric learning And Conformer Network Approach With Ensemble Fusion

    Authors: Ajinkya Kulkarni, Sandipana Dowerah, Tanel Alumae, Mathew Magimai. -Doss

    Abstract: Audio deepfakes are acquiring an unprecedented level of realism with advanced AI. While current research focuses on discerning real speech from spoofed speech, tracing the source system is equally crucial. This work proposes a novel audio source tracing system combining deep metric multi-class N-pair loss with Real Emphasis and Fake Dispersion framework, a Conformer classification network, and ens… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: Accepted at Interspeech 2025, Netherlands

  50. arXiv:2506.00815  [pdf, ps, other

    cs.CL

    Chandomitra: Towards Generating Structured Sanskrit Poetry from Natural Language Inputs

    Authors: Manoj Balaji Jagadeeshan, Samarth Bhatia, Pretam Ray, Harshul Raj Surana, Akhil Rajeev P, Priya Mishra, Annarao Kulkarni, Ganesh Ramakrishnan, Prathosh AP, Pawan Goyal

    Abstract: Text Generation has achieved remarkable performance using large language models. It has also been recently well-studied that these large language models are capable of creative generation tasks but prominently for high-resource languages. This prompts a fundamental question: Is there a way to utilize these (large) language models for structured poetry generation in a low-resource language, such as… ▽ More

    Submitted 16 January, 2026; v1 submitted 31 May, 2025; originally announced June 2025.