Skip to main content

Showing 1–50 of 99 results for author: Desai, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2604.07930  [pdf, ps, other

    cs.IR

    Unified Supervision for Walmart's Sponsored Search Retrieval via Joint Semantic Relevance and Behavioral Engagement Modeling

    Authors: Shasvat Desai, Md Omar Faruk Rokon, Jhalak Nilesh Acharya, Isha Shah, Hong Yao, Utkarsh Porwal, Kuang-chih Lee

    Abstract: Modern search systems rely on a fast first stage retriever to fetch relevant items from a massive catalog of items. Deployed search systems often use user engagement signals to supervise bi-encoder retriever training at scale, because these signals are continuously logged from real traffic and require no additional annotation effort. However, engagement is an imperfect proxy for semantic relevance… ▽ More

    Submitted 15 April, 2026; v1 submitted 9 April, 2026; originally announced April 2026.

    Comments: Accepted to SIGIR 2026, Industry Track

  2. "Who wants to be nagged by AI?": Investigating the Effects of Agreeableness on Older Adults' Perception of LLM-Based Voice Assistants' Explanations

    Authors: Niharika Mathur, Hasibur Rahman, Smit Desai

    Abstract: LLM-based voice assistants (VAs) increasingly support older adults aging in place, yet how an assistant's agreeableness shapes explanation perception remains underexplored. We conducted a study(N=70) examining how VA agreeableness influences older adults' perceptions of explanations across routine and emergency home scenarios. High-agreeableness assistants were perceived as more trustworthy, empat… ▽ More

    Submitted 9 March, 2026; originally announced March 2026.

    Comments: To be published as a poster extended abstract at CHI 2026

  3. arXiv:2603.08164  [pdf, ps, other

    cs.HC

    The Differential Effects of Agreeableness and Extraversion on Older Adults' Perceptions of Conversational AI Explanations in Assistive Settings

    Authors: Niharika Mathur, Hasibur Rahman, Smit Desai

    Abstract: Large Language Model-based Voice Assistants (LLM-VAs) are increasingly deployed in assistive settings for older adults, yet little is known about how an agent's personality shapes user perceptions of its explanations. This paper presents a mixed factorial experiment (N=140) examining how agreeableness and extraversion in an LLM-VA ("Robin") influence older adults' perceptions across seven measures… ▽ More

    Submitted 9 March, 2026; originally announced March 2026.

  4. arXiv:2602.22340  [pdf, ps, other

    cs.HC

    Conversational Successes and Breakdowns in Everyday Smart Glasses Use

    Authors: Xiuqi Tommy Zhu, Xiaoan Liu, Casper Harteveld, Smit Desai, Eileen McGivney

    Abstract: Non-Display Smart Glasses hold the potential to support everyday activities by combining continuous environmental sensing with voice-only interaction powered by large language models (LLMs). Understanding how conversational successes and breakdowns arise in everyday contexts can better inform the design of future voice-only interfaces. To investigate this, we conducted a month-long collaborative a… ▽ More

    Submitted 2 April, 2026; v1 submitted 25 February, 2026; originally announced February 2026.

  5. arXiv:2602.09162  [pdf, ps, other

    cs.LG cond-mat.mtrl-sci

    Boltzmann Reinforcement Learning for Noise resilience in Analog Ising Machines

    Authors: Aditya Choudhary, Saaketh Desai, Prasad Iyer

    Abstract: Analog Ising machines (AIMs) have emerged as a promising paradigm for combinatorial optimization, utilizing physical dynamics to solve Ising problems with high energy efficiency. However, the performance of traditional optimization and sampling algorithms on these platforms is often limited by inherent measurement noise. We introduce BRAIN (Boltzmann Reinforcement for Analog Ising Networks), a dis… ▽ More

    Submitted 9 February, 2026; originally announced February 2026.

  6. arXiv:2602.02917  [pdf, ps, other

    cs.LG

    Weighted Temporal Decay Loss for Learning Wearable PPG Data with Sparse Clinical Labels

    Authors: Yunsung Chung, Keum San Chun, Migyeong Gwak, Han Feng, Yingshuo Liu, Chanho Lim, Viswam Nathan, Nassir Marrouche, Sharanya Arcot Desai

    Abstract: Advances in wearable computing and AI have increased interest in leveraging PPG for health monitoring over the past decade. One of the biggest challenges in developing health algorithms based on such biosignals is the sparsity of clinical labels, which makes biosignals temporally distant from lab draws less reliable for supervision. To address this problem, we introduce a simple training strategy… ▽ More

    Submitted 2 February, 2026; originally announced February 2026.

    Comments: ICASSP 2026

  7. arXiv:2601.12215  [pdf, ps, other

    cs.LG cs.AI

    Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models

    Authors: Megha Thukral, Cyrus Tanade, Simon A. Lee, Juhyeon Lee, Hao Zhou, Keum San Chun, Migyeong Gwak, Viswam Nathan, Md Mahbubur Rahman, Li Zhu, Mehrab Bin Morshed, Subramaniam Venkatraman, Sharanya Arcot Desai

    Abstract: Wearable foundation models have the potential to transform digital health by learning transferable representations from large-scale biosignals collected in everyday settings. While recent progress has been made in large-scale pretraining, most approaches overlook the spectral structure of photoplethysmography (PPG) signals, wherein physiological rhythms unfold across multiple frequency bands. Moti… ▽ More

    Submitted 17 January, 2026; originally announced January 2026.

  8. arXiv:2511.12404  [pdf, ps, other

    cs.MM cs.AI cs.SD

    SynthGuard: An Open Platform for Detecting AI-Generated Multimedia with Multimodal LLMs

    Authors: Shail Desai, Aditya Pawar, Li Lin, Xin Wang, Shu Hu

    Abstract: Artificial Intelligence (AI) has made it possible for anyone to create images, audio, and video with unprecedented ease, enriching education, communication, and creative expression. At the same time, the rapid rise of AI-generated media has introduced serious risks, including misinformation, identity misuse, and the erosion of public trust as synthetic content becomes increasingly indistinguishabl… ▽ More

    Submitted 15 November, 2025; originally announced November 2025.

  9. arXiv:2511.04681  [pdf, ps, other

    astro-ph.CO cs.LG

    Dark Energy Survey Year 3 results: Simulation-based $w$CDM inference from weak lensing and galaxy clustering maps with deep learning: Analysis design

    Authors: A. Thomsen, J. Bucko, T. Kacprzak, V. Ajani, J. Fluri, A. Refregier, D. Anbajagane, F. J. Castander, A. Ferté, M. Gatti, N. Jeffrey, A. Alarcon, A. Amon, K. Bechtol, M. R. Becker, G. M. Bernstein, A. Campos, A. Carnero Rosell, C. Chang, R. Chen, A. Choi, M. Crocce, C. Davis, J. DeRose, S. Dodelson , et al. (77 additional authors not shown)

    Abstract: Data-driven approaches using deep learning are emerging as powerful techniques to extract non-Gaussian information from cosmological large-scale structure. This work presents the first simulation-based inference (SBI) pipeline that combines weak lensing and galaxy clustering maps in a realistic Dark Energy Survey Year 3 (DES Y3) configuration and serves as preparation for a forthcoming analysis of… ▽ More

    Submitted 18 February, 2026; v1 submitted 6 November, 2025; originally announced November 2025.

    Comments: 39 pages, 14 figures

  10. arXiv:2510.25785  [pdf, ps, other

    cs.LG cs.AI eess.SP

    HiMAE: Hierarchical Masked Autoencoders Discover Resolution-Specific Structure in Wearable Time Series

    Authors: Simon A. Lee, Cyrus Tanade, Hao Zhou, Juhyeon Lee, Megha Thukral, Minji Han, Rachel Choi, Md Sazzad Hissain Khan, Baiying Lu, Migyeong Gwak, Mehrab Bin Morshed, Viswam Nathan, Md Mahbubur Rahman, Li Zhu, Subramaniam Venkatraman, Sharanya Arcot Desai

    Abstract: Wearable sensors provide abundant physiological time series, yet the principles governing their predictive utility remain unclear. We hypothesize that temporal resolution is a fundamental axis of representation learning, with different clinical and behavioral outcomes relying on structure at distinct scales. To test this resolution hypothesis, we introduce HiMAE (Hierarchical Masked Autoencoder),… ▽ More

    Submitted 28 October, 2025; originally announced October 2025.

  11. arXiv:2510.22503  [pdf, ps, other

    cs.LG cond-mat.mtrl-sci cs.AI cs.NE

    LLEMA: Evolutionary Search with LLMs for Multi-Objective Materials Discovery

    Authors: Nikhil Abhyankar, Sanchit Kabra, Saaketh Desai, Chandan K. Reddy

    Abstract: Materials discovery requires navigating vast chemical and structural spaces while satisfying multiple, often conflicting, objectives. We present LLM-guided Evolution for MAterials discovery (LLEMA), a unified framework that couples the scientific knowledge embedded in large language models with chemistry-informed evolutionary rules and memory-based refinement. At each iteration, an LLM proposes cr… ▽ More

    Submitted 5 March, 2026; v1 submitted 25 October, 2025; originally announced October 2025.

    Comments: ICLR 2026

  12. arXiv:2509.19147  [pdf, ps, other

    cs.CY cs.AI cs.SI

    Generative Propaganda

    Authors: Madeleine I. G. Daepp, Alejandro Cuevas, Robert Osazuwa Ness, Vickie Yu-Ping Wang, Bharat Kumar Nayak, Dibyendu Mishra, Ti-Chung Cheng, Shaily Desai, Joyojeet Pal

    Abstract: Generative propaganda is the use of generative artificial intelligence (AI) to shape public opinion. To characterize its use in real-world settings, we conducted interviews with defenders (e.g., factcheckers, journalists, officials) in Taiwan and creators (e.g., influencers, political consultants, advertisers) as well as defenders in India, centering two places characterized by high levels of onli… ▽ More

    Submitted 23 September, 2025; originally announced September 2025.

    Comments: Working Paper

    ACM Class: K.4.2

  13. arXiv:2509.12979  [pdf, ps, other

    cs.CR

    Universal share based quantum multi secret image sharing scheme

    Authors: Dipak K. Rabari, Yogesh K. Meghrajani, Laxmi S. Desai

    Abstract: Image security for information has become increasingly critical as internet become more prevalent due to hacking and unauthorized access. To ensure the security of confidential image data, image encryption using visual cryptography plays a crucial role. To share multiple images using visual cryptography, the company organizer utilizes the concept of a universal or common share. Likewise, quantum c… ▽ More

    Submitted 16 September, 2025; originally announced September 2025.

  14. arXiv:2509.09870  [pdf, ps, other

    cs.HC cs.AI cs.CL

    Vibe Check: Understanding the Effects of LLM-Based Conversational Agents' Personality and Alignment on User Perceptions in Goal-Oriented Tasks

    Authors: Hasibur Rahman, Smit Desai

    Abstract: Large language models (LLMs) enable conversational agents (CAs) to express distinctive personalities, raising new questions about how such designs shape user perceptions. This study investigates how personality expression levels and user-agent personality alignment influence perceptions in goal-oriented tasks. In a between-subjects experiment (N=150), participants completed travel planning with CA… ▽ More

    Submitted 11 September, 2025; originally announced September 2025.

  15. arXiv:2508.21098  [pdf, ps, other

    cs.CL cs.AI

    TrInk: Ink Generation with Transformer Network

    Authors: Zezhong Jin, Shubhang Desai, Xu Chen, Biyi Fang, Zhuoyi Huang, Zhe Li, Chong-Xin Gan, Xiao Tu, Man-Wai Mak, Yan Lu, Shujie Liu

    Abstract: In this paper, we propose TrInk, a Transformer-based model for ink generation, which effectively captures global dependencies. To better facilitate the alignment between the input text and generated stroke points, we introduce scaled positional embeddings and a Gaussian memory mask in the cross-attention module. Additionally, we design both subjective and objective evaluation pipelines to comprehe… ▽ More

    Submitted 27 August, 2025; originally announced August 2025.

    Comments: Accepted to EMNLP 2025 Main Conference

  16. arXiv:2508.14318  [pdf, ps, other

    cs.AR cs.AI cs.DC

    Power Stabilization for AI Training Datacenters

    Authors: Esha Choukse, Brijesh Warrier, Scot Heath, Luz Belmont, April Zhao, Hassan Ali Khan, Brian Harry, Matthew Kappel, Russell J. Hewett, Kushal Datta, Yu Pei, Caroline Lichtenberger, John Siegler, David Lukofsky, Zaid Kahn, Gurpreet Sahota, Andy Sullivan, Charles Frederick, Hien Thai, Rebecca Naughton, Daniel Jurnove, Justin Harp, Reid Carper, Nithish Mahalingam, Srini Varkala , et al. (32 additional authors not shown)

    Abstract: Large Artificial Intelligence (AI) training workloads spanning several tens of thousands of GPUs present unique power management challenges. These arise due to the high variability in power consumption during the training. Given the synchronous nature of these jobs, during every iteration there is a computation-heavy phase, where each GPU works on the local data, and a communication-heavy phase wh… ▽ More

    Submitted 21 August, 2025; v1 submitted 19 August, 2025; originally announced August 2025.

  17. arXiv:2506.07400  [pdf, ps, other

    cs.MA cs.AI cs.CV cs.LG

    MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models

    Authors: Philip R. Liu, Sparsh Bansal, Jimmy Dinh, Aditya Pawar, Ramani Satishkumar, Shail Desai, Neeraj Gupta, Xin Wang, Shu Hu

    Abstract: The integration of deep learning-based glaucoma detection with large language models (LLMs) presents an automated strategy to mitigate ophthalmologist shortages and improve clinical reporting efficiency. However, applying general LLMs to medical imaging remains challenging due to hallucinations, limited interpretability, and insufficient domain-specific medical knowledge, which can potentially red… ▽ More

    Submitted 16 December, 2025; v1 submitted 8 June, 2025; originally announced June 2025.

    Journal ref: Proc. 2025 IEEE 8th International Conference on Multimedia Information Processing and Retrieval (MIPR), pp. 456-462, 2025

  18. arXiv:2506.04659  [pdf, ps, other

    cs.HC

    Multi-Tool Analysis of User Interface & Accessibility in Deployed Web-Based Chatbots

    Authors: Mukesh Rajmohan, Smit Desai, Sanchari Das

    Abstract: In this work, we present a multi-tool evaluation of 106 deployed web-based chatbots, across domains like healthcare, education and customer service, comprising both standalone applications and embedded widgets using automated tools (Google Lighthouse, PageSpeed Insights, SiteImprove Accessibility Checker) and manual audits (Microsoft Accessibility Insights). Our analysis reveals that over 80% of c… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 9 pages, 6 figures. Submitted to ACM Conversational User Interfaces (CUI) 2025

    ACM Class: H.5.2; K.4.2

  19. arXiv:2506.02539  [pdf, ps, other

    cs.LG

    VerificAgent: Domain-Specific Memory Verification for Scalable Oversight of Aligned Computer-Use Agents

    Authors: Thong Q. Nguyen, Shubhang Desai, Raja Hasnain Anwar, Firoz Shaik, Vishwas Suryanarayanan, Vishal Chowdhary

    Abstract: Continual memory augmentation lets computer-using agents (CUAs) learn from prior interactions, but unvetted memories can encode domain-inappropriate or unsafe heuristics--spurious rules that drift from user intent and safety constraints. We introduce VerificAgent, a scalable oversight framework that treats persistent memory as an explicit alignment surface. VerificAgent combines (1) an expert-cura… ▽ More

    Submitted 7 August, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

  20. Balancing Efficiency and Empathy: Healthcare Providers' Perspectives on AI-Supported Workflows for Serious Illness Conversations in the Emergency Department

    Authors: Menglin Zhao, Zhuorui Yong, Ruijia Guan, Kai-Wei Chang, Adrian Haimovich, Kei Ouchi, Timothy Bickmore, Zhan Zhang, Bingsheng Yao, Dakuo Wang, Smit Desai

    Abstract: Serious Illness Conversations (SICs), discussions about values and care preferences for patients with life-threatening illness, rarely occur in Emergency Departments (EDs), despite evidence that early conversations improve care alignment and reduce unnecessary interventions. We interviewed 11 ED providers to identify challenges in SICs and opportunities for technology support, with a focus on AI.… ▽ More

    Submitted 31 March, 2026; v1 submitted 30 May, 2025; originally announced June 2025.

    Comments: To appear at ACM CHI'26

  21. arXiv:2504.00698  [pdf

    cs.CL cs.AI cs.LG

    Command A: An Enterprise-Ready Large Language Model

    Authors: Team Cohere, :, Aakanksha, Arash Ahmadian, Marwan Ahmed, Jay Alammar, Milad Alizadeh, Yazeed Alnumay, Sophia Althammer, Arkady Arkhangorodsky, Viraat Aryabumi, Dennis Aumiller, Raphaël Avalos, Zahara Aviv, Sammie Bae, Saurabh Baji, Alexandre Barbet, Max Bartolo, Björn Bebensee, Neeral Beladia, Walter Beller-Morales, Alexandre Bérard, Andrew Berneshawi, Anna Bialas, Phil Blunsom , et al. (205 additional authors not shown)

    Abstract: In this report we describe the development of Command A, a powerful large language model purpose-built to excel at real-world enterprise use cases. Command A is an agent-optimised and multilingual-capable model, with support for 23 languages of global business, and a novel hybrid architecture balancing efficiency with top of the range performance. It offers best-in-class Retrieval Augmented Genera… ▽ More

    Submitted 14 April, 2025; v1 submitted 1 April, 2025; originally announced April 2025.

    Comments: 55 pages

  22. arXiv:2503.14603  [pdf, other

    cs.CL cs.LG

    Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM

    Authors: Yazeed Alnumay, Alexandre Barbet, Anna Bialas, William Darling, Shaan Desai, Joan Devassy, Kyle Duffy, Stephanie Howe, Olivia Lasche, Justin Lee, Anirudh Shrinivason, Jennifer Tracey

    Abstract: Building high-quality large language models (LLMs) for enterprise Arabic applications remains challenging due to the limited availability of digitized Arabic data. In this work, we present a data synthesis and refinement strategy to help address this problem, namely, by leveraging synthetic data generation and human-in-the-loop annotation to expand our Arabic training corpus. We further present ou… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  23. arXiv:2502.20513  [pdf, ps, other

    cs.HC cs.AI cs.LG

    Personas Evolved: Designing Ethical LLM-Based Conversational Agent Personalities

    Authors: Smit Desai, Mateusz Dubiel, Nima Zargham, Thomas Mildner, Laura Spillner

    Abstract: The emergence of Large Language Models (LLMs) has revolutionized Conversational User Interfaces (CUIs), enabling more dynamic, context-aware, and human-like interactions across diverse domains, from social sciences to healthcare. However, the rapid adoption of LLM-based personas raises critical ethical and practical concerns, including bias, manipulation, and unforeseen social consequences. Unlike… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  24. arXiv:2502.16030  [pdf, other

    cs.CV cs.AI

    Real Time Offside Detection using a Single Camera in Soccer

    Authors: Shounak Desai

    Abstract: Technological advancements in soccer have surged over the past decade, transforming aspects of the sport. Unlike binary rules, many soccer regulations, such as the "Offside Rule," rely on subjective interpretation rather than straightforward True or False criteria. The on-field referee holds ultimate authority in adjudicating these nuanced decisions. A significant breakthrough in soccer officiatin… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: 5 pages 11 figures

  25. arXiv:2502.11554  [pdf, ps, other

    cs.HC cs.AI cs.CL cs.CY cs.ET

    Toward Metaphor-Fluid Conversation Design for Voice User Interfaces

    Authors: Smit Desai, Jessie Chin, Dakuo Wang, Benjamin Cowan, Michael Twidale

    Abstract: Metaphors play a critical role in shaping user experiences with Voice User Interfaces (VUIs), yet existing designs often rely on static, human-centric metaphors that fail to adapt to diverse contexts and user needs. This paper introduces Metaphor-Fluid Design, a novel approach that dynamically adjusts metaphorical representations based on conversational use-contexts. We compare this approach to a… ▽ More

    Submitted 23 October, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

  26. arXiv:2502.08134  [pdf, other

    cs.CV

    A Survey on Data Curation for Visual Contrastive Learning: Why Crafting Effective Positive and Negative Pairs Matters

    Authors: Shasvat Desai, Debasmita Ghose, Deep Chakraborty

    Abstract: Visual contrastive learning aims to learn representations by contrasting similar (positive) and dissimilar (negative) pairs of data samples. The design of these pairs significantly impacts representation quality, training efficiency, and computational cost. A well-curated set of pairs leads to stronger representations and faster convergence. As contrastive pre-training sees wider adoption for solv… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: 9 pages, 3 figures

  27. arXiv:2502.05115  [pdf, ps, other

    cs.HC cs.AI

    "It Felt Like I Was Left in the Dark": Exploring Information Needs and Design Opportunities for Family Caregivers of Older Adult Patients in Critical Care Settings

    Authors: Shihan Fu, Bingsheng Yao, Smit Desai, Yuqi Hu, Yuling Sun, Samantha Stonbraker, Yanjun Gao, Elizabeth M. Goldberg, Dakuo Wang

    Abstract: Older adult patients constitute a rapidly growing subgroup of Intensive Care Unit (ICU) patients. In these situations, their family caregivers are expected to represent the unconscious patients to access and interpret patients' medical information. However, caregivers currently have to rely on overloaded clinicians for information updates and typically lack the health literacy to understand comple… ▽ More

    Submitted 18 September, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

  28. arXiv:2412.12347  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.optics

    AutoSciLab: A Self-Driving Laboratory For Interpretable Scientific Discovery

    Authors: Saaketh Desai, Sadhvikas Addamane, Jeffrey Y. Tsao, Igal Brener, Laura P. Swiler, Remi Dingreville, Prasad P. Iyer

    Abstract: Advances in robotic control and sensing have propelled the rise of automated scientific laboratories capable of high-throughput experiments. However, automated scientific laboratories are currently limited by human intuition in their ability to efficiently design and interpret experiments in high-dimensional spaces, throttling scientific discovery. We present AutoSciLab, a machine learning framewo… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: Pre-print for paper accepted in AAAI

  29. arXiv:2411.03140  [pdf

    cs.CY econ.GN

    The Impact of Medicaid Expansion on Medicare Quality Measures

    Authors: Hala Algrain, Elizabeth Cardosa, Shekha Desai, Eugene Fong, Tanguy Ringoir, Huthaifa I. Ashqar

    Abstract: The Affordable Care Act was signed into law in 2010, expanding Medicaid and improving access to care for millions of low-income Americans. Fewer uninsured individuals reduced the cost of uncompensated care, consequently improving the financial health of hospitals. We hypothesize that this amelioration in hospital finances resulted in a marked improvement of quality measures in states that chose to… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  30. arXiv:2410.22744  [pdf, ps, other

    cs.HC cs.AI

    Designing AI Personalities: Enhancing Human-Agent Interaction Through Thoughtful Persona Design

    Authors: Nima Zargham, Mateusz Dubiel, Smit Desai, Thomas Mildner, Hanz-Joachim Belz

    Abstract: In the rapidly evolving field of artificial intelligence (AI) agents, designing the agent's characteristics is crucial for shaping user experience. This workshop aims to establish a research community focused on AI agent persona design for various contexts, such as in-car assistants, educational tools, and smart home environments. We will explore critical aspects of persona design, such as voice,… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: 8 pages, the workshop accepted at the 23rd International Conference on Mobile and Ubiquitous Multimedia (MUM 2024)

  31. arXiv:2410.03342  [pdf, other

    astro-ph.IM cs.DL physics.soc-ph

    A meta-analysis of impact factors of astrophysics journals

    Authors: Rayani Venkat Sai Rithvik, Shantanu Desai

    Abstract: We calculate the 2024 impact factors for the 38 most widely used journals in Astrophysics, using the citations collated by NASA/ADS (Astrophysics Data System) and compare them to the official impact factors. This includes journals which publish papers outside of astrophysics such as PRD, EPJC, Nature, etc. We also propose a new metric to gauge the impact factor based on the median number of citati… ▽ More

    Submitted 4 May, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: 10 pages, 2 figures. More journals added. Also added miscellaneous publication statistics as well as APC details. Accepted for publication in EPJP

  32. arXiv:2409.08449  [pdf, other

    cs.HC

    Beyond Functionality: Co-Designing Voice User Interfaces for Older Adults' Well-being

    Authors: Xinhui Hu, Smit Desai, Morgan Lundy, Jessie Chin

    Abstract: The global population is rapidly aging, necessitating technologies that promote healthy aging. Voice User Interfaces (VUIs), leveraging natural language interaction, offer a promising solution for older adults due to their ease of use. However, current design practices often overemphasize functionality, neglecting older adults' complex aspirations, psychological well-being, and social connectednes… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  33. arXiv:2409.00018  [pdf, ps, other

    cs.CE math.NA

    Analysis of nonlocal smart beams following fractional-order constitutive relations

    Authors: Shubham Desai, Sai Sidhardh

    Abstract: In this study, we develop a fractional-calculus based constitutive model for capturing nonlocal interactions over the multiphysics response in solids. More specifically, we develop constitutive relations for nonlocal piezoelectricity incorporating fractional-order kinematic relations to capture the long-range interactions over electrical and mechanical field variables. This study breaks new ground… ▽ More

    Submitted 16 August, 2024; originally announced September 2024.

    Comments: 36 pages, 21 figures

  34. arXiv:2408.16465  [pdf, other

    cs.HC

    Human and LLM-Based Voice Assistant Interaction: An Analytical Framework for User Verbal and Nonverbal Behaviors

    Authors: Szeyi Chan, Shihan Fu, Jiachen Li, Bingsheng Yao, Smit Desai, Mirjana Prpa, Dakuo Wang

    Abstract: Recent progress in large language model (LLM) technology has significantly enhanced the interaction experience between humans and voice assistants (VAs). This project aims to explore a user's continuous interaction with LLM-based VA (LLM-VA) during a complex task. We recruited 12 participants to interact with an LLM-VA during a cooking task, selected for its complexity and the requirement for cont… ▽ More

    Submitted 3 September, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

  35. arXiv:2407.16083  [pdf

    physics.optics cond-mat.mtrl-sci cs.LG

    Self-driving lab discovers principles for steering spontaneous emission

    Authors: Saaketh Desai, Sadhvikas Addamane, Jeffery Y. Tsao, Igal Brener, Remi Dingreville, Prasad P. Iyer

    Abstract: We developed an autonomous experimentation platform to accelerate interpretable scientific discovery in ultrafast nanophotonics, targeting a novel method to steer spontaneous emission from reconfigurable semiconductor metasurfaces. Controlling spontaneous emission is crucial for clean-energy solutions in illumination, thermal radiation engineering, and remote sensing. Despite the potential of reco… ▽ More

    Submitted 24 July, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: 25 pages, 4 figures in main text, 5 figures in supplementary information

  36. arXiv:2406.10290  [pdf, other

    cs.CL cs.AI cs.LG

    MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

    Authors: Rithesh Murthy, Liangwei Yang, Juntao Tan, Tulika Manoj Awalgaonkar, Yilun Zhou, Shelby Heinecke, Sachin Desai, Jason Wu, Ran Xu, Sarah Tan, Jianguo Zhang, Zhiwei Liu, Shirley Kokane, Zuxin Liu, Ming Zhu, Huan Wang, Caiming Xiong, Silvio Savarese

    Abstract: The deployment of Large Language Models (LLMs) and Large Multimodal Models (LMMs) on mobile devices has gained significant attention due to the benefits of enhanced privacy, stability, and personalization. However, the hardware constraints of mobile devices necessitate the use of models with fewer parameters and model compression techniques like quantization. Currently, there is limited understand… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  37. arXiv:2405.07458  [pdf, other

    cs.HC

    Examining Humanness as a Metaphor to Design Voice User Interfaces

    Authors: Smit Desai, Mateusz Dubiel, Luis A. Leiva

    Abstract: Voice User Interfaces (VUIs) increasingly leverage 'humanness' as a foundational design metaphor, adopting roles like 'assistants,' 'teachers,' and 'secretaries' to foster natural interactions. Yet, this approach can sometimes misalign user trust and reinforce societal stereotypes, leading to socio-technical challenges that might impede long-term engagement. This paper explores an alternative appr… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Accepted to appear in the proceedings of CUI 2024

  38. CUI@CHI 2024: Building Trust in CUIs-From Design to Deployment

    Authors: Smit Desai, Christina Wei, Jaisie Sin, Mateusz Dubiel, Nima Zargham, Shashank Ahire, Martin Porcheron, Anastasia Kuzminykh, Minha Lee, Heloisa Candello, Joel Fischer, Cosmin Munteanu, Benjamin R Cowan

    Abstract: Conversational user interfaces (CUIs) have become an everyday technology for people the world over, as well as a booming area of research. Advances in voice synthesis and the emergence of chatbots powered by large language models (LLMs), notably ChatGPT, have pushed CUIs to the forefront of human-computer interaction (HCI) research and practice. Now that these technologies enable an elemental leve… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  39. arXiv:2312.05410  [pdf, other

    cs.LG physics.comp-ph

    Rethinking materials simulations: Blending direct numerical simulations with neural operators

    Authors: Vivek Oommen, Khemraj Shukla, Saaketh Desai, Remi Dingreville, George Em Karniadakis

    Abstract: Direct numerical simulations (DNS) are accurate but computationally expensive for predicting materials evolution across timescales, due to the complexity of the underlying evolution equations, the nature of multiscale spatio-temporal interactions, and the need to reach long-time integration. We develop a new method that blends numerical solvers with neural operators to accelerate such simulations.… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  40. arXiv:2310.20608  [pdf, other

    cs.LG cs.AI cs.RO

    Autonomous Robotic Reinforcement Learning with Asynchronous Human Feedback

    Authors: Max Balsells, Marcel Torne, Zihan Wang, Samedh Desai, Pulkit Agrawal, Abhishek Gupta

    Abstract: Ideally, we would place a robot in a real-world environment and leave it there improving on its own by gathering more experience autonomously. However, algorithms for autonomous robotic learning have been challenging to realize in the real world. While this has often been attributed to the challenge of sample complexity, even sample-efficient techniques are hampered by two major challenges - the d… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Project website https://guided-exploration-autonomous-rl.github.io/GEAR/

  41. AI-Dentify: Deep learning for proximal caries detection on bitewing x-ray -- HUNT4 Oral Health Study

    Authors: Javier Pérez de Frutos, Ragnhild Holden Helland, Shreya Desai, Line Cathrine Nymoen, Thomas Langø, Theodor Remman, Abhijit Sen

    Abstract: Background: Dental caries diagnosis requires the manual inspection of diagnostic bitewing images of the patient, followed by a visual inspection and probing of the identified dental pieces with potential lesions. Yet the use of artificial intelligence, and in particular deep-learning, has the potential to aid in the diagnosis by providing a quick and informative analysis of the bitewing images.… ▽ More

    Submitted 22 March, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: 24 pages, 5 figure, 7 tables

    ACM Class: I.2.10; I.2.1

    Journal ref: BMC Oral Health 24, 344 (2024)

  42. Using ChatGPT in HCI Research -- A Trioethnography

    Authors: Smit Desai, Tanusree Sharma, Pratyasha Saha

    Abstract: This paper explores the lived experience of using ChatGPT in HCI research through a month-long trioethnography. Our approach combines the expertise of three HCI researchers with diverse research interests to reflect on our daily experience of living and working with ChatGPT. Our findings are presented as three provocations grounded in our collective experiences and HCI theories. Specifically, we e… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  43. Like My Aunt Dorothy: Effects of Conversational Styles on Perceptions, Acceptance and Metaphorical Descriptions of Voice Assistants during Later Adulthood

    Authors: Jessie Chin, Smit Desai, Sheny Lin, Shannon Mejia

    Abstract: Little research has investigated the design of conversational styles of voice assistants (VA) for adults in their later adulthood with varying personalities. In this Wizard of Oz experiment, 34 middle-aged (50 to 64 years old) and 24 older adults (65 to 80 years old) participated in a user study at a simulated home, interacting with a VA using either formal or informal language. Older adults with… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  44. arXiv:2308.10714  [pdf, other

    cs.DC

    CXL Memory as Persistent Memory for Disaggregated HPC: A Practical Approach

    Authors: Yehonatan Fridman, Suprasad Mutalik Desai, Navneet Singh, Thomas Willhalm, Gal Oren

    Abstract: In the landscape of High-Performance Computing (HPC), the quest for efficient and scalable memory solutions remains paramount. The advent of Compute Express Link (CXL) introduces a promising avenue with its potential to function as a Persistent Memory (PMem) solution in the context of disaggregated HPC systems. This paper presents a comprehensive exploration of CXL memory's viability as a candidat… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 12 pages, 9 figures

  45. arXiv:2307.11049  [pdf, other

    cs.LG cs.AI cs.RO

    Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback

    Authors: Marcel Torne, Max Balsells, Zihan Wang, Samedh Desai, Tao Chen, Pulkit Agrawal, Abhishek Gupta

    Abstract: Exploration and reward specification are fundamental and intertwined challenges for reinforcement learning. Solving sequential decision-making tasks requiring expansive exploration requires either careful design of reward functions or the use of novelty-seeking exploration bonuses. Human supervisors can provide effective guidance in the loop to direct the exploration process, but prior methods to… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  46. arXiv:2305.15534  [pdf, other

    cs.IR cs.CY cs.LG

    Representation Online Matters: Practical End-to-End Diversification in Search and Recommender Systems

    Authors: Pedro Silva, Bhawna Juneja, Shloka Desai, Ashudeep Singh, Nadia Fawaz

    Abstract: As the use of online platforms continues to grow across all demographics, users often express a desire to feel represented in the content. To improve representation in search results and recommendations, we introduce end-to-end diversification, ensuring that diverse content flows throughout the various stages of these systems, from retrieval to ranking. We develop, experiment, and deploy scalable… ▽ More

    Submitted 26 May, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT '23), June 12--15, 2023, Chicago, IL, USA

  47. arXiv:2305.13776  [pdf, other

    cs.CL cs.AI

    Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generation

    Authors: Rishabh Gupta, Shaily Desai, Manvi Goel, Anil Bandhakavi, Tanmoy Chakraborty, Md. Shad Akhtar

    Abstract: Counterspeech has been demonstrated to be an efficacious approach for combating hate speech. While various conventional and controlled approaches have been studied in recent years to generate counterspeech, a counterspeech with a certain intent may not be sufficient in every scenario. Due to the complex and multifaceted nature of hate speech, utilizing multiple forms of counter-narratives with var… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  48. arXiv:2305.04506  [pdf, other

    cs.CV cs.AI

    Pedestrian Behavior Maps for Safety Advisories: CHAMP Framework and Real-World Data Analysis

    Authors: Ross Greer, Samveed Desai, Lulua Rakla, Akshay Gopalkrishnan, Afnan Alofi, Mohan Trivedi

    Abstract: It is critical for vehicles to prevent any collisions with pedestrians. Current methods for pedestrian collision prevention focus on integrating visual pedestrian detectors with Automatic Emergency Braking (AEB) systems which can trigger warnings and apply brakes as a pedestrian enters a vehicle's path. Unfortunately, pedestrian-detection-based systems can be hindered in certain situations such as… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  49. arXiv:2303.17971  [pdf, other

    cs.MA cs.GT cs.LG

    Rule Enforcing Through Ordering

    Authors: David Sychrovský, Sameer Desai, Martin Loebl

    Abstract: In many real world situations, like minor traffic offenses in big cities, a central authority is tasked with periodic administering punishments to a large number of individuals. Common practice is to give each individual a chance to suffer a smaller fine and be guaranteed to avoid the legal process with probable considerably larger punishment. However, thanks to the large number of offenders and a… ▽ More

    Submitted 24 October, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: Accepted at the 14th Conference on Decision and Game Theory for Security (GameSec-23)

  50. arXiv:2303.06008  [pdf

    cs.CR

    A detailed review of blockchain and cryptocurrency

    Authors: Nayak Bhatia, Sanchit Bansal, Smit Desai

    Abstract: Cryptocurrency is something that we have all heard about recently, most likely preceded by bitcoin, and how much its prices have boomed over the decade. These cryptocurrencies are actually based on blockchain, a secure datatype, and recently popular form of technology. This paper gives a detailed review about the concept of blockchain and its potential applications, especially elaborating on crypt… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.