Skip to main content

Showing 1–12 of 12 results for author: Khurana, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2512.11315  [pdf, ps, other

    cs.LG

    Benchmarking the Generality of Vision-Language-Action Models

    Authors: Pranav Guruprasad, Sudipta Chowdhury, Harsh Sikka, Mridul Sharma, Helen Lu, Sean Rivera, Aryan Khurana, Hangliang Ren, Yangyue Wang

    Abstract: Generalist multimodal agents are expected to unify perception, language, and control - operating robustly across diverse real world domains. However, current evaluation practices remain fragmented across isolated benchmarks, making it difficult to assess whether today's foundation models truly generalize beyond their training distributions. We introduce MultiNet v1.0, a unified benchmark for measu… ▽ More

    Submitted 12 December, 2025; originally announced December 2025.

    Comments: 23 pages, 7 figures, and 1 table

  2. arXiv:2508.02527  [pdf, ps, other

    cs.CL cs.LG

    I Have No Mouth, and I Must Rhyme: Uncovering Internal Phonetic Representations in LLaMA 3.2

    Authors: Oliver McLaughlin, Arjun Khurana, Jack Merullo

    Abstract: Large language models demonstrate proficiency on phonetic tasks, such as rhyming, without explicit phonetic or auditory grounding. In this work, we investigate how \verb|Llama-3.2-1B-Instruct| represents token-level phonetic information. Our results suggest that Llama uses a rich internal model of phonemes to complete phonetic tasks. We provide evidence for high-level organization of phoneme repre… ▽ More

    Submitted 15 October, 2025; v1 submitted 4 August, 2025; originally announced August 2025.

  3. arXiv:2504.15549  [pdf, other

    cs.HC cs.AI cs.LG

    Do It For Me vs. Do It With Me: Investigating User Perceptions of Different Paradigms of Automation in Copilots for Feature-Rich Software

    Authors: Anjali Khurana, Xiaotian Su, April Yi Wang, Parmit K Chilana

    Abstract: Large Language Model (LLM)-based in-application assistants, or copilots, can automate software tasks, but users often prefer learning by doing, raising questions about the optimal level of automation for an effective user experience. We investigated two automation paradigms by designing and implementing a fully automated copilot (AutoCopilot) and a semi-automated copilot (GuidedCopilot) that autom… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: Accepted for publication in the CHI Conference on Human Factors in Computing Systems (CHI 2025), April 26 - May 1, 2025, Yokohama, Japan

  4. arXiv:2402.08030  [pdf, other

    cs.HC cs.AI cs.LG

    Why and When LLM-Based Assistants Can Go Wrong: Investigating the Effectiveness of Prompt-Based Interactions for Software Help-Seeking

    Authors: Anjali Khurana, Hari Subramonyam, Parmit K Chilana

    Abstract: Large Language Model (LLM) assistants, such as ChatGPT, have emerged as potential alternatives to search methods for helping users navigate complex, feature-rich software. LLMs use vast training data from domain-specific texts, software manuals, and code repositories to mimic human-like interactions, offering tailored assistance, including step-by-step instructions. In this work, we investigated L… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted for publication in the Proceedings of the 29th International Conference on Intelligent User Interfaces (IUI'24), March 18--21, 2024, in Greenville, SC, USA

  5. arXiv:2306.09604  [pdf, other

    cs.IR

    Personalized Summarization of Scientific Scholarly Texts

    Authors: Alka Khurana, Vasudha Bhatnagar, Vikas Kumar

    Abstract: In this paper, we present a proposal for an unsupervised algorithm, P-Summ, that generates an extractive summary of scientific scholarly text to meet the personal knowledge needs of the user. The method delves into the latent semantic space of the document exposed by Weighted Non-negative Matrix Factorization, and scores sentences in consonance with the knowledge needs of the user. The novelty of… ▽ More

    Submitted 27 October, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

  6. arXiv:2202.01340  [pdf, other

    cs.LG

    An Artificial Intelligence Dataset for Solar Energy Locations in India

    Authors: Anthony Ortiz, Dhaval Negandhi, Sagar R Mysorekar, Joseph Kiesecker, Shivaprakash K Nagaraju, Caleb Robinson, Priyal Bhatia, Aditi Khurana, Jane Wang, Felipe Oviedo, Juan Lavista Ferres

    Abstract: Rapid development of renewable energy sources, particularly solar photovoltaics (PV), is critical to mitigate climate change. As a result, India has set ambitious goals to install 500 gigawatts of solar energy capacity by 2030. Given the large footprint projected to meet renewables energy targets, the potential for land use conflicts over environmental values is high. To expedite development of so… ▽ More

    Submitted 30 June, 2022; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: Accepted for publication in Nature Scientific Data

  7. arXiv:2112.02359  [pdf, other

    cs.CV

    Unsupervised Adaptation of Semantic Segmentation Models without Source Data

    Authors: Sujoy Paul, Ansh Khurana, Gaurav Aggarwal

    Abstract: We consider the novel problem of unsupervised domain adaptation of source models, without access to the source data for semantic segmentation. Unsupervised domain adaptation aims to adapt a model learned on the labeled source data, to a new unlabeled target dataset. Existing methods assume that the source data is available along with the target data during adaptation. However, in practical scenari… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

  8. arXiv:2112.02355  [pdf, other

    cs.CV

    SITA: Single Image Test-time Adaptation

    Authors: Ansh Khurana, Sujoy Paul, Piyush Rai, Soma Biswas, Gaurav Aggarwal

    Abstract: In Test-time Adaptation (TTA), given a source model, the goal is to adapt it to make better predictions for test instances from a different distribution than the source. Crucially, TTA assumes no access to the source data or even any additional labeled/unlabeled samples from the target distribution to finetune the source model. In this work, we consider TTA in a more pragmatic setting which we ref… ▽ More

    Submitted 7 September, 2022; v1 submitted 4 December, 2021; originally announced December 2021.

  9. Investigating Entropy for Extractive Document Summarization

    Authors: Alka Khurana, Vasudha Bhatnagar

    Abstract: Automatic text summarization aims to cut down readers time and cognitive effort by reducing the content of a text document without compromising on its essence. Ergo, informativeness is the prime attribute of document summary generated by an algorithm, and selecting sentences that capture the essence of a document is the primary goal of extractive document summarization. In this paper, we employ Sh… ▽ More

    Submitted 29 September, 2021; v1 submitted 22 September, 2021; originally announced September 2021.

  10. arXiv:2010.09672  [pdf, other

    cs.CV

    Multi-Stage Fusion for One-Click Segmentation

    Authors: Soumajit Majumder, Ansh Khurana, Abhinav Rai, Angela Yao

    Abstract: Segmenting objects of interest in an image is an essential building block of applications such as photo-editing and image analysis. Under interactive settings, one should achieve good segmentations while minimizing user input. Current deep learning-based interactive segmentation approaches use early fusion and incorporate user cues at the image input layer. Since segmentation CNNs have many layers… ▽ More

    Submitted 20 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: A preprint of the accepted paper at GCPR 2020

  11. arXiv:2008.00329  [pdf, other

    cs.AR

    CuttleSys: Data-Driven Resource Management forInteractive Applications on Reconfigurable Multicores

    Authors: Neeraj Kulkarni, Gonzalo Gonzalez-Pumariega, Amulya Khurana, Christine Shoemaker, Christina Delimitrou, David Albonesi

    Abstract: Multi-tenancy for latency-critical applications leads to re-source interference and unpredictable performance. Core reconfiguration opens up more opportunities for colocation,as it allows the hardware to adjust to the dynamic performance and power needs of a specific mix of co-scheduled applications. However, reconfigurability also introduces challenges, as even for a small number of reconfigurabl… ▽ More

    Submitted 1 August, 2020; originally announced August 2020.

  12. arXiv:2006.10260  [pdf, other

    cs.CV cs.MM

    Video Moment Localization using Object Evidence and Reverse Captioning

    Authors: Madhawa Vidanapathirana, Supriya Pandhre, Sonia Raychaudhuri, Anjali Khurana

    Abstract: We address the problem of language-based temporal localization of moments in untrimmed videos. Compared to temporal localization with fixed categories, this problem is more challenging as the language-based queries have no predefined activity classes and may also contain complex descriptions. Current state-of-the-art model MAC addresses it by mining activity concepts from both video and language m… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: 7 pages. 6 figures. For source code, refer https://github.com/madhawav/MML