Skip to main content

Showing 1–12 of 12 results for author: Kedia, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2511.04671  [pdf, ps, other

    cs.RO cs.AI cs.CV

    X-Diffusion: Training Diffusion Policies on Cross-Embodiment Human Demonstrations

    Authors: Maximus A. Pace, Prithwish Dan, Chuanruo Ning, Atiksh Bhardwaj, Audrey Du, Edward W. Duan, Wei-Chiu Ma, Kushal Kedia

    Abstract: Human videos can be recorded quickly and at scale, making them an appealing source of training data for robot learning. However, humans and robots differ fundamentally in embodiment, resulting in mismatched action execution. Direct kinematic retargeting of human hand motion can therefore produce actions that are physically infeasible for robots. Despite these low-level differences, human demonstra… ▽ More

    Submitted 6 November, 2025; originally announced November 2025.

  2. arXiv:2505.07096  [pdf, ps, other

    cs.RO cs.AI cs.LG

    X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real

    Authors: Prithwish Dan, Kushal Kedia, Angela Chao, Edward Weiyi Duan, Maximus Adrian Pace, Wei-Chiu Ma, Sanjiban Choudhury

    Abstract: Human videos offer a scalable way to train robot manipulation policies, but lack the action labels needed by standard imitation learning algorithms. Existing cross-embodiment approaches try to map human motion to robot actions, but often fail when the embodiments differ significantly. We propose X-Sim, a real-to-sim-to-real framework that uses object motion as a dense and transferable signal for l… ▽ More

    Submitted 8 November, 2025; v1 submitted 11 May, 2025; originally announced May 2025.

  3. arXiv:2412.06162  [pdf, other

    cs.AI cs.CL

    Query-Efficient Planning with Language Models

    Authors: Gonzalo Gonzalez-Pumariega, Wayne Chen, Kushal Kedia, Sanjiban Choudhury

    Abstract: Planning in complex environments requires an agent to efficiently query a world model to find a feasible sequence of actions from start to goal. Recent work has shown that Large Language Models (LLMs), with their rich prior knowledge and reasoning capabilities, can potentially help with planning by searching over promising states and adapting to feedback from the world. In this paper, we propose a… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

    Comments: 11 pages (not including references or appendix); 13 figures (9 main paper, 4 appendix); (v1) preprint

  4. arXiv:2409.06615  [pdf, other

    cs.RO cs.AI cs.LG

    One-Shot Imitation under Mismatched Execution

    Authors: Kushal Kedia, Prithwish Dan, Angela Chao, Maximus Adrian Pace, Sanjiban Choudhury

    Abstract: Human demonstrations as prompts are a powerful way to program robots to do long-horizon manipulation tasks. However, translating these demonstrations into robot-executable actions presents significant challenges due to execution mismatches in movement styles and physical capabilities. Existing methods for human-robot translation either depend on paired data, which is infeasible to scale, or rely h… ▽ More

    Submitted 28 March, 2025; v1 submitted 10 September, 2024; originally announced September 2024.

  5. arXiv:2402.18796  [pdf, ps, other

    cs.RO

    MOSAIC: Modular Foundation Models for Assistive and Interactive Cooking

    Authors: Huaxiaoyue Wang, Kushal Kedia, Juntao Ren, Rahma Abdullah, Atiksh Bhardwaj, Angela Chao, Kelly Y Chen, Nathaniel Chin, Prithwish Dan, Xinyi Fan, Gonzalo Gonzalez-Pumariega, Aditya Kompella, Maximus Adrian Pace, Yash Sharma, Xiangwan Sun, Neha Sunkara, Sanjiban Choudhury

    Abstract: We present MOSAIC, a modular architecture for coordinating multiple robots to (a) interact with users using natural language and (b) manipulate an open vocabulary of everyday objects. MOSAIC employs modularity at several levels: it leverages multiple large-scale pre-trained models for high-level tasks like language and image recognition, while using streamlined modules designed for low-level task-… ▽ More

    Submitted 25 October, 2025; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 22 pages, 13 figures; CoRL 2024

  6. arXiv:2311.12943  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    InteRACT: Transformer Models for Human Intent Prediction Conditioned on Robot Actions

    Authors: Kushal Kedia, Atiksh Bhardwaj, Prithwish Dan, Sanjiban Choudhury

    Abstract: In collaborative human-robot manipulation, a robot must predict human intents and adapt its actions accordingly to smoothly execute tasks. However, the human's intent in turn depends on actions the robot takes, creating a chicken-or-egg problem. Prior methods ignore such inter-dependency and instead train marginal intent prediction models independent of robot actions. This is because training cond… ▽ More

    Submitted 2 June, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: We release our code and datasets at https://portal-cornell.github.io/interact/

  7. arXiv:2310.13258  [pdf, other

    cs.RO cs.AI cs.LG

    ManiCast: Collaborative Manipulation with Cost-Aware Human Forecasting

    Authors: Kushal Kedia, Prithwish Dan, Atiksh Bhardwaj, Sanjiban Choudhury

    Abstract: Seamless human-robot manipulation in close proximity relies on accurate forecasts of human motion. While there has been significant progress in learning forecast models at scale, when applied to manipulation tasks, these models accrue high errors at critical transition points leading to degradation in downstream planning performance. Our key insight is that instead of predicting the most likely hu… ▽ More

    Submitted 27 November, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: CoRL 2023

  8. arXiv:2308.06137  [pdf, other

    cs.AI

    A Game-Theoretic Framework for Joint Forecasting and Planning

    Authors: Kushal Kedia, Prithwish Dan, Sanjiban Choudhury

    Abstract: Planning safe robot motions in the presence of humans requires reliable forecasts of future human motion. However, simply predicting the most likely motion from prior interactions does not guarantee safety. Such forecasts fail to model the long tail of possible events, which are rarely observed in limited datasets. On the other hand, planning for worst-case motions leads to overtly conservative be… ▽ More

    Submitted 19 October, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: IROS 2023

  9. arXiv:2211.17046  [pdf, other

    cs.CL cs.CY

    Rationale-Guided Few-Shot Classification to Detect Abusive Language

    Authors: Punyajoy Saha, Divyanshu Sheth, Kushal Kedia, Binny Mathew, Animesh Mukherjee

    Abstract: Abusive language is a concerning problem in online social media. Past research on detecting abusive language covers different platforms, languages, demographies, etc. However, models trained using these datasets do not perform well in cross-domain evaluation settings. To overcome this, a common strategy is to use a few samples from the target domain to train models to get better performance in tha… ▽ More

    Submitted 27 July, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: 11 pages, 14 tables, 3 figures, The code repository is https://github.com/punyajoy/RGFS_ECAI

  10. arXiv:2202.10449  [pdf, other

    cs.MA cs.AI

    Optimal Multi-Agent Path Finding for Precedence Constrained Planning Tasks

    Authors: Kushal Kedia, Rajat Kumar Jenamani, Aritra Hazra, Partha Pratim Chakrabarti

    Abstract: Multi-Agent Path Finding (MAPF) is the problem of finding collision-free paths for multiple agents from their start locations to end locations. We consider an extension to this problem, Precedence Constrained Multi-Agent Path Finding (PC-MAPF), wherein agents are assigned a sequence of planning tasks that contain precedence constraints between them. PC-MAPF has various applications, for example in… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  11. arXiv:2102.07150  [pdf, other

    cs.CL

    indicnlp@kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages

    Authors: Kushal Kedia, Abhilash Nandy

    Abstract: The paper presents the submission of the team indicnlp@kgp to the EACL 2021 shared task "Offensive Language Identification in Dravidian Languages." The task aimed to classify different offensive content types in 3 code-mixed Dravidian language datasets. The work leverages existing state of the art approaches in text classification by incorporating additional data and transfer learning on pre-train… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

  12. arXiv:2006.04194  [pdf, other

    cs.RO

    Robotic Motion Planning using Learned Critical Sources and Local Sampling

    Authors: Rajat Kumar Jenamani, Rahul Kumar, Parth Mall, Kushal Kedia

    Abstract: Sampling based methods are widely used for robotic motion planning. Traditionally, these samples are drawn from probabilistic ( or deterministic ) distributions to cover the state space uniformly. Despite being probabilistically complete, they fail to find a feasible path in a reasonable amount of time in constrained environments where it is essential to go through narrow passages (bottleneck regi… ▽ More

    Submitted 7 June, 2020; originally announced June 2020.

    Comments: Accepted at Fourth Machine Learning in Planning and Control of Robot Motion Workshop, 2020 IEEE International Conference on Robotics and Automation