Skip to main content

Showing 1–50 of 57 results for author: Lin, M C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2603.28757  [pdf, ps, other

    cs.CV cs.MM cs.SD

    SonoWorld: From One Image to a 3D Audio-Visual Scene

    Authors: Derong Jin, Xiyi Chen, Ming C. Lin, Ruohan Gao

    Abstract: Tremendous progress in visual scene generation now turns a single image into an explorable 3D world, yet immersion remains incomplete without sound. We introduce Image2AVScene, the task of generating a 3D audio-visual scene from a single image, and present SonoWorld, the first framework to tackle this challenge. From one image, our pipeline outpaints a 360° panorama, lifts it into a navigable 3D s… ▽ More

    Submitted 30 March, 2026; originally announced March 2026.

    Comments: Accepted by CVPR 2026, project page: https://humathe.github.io/sonoworld/

  2. arXiv:2603.06533  [pdf, ps, other

    cs.CV

    NEGATE: Constrained Semantic Guidance for Linguistic Negation in Text-to-Video Diffusion

    Authors: Taewon Kang, Ming C. Lin

    Abstract: Negation is a fundamental linguistic operator, yet it remains inadequately modeled in diffusion-based generative systems. In this work, we present a formal treatment of linguistic negation in diffusion-based generative models by modeling it as a structured feasibility constraint on semantic guidance within diffusion dynamics. Rather than introducing heuristics or retraining model parameters, we re… ▽ More

    Submitted 6 March, 2026; originally announced March 2026.

    Comments: 50 pages, 32 figures

  3. arXiv:2602.19035  [pdf, ps, other

    cs.CV

    OpenVO: Open-World Visual Odometry with Temporal Dynamics Awareness

    Authors: Phuc D. A. Nguyen, Anh N. Nhu, Ming C. Lin

    Abstract: We introduce OpenVO, a novel framework for Open-world Visual Odometry (VO) with temporal awareness under limited input conditions. OpenVO effectively estimates real-world-scale ego-motion from monocular dashcam footage with varying observation rates and uncalibrated cameras, enabling robust trajectory dataset construction from rare driving events recorded in dashcam. Existing VO methods are traine… ▽ More

    Submitted 21 February, 2026; originally announced February 2026.

    Comments: Main paper CVPR 2026

  4. arXiv:2512.17151  [pdf, ps, other

    cs.CV

    Text-Conditioned Background Generation for Editable Multi-Layer Documents

    Authors: Taewon Kang, Joseph K J, Chris Tensmeyer, Jihyung Kil, Wanrong Zhu, Ming C. Lin, Vlad I. Morariu

    Abstract: We present a framework for document-centric background generation with multi-page editing and thematic continuity. To ensure text regions remain readable, we employ a \emph{latent masking} formulation that softly attenuates updates in the diffusion space, inspired by smooth barrier functions in physics and numerical optimization. In addition, we introduce \emph{Automated Readability Optimization (… ▽ More

    Submitted 18 December, 2025; originally announced December 2025.

  5. arXiv:2512.06818  [pdf, ps, other

    cs.CV

    MeshSplatting: Differentiable Rendering with Opaque Meshes

    Authors: Jan Held, Sanghyun Son, Renaud Vandeghen, Daniel Rebain, Matheus Gadelha, Yi Zhou, Anthony Cioppa, Ming C. Lin, Marc Van Droogenbroeck, Andrea Tagliasacchi

    Abstract: Primitive-based splatting methods like 3D Gaussian Splatting have revolutionized novel view synthesis with real-time rendering. However, their point-based representations remain incompatible with mesh-based pipelines that power AR/VR and game engines. We present MeshSplatting, a mesh-based reconstruction approach that jointly optimizes geometry and appearance through differentiable rendering. By e… ▽ More

    Submitted 7 December, 2025; originally announced December 2025.

  6. arXiv:2511.01259  [pdf, ps, other

    cs.GR physics.flu-dyn

    An Adjoint Method for Differentiable Fluid Simulation on Flow Maps

    Authors: Zhiqi Li, Jinjin He, Barnabás Börcsök, Taiyuan Zhang, Duowen Chen, Tao Du, Ming C. Lin, Greg Turk, Bo Zhu

    Abstract: This paper presents a novel adjoint solver for differentiable fluid simulation based on bidirectional flow maps. Our key observation is that the forward fluid solver and its corresponding backward, adjoint solver share the same flow map as the forward simulation. In the forward pass, this map transports fluid impulse variables from the initial frame to the current frame to simulate vortical dynami… ▽ More

    Submitted 3 November, 2025; originally announced November 2025.

    Comments: 15 pages, 16 figures

    Journal ref: ACM SIGGRAPH Asia Conference Proceedings (2025)

  7. arXiv:2510.17665  [pdf, ps, other

    cs.DM

    Efficient recognition algorithms for $(1,2)$-, $(2,1)$- and $(2,2)$-graphs

    Authors: Flavia Bonomo-Braberman, Min Chih Lin, Ignacio Maqueda

    Abstract: A graph $G$ is said to be a $(k,\ell)$-graph if its vertex set can be partitioned into $k$ independent sets and $\ell$ cliques. It is well established that the recognition problem for $(k,\ell)$-graphs is NP-complete whenever $k \geq 3$ or $\ell \geq 3$, while it is solvable in polynomial time otherwise. In particular, for the case $k+\ell \leq 2$, recognition can be carried out in linear time, si… ▽ More

    Submitted 20 October, 2025; originally announced October 2025.

    Comments: 14 pages, 0 figures

    MSC Class: 05C85

  8. arXiv:2510.07807  [pdf, ps, other

    cs.RO

    GM3: A General Physical Model for Micro-Mobility Vehicles

    Authors: Grace Cai, Nithin Parepally, Laura Zheng, Ming C. Lin

    Abstract: Modeling the dynamics of micro-mobility vehicles (MMV) is becoming increasingly important for training autonomous vehicle systems and building urban traffic simulations. However, mainstream tools rely on variants of the Kinematic Bicycle Model (KBM) or mode-specific physics that miss tire slip, load transfer, and rider/vehicle lean. To our knowledge, no unified, physics-based model captures these… ▽ More

    Submitted 14 March, 2026; v1 submitted 9 October, 2025; originally announced October 2025.

  9. arXiv:2509.25122  [pdf, ps, other

    cs.CV

    Triangle Splatting+: Differentiable Rendering with Opaque Triangles

    Authors: Jan Held, Renaud Vandeghen, Sanghyun Son, Daniel Rebain, Matheus Gadelha, Yi Zhou, Ming C. Lin, Marc Van Droogenbroeck, Andrea Tagliasacchi

    Abstract: Reconstructing 3D scenes and synthesizing novel views has seen rapid progress in recent years. Neural Radiance Fields demonstrated that continuous volumetric radiance fields can achieve high-quality image synthesis, but their long training and rendering times limit practicality. 3D Gaussian Splatting (3DGS) addressed these issues by representing scenes with millions of Gaussians, enabling real-tim… ▽ More

    Submitted 29 September, 2025; originally announced September 2025.

    Comments: 9 pages, 6 figures, 2 tables

  10. arXiv:2506.01167  [pdf, ps, other

    cs.LG cs.RO

    Accelerated Learning with Linear Temporal Logic using Differentiable Simulation

    Authors: Alper Kamil Bozkurt, Calin Belta, Ming C. Lin

    Abstract: Ensuring that reinforcement learning (RL) controllers satisfy safety and reliability constraints in real-world settings remains challenging: state-avoidance and constrained Markov decision processes often fail to capture trajectory-level requirements or induce overly conservative behavior. Formal specification languages such as linear temporal logic (LTL) offer correct-by-construction objectives,… ▽ More

    Submitted 2 April, 2026; v1 submitted 1 June, 2025; originally announced June 2025.

  11. arXiv:2505.21880  [pdf

    cs.MA cs.AI cs.CL cs.CY

    Incorporating LLMs for Large-Scale Urban Complex Mobility Simulation

    Authors: Yu-Lun Song, Chung-En Tsern, Che-Cheng Wu, Yu-Ming Chang, Syuan-Bo Huang, Wei-Chu Chen, Michael Chia-Liang Lin, Yu-Ta Lin

    Abstract: This study presents an innovative approach to urban mobility simulation by integrating a Large Language Model (LLM) with Agent-Based Modeling (ABM). Unlike traditional rule-based ABM, the proposed framework leverages LLM to enhance agent diversity and realism by generating synthetic population profiles, allocating routine and occasional locations, and simulating personalized routes. Using real-wor… ▽ More

    Submitted 3 July, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

    Comments: 8 pages, 8 figures. This paper is reviewed and accepted by the CUPUM (Computational Urban Planning and Urban Management) Conference held by University College London (UCL) in 2025

  12. arXiv:2505.16819  [pdf, ps, other

    cs.CV

    Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts

    Authors: Taewon Kang, Ming C. Lin

    Abstract: Recent advances in scene-based video generation have enabled systems to synthesize coherent visual narratives from structured prompts. However, a crucial dimension of storytelling -- character-driven dialogue and speech -- remains underexplored. In this paper, we present a modular pipeline that transforms action-level prompts into visually and auditorily grounded narrative dialogue, enriching visu… ▽ More

    Submitted 27 September, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: 22 pages, 5 figures; revised supplementary document, etc

  13. arXiv:2503.06310  [pdf, ps, other

    cs.CV

    Text2Story: Advancing Video Storytelling with Text Guidance

    Authors: Taewon Kang, Divya Kothandaraman, Ming C. Lin

    Abstract: Generating coherent long-form video sequences from discrete input using only text prompts is a critical task in content creation. While diffusion-based models excel at short video synthesis, long-form storytelling from text remains largely unexplored and a challenge due to difficulties in temporal coherency, preserving semantic meaning, and maintaining both scene context and action continuity acro… ▽ More

    Submitted 27 September, 2025; v1 submitted 8 March, 2025; originally announced March 2025.

    Comments: 16 pages, 4 figures; revised method figure, etc

  14. arXiv:2503.04994  [pdf, other

    cs.RO

    Quantifying and Modeling Driving Styles in Trajectory Forecasting

    Authors: Laura Zheng, Hamidreza Yaghoubi Araghi, Tony Wu, Sandeep Thalapanane, Tianyi Zhou, Ming C. Lin

    Abstract: Trajectory forecasting has become a popular deep learning task due to its relevance for scenario simulation for autonomous driving. Specifically, trajectory forecasting predicts the trajectory of a short-horizon future for specific human drivers in a particular traffic scenario. Robust and accurate future predictions can enable autonomous driving planners to optimize for low-risk and predictable o… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  15. arXiv:2502.00050  [pdf, other

    cs.RO cs.LG

    DISC: Dataset for Analyzing Driving Styles In Simulated Crashes for Mixed Autonomy

    Authors: Sandip Sharan Senthil Kumar, Sandeep Thalapanane, Guru Nandhan Appiya Dilipkumar Peethambari, Sourang SriHari, Laura Zheng, Ming C. Lin

    Abstract: Handling pre-crash scenarios is still a major challenge for self-driving cars due to limited practical data and human-driving behavior datasets. We introduce DISC (Driving Styles In Simulated Crashes), one of the first datasets designed to capture various driving styles and behaviors in pre-crash scenarios for mixed autonomy analysis. DISC includes over 8 classes of driving styles/behaviors from h… ▽ More

    Submitted 28 January, 2025; originally announced February 2025.

  16. arXiv:2412.16776  [pdf, ps, other

    cs.CV cs.GR cs.LG

    DMesh++: An Efficient Differentiable Mesh for Complex Shapes

    Authors: Sanghyun Son, Matheus Gadelha, Yang Zhou, Matthew Fisher, Zexiang Xu, Yi-Ling Qiao, Ming C. Lin, Yi Zhou

    Abstract: Recent probabilistic methods for 3D triangular meshes capture diverse shapes by differentiable mesh connectivity, but face high computational costs with increased shape details. We introduce a new differentiable mesh processing method that addresses this challenge and efficiently handles meshes with intricate structures. Our method reduces time complexity from O(N) to O(log N) and requires signifi… ▽ More

    Submitted 6 July, 2025; v1 submitted 21 December, 2024; originally announced December 2024.

    Comments: 20 pages, 24 figures, 6 tables

  17. arXiv:2412.16750  [pdf, other

    cs.RO cs.DC cs.LG

    Gradient-based Trajectory Optimization with Parallelized Differentiable Traffic Simulation

    Authors: Sanghyun Son, Laura Zheng, Brian Clipp, Connor Greenwell, Sujin Philip, Ming C. Lin

    Abstract: We present a parallelized differentiable traffic simulator based on the Intelligent Driver Model (IDM), a car-following framework that incorporates driver behavior as key variables. Our vehicle simulator efficiently models vehicle motion, generating trajectories that can be supervised to fit real-world data. By leveraging its differentiable nature, IDM parameters are optimized using gradient-based… ▽ More

    Submitted 17 February, 2025; v1 submitted 21 December, 2024; originally announced December 2024.

    Comments: 9 pages, 6 figures, 3 tables

  18. arXiv:2411.01391  [pdf, other

    quant-ph cs.ET cs.LG math.NA math.OC

    Differentiable Quantum Computing for Large-scale Linear Control

    Authors: Connor Clayton, Jiaqi Leng, Gengzhi Yang, Yi-Ling Qiao, Ming C. Lin, Xiaodi Wu

    Abstract: As industrial models and designs grow increasingly complex, the demand for optimal control of large-scale dynamical systems has significantly increased. However, traditional methods for optimal control incur significant overhead as problem dimensions grow. In this paper, we introduce an end-to-end quantum algorithm for linear-quadratic control with provable speedups. Our algorithm, based on a poli… ▽ More

    Submitted 2 November, 2024; originally announced November 2024.

  19. arXiv:2410.00292  [pdf, other

    cs.CL cs.CV

    Insight: A Multi-Modal Diagnostic Pipeline using LLMs for Ocular Surface Disease Diagnosis

    Authors: Chun-Hsiao Yeh, Jiayun Wang, Andrew D. Graham, Andrea J. Liu, Bo Tan, Yubei Chen, Yi Ma, Meng C. Lin

    Abstract: Accurate diagnosis of ocular surface diseases is critical in optometry and ophthalmology, which hinge on integrating clinical data sources (e.g., meibography imaging and clinical metadata). Traditional human assessments lack precision in quantifying clinical observations, while current machine-based methods often treat diagnoses as multi-class classification problems, limiting the diagnoses to a p… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

    Comments: Accepted to MICCAI 2024. Project Webpage: https://danielchyeh.github.io/MDPipe/

  20. arXiv:2408.06157  [pdf, ps, other

    cs.CV

    3D-free meets 3D priors: Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance

    Authors: Taewon Kang, Divya Kothandaraman, Dinesh Manocha, Ming C. Lin

    Abstract: Recent 3D novel view synthesis (NVS) methods often require extensive 3D data for training, and also typically lack generalization beyond the training distribution. Moreover, they tend to be object centric and struggle with complex and intricate scenes. Conversely, 3D-free methods can generate text-controlled views of complex, in-the-wild scenes using a pretrained stable diffusion model without the… ▽ More

    Submitted 15 November, 2025; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: Accepted to The 40th Annual AAAI Conference on Artificial Intelligence (AAAI-26), AAAI 2026 Workshop on AI for Environmental Science (AI4ES). Due to arXiv's 1,920-character limit, the abstract here is shortened. Please refer to the paper (View PDF) to read the full abstract. 14 pages, 13 figures, v5: AAAI-26 camera-ready version

  21. arXiv:2407.09466  [pdf, other

    cs.RO cs.GR

    TRAVERSE: Traffic-Responsive Autonomous Vehicle Experience & Rare-event Simulation for Enhanced safety

    Authors: Sandeep Thalapanane, Sandip Sharan Senthil Kumar, Guru Nandhan Appiya Dilipkumar Peethambari, Sourang SriHari, Laura Zheng, Julio Poveda, Ming C. Lin

    Abstract: Data for training learning-enabled self-driving cars in the physical world are typically collected in a safe, normal environment. Such data distribution often engenders a strong bias towards safe driving, making self-driving cars unprepared when encountering adversarial scenarios like unexpected accidents. Due to a dearth of such adverse data that is unrealistic for drivers to collect, autonomous… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  22. arXiv:2406.09648  [pdf, other

    cs.LG cs.CV

    An Intrinsic Vector Heat Network

    Authors: Alexander Gao, Maurice Chu, Mubbasir Kapadia, Ming C. Lin, Hsueh-Ti Derek Liu

    Abstract: Vector fields are widely used to represent and model flows for many science and engineering applications. This paper introduces a novel neural network architecture for learning tangent vector fields that are intrinsically defined on manifold surfaces embedded in 3D. Previous approaches to learning vector fields on surfaces treat vectors as multi-dimensional scalar fields, using traditional scalar-… ▽ More

    Submitted 18 July, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  23. arXiv:2406.01431  [pdf, other

    cs.RO

    Deep Stochastic Kinematic Models for Probabilistic Motion Forecasting in Traffic

    Authors: Laura Zheng, Sanghyun Son, Jing Liang, Xijun Wang, Brian Clipp, Ming C. Lin

    Abstract: In trajectory forecasting tasks for traffic, future output trajectories can be computed by advancing the ego vehicle's state with predicted actions according to a kinematics model. By unrolling predicted trajectories via time integration and models of kinematic dynamics, predicted trajectories should not only be kinematically feasible but also relate uncertainty from one timestep to the next. Whil… ▽ More

    Submitted 6 September, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 8 pages

  24. arXiv:2406.01425  [pdf, ps, other

    cs.CV

    Adaptive Sensitivity Analysis for Robust Augmentation against Natural Corruptions in Image Segmentation

    Authors: Laura Zheng, Wenjie Wei, Tony Wu, Jacob Clements, Shreelekha Revankar, Andre Harrison, Yu Shen, Ming C. Lin

    Abstract: Achieving robustness in image segmentation models is challenging due to the fine-grained nature of pixel-level classification. These models, which are crucial for many real-time perception applications, particularly struggle when faced with natural corruptions in the wild for autonomous systems. While sensitivity analysis can help us understand how input variables influence model outputs, its appl… ▽ More

    Submitted 16 June, 2025; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 9 pages

  25. arXiv:2404.13445  [pdf, ps, other

    cs.CV cs.GR

    DMesh: A Differentiable Mesh Representation

    Authors: Sanghyun Son, Matheus Gadelha, Yang Zhou, Zexiang Xu, Ming C. Lin, Yi Zhou

    Abstract: We present a differentiable representation, DMesh, for general 3D triangular meshes. DMesh considers both the geometry and connectivity information of a mesh. In our design, we first get a set of convex tetrahedra that compactly tessellates the domain based on Weighted Delaunay Triangulation (WDT), and select triangular faces on the tetrahedra to define the final mesh. We formulate probability of… ▽ More

    Submitted 5 July, 2025; v1 submitted 20 April, 2024; originally announced April 2024.

    Comments: 36 pages, 24 figures. Updated with camera-ready version

  26. arXiv:2312.08710  [pdf, other

    cs.LG cs.AI

    Gradient Informed Proximal Policy Optimization

    Authors: Sanghyun Son, Laura Yu Zheng, Ryan Sullivan, Yi-Ling Qiao, Ming C. Lin

    Abstract: We introduce a novel policy learning method that integrates analytical gradients from differentiable environments with the Proximal Policy Optimization (PPO) algorithm. To incorporate analytical gradients into the PPO framework, we introduce the concept of an α-policy that stands as a locally superior policy. By adaptively modifying the α value, we can effectively manage the influence of analytica… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 27 pages, NeurIPS 2023 Conference

  27. arXiv:2310.20491  [pdf, other

    cs.RO

    Collaborative Decision-Making Using Spatiotemporal Graphs in Connected Autonomy

    Authors: Peng Gao, Yu Shen, Ming C. Lin

    Abstract: Collaborative decision-making is an essential capability for multi-robot systems, such as connected vehicles, to collaboratively control autonomous vehicles in accident-prone scenarios. Under limited communication bandwidth, capturing comprehensive situational awareness by integrating connected agents' observation is very challenging. In this paper, we propose a novel collaborative decision-making… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  28. arXiv:2309.08214  [pdf, other

    cs.RO

    MTG: Mapless Trajectory Generator with Traversability Coverage for Outdoor Navigation

    Authors: Jing Liang, Peng Gao, Xuesu Xiao, Adarsh Jagan Sathyamoorthy, Mohamed Elnoor, Ming C. Lin, Dinesh Manocha

    Abstract: We present a novel learning-based trajectory generation algorithm for outdoor robot navigation. Our goal is to compute collision-free paths that also satisfy the environment-specific traversability constraints. Our approach is designed for global planning using limited onboard robot perception in mapless environments while ensuring comprehensive coverage of all traversable directions. Our formulat… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 9

  29. arXiv:2309.04581  [pdf, other

    cs.GR cs.CV cs.LG

    Dynamic Mesh-Aware Radiance Fields

    Authors: Yi-Ling Qiao, Alexander Gao, Yiran Xu, Yue Feng, Jia-Bin Huang, Ming C. Lin

    Abstract: Embedding polygonal mesh assets within photorealistic Neural Radience Fields (NeRF) volumes, such that they can be rendered and their dynamics simulated in a physically consistent manner with the NeRF, is under-explored from the system perspective of integrating NeRF into the traditional graphics pipeline. This paper designs a two-way coupling between mesh and NeRF during rendering and simulation.… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: ICCV 2023

  30. arXiv:2303.04901  [pdf, other

    cs.RO cs.HC

    Towards Driving Policies with Personality: Modeling Behavior and Style in Risky Scenarios via Data Collection in Virtual Reality

    Authors: Laura Zheng, Julio Poveda, James Mullen, Shreelekha Revankar, Ming C. Lin

    Abstract: Autonomous driving research currently faces data sparsity in representation of risky scenarios. Such data is both difficult to obtain ethically in the real world, and unreliable to obtain via simulation. Recent advances in virtual reality (VR) driving simulators lower barriers to tackling this problem in simulation. We propose the first data collection framework for risky scenario driving data fro… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  31. arXiv:2301.05311  [pdf, other

    cs.RO cs.GR

    A Framework for Active Haptic Guidance Using Robotic Haptic Proxies

    Authors: Niall L. Williams, Nicholas Rewkowski, Jiasheng Li, Ming C. Lin

    Abstract: Haptic feedback is an important component of creating an immersive mixed reality experience. Traditionally, haptic forces are rendered in response to the user's interactions with the virtual environment. In this work, we explore the idea of rendering haptic forces in a proactive manner, with the explicit intention to influence the user's behavior through compelling haptic forces. To this end, we p… ▽ More

    Submitted 27 February, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

  32. arXiv:2212.03450  [pdf, other

    cs.CV

    Tracking the Dynamics of the Tear Film Lipid Layer

    Authors: Tejasvi Kothapalli, Charlie Shou, Jennifer Ding, Jiayun Wang, Andrew D. Graham, Tatyana Svitova, Stella X. Yu, Meng C. Lin

    Abstract: Dry Eye Disease (DED) is one of the most common ocular diseases: over five percent of US adults suffer from DED. Tear film instability is a known factor for DED, and is thought to be regulated in large part by the thin lipid layer that covers and stabilizes the tear film. In order to aid eye related disease diagnosis, this work proposes a novel paradigm in using computer vision techniques to numer… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: NeurIPS Medical Imaging Workshop

  33. arXiv:2210.12352  [pdf, other

    cs.CV cs.GR cs.LG

    NeuPhysics: Editable Neural Geometry and Physics from Monocular Videos

    Authors: Yi-Ling Qiao, Alexander Gao, Ming C. Lin

    Abstract: We present a method for learning 3D geometry and physics parameters of a dynamic scene from only a monocular RGB video input. To decouple the learning of underlying scene geometry from dynamic motion, we represent the scene as a time-invariant signed distance function (SDF) which serves as a reference frame, along with a time-conditioned deformation field. We further bridge this neural geometry re… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  34. arXiv:2210.08046  [pdf, other

    cs.GR cs.LG cs.MA

    Differentiable Hybrid Traffic Simulation

    Authors: Sanghyun Son, Yi-Ling Qiao, Jason Sewall, Ming C. Lin

    Abstract: We introduce a novel differentiable hybrid traffic simulator, which simulates traffic using a hybrid model of both macroscopic and microscopic models and can be directly integrated into a neural network for traffic control and flow optimization. This is the first differentiable traffic simulator for macroscopic and hybrid models that can compute gradients for traffic states across time steps and i… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 13 pages, Siggraph Asia 2022 Journal Paper

    ACM Class: I.6.1; I.6.3

  35. arXiv:2210.05667  [pdf, other

    cs.CV cs.AI cs.LG

    Human Body Measurement Estimation with Adversarial Augmentation

    Authors: Nataniel Ruiz, Miriam Bellver, Timo Bolkart, Ambuj Arora, Ming C. Lin, Javier Romero, Raja Bala

    Abstract: We present a Body Measurement network (BMnet) for estimating 3D anthropomorphic measurements of the human body shape from silhouette images. Training of BMnet is performed on data from real human subjects, and augmented with a novel adversarial body simulator (ABS) that finds and synthesizes challenging body shapes. ABS is based on the skinned multiperson linear (SMPL) body model, and aims to maxi… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Published at the International Conference on 3D Vision (3DV) 2022

  36. arXiv:2210.03772  [pdf, other

    cs.RO cs.MA

    Traffic-Aware Autonomous Driving with Differentiable Traffic Simulation

    Authors: Laura Zheng, Sanghyun Son, Ming C. Lin

    Abstract: While there have been advancements in autonomous driving control and traffic simulation, there have been little to no works exploring their unification with deep learning. Works in both areas seem to focus on entirely different exclusive problems, yet traffic and driving are inherently related in the real world. In this paper, we present Traffic-Aware Autonomous Driving (TrAAD), a generalizable di… ▽ More

    Submitted 6 April, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

  37. arXiv:2205.01758  [pdf, other

    cs.LG cs.GR cs.RO

    Differentiable Simulation of Soft Multi-body Systems

    Authors: Yi-Ling Qiao, Junbang Liang, Vladlen Koltun, Ming C. Lin

    Abstract: We present a method for differentiable simulation of soft articulated bodies. Our work enables the integration of differentiable physical dynamics into gradient-based pipelines. We develop a top-down matrix assembly algorithm within Projective Dynamics and derive a generalized dry friction model for soft continuum using a new matrix splitting strategy. We derive a differentiable control framework… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2021

  38. arXiv:2204.11785  [pdf, ps, other

    cs.DM math.CO

    Graphs whose vertices of degree at least 2 lie in a triangle

    Authors: Vinicius L. do Forte, Min Chih Lin, Abilio Lucena, Nelson Maculan, Veronica A. Moyano, Jayme L. Szwarcfiter

    Abstract: A pendant vertex is one of degree one and an isolated vertex has degree zero. A neighborhood star-free (NSF for short) graph is one in which every vertex is contained in a triangle except pendant vertices and isolated vertices. This class has been considered before for several contexts. In the present paper, we study the complexity of the dominating induced matching (DIM) problem and the perfect e… ▽ More

    Submitted 7 April, 2024; v1 submitted 25 April, 2022; originally announced April 2022.

    MSC Class: 05C70; 05C85; 68R07; 68R10; 68Q25 ACM Class: G.2.2; F.2.m

  39. arXiv:2110.02411  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Voice Aging with Audio-Visual Style Transfer

    Authors: Justin Wilson, Sunyeong Park, Seunghye J. Wilson, Ming C. Lin

    Abstract: Face aging techniques have used generative adversarial networks (GANs) and style transfer learning to transform one's appearance to look younger/older. Identity is maintained by conditioning these generative networks on a learned vector representation of the source content. In this work, we apply a similar approach to age a speaker's voice, referred to as voice aging. We first analyze the classifi… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  40. arXiv:2110.02405  [pdf, other

    cs.CV cs.SD eess.AS

    Echo-Reconstruction: Audio-Augmented 3D Scene Reconstruction

    Authors: Justin Wilson, Nicholas Rewkowski, Ming C. Lin, Henry Fuchs

    Abstract: Reflective and textureless surfaces such as windows, mirrors, and walls can be a challenge for object and scene reconstruction. These surfaces are often poorly reconstructed and filled with depth discontinuities and holes, making it difficult to cohesively reconstruct scenes that contain these planar discontinuities. We propose Echoreconstruction, an audio-visual method that uses the reflections o… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  41. arXiv:2110.02404  [pdf, other

    cs.CV cs.SD eess.AS

    3D-MOV: Audio-Visual LSTM Autoencoder for 3D Reconstruction of Multiple Objects from Video

    Authors: Justin Wilson, Ming C. Lin

    Abstract: 3D object reconstructions of transparent and concave structured objects, with inferred material properties, remains an open research problem for robot navigation in unstructured environments. In this paper, we propose a multimodal single- and multi-frame neural network for 3D reconstructions using audio-visual inputs. Our trained reconstruction LSTM autoencoder 3D-MOV accepts multiple inputs to ac… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  42. arXiv:2109.07719  [pdf, other

    cs.LG cs.GR cs.RO

    Efficient Differentiable Simulation of Articulated Bodies

    Authors: Yi-Ling Qiao, Junbang Liang, Vladlen Koltun, Ming C. Lin

    Abstract: We present a method for efficient differentiable simulation of articulated bodies. This enables integration of articulated body dynamics into deep learning frameworks, and gradient-based optimization of neural networks that operate on articulated bodies. We derive the gradients of the forward dynamics using spatial algebra and the adjoint method. Our approach is an order of magnitude faster than a… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: ICML 2021

  43. arXiv:2102.13262  [pdf, other

    cs.CV cs.LG cs.RO

    Improving Robustness of Learning-based Autonomous Steering Using Adversarial Images

    Authors: Yu Shen, Laura Zheng, Manli Shu, Weizi Li, Tom Goldstein, Ming C. Lin

    Abstract: For safety of autonomous driving, vehicles need to be able to drive under various lighting, weather, and visibility conditions in different environments. These external and environmental factors, along with internal factors associated with sensors, can pose significant challenges to perceptual data processing, hence affecting the decision-making and control of the vehicle. In this work, we address… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

  44. arXiv:2008.07436  [pdf, other

    cs.RO

    Multi-Agent Coverage in Urban Environments

    Authors: Shivang Patel, Senthil Hariharan, Pranav Dhulipala, Ming C Lin, Dinesh Manocha, Huan Xu, Michael Otte

    Abstract: We study multi-agent coverage algorithms for autonomous monitoring and patrol in urban environments. We consider scenarios in which a team of flying agents uses downward facing cameras (or similar sensors) to observe the environment outside of buildings at street-level. Buildings are considered obstacles that impede movement, and cameras are assumed to be ineffective above a maximum altitude. We s… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

  45. arXiv:2007.02168  [pdf, other

    cs.LG cs.GR stat.ML

    Scalable Differentiable Physics for Learning and Control

    Authors: Yi-Ling Qiao, Junbang Liang, Vladlen Koltun, Ming C. Lin

    Abstract: Differentiable physics is a powerful approach to learning and control problems that involve physical objects and environments. While notable progress has been made, the capabilities of differentiable physics solvers remain limited. We develop a scalable framework for differentiable physics that can support a large number of objects and their interactions. To accommodate objects with arbitrary geom… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

    Journal ref: Proceedings of the 37th International Conference on Machine Learning, ICML 2020

  46. arXiv:1908.09464  [pdf, other

    cs.CV

    Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images

    Authors: Junbang Liang, Ming C. Lin

    Abstract: We propose a scalable neural network framework to reconstruct the 3D mesh of a human body from multi-view images, in the subspace of the SMPL model. Use of multi-view images can significantly reduce the projection ambiguity of the problem, increasing the reconstruction accuracy of the 3D human body under clothing. Our experiments show that this method benefits from the synthetic dataset generated… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

    Comments: To be published to ICCV 2019

  47. arXiv:1907.08874  [pdf, other

    cs.RO

    ADAPS: Autonomous Driving Via Principled Simulations

    Authors: Weizi Li, David Wolinski, Ming C. Lin

    Abstract: Autonomous driving has gained significant advancements in recent years. However, obtaining a robust control policy for driving remains challenging as it requires training data from a variety of scenarios, including rare situations (e.g., accidents), an effective policy architecture, and an efficient learning mechanism. We propose ADAPS for producing robust control policies for autonomous vehicles.… ▽ More

    Submitted 20 July, 2019; originally announced July 2019.

    Comments: Accepted to ICRA2019

  48. arXiv:1902.08379  [pdf, other

    cs.RO

    LSwarm: Efficient Collision Avoidance for Large Swarms with Coverage Constraints in Complex Urban Scenes

    Authors: Senthil Hariharan Arul, Adarsh Jagan Sathyamoorthy, Shivang Patel, Michael Otte, Huan Xu, Ming C Lin, Dinesh Manocha

    Abstract: In this paper, we address the problem of collision avoidance for a swarm of UAVs used for continuous surveillance of an urban environment. Our method, LSwarm, efficiently avoids collisions with static obstacles, dynamic obstacles and other agents in 3-D urban environments while considering coverage constraints. LSwarm computes collision avoiding velocities that (i) maximize the conformity of an ag… ▽ More

    Submitted 26 May, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: 11 pages

  49. arXiv:1810.12295  [pdf

    cs.OH

    Estimating Traffic Conditions At Metropolitan Scale Using Traffic Flow Theory

    Authors: Weizi Li, Meilei Jiang, Yaoyu Chen, Ming C. Lin

    Abstract: The rapid urbanization and increasing traffic have serious social, economic, and environmental impact on metropolitan areas worldwide. It is of a great importance to understand the complex interplay of road networks and traffic conditions. The authors propose a novel framework to estimate traffic conditions at the metropolitan scale using GPS traces. Their approach begins with an initial estimatio… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.

    Comments: Accepted in Transportation Research Board 97th Annual Meeting, 2018

  50. arXiv:1705.08379  [pdf, ps, other

    cs.DM

    Perfect Edge Domination: Hard and Solvable Cases

    Authors: Min Chih Lin, Vadim Lozin, Veronica A. Moyano, Jayme L. Szwarcfiter

    Abstract: Let $G$ be an undirected graph. An edge of $G$ dominates itself and all edges adjacent to it. A subset $E'$ of edges of $G$ is an edge dominating set of $G$, if every edge of the graph is dominated by some edge of $E'$. We say that $E'$ is a perfect edge dominating set of $G$, if every edge not in $E'$ is dominated by exactly one edge of $E'$. The perfect edge dominating problem is to determine a… ▽ More

    Submitted 23 May, 2017; originally announced May 2017.