Skip to main content

Showing 1–50 of 190 results for author: Ke, L

.
  1. arXiv:2604.03785  [pdf, ps, other

    cs.AI cs.MA

    Decomposing Communication Gain and Delay Cost Under Cross-Timestep Delays in Cooperative Multi-Agent Reinforcement Learning

    Authors: Zihong Gao, Hongjian Liang, Lei Hao, Liangjun Ke

    Abstract: Communication is essential for coordination in \emph{cooperative} multi-agent reinforcement learning under partial observability, yet \emph{cross-timestep} delays cause messages to arrive multiple timesteps after generation, inducing temporal misalignment and making information stale when consumed. We formalize this setting as a delayed-communication partially observable Markov game (DeComm-POMG… ▽ More

    Submitted 4 April, 2026; originally announced April 2026.

  2. arXiv:2603.06569  [pdf, ps, other

    cs.CV

    Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

    Authors: Boqiang Zhang, Lei Ke, Ruihan Yang, Qi Gao, Tianyuan Qu, Rossell Chen, Dong Yu, Leoweiliang

    Abstract: Vision Language Model (VLM) development has largely relied on scaling model size, which hinders deployment on compute-constrained mobile and edge devices such as smartphones and robots. In this work, we explore the performance limits of compact (e.g., 2B and 8B) VLMs. We challenge the prevailing practice that state-of-the-art VLMs must rely on vision encoders initialized via massive contrastive pr… ▽ More

    Submitted 14 March, 2026; v1 submitted 6 March, 2026; originally announced March 2026.

    Comments: Penguin-VL demonstrates that text-only initialized vision encoders can achieve superior performance in multimodal understanding tasks; Code: https://github.com/tencent-ailab/Penguin-VL

  3. arXiv:2602.18527  [pdf, ps, other

    cs.CV cs.AI cs.SD

    JAEGER: Joint 3D Audio-Visual Grounding and Reasoning in Simulated Physical Environments

    Authors: Zhan Liu, Changli Tang, Yuxin Wang, Zhiyuan Zhu, Youjun Chen, Yiwen Shao, Tianzi Wang, Lei Ke, Zengrui Jin, Chao Zhang

    Abstract: Current audio-visual large language models (AV-LLMs) are predominantly restricted to 2D perception, relying on RGB video and monaural audio. This design choice introduces a fundamental dimensionality mismatch that precludes reliable source localization and spatial reasoning in complex 3D environments. We address this limitation by presenting JAEGER, a framework that extends AV-LLMs to 3D space, to… ▽ More

    Submitted 19 February, 2026; originally announced February 2026.

  4. arXiv:2602.14571  [pdf, ps, other

    cs.LG hep-ex

    DCTracks: An Open Dataset for Machine Learning-Based Drift Chamber Track Reconstruction

    Authors: Qian Liyan, Zhang Yao, Yuan Ye, Zhang Zhaoke, Fang Jin, Jiang Shimiao, Zhang Jin, Li Ke, Liu Beijiang, Xu Chenglin, Zhang Yifan, Jia Xiaoqian, Qin Xiaoshuai, Huang Xingtao

    Abstract: We introduce a Monte Carlo (MC) dataset of single- and two-track drift chamber events to advance Machine Learning (ML)-based track reconstruction. To enable standardized and comparable evaluation, we define track reconstruction specific metrics and report results for traditional track reconstruction algorithms and a Graph Neural Networks (GNNs) method, facilitating rigorous, reproducible validatio… ▽ More

    Submitted 16 February, 2026; originally announced February 2026.

  5. arXiv:2602.12013  [pdf, ps, other

    cs.AI

    InjectRBP: Steering Large Language Model Reasoning Behavior via Pattern Injection

    Authors: Xiuping Wu, Zhao Yu, Yuxin Cheng, Ngai Wong, Liangjun Ke, Tapas Mishra, Konstantinos V. Katsikopoulos

    Abstract: Reasoning can significantly enhance the performance of Large Language Models. While recent studies have exploited behavior-related prompts adjustment to enhance reasoning, these designs remain largely intuitive and lack a systematic analysis of the underlying behavioral patterns. Motivated by this, we investigate how models' reasoning behaviors shape reasoning from the perspective of behavioral pa… ▽ More

    Submitted 12 February, 2026; originally announced February 2026.

  6. arXiv:2602.11146  [pdf, ps, other

    cs.CV cs.AI

    Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling

    Authors: Gongye Liu, Bo Yang, Yida Zhi, Zhizhou Zhong, Lei Ke, Didan Deng, Han Gao, Yongxiang Huang, Kaihao Zhang, Hongbo Fu, Wenhan Luo

    Abstract: Preference optimization for diffusion and flow-matching models relies on reward functions that are both discriminatively robust and computationally efficient. Vision-Language Models (VLMs) have emerged as the primary reward provider, leveraging their rich multimodal priors to guide alignment. However, their computation and memory cost can be substantial, and optimizing a latent diffusion generator… ▽ More

    Submitted 11 February, 2026; originally announced February 2026.

    Comments: Code: https://github.com/HKUST-C4G/diffusion-rm

  7. arXiv:2602.08900  [pdf, ps, other

    cond-mat.str-el

    Anisotropy, frustration and saddle point in the twisted Kagome antiferromagnet ErPdPb

    Authors: Resham Babu Regmi, Sk Jamaluddin, Y. Lee, Hari Bhandari, Po-Hao Chang, Peter E. Siegfried, Abhijeet Nayak, Mohamed El Gazzah, Bence G. Márkus, Anna Nyáry, Zachary T. Messegee, Miya P. Zhao, Xiaoyan Tan, László Forró, Liqin Ke, Igor I. Mazin, Nirmal J. Ghimire

    Abstract: The kagome lattice, with its inherent geometric frustration, provides a rich platform for exploring intriguing magnetic phenomena and topological electronic structures. In reduced-symmetry structures, such as twisted kagome systems involving rare earth elements, additional anisotropy can arise, enabling intriguing properties including spin-ice states, magnetocaloric effects, noncollinear magnetic… ▽ More

    Submitted 9 February, 2026; originally announced February 2026.

  8. arXiv:2602.06992  [pdf

    cs.CY cs.AI cs.HC

    A New Mode of Teaching Chinese as a Foreign Language from the Perspective of Smart System Studied by Using Rongzhixue

    Authors: Xiaohui Zou, Lijun Ke, Shunpeng Zou

    Abstract: The purpose of this study is to introduce a new model of teaching Chinese as a foreign language from the perspective of integrating wisdom. Its characteristics are as follows: focusing on the butterfly model of interpretation before translation, highlighting the new method of bilingual thinking training, on the one hand, applying the new theory of Chinese characters, the theory of the relationship… ▽ More

    Submitted 28 January, 2026; originally announced February 2026.

    Comments: 11 pages, in Chinese language, 22 figures

  9. arXiv:2601.05697  [pdf, ps, other

    hep-ex

    Reconstruction of atmospheric neutrinos in DUNE's horizontal-drift far-detector module

    Authors: DUNE Collaboration, S. Abbaslu, F. Abd Alrahman, A. Abed Abud, R. Acciarri, L. P. Accorsi, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, C. Adriano, F. Akbar, F. Alemanno, N. S. Alex, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1325 additional authors not shown)

    Abstract: This paper reports on the capabilities in reconstructing and identifying atmospheric neutrino interactions in one of the Deep Underground Neutrino Experiment's (DUNE) far detector modules, a liquid argon time projection chamber (LArTPC) with horizontal drift (FD-HD) of ionization electrons. The reconstruction is based upon the workflow developed for DUNE's long-baseline oscillation analysis, with… ▽ More

    Submitted 9 January, 2026; originally announced January 2026.

    Comments: 43 pages, 23 figures

    Report number: FERMILAB-PUB-25-0961-LBNF

  10. arXiv:2512.18215  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.CV

    Stable and Efficient Single-Rollout RL for Multimodal Reasoning

    Authors: Rui Liu, Dian Yu, Lei Ke, Haolin Liu, Yujun Zhou, Zhenwen Liang, Haitao Mi, Pratap Tokekar, Dong Yu

    Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has become a key paradigm to improve the reasoning capabilities of Multimodal Large Language Models (MLLMs). However, prevalent group-based algorithms such as GRPO require multi-rollout sampling for each prompt. While more efficient single-rollout variants have recently been explored in text-only settings, we find that they suffer from severe i… ▽ More

    Submitted 20 December, 2025; originally announced December 2025.

  11. arXiv:2512.16864  [pdf, ps, other

    cs.CV

    RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing

    Authors: Tianyuan Qu, Lei Ke, Xiaohang Zhan, Longxiang Tang, Yuqi Liu, Bohao Peng, Bei Yu, Dong Yu, Jiaya Jia

    Abstract: Instruction-based image editing enables natural-language control over visual modifications, yet existing models falter under Instruction-Visual Complexity (IV-Complexity), where intricate instructions meet cluttered or ambiguous scenes. We introduce RePlan (Region-aligned Planning), a plan-then-execute framework that couples a vision-language planner with a diffusion editor. The planner decomposes… ▽ More

    Submitted 18 December, 2025; originally announced December 2025.

    Comments: Precise region control and planning for instruction-based image editing. Our project page: https://replan-iv-edit.github.io

  12. arXiv:2512.16561  [pdf, ps, other

    cs.CV

    N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models

    Authors: Yuxin Wang, Lei Ke, Boqiang Zhang, Tianyuan Qu, Hanxun Yu, Zhenpeng Huang, Meng Yu, Dan Xu, Dong Yu

    Abstract: While current multimodal models can answer questions based on 2D images, they lack intrinsic 3D object perception, limiting their ability to comprehend spatial relationships and depth cues in 3D scenes. In this work, we propose N3D-VLM, a novel unified framework that seamlessly integrates native 3D object perception with 3D-aware visual reasoning, enabling both precise 3D grounding and interpretab… ▽ More

    Submitted 18 December, 2025; originally announced December 2025.

    Comments: Project Page: https://n3d-vlm.github.io

  13. arXiv:2512.10284  [pdf, ps, other

    cs.CV cs.AI cs.CL

    MotionEdit: Benchmarking and Learning Motion-Centric Image Editing

    Authors: Yixin Wan, Lei Ke, Wenhao Yu, Kai-Wei Chang, Dong Yu

    Abstract: We introduce MotionEdit, a novel dataset for motion-centric image editing-the task of modifying subject actions and interactions while preserving identity, structure, and physical plausibility. Unlike existing image editing datasets that focus on static appearance changes or contain only sparse, low-quality motion edits, MotionEdit provides high-fidelity image pairs depicting realistic motion tran… ▽ More

    Submitted 13 December, 2025; v1 submitted 10 December, 2025; originally announced December 2025.

    Comments: Technical Report. We propose MotionEdit, a dataset and benchmark for motion-centric image editing. We also introduce MotionNFT, a reward training framework to improve existing models with motion-aware guidance. Github: https://github.com/elainew728/motion-edit/

  14. arXiv:2512.09518  [pdf, ps, other

    cond-mat.mes-hall

    Controlling Skyrmion Lattices via Strain: Elongation, Tilting, and Collapse Mechanisms

    Authors: Haijun Zhao, Tae-Hoon Kim, Lin Zhou, Liqin Ke

    Abstract: This study establishes a comprehensive framework for the three-dimensional strain control of magnetic skyrmion strings. We integrate analytical modeling, micromagnetic simulations, and \textit{in situ} Lorentz transmission electron microscopy experiments to demonstrate that externally applied strain is a powerful stimuli for manipulating three-dimensional magnetic skyrmion strings. Analytical mode… ▽ More

    Submitted 10 December, 2025; originally announced December 2025.

    Journal ref: Phys. Rev. B 112, 214417 (2025)

  15. Topological Defect Mediated Helical Phase Reorientation by Uniaxial Stress

    Authors: Tae-Hoon Kim, Haijun Zhao, Brandt A. Jensen, Liqin Ke, Lin Zhou

    Abstract: Strain engineering enables precise, energy-efficient control of nanoscale magnetism. However, unlike well-studied strain-dislocation interactions in mechanical deformation, the spatial evolution of strain-induced spin rearrangement remains poorly understood. Using \emph{in situ} Lorentz transmission electron microscopy, we manipulate and observe helical domain reorientation under quantitatively ap… ▽ More

    Submitted 6 December, 2025; originally announced December 2025.

    Journal ref: Phys. Rev. Lett. 134, 136704 (2025)

  16. arXiv:2512.06465  [pdf, ps, other

    cond-mat.mes-hall physics.app-ph

    Phase-Factor-Controlled Surface Spirals in the Magnetic Conical Phase: The Role of In-Plane Directionality

    Authors: Haijun Zhao, Tae-Hoon Kim, Lin Zhou, Liqin Ke

    Abstract: In chiral magnets, the magnetic textures surrounding domain walls exhibit a rich variety of structures, offering insights into fundamental physics and potential applications in spintronic devices. Conical spirals and related structures possess intrinsic in-plane directionalities governed by phase factors $φ_0$, which are often obscured in long spirals due to cylindrical symmetry but become promine… ▽ More

    Submitted 6 December, 2025; originally announced December 2025.

    Journal ref: Phys. Rev. Applied 24, 064023 (2025)

  17. arXiv:2511.18834  [pdf, ps, other

    cs.CV cs.AI

    FlowSteer: Guiding Few-Step Image Synthesis with Authentic Trajectories

    Authors: Lei Ke, Hubery Yin, Gongye Liu, Zhengyao Lv, Jingcai Guo, Chen Li, Wenhan Luo, Yujiu Yang, Jing Lyu

    Abstract: With the success of flow matching in visual generation, sampling efficiency remains a critical bottleneck for its practical application. Among flow models' accelerating methods, ReFlow has been somehow overlooked although it has theoretical consistency with flow matching. This is primarily due to its suboptimal performance in practical scenarios compared to consistency distillation and score disti… ▽ More

    Submitted 24 November, 2025; originally announced November 2025.

    Comments: Few-Step Image Synthesis

  18. arXiv:2511.14759  [pdf, ps, other

    cs.LG cs.RO

    $π^{*}_{0.6}$: a VLA That Learns From Experience

    Authors: Physical Intelligence, Ali Amin, Raichelle Aniceto, Ashwin Balakrishna, Kevin Black, Ken Conley, Grace Connors, James Darpinian, Karan Dhabalia, Jared DiCarlo, Danny Driess, Michael Equi, Adnan Esmail, Yunhao Fang, Chelsea Finn, Catherine Glossop, Thomas Godden, Ivan Goryachev, Lachy Groom, Hunter Hancock, Karol Hausman, Gashon Hussein, Brian Ichter, Szymon Jakubczak, Rowan Jen , et al. (31 additional authors not shown)

    Abstract: We study how vision-language-action (VLA) models can improve through real-world deployments via reinforcement learning (RL). We present a general-purpose method, RL with Experience and Corrections via Advantage-conditioned Policies (RECAP), that provides for RL training of VLAs via advantage conditioning. Our method incorporates heterogeneous data into the self-improvement process, including demon… ▽ More

    Submitted 18 November, 2025; v1 submitted 18 November, 2025; originally announced November 2025.

  19. arXiv:2511.13462  [pdf, ps, other

    hep-ex

    Measurement of Exclusive $π^+$--argon Interactions Using ProtoDUNE-SP

    Authors: DUNE Collaboration, S. Abbaslu, A. Abed Abud, R. Acciarri, L. P. Accorsi, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, C. Adriano, F. Akbar, F. Alemanno, N. S. Alex, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos, M. Andreotti , et al. (1304 additional authors not shown)

    Abstract: We present the measurement of $π^{+}$--argon inelastic cross sections using the ProtoDUNE Single-Phase liquid argon time projection chamber in the incident $π^+$ kinetic energy range of 500 -- 800 MeV in multiple exclusive channels (absorption, charge exchange, and the remaining inelastic interactions). The results of this analysis are important inputs to simulations of liquid argon neutrino exper… ▽ More

    Submitted 17 November, 2025; originally announced November 2025.

    Report number: CERN-EP-2025-268; FERMILAB-PUB-25-0732-LBNF

  20. arXiv:2511.11925  [pdf, ps, other

    nucl-ex hep-ex

    First Measurement of $π^+$-Ar and $p$-Ar Total Inelastic Cross Sections in the Sub-GeV Energy Regime with ProtoDUNE-SP Data

    Authors: DUNE Collaboration, S. Abbaslu, F. Abd Alrahman, A. Abed Abud, R. Acciarri, L. P. Accorsi, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, C. Adriano, F. Akbar, F. Alemanno, N. S. Alex, L. Aliaga Soplin, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1327 additional authors not shown)

    Abstract: The ProtoDUNE-SP detector, a kiloton-scale prototype for the Deep Underground Neutrino Experiment (DUNE), is the largest liquid argon time projection chamber built to date. Operated at CERN from 2018 to 2020, it collected both cosmic-ray data and a beam consisting of positively-charged particles with discrete momentum settings across a range of 0.3 GeV/$c$ to 7 GeV/$c$. In this letter, we report t… ▽ More

    Submitted 14 November, 2025; originally announced November 2025.

    Report number: FERMILAB-PUB-25-0814-LBNF, CERN-EP-2025-266

  21. arXiv:2511.08469  [pdf, ps, other

    cs.NE eess.SP

    Spatio-Temporal Cluster-Triggered Encoding for Spiking Neural Networks

    Authors: Lingyun Ke, Minchi Hu

    Abstract: Encoding static images into spike trains is a crucial step for enabling Spiking Neural Networks (SNNs) to process visual information efficiently. However, existing schemes such as rate coding, Poisson encoding, and time-to-first-spike (TTFS) often ignore spatial relationships and yield temporally inconsistent spike patterns. In this article, a novel cluster-based encoding approach is proposed, whi… ▽ More

    Submitted 11 November, 2025; originally announced November 2025.

    Comments: 7 pages, 3 figures at present

  22. arXiv:2511.07904  [pdf, ps, other

    cs.LG cs.AI

    Test-driven Reinforcement Learning in Continuous Control

    Authors: Zhao Yu, Xiuping Wu, Liangjun Ke

    Abstract: Reinforcement learning (RL) has been recognized as a powerful tool for robot control tasks. RL typically employs reward functions to define task objectives and guide agent learning. However, since the reward function serves the dual purpose of defining the optimal goal and guiding learning, it is challenging to design the reward function manually, which often results in a suboptimal task represent… ▽ More

    Submitted 9 December, 2025; v1 submitted 11 November, 2025; originally announced November 2025.

    Comments: AAAI 2026 oral

  23. arXiv:2511.07606  [pdf, ps, other

    cond-mat.str-el

    Hidden symmetry-breaking in a kagome Ising ferromagnet

    Authors: Tianxiong Han, Tyler J. Slade, Liqin Ke, Qing-Ping Ding, Minseong Lee, Ryan D. McKenzie, Bing Li, Durba R. Jaishi, Yongbin Lee, Daniel M. Pajerowski, Qiang Zhang, Tao Hong, Paul C. Canfield, Yuji Furukawa, Komalavalli Thirunavukkuarasu, Aashish Sapkota, Rebecca Flint, Robert J. McQueeney

    Abstract: Kagome metals can host unconventional electronic phenomena that emerge from their frustrated lattice geometry and associated band topology. Correlated electronic orders, such as charge-density waves and superconductivity, are observed to intertwine with subtle time-reversal symmetry breaking whose microscopic origin is not currently understood. Here, we provide evidence for such time-reversal symm… ▽ More

    Submitted 10 November, 2025; originally announced November 2025.

    Comments: 7 pages, 4 figures

  24. arXiv:2511.03088  [pdf, ps, other

    econ.GN

    A Computer Vision Based Proxy for Political Polarization in Religious Countries: A Turkiye Case Study

    Authors: Liangze Ke

    Abstract: This paper examines a novel proxy for political polarization, initially proposed by Caliskan et al., which estimates intergroup distances using computer vision. Analyzing 1,400+ YouTube videos with advanced object detection, their study quantifies demographic and religious divides in Turkiye, a deeply polarized nation. Our findings reveal strong correlations between intergroup distances and electo… ▽ More

    Submitted 4 November, 2025; originally announced November 2025.

    Comments: Preprint. 44 pages, 12 figures, 7 tables. Data were obtained while the first author was at the Caliskan Lab and are not public. Access may be granted upon reasonable request with provider approval; contact cantay.caliskan@gmail.com. Special thanks to Dr. Cantay Caliskan; this work is dedicated to him

    MSC Class: D72; Z12; C55

  25. arXiv:2510.23571  [pdf, ps, other

    cs.RO cs.AI cs.CV cs.LG

    RobotArena $\infty$: Scalable Robot Benchmarking via Real-to-Sim Translation

    Authors: Yash Jangir, Yidi Zhang, Pang-Chi Lo, Kashu Yamazaki, Chenyu Zhang, Kuan-Hsun Tu, Tsung-Wei Ke, Lei Ke, Yonatan Bisk, Katerina Fragkiadaki

    Abstract: The pursuit of robot generalists, agents capable of performing diverse tasks across diverse environments, demands rigorous and scalable evaluation. Yet real-world testing of robot policies remains fundamentally constrained: it is labor-intensive, slow, unsafe at scale, and difficult to reproduce. As policies expand in scope and complexity, these barriers only intensify, since defining "success" in… ▽ More

    Submitted 19 March, 2026; v1 submitted 27 October, 2025; originally announced October 2025.

    Comments: Website: https://robotarenainf.github.io

  26. arXiv:2510.08380  [pdf, ps, other

    hep-ex

    Identification of low-energy kaons in the ProtoDUNE-SP detector

    Authors: DUNE Collaboration, S. Abbaslu, F. Abd Alrahman, A. Abed Abud, R. Acciarri, L. P. Accorsi, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, C. Adriano, F. Akbar, F. Alemanno, N. S. Alex, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1325 additional authors not shown)

    Abstract: The Deep Underground Neutrino Experiment (DUNE) is a next-generation neutrino experiment with a rich physics program that includes searches for the hypothetical phenomenon of proton decay. Utilizing liquid-argon time-projection chamber technology, DUNE is expected to achieve world-leading sensitivity in the proton decay channels that involve charged kaons in their final states. The first DUNE demo… ▽ More

    Submitted 9 October, 2025; originally announced October 2025.

    Report number: CERN-EP-2025-231, FERMILAB-PUB-25-0717-LBNF

  27. arXiv:2510.03345  [pdf

    cs.LG cs.AI

    Pilot selection in the era of Virtual reality: algorithms for accurate and interpretable machine learning models

    Authors: Luoma Ke, Guangpeng Zhang, Jibo He, Yajing Li, Yan Li, Xufeng Liu, Peng Fang

    Abstract: With the rapid growth of the aviation industry, there is a need for a large number of flight crew. How to select the right pilots in a cost-efficient manner has become an important research question. In the current study, twenty-three pilots were recruited from China Eastern Airlines, and 23 novices were from the community of Tsinghua University. A novel approach incorporating machine learning and… ▽ More

    Submitted 2 October, 2025; originally announced October 2025.

  28. arXiv:2509.07664  [pdf, ps, other

    hep-ex

    Towards mono-energetic virtual $ν$ beam cross-section measurements: A feasibility study of $ν$-Ar interaction analysis with DUNE-PRISM

    Authors: DUNE Collaboration, S. Abbaslu, A. Abed Abud, R. Acciarri, L. P. Accorsi, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, C. Adriano, F. Akbar, F. Alemanno, N. S. Alex, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos, M. Andreotti , et al. (1302 additional authors not shown)

    Abstract: Neutrino-nucleus cross-section measurements are critical for future neutrino oscillation analyses. However, our models to describe them require further refinement, and a deeper understanding of the underlying physics is essential for future neutrino oscillation experiments to realize their ambitious physics goals. Current neutrino cross-section measurements provide clear deficiencies in neutrino i… ▽ More

    Submitted 9 September, 2025; originally announced September 2025.

    Report number: FERMILAB-PUB-25-0627-LBNF

  29. arXiv:2509.07012  [pdf, ps, other

    physics.ins-det hep-ex

    Operation of a Modular 3D-Pixelated Liquid Argon Time-Projection Chamber in a Neutrino Beam

    Authors: DUNE Collaboration, S. Abbaslu, A. Abed Abud, R. Acciarri, L. P. Accorsi, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, C. Adriano, F. Akbar, F. Alemanno, N. S. Alex, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos, M. Andreotti , et al. (1299 additional authors not shown)

    Abstract: The 2x2 Demonstrator, a prototype for the Deep Underground Neutrino Experiment (DUNE) liquid argon (LAr) Near Detector, was exposed to the Neutrinos from the Main Injector (NuMI) neutrino beam at Fermi National Accelerator Laboratory (Fermilab). This detector prototypes a new modular design for a liquid argon time-projection chamber (LArTPC), comprised of a two-by-two array of four modules, each f… ▽ More

    Submitted 6 September, 2025; originally announced September 2025.

    Report number: FERMILAB-PUB-25-0537-LBNF

  30. arXiv:2508.21060  [pdf, ps, other

    cs.CV

    Multi-View 3D Point Tracking

    Authors: Frano Rajič, Haofei Xu, Marko Mihajlovic, Siyuan Li, Irem Demir, Emircan Gündoğdu, Lei Ke, Sergey Prokudin, Marc Pollefeys, Siyu Tang

    Abstract: We introduce the first data-driven multi-view 3D point tracker, designed to track arbitrary points in dynamic scenes using multiple camera views. Unlike existing monocular trackers, which struggle with depth ambiguities and occlusion, or prior multi-camera methods that require over 20 cameras and tedious per-sequence optimization, our feed-forward model directly predicts 3D correspondences using a… ▽ More

    Submitted 28 August, 2025; originally announced August 2025.

    Comments: ICCV 2025, Oral. Project page: https://ethz-vlg.github.io/mvtracker

  31. arXiv:2508.19496  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.str-el quant-ph

    Accurate calculation of light rare-earth magnetic anisotropy with density functional theory

    Authors: Liqin Ke, R. Flint, Y. Lee

    Abstract: Density functional theory (DFT) has long struggled to treat light rare-earth magnetism. We show that this difficulty arises from an overestimate of the $4f$ charge asphericity, and thus the magnetic anisotropy energy, due to the inadequacy of single Slater-determinant representations. We propose an effective solution by combining constrained DFT+U with crystal field theory and a systematic many-bo… ▽ More

    Submitted 26 August, 2025; originally announced August 2025.

    Comments: 6 pages, 3 figures

  32. Generative Video Matting

    Authors: Yongtao Ge, Kangyang Xie, Guangkai Xu, Mingyu Liu, Li Ke, Longtao Huang, Hui Xue, Hao Chen, Chunhua Shen

    Abstract: Video matting has traditionally been limited by the lack of high-quality ground-truth data. Most existing video matting datasets provide only human-annotated imperfect alpha and foreground annotations, which must be composited to background images or videos during the training stage. Thus, the generalization capability of previous methods in real-world scenarios is typically poor. In this work, we… ▽ More

    Submitted 11 August, 2025; originally announced August 2025.

    Journal ref: SIGGRAPH Conference Papers 2025

  33. arXiv:2507.08586  [pdf, ps, other

    physics.ins-det hep-ex

    Spatial and Temporal Evaluations of the Liquid Argon Purity in ProtoDUNE-SP

    Authors: DUNE Collaboration, S. Abbaslu, A. Abed Abud, R. Acciarri, L. P. Accorsi, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, C. Adriano, F. Akbar, F. Alemanno, N. S. Alex, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos, M. Andreotti , et al. (1301 additional authors not shown)

    Abstract: Liquid argon time projection chambers (LArTPCs) rely on highly pure argon to ensure that ionization electrons produced by charged particles reach readout arrays. ProtoDUNE Single-Phase (ProtoDUNE-SP) was an approximately 700-ton liquid argon detector intended to prototype the Deep Underground Neutrino Experiment (DUNE) Far Detector Horizontal Drift module. It contains two drift volumes bisected by… ▽ More

    Submitted 27 August, 2025; v1 submitted 11 July, 2025; originally announced July 2025.

    Report number: CERN-EP-2025-157, FERMILAB-PUB-25-0445-V

    Journal ref: JINST (2025) 20 P09008

  34. arXiv:2507.04456  [pdf, ps, other

    cs.CV

    BiVM: Accurate Binarized Neural Network for Efficient Video Matting

    Authors: Haotong Qin, Xianglong Liu, Xudong Ma, Lei Ke, Yulun Zhang, Jie Luo, Michele Magno

    Abstract: Deep neural networks for real-time video matting suffer significant computational limitations on edge devices, hindering their adoption in widespread applications such as online conferences and short-form video production. Binarization emerges as one of the most common compression approaches with compact 1-bit parameters and efficient bitwise operations. However, accuracy and efficiency limitation… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  35. arXiv:2506.13867  [pdf, ps, other

    cs.RO

    ATK: Automatic Task-driven Keypoint Selection for Robust Policy Learning

    Authors: Yunchu Zhang, Shubham Mittal, Zhengyu Zhang, Liyiming Ke, Siddhartha Srinivasa, Abhishek Gupta

    Abstract: Visuomotor policies often suffer from perceptual challenges, where visual differences between training and evaluation environments degrade policy performance. Policies relying on state estimations, like 6D pose, require task-specific tracking and are difficult to scale, while raw sensor-based policies may lack robustness to small visual disturbances. In this work, we leverage 2D keypoints--spatial… ▽ More

    Submitted 4 October, 2025; v1 submitted 16 June, 2025; originally announced June 2025.

  36. arXiv:2506.12716  [pdf, ps, other

    cs.CV

    Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors

    Authors: Wen-Hsuan Chu, Lei Ke, Jianmeng Liu, Mingxiao Huo, Pavel Tokmakov, Katerina Fragkiadaki

    Abstract: We tackle the challenge of generating dynamic 4D scenes from monocular, multi-object videos with heavy occlusions, and introduce GenMOJO, a novel approach that integrates rendering-based deformable 3D Gaussian optimization with generative priors for view synthesis. While existing models perform well on novel view synthesis for isolated objects, they struggle to generalize to complex, cluttered sce… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

    Comments: This is an updated and extended version of our CVPR paper "Robust Multi-Object 4D Generation in Complex Video Scenarios"

  37. arXiv:2505.23309  [pdf, other

    cs.LG cs.AI

    Score-based Generative Modeling for Conditional Independence Testing

    Authors: Yixin Ren, Chenghou Jin, Yewei Xia, Li Ke, Longtao Huang, Hui Xue, Hao Zhang, Jihong Guan, Shuigeng Zhou

    Abstract: Determining conditional independence (CI) relationships between random variables is a fundamental yet challenging task in machine learning and statistics, especially in high-dimensional settings. Existing generative model-based CI testing methods, such as those utilizing generative adversarial networks (GANs), often struggle with undesirable modeling of conditional distributions and training insta… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Accepted by KDD2025

  38. arXiv:2505.23054  [pdf, ps, other

    cs.CV

    Zero-P-to-3: Zero-Shot Partial-View Images to 3D Object

    Authors: Yuxuan Lin, Ruihang Chu, Zhenyu Chen, Xiao Tang, Lei Ke, Haoling Li, Yingji Zhong, Zhihao Li, Shiyong Liu, Xiaofei Wu, Jianzhuang Liu, Yujiu Yang

    Abstract: Generative 3D reconstruction shows strong potential in incomplete observations. While sparse-view and single-image reconstruction are well-researched, partial observation remains underexplored. In this context, dense views are accessible only from a specific angular range, with other perspectives remaining inaccessible. This task presents two main challenges: (i) limited View Range: observations c… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  39. arXiv:2505.02936  [pdf, other

    cond-mat.mtrl-sci

    Doping-induced Spin Reorientation in Kagome Magnet TmMn6Sn6

    Authors: Mohamed El Gazzah, Po-Hao Chang, Y. Lee, Hari Bhandari, Resham Regmi, Xiuqan Zhou, John F. Mitchell, Liqin Ke, Igor I. Mazin, Nirmal J. Ghimire

    Abstract: The kagome-lattice compounds RMn6Sn6 (R is a rare earth element), where the Mn atoms form a kagome net in the basal plane, are currently attracting a great deal of attention as they have been shown to host complex magnetic textures and electronic topological states strongly sensitive to the choice of the R atom. Among the magnetic R atoms, TmMn6Sn6 orders with the easy-plane magnetization forming… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  40. arXiv:2504.16054  [pdf, other

    cs.LG cs.RO

    $π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization

    Authors: Physical Intelligence, Kevin Black, Noah Brown, James Darpinian, Karan Dhabalia, Danny Driess, Adnan Esmail, Michael Equi, Chelsea Finn, Niccolo Fusai, Manuel Y. Galliker, Dibya Ghosh, Lachy Groom, Karol Hausman, Brian Ichter, Szymon Jakubczak, Tim Jones, Liyiming Ke, Devin LeBlanc, Sergey Levine, Adrian Li-Bell, Mohith Mothukuri, Suraj Nair, Karl Pertsch, Allen Z. Ren , et al. (11 additional authors not shown)

    Abstract: In order for robots to be useful, they must perform practically relevant tasks in the real world, outside of the lab. While vision-language-action (VLA) models have demonstrated impressive results for end-to-end robot control, it remains an open question how far such models can generalize in the wild. We describe $π_{0.5}$, a new model based on $π_{0}$ that uses co-training on heterogeneous tasks… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  41. arXiv:2504.14717  [pdf, ps, other

    cs.CV cs.LG

    TAPIP3D: Tracking Any Point in Persistent 3D Geometry

    Authors: Bowei Zhang, Lei Ke, Adam W. Harley, Katerina Fragkiadaki

    Abstract: We introduce TAPIP3D, a novel approach for long-term 3D point tracking in monocular RGB and RGB-D videos. TAPIP3D represents videos as camera-stabilized spatio-temporal feature clouds, leveraging depth and camera motion information to lift 2D video features into a 3D world space where camera movement is effectively canceled out. Within this stabilized 3D representation, TAPIP3D iteratively refines… ▽ More

    Submitted 14 November, 2025; v1 submitted 20 April, 2025; originally announced April 2025.

    Comments: NeurIPS 2025. Long-term feed-forward 3D point tracking in persistent 3D point maps. Code:https://github.com/zbw001/TAPIP3D

  42. arXiv:2503.23744  [pdf, other

    physics.acc-ph hep-ex physics.ins-det

    European Contributions to Fermilab Accelerator Upgrades and Facilities for the DUNE Experiment

    Authors: DUNE Collaboration, A. Abed Abud, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, F. Akbar, F. Alemanno, N. S. Alex, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1322 additional authors not shown)

    Abstract: The Proton Improvement Plan (PIP-II) to the FNAL accelerator chain and the Long-Baseline Neutrino Facility (LBNF) will provide the world's most intense neutrino beam to the Deep Underground Neutrino Experiment (DUNE) enabling a wide-ranging physics program. This document outlines the significant contributions made by European national laboratories and institutes towards realizing the first phase o… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

    Comments: Submitted to the 2026 Update of the European Strategy for Particle Physics

  43. arXiv:2503.23743  [pdf, other

    physics.data-an hep-ex physics.ins-det

    DUNE Software and Computing Research and Development

    Authors: DUNE Collaboration, A. Abed Abud, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, F. Akbar, F. Alemanno, N. S. Alex, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1322 additional authors not shown)

    Abstract: The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy toward the implementation of this leading-edge, large-scale science project. The ambitious physics program of Phase I and Phase II of DUNE is dependent upon deployment and utilization of significant computing res… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

    Comments: Submitted to the 2026 Update of the European Strategy for Particle Physics

  44. arXiv:2503.23293  [pdf, other

    physics.ins-det

    The DUNE Phase II Detectors

    Authors: DUNE Collaboration, A. Abed Abud, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, F. Akbar, F. Alemanno, N. S. Alex, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1322 additional authors not shown)

    Abstract: The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy for the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and… ▽ More

    Submitted 29 March, 2025; originally announced March 2025.

    Comments: Submitted to the 2026 Update of the European Strategy for Particle Physics

  45. arXiv:2503.23291  [pdf, other

    hep-ex

    The DUNE Science Program

    Authors: DUNE Collaboration, A. Abed Abud, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, F. Akbar, F. Alemanno, N. S. Alex, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1322 additional authors not shown)

    Abstract: The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy for the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and… ▽ More

    Submitted 29 March, 2025; originally announced March 2025.

    Comments: Submitted to the 2026 Update of the European Strategy of Particle Physics

  46. arXiv:2503.14956  [pdf, other

    nlin.CD q-bio.BM

    Machine learning predictions from unpredictable chaos

    Authors: Jian Jiang, Long Chen, Lu ke, Bozheng Dou, Yueying Zhu, Yazhou Shi, Huahai Qiu, Bengong Zhang, Tianshou Zhou, Guo-Wei Wei

    Abstract: Chaos is omnipresent in nature, and its understanding provides enormous social and economic benefits. However, the unpredictability of chaotic systems is a textbook concept due to their sensitivity to initial conditions, aperiodic behavior, fractal dimensions, nonlinearity, and strange attractors. In this work, we introduce, for the first time, chaotic learning, a novel multiscale topological para… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  47. arXiv:2503.04824  [pdf, other

    cs.GR cs.AI cs.CV

    ProReflow: Progressive Reflow with Decomposed Velocity

    Authors: Lei Ke, Haohang Xu, Xuefei Ning, Yu Li, Jiajun Li, Haoling Li, Yuxuan Lin, Dongsheng Jiang, Yujiu Yang, Linfeng Zhang

    Abstract: Diffusion models have achieved significant progress in both image and video generation while still suffering from huge computation costs. As an effective solution, flow matching aims to reflow the diffusion process of diffusion models into a straight line for a few-step and even one-step generation. However, in this paper, we suggest that the original training pipeline of flow matching is not opti… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: Our codes will be released at Github

  48. arXiv:2502.19417  [pdf, ps, other

    cs.RO cs.AI cs.LG

    Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models

    Authors: Lucy Xiaoyang Shi, Brian Ichter, Michael Equi, Liyiming Ke, Karl Pertsch, Quan Vuong, James Tanner, Anna Walling, Haohuan Wang, Niccolo Fusai, Adrian Li-Bell, Danny Driess, Lachy Groom, Sergey Levine, Chelsea Finn

    Abstract: Generalist robots that can perform a range of different tasks in open-world settings must be able to not only reason about the steps needed to accomplish their goals, but also process complex instructions, prompts, and even feedback during task execution. Intricate instructions (e.g., "Could you make me a vegetarian sandwich?" or "I don't like that one") require not just the ability to physically… ▽ More

    Submitted 15 July, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: ICML 2025

  49. arXiv:2502.18480  [pdf, other

    cs.IR cs.AI cs.CL

    QExplorer: Large Language Model Based Query Extraction for Toxic Content Exploration

    Authors: Shaola Ren, Li Ke, Longtao Huang, Dehong Gao, Hui Xue

    Abstract: Automatically extracting effective queries is challenging in information retrieval, especially in toxic content exploration, as such content is likely to be disguised. With the recent achievements in generative Large Language Model (LLM), we are able to leverage the capabilities of LLMs to extract effective queries for similar content exploration directly. This study proposes QExplorer, an approac… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  50. Neutrino Interaction Vertex Reconstruction in DUNE with Pandora Deep Learning

    Authors: DUNE Collaboration, A. Abed Abud, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, F. Akbar, F. Alemanno, N. S. Alex, K. Allison, M. Alrashed, A. Alton, R. Alvarez, T. Alves, A. Aman, H. Amar, P. Amedo, J. Anderson, C. Andreopoulos , et al. (1313 additional authors not shown)

    Abstract: The Pandora Software Development Kit and algorithm libraries perform reconstruction of neutrino interactions in liquid argon time projection chamber detectors. Pandora is the primary event reconstruction software used at the Deep Underground Neutrino Experiment, which will operate four large-scale liquid argon time projection chambers at the far detector site in South Dakota, producing high-resolu… ▽ More

    Submitted 26 June, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: 32 pages, 18 figures

    Report number: FERMILAB-PUB-25-0037-LBNF