Skip to main content

Showing 1–50 of 2,692 results for author: Fu, Y

.
  1. arXiv:2604.12946  [pdf, ps, other

    cs.LG

    Parcae: Scaling Laws For Stable Looped Language Models

    Authors: Hayden Prairie, Zachary Novack, Taylor Berg-Kirkpatrick, Daniel Y. Fu

    Abstract: Traditional fixed-depth architectures scale quality by increasing training FLOPs, typically through increased parameterization, at the expense of a higher memory footprint, or data. A potential alternative is looped architectures, which instead increase FLOPs by sending activations through a block of layers in a loop. While promising, existing recipes for training looped architectures can be unsta… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

  2. arXiv:2604.12944  [pdf, ps, other

    cs.CV cs.AI

    Distorted or Fabricated? A Survey on Hallucination in Video LLMs

    Authors: Yiyang Huang, Yitian Zhang, Yizhou Wang, Mingyuan Zhang, Liang Shi, Huimin Zeng, Yun Fu

    Abstract: Despite significant progress in video-language modeling, hallucinations remain a persistent challenge in Video Large Language Models (Vid-LLMs), referring to outputs that appear plausible yet contradict the content of the input video. This survey presents a comprehensive analysis of hallucinations in Vid-LLMs and introduces a systematic taxonomy that categorizes them into two core types: dynamic d… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

    Comments: ACL 2026 findings

  3. arXiv:2604.12889  [pdf, ps, other

    physics.optics

    Building reliable 3D photonic integrated circuits and cavities at the wafer scale

    Authors: Yuhao Huang, Yunqi Fu, Yu Xia, Yuemin Li, Zheng Li, Yaoran Huang, Zhaoting Geng, Mingfei Liu, Chao Xiang

    Abstract: Three-dimensional (3D) photonic integrated circuits (PIC) are emerging as an indispensable scheme for high density and multifunctional photonic systems. However, the wafer-scale scaling of PICs towards a 3D configuration is constrained by two key factors: (i) the trade-off between inter-layer taper efficiency and footprint, and (ii) wafer-scale uniformity of inter-layer transition loss. In this wo… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

    Comments: 10 pages, 4 figures

  4. arXiv:2604.12566  [pdf, ps, other

    physics.optics

    Scalable 3D silicon nitride photonic interposer for high-density optical interconnects

    Authors: Yu Xia, Yuhao Huang, Yuemin Li, Jie Wang, Yunqi Fu, Yaoran Huang, Hongjie Liang, Hao Fang, Zheng Li, Mingfei Liu, Yitian Tong, Di Yu, Chao Xiang

    Abstract: Modern computing workloads demand energy-efficient, high-bandwidth interconnects, motivating photonic interposers as an alternative to electrical links. Here we demonstrate a compact 3D silicon nitride (SiN) photonic interposer prototype comprising two routing layers, with the 3D routing scheme optimized by a global optimization algorithm. The 3D interposer realizes a fully connected 12-node optic… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

    Comments: 5 pages, 3 figures

  5. arXiv:2604.12524  [pdf, ps, other

    hep-ex

    Observation of the Exotic State $π_{1}(1600)$ in $ψ(2S)\rightarrowγχ_{c1},χ_{c1}\rightarrowπ^{+}π^{-}η'$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, X. L. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (728 additional authors not shown)

    Abstract: A partial wave analysis of the process $ψ(2S)\rightarrowγχ_{c1}, χ_{c1}\rightarrowπ^+π^-η^{\prime}$ is performed using $(2712.4\pm14.3)\times10^{6}$ $ψ(2S)$ events collected with the BESIII detector. An isovector state with exotic quantum numbers $J^{PC}=1^{-+}$, denoted as $π_{1}(1600)$, is observed for the first time in the charmonium decay of $χ_{c1}\rightarrowπ_{1}^{\pm}(1600)π^{\mp}$,… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

  6. arXiv:2604.12455  [pdf, ps, other

    eess.AS

    Sky-Ear: An Unmanned Aerial Vehicle-Enabled Victim Sound Detection and Localization System

    Authors: Yi Hong, Mingyang Wang, Yalin Liu, Yaru Fu, Kevin Hung

    Abstract: Unmanned Aerial Vehicles (UAVs) are increasingly deployed in search-and-rescue (SAR) missions, yet continuous and reliable victim detection and localization remain challenging due to on-board hardware constraints. This paper designs an UAV-Enabled Victim Sound Detection and Localization System (called ``Sky-Ear'' for brevity) to achieve energy-efficient acoustic sensing and sound detection for SAR… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

  7. arXiv:2604.12450  [pdf, ps, other

    quant-ph cond-mat.other

    $\mathbb{Z}_{2}$ Skin Channels and Scale-Dependent Dynamical Quantum Phase Transitions

    Authors: Yongxu Fu

    Abstract: We analytically describe the dynamically separated $\mathbb{Z}_{2}$ skin channels (wavepacket evolutions) under periodic boundary condition (PBC) in non-Hermitian systems with anomalous time-reversal symmetry (ATRS), by combining the semiclassical worldline perspective with an enhanced understanding of skin effects. These channels, tied to the initial state and relevant symmetries, exhibit individ… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

    Comments: 6 pages, 2 figures. Supplemental Material is in preparation

  8. arXiv:2604.12374  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

    Authors: NVIDIA, :, Aakshita Chandiramani, Aaron Blakeman, Abdullahi Olaoye, Abhibha Gupta, Abhilash Somasamudramath, Abhinav Khattar, Adeola Adesoba, Adi Renduchintala, Adil Asif, Aditya Agrawal, Aditya Vavre, Ahmad Kiswani, Aishwarya Padmakumar, Ajay Hotchandani, Akanksha Shukla, Akhiad Bercovich, Aleksander Ficek, Aleksandr Shaposhnikov, Alex Gronskiy, Alex Kondratenko, Alex Neefus, Alex Steiner, Alex Yang , et al. (522 additional authors not shown)

    Abstract: We describe the pre-training, post-training, and quantization of Nemotron 3 Super, a 120 billion (active 12 billion) parameter hybrid Mamba-Attention Mixture-of-Experts model. Nemotron 3 Super is the first model in the Nemotron 3 family to 1) be pre-trained in NVFP4, 2) leverage LatentMoE, a new Mixture-of-Experts architecture that optimizes for both accuracy per FLOP and accuracy per parameter, a… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

  9. arXiv:2604.11998  [pdf, ps, other

    cs.CV cs.AI

    The Second Challenge on Cross-Domain Few-Shot Object Detection at NTIRE 2026: Methods and Results

    Authors: Xingyu Qiu, Yuqian Fu, Jiawei Geng, Bin Ren, Jiancheng Pan, Zongwei Wu, Hao Tang, Yanwei Fu, Radu Timofte, Nicu Sebe, Mohamed Elhoseiny, Lingyi Hong, Mingxi Cheng, Xingqi He, Runze Li, Xingdong Sheng, Wenqiang Zhang, Jiacong Liu, Shu Luo, Yikai Qin, Yaze Zhao, Yongwei Jiang, Yixiong Zou, Zhe Zhang, Yang Yang , et al. (49 additional authors not shown)

    Abstract: Cross-domain few-shot object detection (CD-FSOD) remains a challenging problem for existing object detectors and few-shot learning approaches, particularly when generalizing across distinct domains. As part of NTIRE 2026, we hosted the second CD-FSOD Challenge to systematically evaluate and promote progress in detecting objects in unseen target domains under limited annotation conditions. The chal… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

    Comments: accepted by CVPRW 26 @ NTIRE

  10. arXiv:2604.10655  [pdf, ps, other

    cs.CV cs.AI cs.MM

    LoViF 2026 The First Challenge on Weather Removal in Videos

    Authors: Chenghao Qian, Xin Li, Yeying Jin, Shangguan Sun, Yilian Zhong, Yuxiang Chen, Shibo Yin, Yushun Fang, Xilei Zhu, Yahui Wang, Chen Lu, Ying Fu, Jianan Tian, Jifan Zhang, Chen Zhou, Junyang Jiang, Yuping Sun, Zhuohang Shi, Xiaojing Liu, Jiao Liu, Yatong Zhou, Shuai Liu, Qiang Deng, Jiajia Mi, Qianhao Luo , et al. (1 additional authors not shown)

    Abstract: This paper presents a review of the LoViF 2026 Challenge on Weather Removal in Videos. The challenge encourages the development of methods for restoring clean videos from inputs degraded by adverse weather conditions such as rain and snow, with an emphasis on achieving visually plausible and temporally consistent results while preserving scene structure and motion dynamics. To support this task, w… ▽ More

    Submitted 14 April, 2026; v1 submitted 12 April, 2026; originally announced April 2026.

    Comments: CVPR Workshop Challenge Report

  11. arXiv:2604.10523  [pdf, ps, other

    hep-ex

    Measurement of the branching fractions of $χ_{cJ} \to π^{+}π^{-}π^{0}π^{0}$ via $ψ(3686) \to γχ_{cJ}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, H. R. Bao, X. L. Bao, M. Barbagiovanni, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (741 additional authors not shown)

    Abstract: Using $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector operating at BEPCII, the branching fractions of $χ_{cJ}\toπ^+π^-π^0π^0$ ($J=0,~1,~2$) are measured via the radiative transition $ψ(3686)\toγχ_{cJ}$. The results are $\mathcal{B}(χ_{c0} \to π^{+}π^{-}π^{0}π^{0}) = (3.10 \pm 0.01 \pm 0.14) \times 10^{-2}$,… ▽ More

    Submitted 12 April, 2026; originally announced April 2026.

  12. arXiv:2604.10444  [pdf, ps, other

    hep-ex

    First Observation of \boldmath{$D^+ \to a_0(980)ρ$ and $D^+ \to a_0(980)^+ f_0(500)$} in \boldmath{$D^+ \to π^+π^+π^-η$ and $D^+ \to π^+π^0π^0η$} Decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, X. L. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (734 additional authors not shown)

    Abstract: We perform the first amplitude analysis of the singly Cabibbo-suppressed decays $D^+ \to π^+ π^{+(0)} π^{-(0)} η$, using $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773\,GeV, corresponding to an integrated luminosity of 20.3 $\rm{fb}^{-1}$. The absolute branching fractions of the $D^+ \to π^+ π^+ π^- η$ and $D^+ \to π^+ π^0 π^0 η$ decays are measure… ▽ More

    Submitted 11 April, 2026; originally announced April 2026.

  13. arXiv:2604.10065  [pdf, ps, other

    cs.CL cs.AI cs.SD eess.AS

    ASPIRin: Action Space Projection for Interactivity-Optimized Reinforcement Learning in Full-Duplex Speech Language Models

    Authors: Chi-Yuan Hsiao, Ke-Han Lu, Yu-Kuan Fu, Guan-Ting Lin, Hsiao-Tsung Hung, Hung-yi Lee

    Abstract: End-to-end full-duplex Speech Language Models (SLMs) require precise turn-taking for natural interaction. However, optimizing temporal dynamics via standard raw-token reinforcement learning (RL) degrades semantic quality, causing severe generative collapse and repetition. We propose ASPIRin, an interactivity-optimized RL framework that explicitly decouples when to speak from what to say. Using Act… ▽ More

    Submitted 11 April, 2026; originally announced April 2026.

  14. arXiv:2604.08905  [pdf, ps, other

    cs.AI cs.LG

    StaRPO: Stability-Augmented Reinforcement Policy Optimization

    Authors: Jinghan Zhang, Fengran Mo, Tharindu Cyril Weerasooriya, Ruimin Dai, Xiaoyan Han, Yanjie Fu, Dakuo Wang, Kunpeng Liu

    Abstract: Reinforcement learning (RL) is effective in enhancing the accuracy of large language models in complex reasoning tasks. Existing RL policy optimization frameworks rely on final-answer correctness as feedback signals and rarely capture the internal logical structure of the reasoning process. Consequently, the models would generate fluent and semantically relevant responses but logically inconsisten… ▽ More

    Submitted 9 April, 2026; originally announced April 2026.

  15. arXiv:2604.08453  [pdf, ps, other

    math.NA physics.comp-ph

    Hard-constrained Physics-informed Neural Networks for Interface Problems

    Authors: Seung Whan Chung, Stephen Castonguay, Sumanta Roy, Michael Penwarden, Yucheng Fu, Pratanu Roy

    Abstract: Physics-informed neural networks (PINNs) have emerged as a flexible framework for solving partial differential equations, but their performance on interface problems remains challenging because continuity and flux conditions are typically imposed through soft penalty terms. The standard soft-constraint formulation leads to imperfect interface enforcement and degraded accuracy near interfaces. We i… ▽ More

    Submitted 9 April, 2026; originally announced April 2026.

    Comments: 53 pages, 14 figures

    Report number: 25-ERD-052, LLNL-JRNL-2010925 MSC Class: 68T07; 35J25

  16. arXiv:2604.07734  [pdf, ps, other

    astro-ph.HE

    Resolving the 2024 Outburst of Magnetar 1E 1841-045 from its host Supernova Remnant with EP-FXT

    Authors: Yu-Cong Fu, Lin Lin, Yu-Jia Zheng, Ming-Yu Ge, Han-Long Peng, Dong-Ming Li, Francesco Coti Zelati, Ersin Göǧüş, Nanda Rea, Bing Zhang, Wei-Wei Zhu, Ke-Jia Lee, Teruaki Enoto, Chryssa Kouveliotou

    Abstract: The magnetar 1E 1841-045 exhibited a new active episode starting on August 20, 2024, marked by X-ray bursts and enhanced persistent emission. Using data from the Einstein Probe (EP), we report on the timing and spectral results following the onset of this outburst. The pulse profile displays a multi-peaked structure, with notable phase shifts in the secondary peak. Energy-resolved pulse profile an… ▽ More

    Submitted 8 April, 2026; originally announced April 2026.

    Comments: Accepted by ApJ

  17. arXiv:2604.06832  [pdf, ps, other

    cs.CL

    Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLM

    Authors: Chengyue Wu, Shiyi Lan, Yonggan Fu, Sensen Gao, Jin Wang, Jincheng Yu, Jose M. Alvarez, Pavlo Molchanov, Ping Luo, Song Han, Ligeng Zhu, Enze Xie

    Abstract: Vision-language models (VLMs) predominantly rely on autoregressive decoding, which generates tokens one at a time and fundamentally limits inference throughput. This limitation is especially acute in physical AI scenarios such as robotics and autonomous driving, where VLMs are deployed on edge devices at batch size one, making AR decoding memory-bandwidth-bound and leaving hardware parallelism und… ▽ More

    Submitted 10 April, 2026; v1 submitted 8 April, 2026; originally announced April 2026.

  18. arXiv:2604.05712  [pdf, ps, other

    hep-ex

    Precise measurement of the CKM angle $γ$ with a novel approach

    Authors: The BESIII, LHCb Collaborations, :, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, H. R. Bao, X. L. Bao, M. Barbagiovanni, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco , et al. (1936 additional authors not shown)

    Abstract: A measurement of the CKM angle $γ$ is performed by applying a novel, unbinned, model-independent approach to datasets of electron-positron collisions collected by the BESIII experiment and proton-proton collisions by the LHCb experiment, corresponding to integrated luminosities of 8 fb$^{-1}$ and 9 fb$^{-1}$, respectively. The $C\!P$-violating phase $γ$ is determined from… ▽ More

    Submitted 7 April, 2026; originally announced April 2026.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/5991/ (LHCb public pages)

    Report number: LHCb-PAPER-2025-064, CERN-EP-2026-068

  19. arXiv:2604.05701  [pdf, ps, other

    hep-ex

    Measurement of the CKM angle $γ$ in $B^{\pm} \rightarrow D(\rightarrow K^{0}_{\rm S} h^{\prime+}h^{\prime-})h^{\pm}$ decays with a novel approach

    Authors: The BESIII, LHCb Collaborations, :, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, H. R. Bao, X. L. Bao, M. Barbagiovanni, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco , et al. (1936 additional authors not shown)

    Abstract: A measurement of the CKM angle $γ$ and related strong-phase parameters is performed using a novel, model-independent approach in ${B^{\pm}\rightarrow D(\rightarrow K^{0}_{\rm S} h^{\prime+}h^{\prime-}) h^{\pm}}$ decays, where $h^{(\prime)} \equiv π, K$. The analysis uses a joint data sample of electron-positron collisions collected by the BESIII experiment at the Beijing Electron-Positron Collider… ▽ More

    Submitted 7 April, 2026; originally announced April 2026.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3989/ (LHCb public pages)

    Report number: LHCb-PAPER-2025-063, CERN-EP-2026-067

  20. arXiv:2604.04765  [pdf, ps, other

    hep-ph

    Precision QCD with the Electron-Ion Collider

    Authors: C. Alexandrou, M. Arratia, E. C. Aschenauer, A. Avkhadiev, P. V. Balachandran, V. Bertone, I. Borsa, M. Cerutti, X. Chu, W. Cosyn, D. de Florian, A. Dumitru, M. Engelhardt, R. Fatemi, S. Forte, Y. Fu, L. Gamberg, H. Gao, T. Gehrmann, A. Gehrmann-De Ridder, Y. Go, Y. Guo, Y. Hatta, J. Haug, T. J. Hobbs , et al. (44 additional authors not shown)

    Abstract: This document summarizes the discussions at the program "Precision QCD with the Electron Ion Collider", held from May to June 2025 at the Institute for Nuclear Theory (INT) at the University of Washington. The program was co-sponsored by the INT and by the Center for Frontiers in Nuclear Science (CFNS, Stony Brook University). Over its five-week duration it brought together about 70 theorists, exp… ▽ More

    Submitted 6 April, 2026; originally announced April 2026.

    Comments: Summary of the 2025 joint CFNS-INT program: Precision QCD with the Electron-Ion Collider. 165 pages, 35 figures

    Report number: INT-PUB-26-011

  21. arXiv:2604.04693  [pdf, ps, other

    cs.CV

    3D Gaussian Splatting for Annular Dark Field Scanning Transmission Electron Microscopy Tomography Reconstruction

    Authors: Beiyuan Zhang, Hesong Li, Ruiwen Shao, Ying Fu

    Abstract: Analytical Dark Field Scanning Transmission Electron Microscopy (ADF-STEM) tomography reconstructs nanoscale materials in 3D by integrating multi-view tilt-series images, enabling precise analysis of their structural and compositional features. Although integrating more tilt views improves 3D reconstruction, it requires extended electron exposure that risks damaging dose-sensitive materials and in… ▽ More

    Submitted 6 April, 2026; originally announced April 2026.

  22. arXiv:2604.04496  [pdf, ps, other

    cs.CV

    The Indra Representation Hypothesis for Multimodal Alignment

    Authors: Jianglin Lu, Hailing Wang, Kuo Yang, Yitian Zhang, Simon Jenni, Yun Fu

    Abstract: Recent studies have uncovered an interesting phenomenon: unimodal foundation models tend to learn convergent representations, regardless of differences in architecture, training objectives, or data modalities. However, these representations are essentially internal abstractions of samples that characterize samples independently, leading to limited expressiveness. In this paper, we propose The Indr… ▽ More

    Submitted 6 April, 2026; originally announced April 2026.

  23. arXiv:2604.04135  [pdf, ps, other

    cs.CV

    NTIRE 2026 3D Restoration and Reconstruction in Real-world Adverse Conditions: RealX3D Challenge Results

    Authors: Shuhong Liu, Chenyu Bao, Ziteng Cui, Xuangeng Chu, Bin Ren, Lin Gu, Xiang Chen, Mingrui Li, Long Ma, Marcos V. Conde, Radu Timofte, Yun Liu, Ryo Umagami, Tomohiro Hashimoto, Zijian Hu, Yuan Gan, Tianhan Xu, Yusuke Kurose, Tatsuya Harada, Junwei Yuan, Gengjia Chang, Xining Ge, Mache You, Qida Cao, Zeliang Li , et al. (81 additional authors not shown)

    Abstract: This paper presents a comprehensive review of the NTIRE 2026 3D Restoration and Reconstruction (3DRR) Challenge, detailing the proposed methods and results. The challenge seeks to identify robust reconstruction pipelines that are robust under real-world adverse conditions, specifically extreme low-light and smoke-degraded environments, as captured by our RealX3D benchmark. A total of 279 participa… ▽ More

    Submitted 5 April, 2026; originally announced April 2026.

  24. arXiv:2604.03477  [pdf, ps, other

    math.LO

    Towards Trans-Exponential O-minimal Expansion of $(\mathbb{R},+,\cdot, 0, 1 <)$

    Authors: Yayi Fu

    Abstract: We add an analytic trans-exponential function $\varphi$ to $\mathbb{R}_{an,\exp}$. We reduce the o-minimality of $\mathbb{R}_{an,\exp,\varphi}$ to the existence of "many" regular values for some definable systems of functions, which is a necessary condition for the o-minimality of $\mathbb{R}_{an,\exp,\varphi}$.

    Submitted 3 April, 2026; originally announced April 2026.

  25. arXiv:2604.02486  [pdf, ps, other

    cs.CV cs.CL

    VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors

    Authors: Haz Sameen Shahgir, Xiaofu Chen, Yu Fu, Erfan Shayegani, Nael Abu-Ghazaleh, Yova Kementchedjhieva, Yue Dong

    Abstract: Vision-language models (VLMs) have achieved impressive performance across a wide range of multimodal tasks. However, they often fail on tasks that require fine-grained visual perception, even when the required information is still present in their internal representations. Prior work has attributed this ``hidden-in-plain-sight'' gap to the language model, but the cause remains unexplained. In this… ▽ More

    Submitted 15 April, 2026; v1 submitted 2 April, 2026; originally announced April 2026.

  26. arXiv:2604.02029  [pdf, ps, other

    cs.AI

    The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

    Authors: Xinlei Yu, Zhangquan Chen, Yongbo He, Tianyu Fu, Cheng Yang, Chengming Xu, Yue Ma, Xiaobin Hu, Zhe Cao, Jie Xu, Guibin Zhang, Jiale Tao, Jiayi Zhang, Siyuan Ma, Kaituo Feng, Haojie Huang, Youxing Li, Ronghao Chen, Huacan Wang, Chenglin Wu, Zikun Su, Xiaogang Xu, Kelu Yao, Kun Wang, Chen Gao , et al. (12 additional authors not shown)

    Abstract: Latent space is rapidly emerging as a native substrate for language-based models. While modern systems are still commonly understood through explicit token-level generation, an increasing body of work shows that many critical internal processes are more naturally carried out in continuous latent space than in human-readable verbal traces. This shift is driven by the structural limitations of expli… ▽ More

    Submitted 2 April, 2026; originally announced April 2026.

  27. arXiv:2604.02022  [pdf, ps, other

    cs.AI

    ATBench: A Diverse and Realistic Agent Trajectory Benchmark for Safety Evaluation and Diagnosis

    Authors: Yu Li, Haoyu Luo, Yuejin Xie, Yuqian Fu, Zhonghao Yang, Shuai Shao, Qihan Ren, Wanying Qu, Yanwei Fu, Yujiu Yang, Jing Shao, Xia Hu, Dongrui Liu

    Abstract: Evaluating the safety of LLM-based agents is increasingly important because risks in realistic deployments often emerge over multi-step interactions rather than isolated prompts or final responses. Existing trajectory-level benchmarks remain limited by insufficient interaction diversity, coarse observability of safety failures, and weak long-horizon realism. We introduce ATBench, a trajectory-leve… ▽ More

    Submitted 8 April, 2026; v1 submitted 2 April, 2026; originally announced April 2026.

  28. arXiv:2604.01934  [pdf, ps, other

    cs.CV

    Rethinking Representations for Cross-Domain Infrared Small Target Detection: A Generalizable Perspective from the Frequency Domain

    Authors: Yimin Fu, Songbo Wang, Feiyan Wu, Jialin Lyu, Zhunga Liu, Michael K. Ng

    Abstract: The accurate target-background separation in infrared small target detection (IRSTD) highly depends on the discriminability of extracted representations. However, most existing methods are confined to domain-consistent settings, while overlooking whether such discriminability can generalize to unseen domains. In practice, distribution shifts between training and testing data are inevitable due to… ▽ More

    Submitted 2 April, 2026; originally announced April 2026.

    Comments: The code will be released at https://github.com/fuyimin96/S2CPNet upon acceptance

  29. arXiv:2604.00453  [pdf, ps, other

    physics.bio-ph cond-mat.stat-mech

    In-vivo entropy production of A. subaru

    Authors: Yu Fu, Emmy Dobson, Benjamin B. Machta, Michael C. Abbott

    Abstract: Entropy production is often used as a proxy for energy consumption of a non-equilibrium system. Lower bounds can be estimated from coarse-grained observations, and this has been done for various biological systems. Here, we apply these tools to a more macroscopic system whose true energy consumption is also known. We find that while entropy production does give a lower bound, it is some 25 orders… ▽ More

    Submitted 1 April, 2026; originally announced April 2026.

    Comments: 9 pages, 6 figures

  30. arXiv:2604.00368  [pdf, ps, other

    cs.DC

    TENT: A Declarative Slice Spraying Engine for Performant and Resilient Data Movement in Disaggregated LLM Serving

    Authors: Feng Ren, Ruoyu Qin, Teng Ma, Shangming Cai, Zheng Liu, Chao Lei, Dejiang Zhu, Ke Yang, Zheming Li, Jialei Cui, Weixiao Huang, Yikai Zhao, Yineng Zhang, Hao Wu, Xiang Gao, Yuhao Fu, Jinlei Jiang, Yongwei Wu, Mingxing Zhang

    Abstract: Modern GPU clusters are built upon a complex hierarchy of heterogeneous interconnects, ranging from multi-rail RDMA to proprietary fabrics such as Multi-Node NVLink and Ascend UB. Orchestrating these diverse links effectively remains a critical challenge in disaggregated LLM serving. Operating Mooncake TE on thousands of GPUs exposed a critical limitation shared by existing frameworks: imperative,… ▽ More

    Submitted 31 March, 2026; originally announced April 2026.

  31. arXiv:2603.29854  [pdf, ps, other

    hep-ex

    First energy scan measurement of $e^{+}e^{-}\to K^{+}K^{-}$ around the $ψ(2S)$ resonance

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, X. L. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (683 additional authors not shown)

    Abstract: We report the first measurement of the $e^{+}e^{-}\to K^{+}K^{-}$ cross sections around the $ψ(2S)$ resonance using the energy scan method. The analysis is based on $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of 495~pb$^{-1}$ collected with the BESIII detector at BEPCII. By analyzing the cross section line-shape, we extract the relative phase $Φ$ between the strong and el… ▽ More

    Submitted 31 March, 2026; originally announced March 2026.

    Comments: 9 pages, 4 figures

  32. arXiv:2603.28232  [pdf, ps, other

    hep-ex hep-ph

    Observation of $Λ^+_c\to nπ^+η$ and search for $Λ^+_c\to na_0(980)^+$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, X. L. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (722 additional authors not shown)

    Abstract: By analysing 6.1 ${\rm fb}^{-1}$ of data collected at center-of-mass energies between $\sqrt{s}=4.600$ and 4.843 $\rm GeV$ with the BESIII detector at the BEPCII collider, we observe the decay $Λ_c^+\to nπ^+η$ for the first time with a statistical significance of $9.5σ$. The ratio of branching fractions $\mathcal{B}(Λ_c^+\to nπ^+η)/\mathcal{B}(Λ_c^+\to Λπ^+η)$ is measured to be… ▽ More

    Submitted 30 March, 2026; originally announced March 2026.

    Comments: 25 pages, 6 figures

  33. arXiv:2603.28020  [pdf, ps, other

    cs.CV

    Physically Inspired Gaussian Splatting for HDR Novel View Synthesis

    Authors: Huimin Zeng, Yue Bai, Hailing Wang, Yun Fu

    Abstract: High dynamic range novel view synthesis (HDR-NVS) reconstructs scenes with dynamic details by fusing multi-exposure low dynamic range (LDR) views, yet it struggles to capture ambient illumination-dependent appearance. Implicitly supervising HDR content by constraining tone-mapped results fails in correcting abnormal HDR values, and results in limited gradients for Gaussians in under/over-exposed r… ▽ More

    Submitted 30 March, 2026; originally announced March 2026.

    Comments: Accepted to CVPR 2026

  34. arXiv:2603.27965  [pdf, ps, other

    cs.CV

    ExFusion: Efficient Transformer Training via Multi-Experts Fusion

    Authors: Jiacheng Ruan, Daize Dong, Xiaoye Qu, Tong Zhu, Ting Liu, Yuzhuo Fu, Yu Cheng, Suncheng Xiang

    Abstract: Mixture-of-Experts (MoE) models substantially improve performance by increasing the capacity of dense architectures. However, directly training MoE models requires considerable computational resources and introduces extra overhead in parameter storage and deployment. Therefore, it is critical to develop an approach that leverages the multi-expert capability of MoE to enhance performance while incu… ▽ More

    Submitted 29 March, 2026; originally announced March 2026.

    Comments: Accepted by IEEE TMM2026

  35. arXiv:2603.25698  [pdf, ps, other

    hep-th

    Notes on Diagrammatic Coaction for Cosmological Wavefunction Coefficients: A Two-Site Prelude

    Authors: Yuhan Fu, Jiahao Liu

    Abstract: We study the coaction of cosmological wavefunction coefficients of conformally coupled scalars in FRW background of a two-site example, which turns out to have an elegant diagrammatic interpretation. We show how the coaction acts on the twisted integrals for wavefunction coefficients, decomposing them into contributions associated with subtopologies and cuts, with the subtopologies admitting an in… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

    Comments: 18 pages

  36. arXiv:2603.25649  [pdf, ps, other

    hep-ex

    Amplitude analysis and branching fraction measurement of the decay $D^0 \to K^+K^-π^0π^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, M. S. Anderson, Y. Bai, O. Bakina, H. R. Bao, X. L. Bao, M. Barbagiovanni, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone , et al. (749 additional authors not shown)

    Abstract: An amplitude analysis of the singly Cabibbo-suppressed decay $D^0 \to K^+ K^- π^0 π^0$ is performed, for the first time, to determine the relative magnitudes and phases of different intermediate processes. The analysis uses $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy 3.773~GeV corresponding to an integrated luminosity of 20.3 $\rm fb^{-1}$. The absolute… ▽ More

    Submitted 30 March, 2026; v1 submitted 26 March, 2026; originally announced March 2026.

  37. arXiv:2603.25633  [pdf, ps, other

    cs.AI

    Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?

    Authors: Liang Zhang, Yu Fu, Xinyi Jin

    Abstract: Large Language Models (LLMs) are increasingly used in math education not only as problem solvers but also as assessors of learners' reasoning. However, it remains unclear whether stronger math problem-solving ability is associated with stronger step-level assessment performance. This study examines that relationship using the GSM8K and MATH subsets of PROCESSBENCH, a human-annotated benchmark for… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

  38. arXiv:2603.25562  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

    Authors: Yuqian Fu, Haohuan Huang, Kaiwen Jiang, Yuanheng Zhu, Dongbin Zhao

    Abstract: On-policy distillation (OPD) is appealing for large language model (LLM) post-training because it evaluates teacher feedback on student-generated rollouts rather than fixed teacher traces. In long-horizon settings, however, the common sampled-token variant is fragile: it reduces distribution matching to a one-token signal and becomes increasingly unreliable as rollouts drift away from prefixes the… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

  39. arXiv:2603.25085  [pdf, ps, other

    physics.ins-det

    Beam Test Characterization of Silicon Microstrip Detector Flight-Model Ladders for the AMS-02 Upgrade

    Authors: Dexing Miao, Giovanni Ambrosi, Mattia Barbanera, Baasansuren Batsukh, Hengyi Cai, Mengke Cai, Xudong Cai, Yuman Cai, Yuan-Hann Chang, Shanzhen Chen, Hsin-Yi Chou, Xingzhu Cui, Mingyi Dong, Matteo Duranti, Ke Gong, Mingjie Feng, Valerio Formato, Yisheng Fu, Daojin Hong, Maria Ionica, Xiaojie Jiang, Yaozu Jiang, Liangchenglong Jin, Shengjie Jin, Vladimir Koutsenko , et al. (34 additional authors not shown)

    Abstract: The AMS-02 experiment plans to install a new silicon microstrip tracker layer (Layer-0) on top of the existing detector, increasing the cosmic-ray acceptance by a factor of 3. Layer-0 employs a design in which multiple silicon microstrip detectors (SSDs) are connected in series to form long detector ladders. We present a detailed performance study of the flight-model ladders using a 350~GeV mixed… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

  40. arXiv:2603.24652  [pdf, ps, other

    cs.CL cs.LG

    Demystifying When Pruning Works via Representation Hierarchies

    Authors: Shwai He, Guoheng Sun, Haichao Zhang, Yun Fu, Ang Li

    Abstract: Network pruning, which removes less important parameters or architectures, is often expected to improve efficiency while preserving performance. However, this expectation does not consistently hold across language tasks: pruned models can perform well on non-generative tasks but frequently fail in generative settings. To understand this discrepancy, we analyze network pruning from a representation… ▽ More

    Submitted 6 April, 2026; v1 submitted 25 March, 2026; originally announced March 2026.

    Comments: 27 pages, 21 figures, and 3 tables. Includes appendix with supplementary experiments and derivations

  41. arXiv:2603.24272  [pdf, ps, other

    hep-ex

    Cross Section Measurements of $\bar{n}p \rightarrow K^{+}K^{-}π^{+}(π^{0})$ via Antineutrons Produced by $J/ψ\to p π^{-} \bar{n}$ Decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, X. L. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (737 additional authors not shown)

    Abstract: Based on a novel method for producing antineutrons via $J/ψ$ decays, we report a study of $\bar{n}p$ inelastic scattering into final states containing kaons. The analysis uses $(10087\pm44)\times 10^6$ $J/ψ$ events collected at the BESIII detector operating at the BEPCII storage ring. Antineutrons are produced via $J/ψ\to p π^{-} \bar{n}$ decays and tagged by the detected protons and pions, result… ▽ More

    Submitted 25 March, 2026; originally announced March 2026.

  42. arXiv:2603.24176  [pdf, ps, other

    eess.IV cs.CV q-bio.NC

    Modeling Spatiotemporal Neural Frames for High Resolution Brain Dynamic

    Authors: Wanying Qu, Jianxiong Gao, Wei Wang, Yanwei Fu

    Abstract: Capturing dynamic spatiotemporal neural activity is essential for understanding large-scale brain mechanisms. Functional magnetic resonance imaging (fMRI) provides high-resolution cortical representations that form a strong basis for characterizing fine-grained brain activity patterns. The high acquisition cost of fMRI limits large-scale applications, therefore making high-quality fMRI reconstruct… ▽ More

    Submitted 31 March, 2026; v1 submitted 25 March, 2026; originally announced March 2026.

    Comments: CVPR 2026

  43. arXiv:2603.23081  [pdf, ps, other

    hep-ex

    Amplitude Analysis of the Isospin-Violating Decay $J/ψ\rightarrowγηπ^{0}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, H. -R. Bao, X. L. Bao, M. Barbagiovanni, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (736 additional authors not shown)

    Abstract: Using $(10087 \pm 44)\times 10^{6}$ $\jpsi$ events collected with the BESIII detector, we perform the first amplitude analysis of the process $\jpsi\toγη\piz$. The decay is dominated by the intermediate processes $\jpsi\to\piz \bo \left( \toγη\right)$, $\jpsi\to\pizρ(1450)^0 \left( \toγη\right)$ and $\jpsi\toηh_1(1170) \left( \toγ\piz\right)$. Contributions from $\jpsi\toγa_0(980)^0(\toη\piz)$,… ▽ More

    Submitted 24 March, 2026; originally announced March 2026.

    Comments: 14 pages, 4 figures

  44. arXiv:2603.22804  [pdf, ps, other

    hep-ex

    Search for the radiative decays $D^0\to γ\bar K_1(1270)^0$ and $D^+\to γK_1(1270)^+$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (678 additional authors not shown)

    Abstract: A search for the radiative decays $D^0\to γ\bar K_1(1270)^0$ and $D^+\to γK_1(1270)^+$ is conducted using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and upper limits on the branching fractions of $D^0\to γ\bar K_1(1270)^0$ and… ▽ More

    Submitted 24 March, 2026; originally announced March 2026.

    Comments: 11 pages 5 figures 4 table

  45. arXiv:2603.22529  [pdf, ps, other

    cs.CV cs.AI cs.CL

    Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos

    Authors: Shoubin Yu, Lei Shu, Antoine Yang, Yao Fu, Srinivas Sunkara, Maria Wang, Jindong Chen, Mohit Bansal, Boqing Gong

    Abstract: Multimodal AI agents are increasingly automating complex real-world workflows that involve online web execution. However, current web-agent benchmarks suffer from a critical limitation: they focus entirely on web-based interaction and perception, lacking grounding in the user's real-world physical surroundings. This limitation prevents evaluation in crucial scenarios, such as when an agent must us… ▽ More

    Submitted 23 March, 2026; originally announced March 2026.

    Comments: CVPR 2026. Project page: https://ego2web.github.io/

  46. arXiv:2603.22293  [pdf, ps, other

    cs.CL cs.AI cs.LG

    TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

    Authors: Yutao Xie, Nathaniel Thomas, Nicklas Hansen, Yang Fu, Li Erran Li, Xiaolong Wang

    Abstract: Search-augmented large language models (LLMs) trained with reinforcement learning (RL) have achieved strong results on open-domain question answering (QA), but training still remains a significant challenge. The optimization is often unstable due to sparse rewards and difficult credit assignments across reasoning and tool calls. To address this, we introduce Turn-Level Information Potential Reward… ▽ More

    Submitted 11 March, 2026; originally announced March 2026.

    Comments: Code: https://github.com/ucsd-wang-lab-lm/tips

  47. arXiv:2603.22281  [pdf, ps, other

    cs.CV cs.AI cs.CL cs.LG cs.RO

    ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model

    Authors: Haichao Zhang, Yijiang Li, Shwai He, Tushar Nagarajan, Mingfei Chen, Jianglin Lu, Ang Li, Yun Fu

    Abstract: Recent progress in latent world models (e.g., V-JEPA2) has shown promising capability in forecasting future world states from video observations. Nevertheless, dense prediction from a short observation window limits temporal context and can bias predictors toward local, low-level extrapolation, making it difficult to capture long-horizon semantics and reducing downstream utility. Vision--language… ▽ More

    Submitted 23 March, 2026; originally announced March 2026.

    Comments: 10 pages, 5 figures

    MSC Class: 68T45; 68T07; 68U10; 68T10; 93C85; 93B40; 70Q05 ACM Class: I.2.9; I.2.10; I.2.6; I.2.7; I.4.8; I.4.9; I.2.8

  48. arXiv:2603.22004  [pdf, ps, other

    hep-ph

    A Fast Method for Correlated Updates of Proton PDFs and the Strong Coupling $α_s$

    Authors: Yao Fu, Carl Schmidt, C. --P. Yuan

    Abstract: We present an extended version of the \texttt{ePump} framework that enables the simultaneous profiling of proton parton distribution functions (PDFs) and the strong coupling $α_s$ using new experimental data. By promoting $α_s$ to a fit parameter within the Hessian updating formalism, the method performs coherent updates of $\{\text{PDFs},α_s\}$ while preserving parameter correlations and the full… ▽ More

    Submitted 23 March, 2026; originally announced March 2026.

  49. arXiv:2603.21621  [pdf, ps, other

    cs.LG

    Proximal Policy Optimization in Path Space: A Schrödinger Bridge Perspective

    Authors: Yuehu Gong, Zeyuan Wang, Yulin Chen, Yanwei Fu

    Abstract: On-policy reinforcement learning with generative policies is promising but remains underexplored. A central challenge is that proximal policy optimization (PPO) is traditionally formulated in terms of action-space probability ratios, whereas diffusion- and flow-based policies are more naturally represented as trajectory-level generative processes. In this work, we propose GSB-PPO, a path-space for… ▽ More

    Submitted 23 March, 2026; originally announced March 2026.

    Comments: 12 pages, 3figures

  50. arXiv:2603.20930  [pdf, ps, other

    cs.LG cs.AI cs.IT

    Causally-Guided Diffusion for Stable Feature Selection

    Authors: Arun Vignesh Malarkkan, Xinyuan Wang, Kunpeng Liu, Denghui Zhang, Yanjie Fu

    Abstract: Feature selection is fundamental to robust data-centric AI, but most existing methods optimize predictive performance under a single data distribution. This often selects spurious features that fail under distribution shifts. Motivated by principles from causal invariance, we study feature selection from a stability perspective and introduce Causally-Guided Diffusion for Stable Feature Selection (… ▽ More

    Submitted 21 March, 2026; originally announced March 2026.

    Comments: 8 pages + references + appendix