Skip to main content

Showing 1–50 of 4,826 results for author: Xue, C

.
  1. arXiv:2604.15063  [pdf, ps, other

    cs.LG cs.AI cs.CR

    No More Guessing: a Verifiable Gradient Inversion Attack in Federated Learning

    Authors: Francesco Diana, Chuan Xu, André Nusser, Giovanni Neglia

    Abstract: Gradient inversion attacks threaten client privacy in federated learning by reconstructing training samples from clients' shared gradients. Gradients aggregate contributions from multiple records and existing attacks may fail to disentangle them, yielding incorrect reconstructions with no intrinsic way to certify success. In vision and language, attackers may fall back on human inspection to judge… ▽ More

    Submitted 16 April, 2026; originally announced April 2026.

  2. arXiv:2604.14297  [pdf, ps, other

    astro-ph.HE astro-ph.IM

    Rapid-response 1.3 mm Observations of GRB 260127A with the Submillimeter Array

    Authors: Garrett K. Keating, Tanmoy Laskar, Anna Y. Q. Ho, Peter K. Blanchard, Kate D. Alexander, Edo Berger, Mark Gurwell, Tarraneh Eftekhari, Chloe T. Xu, Joshua Bennett Lovell, Ramprasad Rao, Peter K. G. Williams

    Abstract: We present the results from rapid-response 1.3 mm observations of GRB 260127A using the Submillimeter Array (SMA). SMA arrived on-source 12.6 minutes after the initial detection by the Neil Gehrels Swift Observatory, representing the earliest millimeter/submillimeter observations of a GRB to date. From these observations, we find a source with flux density $6.9\pm1.7$ mJy, consistent with the X-ra… ▽ More

    Submitted 15 April, 2026; originally announced April 2026.

    Comments: Accepted for publication in ApJL; 7 pages, 3 figures

  3. arXiv:2604.14125  [pdf, ps, other

    cs.CV cs.AI cs.RO

    HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System

    Authors: Tianshuo Yang, Guanyu Chen, Yutian Chen, Zhixuan Liang, Yitian Liu, Zanxin Chen, Chunpu Xu, Haotian Liang, Jiangmiao Pang, Yao Mu, Ping Luo

    Abstract: While end-to-end Vision-Language-Action (VLA) models offer a promising paradigm for robotic manipulation, fine-tuning them on narrow control data often compromises the profound reasoning capabilities inherited from their base Vision-Language Models (VLMs). To resolve this fundamental trade-off, we propose HiVLA, a visual-grounded-centric hierarchical framework that explicitly decouples high-level… ▽ More

    Submitted 15 April, 2026; originally announced April 2026.

    Comments: Project Page: https://tianshuoy.github.io/HiVLA-page/

  4. arXiv:2604.14010  [pdf, ps, other

    cs.LG cs.CL

    Parameter Importance is Not Static: Evolving Parameter Isolation for Supervised Fine-Tuning

    Authors: Zekai Lin, Chao Xue, Di Liang, Xingsheng Han, Peiyang Liu, Xianjie Wu, Lei Jiang, Yu Lu, Haibo Shi, Shuang Liang, Minlong Peng

    Abstract: Supervised Fine-Tuning (SFT) of large language models often suffers from task interference and catastrophic forgetting. Recent approaches alleviate this issue by isolating task-critical parameters during training. However, these methods represent a static solution to a dynamic problem, assuming that parameter importance remains fixed once identified. In this work, we empirically demonstrate that p… ▽ More

    Submitted 15 April, 2026; originally announced April 2026.

  5. arXiv:2604.13905  [pdf, ps, other

    cs.CV

    Rethinking Image-to-3D Generation with Sparse Queries: Efficiency, Capacity, and Input-View Bias

    Authors: Zhiyuan Xu, Jiuming Liu, Yuxin Chen, Masayoshi Tomizuka, Chenfeng Xu, Chensheng Peng

    Abstract: We present SparseGen, a novel framework for efficient image-to-3D generation, which exhibits low input-view bias while being significantly faster. Unlike traditional approaches that rely on dense volumetric grids, triplanes, or pixel-aligned primitives, we model scenes with a compact sparse set of learned 3D anchor queries and a learned expansion operator that decodes each transformed query into a… ▽ More

    Submitted 15 April, 2026; originally announced April 2026.

    Comments: Code is available at https://github.com/Pixtella/SparseGen

  6. arXiv:2604.13592  [pdf, ps, other

    cs.CL

    Foresight Optimization for Strategic Reasoning in Large Language Models

    Authors: Jiashuo Wang, Jiawen Duan, Jian Wang, Kaitao Song, Chunpu Xu, Johnny K. W. Ho, Fenggang Yu, Wenjie Li, Johan F. Hoorn

    Abstract: Reasoning capabilities in large language models (LLMs) have generally advanced significantly. However, it is still challenging for existing reasoning-based LLMs to perform effective decision-making abilities in multi-agent environments, due to the absence of explicit foresight modeling. To this end, strategic reasoning, the most fundamental capability to anticipate the counterpart's behaviors and… ▽ More

    Submitted 16 April, 2026; v1 submitted 15 April, 2026; originally announced April 2026.

    Comments: ACL 2026 Main Conference

  7. arXiv:2604.13436  [pdf, ps, other

    quant-ph

    Coherent Rydberg excitation of single atoms using a pulsed fiber amplifier

    Authors: Ying-Wen Zhang, Yang Wang, Chen-Long Xu, Yi-Bo Wang, Peng Xu

    Abstract: In recent years, the growing scale of programmable neutral-atom arrays has led to an increasing demand for higher-power Rydberg excitation light. Although pulsed amplifiers deliver higher peak power than continuous-wave lasers, their use for efficient coherent Rydberg excitation of single atoms in arrays has been limited by challenges such as pulse distortion, synchronization with excitation seque… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

    Comments: 10 pages, 8 figures

  8. arXiv:2604.12968  [pdf, ps, other

    cs.LG cs.CV

    Evolution of Optimization Methods: Algorithms, Scenarios, and Evaluations

    Authors: Tong Zhang, Jiangning Zhang, Zhucun Xue, Juntao Jiang, Yicheng Xu, Chengming Xu, Teng Hu, Xingyu Xie, Xiaobin Hu, Yabiao Wang, Yong Liu, Shuicheng Yan

    Abstract: Balancing convergence speed, generalization capability, and computational efficiency remains a core challenge in deep learning optimization. First-order gradient descent methods, epitomized by stochastic gradient descent (SGD) and Adam, serve as the cornerstone of modern training pipelines. However, large-scale model training, stringent differential privacy requirements, and distributed learning p… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

  9. arXiv:2604.12524  [pdf, ps, other

    hep-ex

    Observation of the Exotic State $π_{1}(1600)$ in $ψ(2S)\rightarrowγχ_{c1},χ_{c1}\rightarrowπ^{+}π^{-}η'$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, X. L. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (728 additional authors not shown)

    Abstract: A partial wave analysis of the process $ψ(2S)\rightarrowγχ_{c1}, χ_{c1}\rightarrowπ^+π^-η^{\prime}$ is performed using $(2712.4\pm14.3)\times10^{6}$ $ψ(2S)$ events collected with the BESIII detector. An isovector state with exotic quantum numbers $J^{PC}=1^{-+}$, denoted as $π_{1}(1600)$, is observed for the first time in the charmonium decay of $χ_{c1}\rightarrowπ_{1}^{\pm}(1600)π^{\mp}$,… ▽ More

    Submitted 14 April, 2026; v1 submitted 14 April, 2026; originally announced April 2026.

  10. arXiv:2604.12286  [pdf, ps, other

    cs.CV

    LiveMoments: Reselected Key Photo Restoration in Live Photos via Reference-guided Diffusion

    Authors: Clara Xue, Zizheng Yan, Zhenning Shi, Yuhang Yu, Jingyu Zhuang, Qi Zhang, Jinwei Chen, Qingnan Fan

    Abstract: Live Photo captures both a high-quality key photo and a short video clip to preserve the precious dynamics around the captured moment. While users may choose alternative frames as the key photo to capture better expressions or timing, these frames often exhibit noticeable quality degradation, as the photo capture ISP pipeline delivers significantly higher image quality than the video pipeline. Thi… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

    Comments: Accepted by ICLR 2026

  11. arXiv:2604.12056  [pdf, ps, other

    cs.CL cs.LG

    LoSA: Locality Aware Sparse Attention for Block-Wise Diffusion Language Models

    Authors: Haocheng Xi, Harman Singh, Yuezhou Hu, Coleman Hooper, Rishabh Tiwari, Aditya Tomar, Minjae Lee, Wonjun Kang, Michael Mahoney, Chenfeng Xu, Kurt Keutzer, Amir Gholami

    Abstract: Block-wise diffusion language models (DLMs) generate multiple tokens in any order, offering a promising alternative to the autoregressive decoding pipeline. However, they still remain bottlenecked by memory-bound attention in long-context scenarios. Naive sparse attention fails on DLMs due to a KV Inflation problem, where different queries select different prefix positions, making the union of acc… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

    Comments: 16 pages, 11 figures, 6 tables

  12. arXiv:2604.11913  [pdf, ps, other

    cs.CV

    V-Nutri: Dish-Level Nutrition Estimation from Egocentric Cooking Videos

    Authors: Chengkun Yue, Chuanzhi Xu, Jiangpeng He

    Abstract: Nutrition estimation of meals from visual data is an important problem for dietary monitoring and computational health, but existing approaches largely rely on single images of the finally completed dish. This setting is fundamentally limited because many nutritionally relevant ingredients and transformations, such as oils, sauces, and mixed components, become visually ambiguous after cooking, mak… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

    Comments: Accepted to the 3rd MetaFood Workshop at CVPR 2026

  13. arXiv:2604.11414  [pdf, ps, other

    physics.flu-dyn

    Compressible turbulent boundary layers over two-dimensional square-rib roughness

    Authors: Youtian Su, Wei-Xi Huang, Chunxiao Xu

    Abstract: Direct numerical simulations are performed to investigate the combined effects of surface roughness and wall heat transfer on spatially developing compressible turbulent boundary layers at $Ma=2.5$. The roughness consists of transverse square bars with $λ_x/k=8$ and $k^+ \approx 35$, under adiabatic and wall-cooling ($T_w/T_r = 0.5$) conditions. Dynamically, the conventional zero-moment method fai… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

    Comments: 26 pages, 16 figures

  14. arXiv:2604.11040  [pdf, ps, other

    cs.AI

    Intelligent Approval of Access Control Flow in Office Automation Systems via Relational Modeling

    Authors: Dugang Liu, Zulong Chen, Chuanfei Xu, Jiaxuan He, Yunlu Ma, Jia Xu

    Abstract: Office automation (OA) systems play a crucial role in enterprise operations and management, with access control flow approval (ACFA) being a key component that manages the accessibility of various resources. However, traditional ACFA requires approval from the person in charge at each step, which consumes a significant amount of manpower and time. Its intelligence is a crucial issue that needs to… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

  15. arXiv:2604.11035  [pdf, ps, other

    cs.AI

    Introspective Diffusion Language Models

    Authors: Yifan Yu, Yuqing Jian, Junxiong Wang, Zhongzhu Zhou, Donglin Zhuang, Xinyu Fang, Sri Yanamandra, Xiaoxia Wu, Qingyang Wu, Shuaiwen Leon Song, Tri Dao, Ben Athiwaratkun, James Zou, Fan Lai, Chenfeng Xu

    Abstract: Diffusion language models promise parallel generation, yet still lag behind autoregressive (AR) models in quality. We stem this gap to a failure of introspective consistency: AR models agree with their own generations, while DLMs often do not. We define the introspective acceptance rate, which measures whether a model accepts its previously generated tokens. This reveals why AR training has a stru… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

  16. arXiv:2604.10647  [pdf, ps, other

    cs.RO

    OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal Interaction

    Authors: Shaqi Luo, Yuanyuan Li, Youhao Hu, Chenhao Yu, Chaoran Xu, Jiachen Zhang, Guocai Yao, Tiejun Huang, Ran He, Zhongyuan Wang

    Abstract: UMI-style interfaces enable scalable robot learning, but existing systems remain largely visuomotor, relying primarily on RGB observations and trajectory while providing only limited access to physical interaction signals. This becomes a fundamental limitation in contact-rich manipulation, where success depends on contact dynamics such as tactile interaction, internal grasping force, and external… ▽ More

    Submitted 12 April, 2026; originally announced April 2026.

  17. arXiv:2604.10578  [pdf, ps, other

    cs.CV

    Rein3D: Reinforced 3D Indoor Scene Generation with Panoramic Video Diffusion Models

    Authors: Dehui Wang, Congsheng Xu, Rong Wei, Yue Shi, Shoufa Chen, Dingxiang Luo, Tianshuo Yang, Xiaokang Yang, Wei Sui, Yusen Qin, Rui Tang, Yao Mu

    Abstract: The growing demand for Embodied AI and VR applications has highlighted the need for synthesizing high-quality 3D indoor scenes from sparse inputs. However, existing approaches struggle to infer massive amounts of missing geometry in large unseen areas while maintaining global consistency, often producing locally plausible but globally inconsistent reconstructions. We present Rein3D, a framework th… ▽ More

    Submitted 14 April, 2026; v1 submitted 12 April, 2026; originally announced April 2026.

  18. arXiv:2604.10523  [pdf, ps, other

    hep-ex

    Measurement of the branching fractions of $χ_{cJ} \to π^{+}π^{-}π^{0}π^{0}$ via $ψ(3686) \to γχ_{cJ}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, H. R. Bao, X. L. Bao, M. Barbagiovanni, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (741 additional authors not shown)

    Abstract: Using $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector operating at BEPCII, the branching fractions of $χ_{cJ}\toπ^+π^-π^0π^0$ ($J=0,~1,~2$) are measured via the radiative transition $ψ(3686)\toγχ_{cJ}$. The results are $\mathcal{B}(χ_{c0} \to π^{+}π^{-}π^{0}π^{0}) = (3.10 \pm 0.01 \pm 0.14) \times 10^{-2}$,… ▽ More

    Submitted 12 April, 2026; originally announced April 2026.

  19. arXiv:2604.10444  [pdf, ps, other

    hep-ex

    First Observation of \boldmath{$D^+ \to a_0(980)ρ$ and $D^+ \to a_0(980)^+ f_0(500)$} in \boldmath{$D^+ \to π^+π^+π^-η$ and $D^+ \to π^+π^0π^0η$} Decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, X. L. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (734 additional authors not shown)

    Abstract: We perform the first amplitude analysis of the singly Cabibbo-suppressed decays $D^+ \to π^+ π^{+(0)} π^{-(0)} η$, using $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773\,GeV, corresponding to an integrated luminosity of 20.3 $\rm{fb}^{-1}$. The absolute branching fractions of the $D^+ \to π^+ π^+ π^- η$ and $D^+ \to π^+ π^0 π^0 η$ decays are measure… ▽ More

    Submitted 15 April, 2026; v1 submitted 11 April, 2026; originally announced April 2026.

  20. arXiv:2604.10126  [pdf, ps, other

    cs.SE cs.AI

    MR-Coupler: Automated Metamorphic Test Generation via Functional Coupling Analysis

    Authors: Congying Xu, Hengcheng Zhu, Songqiang Chen, Jiarong Wu, Valerio Terragni, Shing-Chi Cheung

    Abstract: Metamorphic testing (MT) is a widely recognized technique for alleviating the oracle problem in software testing. However, its adoption is hindered by the difficulty of constructing effective metamorphic relations (MRs), which often require domain-specific or hard-to-obtain knowledge. In this work, we propose a novel approach that leverages the functional coupling between methods, which is readily… ▽ More

    Submitted 11 April, 2026; originally announced April 2026.

    Comments: Note: Accepted on ACM International Conference on the Foundations of Software Engineering (FSE) 2026

    Journal ref: Proceedings of the ACM on Software Engineering, Volume 3, Article FSE206 (FSE 2026)

  21. arXiv:2604.10079  [pdf, ps, other

    cs.CL

    Why Supervised Fine-Tuning Fails to Learn: A Systematic Study of Incomplete Learning in Large Language Models

    Authors: Chao Xue, Yao Wang, Mengqiao Liu, Di Liang, Xingsheng Han, Peiyang Liu, Xianjie Wu, Chenyao Lu, Lei Jiang, Yu Lu, Haibo Shi, Shuang Liang, Minlong Peng, Flora D. Salim

    Abstract: Supervised Fine-Tuning (SFT) is the standard approach for adapting large language models (LLMs) to downstream tasks. However, we observe a persistent failure mode: even after convergence, models often fail to correctly reproduce a subset of their own supervised training data. We refer to this behavior as the Incomplete Learning Phenomenon(ILP). This paper presents the first systematic study of ILP… ▽ More

    Submitted 16 April, 2026; v1 submitted 11 April, 2026; originally announced April 2026.

    Comments: Accepted by ACL 2026 Main

  22. arXiv:2604.10072  [pdf, ps, other

    cs.CL

    Reason Only When Needed: Efficient Generative Reward Modeling via Model-Internal Uncertainty

    Authors: Chao Xue, Yao Wang, Mengqiao Liu, Di Liang, Xingsheng Han, Peiyang Liu, Xianjie Wu, Chenyao Lu, Lei Jiang, Yu Lu, Haibo Shi, Shuang Liang, Minlong Peng, Flora D. Salim

    Abstract: Recent advancements in the Generative Reward Model (GRM) have demonstrated its potential to enhance the reasoning abilities of LLMs through Chain-of-Thought (CoT) prompting. Despite these gains, existing implementations of GRM suffer from two critical limitations. First, CoT prompting is applied indiscriminately to all inputs regardless of their inherent complexity. This introduces unnecessary com… ▽ More

    Submitted 16 April, 2026; v1 submitted 11 April, 2026; originally announced April 2026.

    Comments: accepted by ACL 2026 Findings

  23. arXiv:2604.10052  [pdf, ps, other

    cs.CR cs.NI

    Impact of Intelligent Technologies on IoV Security: Integrating Edge Computing and AI

    Authors: Awais Bilal, Kashif Sharif, Liehuang Zhu, Chang Xu, Fan Li, Sadaf Bukhari, Sujit Biswas

    Abstract: The rapid development and integration of intelligent technologies in the Internet of Vehicles (IoV) have revolutionized transportation systems by enhancing connectivity, automation, and safety. However, the complexity and connectivity of IoV networks also introduce security challenges, including data privacy concerns, cyber threats, and system vulnerabilities. This paper surveys the role of Edge C… ▽ More

    Submitted 11 April, 2026; originally announced April 2026.

  24. arXiv:2604.09750  [pdf, ps, other

    cs.CR cs.AI

    Conflicts Make Large Reasoning Models Vulnerable to Attacks

    Authors: Honghao Liu, Chengjin Xu, Xuhui Jiang, Cehao Yang, Shengming Yin, Zhengwu Ma, Lionel Ni, Jian Guo

    Abstract: Large Reasoning Models (LRMs) have achieved remarkable performance across diverse domains, yet their decision-making under conflicting objectives remains insufficiently understood. This work investigates how LRMs respond to harmful queries when confronted with two categories of conflicts: internal conflicts that pit alignment values against each other and dilemmas, which impose mutually contradict… ▽ More

    Submitted 10 April, 2026; originally announced April 2026.

  25. arXiv:2604.09668  [pdf, ps, other

    cs.IR cs.CV

    Decoding Ancient Oracle Bone Script via Generative Dictionary Retrieval

    Authors: Yin Wu, Gangjian Zhang, Jiayu Chen, Chang Xu, Yuyu Luo, Nan Tang, Hui Xiong

    Abstract: Understanding humanity's earliest writing systems is crucial for reconstructing civilization's origins, yet many ancient scripts remain undeciphered. Oracle Bone Script (OBS) from China's Shang dynasty exemplifies this challenge: only approximately 1,500 of roughly 4,600 characters have been decoded, and a substantial portion of these 3,000-year-old inscriptions remains only partially understood.… ▽ More

    Submitted 1 April, 2026; originally announced April 2026.

    Comments: 19 pages, 4 figures. Under review at Nature Machine Intelligence

  26. arXiv:2604.09288  [pdf, ps, other

    cs.LG

    Are Independently Estimated View Uncertainties Comparable? Unified Routing for Trusted Multi-View Classification

    Authors: Yilin Zhang, Cai Xu, Haishun Chen, Ziyu Guan, Wei Zhao

    Abstract: Trusted multi-view classification typically relies on a view-wise evidential fusion process: each view independently produces class evidence and uncertainty, and the final prediction is obtained by aggregating these independent opinions. While this design is modular and uncertainty-aware, it implicitly assumes that evidence from different views is numerically comparable. In practice, however, this… ▽ More

    Submitted 10 April, 2026; originally announced April 2026.

    Comments: 14pages, Under Review

  27. arXiv:2604.08229  [pdf, ps, other

    nucl-ex

    Ground State Decay of the Three-Proton Emitter $^{17}$Na Reveals Isospin Symmetry Breaking

    Authors: X. -D. Xu, I. Mukha, Z. C. Xu, S. M. Wang, K. Y. Zhang, L. Acosta, E. Casarejos, D. Cortina-Gil, J. M. Espino, A. Fomichev, H. Geissel, J. Gómez-Camacho, L. V. Grigorenko, O. Kiselev, A. A. Korsheninnikov, N. Kurz, Yu. A. Litvinov, I. Martel, C. Nociforo, M. Pfützner, C. Rodríguez-Tajes, C. Scheidenberger, M. Stanoiu, K. Sümmerer, H. Weick , et al. (2 additional authors not shown)

    Abstract: The spectrum of the exotic three-proton (3p) emitter $^{17}$Na has been studied by detecting all in-flight decay products. Derived from the measured angular correlations $^{14}$O+p+p+p, a resonant peak has been discovered at the 3p-decay energy of 2.24($^{+0.17}_{-0.25}$) MeV, which likely corresponds to the $^{17}$Na ground state. This decay energy value is significantly smaller than the previous… ▽ More

    Submitted 9 April, 2026; originally announced April 2026.

    Comments: 10 pages, 8 figures

  28. arXiv:2604.08044  [pdf, ps, other

    cs.AR

    A Full-Stack Performance Evaluation Infrastructure for 3D-DRAM-based LLM Accelerators

    Authors: Cong Li, Chenhao Xue, Yi Ren, Xiping Dong, Yu Cheng, Yinbo Hu, Fujun Bai, Yixin Guo, Xiping Jiang, Qiang Wu, Zhi Yang, Zhe Cheng, Yuan Xie, Guangyu Sun

    Abstract: Large language models (LLMs) exhibit memory-intensive behavior during decoding, making it a key bottleneck in LLM inference. To accelerate decoding execution, hybrid-bonding-based 3D-DRAM has been adopted in LLM accelerators. While this emerging technology provides strong performance gains over existing hardware, current 3D-DRAM accelerators (3D-Accelerators) rely on closed-source evaluation tools… ▽ More

    Submitted 9 April, 2026; originally announced April 2026.

  29. arXiv:2604.07963  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Rethinking Data Mixing from the Perspective of Large Language Models

    Authors: Yuanjian Xu, Tianze Sun, Changwei Xu, XinLong Zhao, Jianing Hao, Ran Chen, Yang Liu, Ruijie Xu, Stephen Chen, Guang Zhang

    Abstract: Data mixing strategy is essential for large language model (LLM) training. Empirical evidence shows that inappropriate strategies can significantly reduce generalization. Although recent methods have improved empirical performance, several fundamental questions remain open: what constitutes a domain, whether human and model perceptions of domains are aligned, and how domain weighting influences ge… ▽ More

    Submitted 9 April, 2026; originally announced April 2026.

  30. arXiv:2604.07941  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Large Language Model Post-Training: A Unified View of Off-Policy and On-Policy Learning

    Authors: Shiwan Zhao, Zhihu Wang, Xuyang Zhao, Jiaming Zhou, Caiyue Xu, Chenfei Liu, Liting Zhang, Yuhang Jia, Yanzhe Zhang, Hualong Yu, Zichen Xu, Qicheng Li, Yong Qin

    Abstract: Post-training has become central to turning pretrained large language models (LLMs) into aligned, capable, and deployable systems. Recent progress spans supervised fine-tuning (SFT), preference optimization, reinforcement learning (RL), process supervision, verifier-guided methods, distillation, and multi-stage pipelines. Yet these methods are often discussed in fragmented ways, organized by label… ▽ More

    Submitted 16 April, 2026; v1 submitted 9 April, 2026; originally announced April 2026.

    Comments: 38 pages, 1 figure, 8 tables

  31. arXiv:2604.07725  [pdf, ps, other

    cs.AI cs.CL

    Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

    Authors: Monishwaran Maheswaran, Leon Lakhani, Zhongzhu Zhou, Shijia Yang, Junxiong Wang, Coleman Hooper, Yuezhou Hu, Rishabh Tiwari, Jue Wang, Harman Singh, Qingyang Wu, Yuqing Jian, Ce Zhang, Kurt Keutzer, Tri Dao, Xiaoxia Wu, Ben Athiwaratkun, James Zou, Chenfeng Xu

    Abstract: We show that verifier-free evolution is bottlenecked by both diversity and efficiency: without external correction, repeated evolution accelerates collapse toward narrow modes, while the uniform use of a high-cost model wastes compute and quickly becomes economically impractical. We introduce Squeeze Evolve, a unified multi-model orchestration framework for verifier-free evolutionary inference. Ou… ▽ More

    Submitted 10 April, 2026; v1 submitted 8 April, 2026; originally announced April 2026.

    Comments: 40 Pages, Project Page: https://squeeze-evolve.github.io/

  32. arXiv:2604.07409  [pdf, ps, other

    cs.LG eess.IV

    GAN-based Domain Adaptation for Image-aware Layout Generation in Advertising Poster Design

    Authors: Chenchen Xu, Min Zhou, Tiezheng Ge, Weiwei Xu

    Abstract: Layout plays a crucial role in graphic design and poster generation. Recently, the application of deep learning models for layout generation has gained significant attention. This paper focuses on using a GAN-based model conditioned on images to generate advertising poster graphic layouts, requiring a dataset of paired product images and layouts. To address this task, we introduce the Content-awar… ▽ More

    Submitted 8 April, 2026; originally announced April 2026.

    Comments: arXiv admin note: text overlap with arXiv:2303.14377

  33. arXiv:2604.07378  [pdf, ps, other

    cs.RO

    Evaluation as Evolution: Transforming Adversarial Diffusion into Closed-Loop Curricula for Autonomous Vehicles

    Authors: Yicheng Guo, Jiaqi Liu, Chengkai Xu, Peng Hang, Jian Sun

    Abstract: Autonomous vehicles in interactive traffic environments are often limited by the scarcity of safety-critical tail events in static datasets, which biases learned policies toward average-case behaviors and reduces robustness. Existing evaluation methods attempt to address this through adversarial stress testing, but are predominantly open-loop and post-hoc, making it difficult to incorporate discov… ▽ More

    Submitted 7 April, 2026; originally announced April 2026.

  34. arXiv:2604.06959  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Microscopic evidence of spin-driven multiferroicity and topological spin textures in monolayer NiI2

    Authors: Haitao Wang, Tianxing Jiang, Weiyi Pan, Xu Wang, Hongyu Wang, Junchao Tian, Lianchuang Li, Dongming Zhao, Qingle Zhang, Chenxi Wang, Ying Yang, Hongjun Xiang, Changsong Xu, Donglai Feng, Tong Zhang

    Abstract: In type II multiferroics, noncollinear spin textures are expected to induce electric polarization directly, leading to strong magnetoelectric coupling. Realizing such spin driven multiferroicity in two-dimensional systems, and elucidating the interplay between local spins and electric polarization, are of both fundamental and technological importance. Here, using vectorial spin polarized scanning… ▽ More

    Submitted 8 April, 2026; originally announced April 2026.

    Comments: 26 pages, 20 figures, supplementary materials included

    Journal ref: Phys. Rev. Lett. 136, 026402 (2026)

  35. arXiv:2604.06912  [pdf, ps, other

    cs.CV cs.AI

    Q-Zoom: Query-Aware Adaptive Perception for Efficient Multimodal Large Language Models

    Authors: Yuheng Shi, Xiaohuan Pei, Linfeng Wen, Minjing Dong, Chang Xu

    Abstract: MLLMs require high-resolution visual inputs for fine-grained tasks like document understanding and dense scene perception. However, current global resolution scaling paradigms indiscriminately flood the quadratic self-attention mechanism with visually redundant tokens, severely bottlenecking inference throughput while ignoring spatial sparsity and query intent. To overcome this, we propose Q-Zoom,… ▽ More

    Submitted 8 April, 2026; originally announced April 2026.

    Comments: 16 pages, 9 figures

  36. arXiv:2604.06121  [pdf, ps, other

    physics.flu-dyn

    Free Surface Enhancement of Droplet Rupture by Cavitation Bubble Collapse

    Authors: Chenghao Xu, Zhengyu Yang, Jie Feng

    Abstract: The interaction between cavitation bubbles and surrounding droplets plays a central role in applications such as surface cleaning, ultrasonic emulsification, and therapeutic delivery. These processes depend on bubble-driven microjets that drive the deformation and breakup of the droplets, which are significantly influenced by geometric confinements. Here, we investigate the hydrodynamic interactio… ▽ More

    Submitted 7 April, 2026; originally announced April 2026.

  37. arXiv:2604.05712  [pdf, ps, other

    hep-ex

    Precise measurement of the CKM angle $γ$ with a novel approach

    Authors: The BESIII, LHCb Collaborations, :, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, H. R. Bao, X. L. Bao, M. Barbagiovanni, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco , et al. (1936 additional authors not shown)

    Abstract: A measurement of the CKM angle $γ$ is performed by applying a novel, unbinned, model-independent approach to datasets of electron-positron collisions collected by the BESIII experiment and proton-proton collisions by the LHCb experiment, corresponding to integrated luminosities of 8 fb$^{-1}$ and 9 fb$^{-1}$, respectively. The $C\!P$-violating phase $γ$ is determined from… ▽ More

    Submitted 7 April, 2026; originally announced April 2026.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/5991/ (LHCb public pages)

    Report number: LHCb-PAPER-2025-064, CERN-EP-2026-068

  38. arXiv:2604.05701  [pdf, ps, other

    hep-ex

    Measurement of the CKM angle $γ$ in $B^{\pm} \rightarrow D(\rightarrow K^{0}_{\rm S} h^{\prime+}h^{\prime-})h^{\pm}$ decays with a novel approach

    Authors: The BESIII, LHCb Collaborations, :, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Aliberti, A. Amoroso, Q. An, Y. H. An, Y. Bai, O. Bakina, H. R. Bao, X. L. Bao, M. Barbagiovanni, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco , et al. (1936 additional authors not shown)

    Abstract: A measurement of the CKM angle $γ$ and related strong-phase parameters is performed using a novel, model-independent approach in ${B^{\pm}\rightarrow D(\rightarrow K^{0}_{\rm S} h^{\prime+}h^{\prime-}) h^{\pm}}$ decays, where $h^{(\prime)} \equiv π, K$. The analysis uses a joint data sample of electron-positron collisions collected by the BESIII experiment at the Beijing Electron-Positron Collider… ▽ More

    Submitted 7 April, 2026; originally announced April 2026.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3989/ (LHCb public pages)

    Report number: LHCb-PAPER-2025-063, CERN-EP-2026-067

  39. arXiv:2604.05620  [pdf, ps, other

    cs.CV cs.AI

    Semantic-Topological Graph Reasoning for Language-Guided Pulmonary Screening

    Authors: Chenyu Xue, Yiran Liu, Mian Zhou, Jionglong Su, Zhixiang Lu

    Abstract: Medical image segmentation driven by free-text clinical instructions is a critical frontier in computer-aided diagnosis. However, existing multimodal and foundation models struggle with the semantic ambiguity of clinical reports and fail to disambiguate complex anatomical overlaps in low-contrast scans. Furthermore, fully fine-tuning these massive architectures on limited medical datasets invariab… ▽ More

    Submitted 7 April, 2026; originally announced April 2026.

  40. arXiv:2604.05353  [pdf, ps, other

    eess.SP

    Quasi-stationary Slice Detection-Based Robust Respiration Rate Estimation under Large-scale Random Body Movement

    Authors: Chendong Xu, Shuai Yao, Haoying Bao, Chiyuan Ma, Qisong Wu

    Abstract: Radar-based non-contact respiration rate (RR) measurement has become increasingly popular due to its convenience, non-intrusiveness, and low cost. However, it is still quite challenging to accurately acquire vital signs estimation in complex measurement scenarios with large-scale random body movements (RBM), particularly for RR estimation due to strong low-frequency interferences. To cope with the… ▽ More

    Submitted 6 April, 2026; originally announced April 2026.

  41. arXiv:2604.04839  [pdf, ps, other

    cs.CL

    MERIT: Multilingual Expert-Reward Informed Tuning for Chinese-Centric Low-Resource Machine Translation

    Authors: Zhixiang Lu, Chong Zhang, Chenyu Xue, Angelos Stefanidis, Chong Li, Jionglong Su, Zhengyong Jiang

    Abstract: Neural machine translation (NMT) from Chinese to low-resource Southeast Asian languages remains severely constrained by the extreme scarcity of clean parallel corpora and the pervasive noise in existing mined data. This chronic shortage not only impedes effective model training but also sustains a large performance gap with high-resource directions, leaving millions of speakers of languages such a… ▽ More

    Submitted 6 April, 2026; originally announced April 2026.

  42. arXiv:2604.04815  [pdf, ps, other

    cs.CL cs.AI

    LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection

    Authors: Cheng Xu, Changhong Jin, Yingjie Niu, Nan Yan, Yuke Mei, Shuhao Guan, Liming Chen, M-Tahar Kechadi

    Abstract: The rapid development of Large Language Models (LLMs) has transformed fake news detection and fact-checking tasks from simple classification to complex reasoning. However, evaluation frameworks have not kept pace. Current benchmarks are static, making them vulnerable to benchmark data contamination (BDC) and ineffective at assessing reasoning under temporal uncertainty. To address this, we introdu… ▽ More

    Submitted 6 April, 2026; originally announced April 2026.

    Comments: ACL 2026 Main

  43. arXiv:2604.04771  [pdf, ps, other

    cs.CV cs.CL

    MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

    Authors: Bin Wang, Tianyao He, Linke Ouyang, Fan Wu, Zhiyuan Zhao, Tao Chu, Yuan Qu, Zhenjiang Jin, Weijun Zeng, Ziyang Miao, Bangrui Xu, Junbo Niu, Mengzhang Cai, Jiantao Qiu, Qintong Zhang, Dongsheng Ma, Yuefeng Sun, Hejun Dong, Wenzheng Zhang, Jutao Xiao, Jiayong Shi, Pengyu Liao, Xiaomeng Zhao, Huaping Zhong, Liqun Wei , et al. (18 additional authors not shown)

    Abstract: Current document parsing methods advance primarily through model architecture innovation, while systematic engineering of training data remains underexplored. Yet state-of-the-art models spanning diverse architectures and parameter scales exhibit highly consistent failure patterns on the same set of hard samples, suggesting that the performance bottleneck stems from shared deficiencies in training… ▽ More

    Submitted 9 April, 2026; v1 submitted 6 April, 2026; originally announced April 2026.

    Comments: Technical Report

  44. Tighter entropic uncertainty relations in the presence of quantum memories for complete sets of mutually unbiased bases

    Authors: Qing-Hua Zhang, Cong Xu, Jing-Feng Wu, Shao-Ming Fei

    Abstract: Entropic uncertainty relations provide an information-theoretic framework for quantifying the fundamental indeterminacy inherent in quantum mechanics. We propose more stringent quantum-memory-assisted entropic uncertainty relations for complete sets of mutually unbiased bases in multipartite scenarios. We present lower and upper bounds of the quantum uncertainties based on the complementarity of t… ▽ More

    Submitted 5 April, 2026; originally announced April 2026.

    Journal ref: Advanced Quantum Technologies, 2026; 9:e00761

  45. arXiv:2604.03929  [pdf

    cond-mat.mtrl-sci physics.optics

    Direct Photocurrent Detection of Optical Vortex Based on the Orbital Photo Galvanic Effect: Progress, Challenge and Perspective

    Authors: Jinluo Cheng, Dehong Yang, Weiming Wang, Chang Xu, Zipu Fan, Dong Sun

    Abstract: A photodetector that can directly distinguish the orbital angular momentum (OAM) of light is highly desirable for integrated on-chip OAM detection and focal plane array devices. The recent development of OAM detectors based on the intrinsic orbital photo galvanic effects (OPGE) of materials provide a new route for direct OAM detection that is on-chip scalable with high resolution and speed. In thi… ▽ More

    Submitted 4 April, 2026; originally announced April 2026.

    Comments: 28 pages, 5 figures, 3 tables; Accepted by Advanced Science

  46. arXiv:2604.03893  [pdf, ps, other

    cs.AI

    FeynmanBench: Benchmarking Multimodal LLMs on Diagrammatic Physics Reasoning

    Authors: Zeyu Wang, Xiaogang Li, Peiyao Xiao, Qinhao Kong, Ben Wang, Chengliang Xu, Zichao Chen, Bing Zhao, Hu Wei

    Abstract: Breakthroughs in frontier theory often depend on the combination of concrete diagrammatic notations with rigorous logic. While multimodal large language models (MLLMs) show promise in general scientific tasks, current benchmarks often focus on local information extraction rather than the global structural logic inherent in formal scientific notations. In this work, we introduce FeynmanBench, the f… ▽ More

    Submitted 4 April, 2026; originally announced April 2026.

    Comments: 10 pages, 5 figures

  47. arXiv:2604.03699  [pdf, ps, other

    cs.IT eess.SP

    Region-Based Constellation Designs for Constructive Interference Precoding in MU-MIMO

    Authors: Yupeng Zheng, Chunmei Xu, Jinfei Wang, Yi Ma, Rahim Tafazolli

    Abstract: The performance of constructive interference precoding (CIP) for multi-user multi-antenna (MU-MIMO) systems is governed by the structure of the constructive interference (CI) regions, yet this is overlooked in conventional constellation design. This work proposes the region-based constellation (RBC) model to lay the foundation for CIP constellation design. An RBC directly defines the mapping betwe… ▽ More

    Submitted 4 April, 2026; originally announced April 2026.

    Comments: 12 pages, 10 figures, submitted to IEEE Transactions on Signal Processing

  48. arXiv:2604.03339  [pdf, ps, other

    cs.CV

    Hierarchical Awareness Adapters with Hybrid Pyramid Feature Fusion for Dense Depth Prediction

    Authors: Wuqi Su, Huilun Song, Chen Zhao, Chi Xu

    Abstract: Monocular depth estimation from a single RGB image remains a fundamental challenge in computer vision due to inherent scale ambiguity and the absence of explicit geometric cues. Existing approaches typically rely on increasingly complex network architectures to regress depth maps, which escalates training costs and computational overhead without fully exploiting inter-pixel spatial dependencies. W… ▽ More

    Submitted 3 April, 2026; originally announced April 2026.

  49. arXiv:2604.03114  [pdf, ps, other

    cs.CV cs.AI

    Can VLMs Truly Forget? Benchmarking Training-Free Visual Concept Unlearning

    Authors: Zhangyun Tan, Zeliang Zhang, Susan Liang, Yolo Yunlong Tang, Lisha Chen, Chenliang Xu

    Abstract: VLMs trained on web-scale data retain sensitive and copyrighted visual concepts that deployment may require removing. Training-based unlearning methods share a structural flaw: fine-tuning on a narrow forget set degrades general capabilities before unlearning begins, making it impossible to attribute subsequent performance drops to the unlearning procedure itself. Training-free approaches sidestep… ▽ More

    Submitted 3 April, 2026; originally announced April 2026.

  50. arXiv:2604.03044  [pdf, ps, other

    cs.CL cs.AI

    JoyAI-LLM Flash: Advancing Mid-Scale LLMs with Token Efficiency

    Authors: Aichen Cai, Anmeng Zhang, Anyu Li, Bo Zhang, Bohua Cai, Chang Li, Changjian Jiang, Changkai Lu, Chao Xue, Chaocai Liang, Cheng Zhang, Dongkai Liu, Fei Wang, Guoqiang Huang, Haijian Ke, Han Lin, Hao Wang, Ji Miao, Jiacheng Zhang, Jialong Shi, Jifeng Zhu, Jingjing Qian, Junhui Luo, Junwu Xiong, Lam So , et al. (44 additional authors not shown)

    Abstract: We introduce JoyAI-LLM Flash, an efficient Mixture-of-Experts (MoE) language model designed to redefine the trade-off between strong performance and token efficiency in the sub-50B parameter regime. JoyAI-LLM Flash is pretrained on a massive corpus of 20 trillion tokens and further optimized through a rigorous post-training pipeline, including supervised fine-tuning (SFT), Direct Preference Optimi… ▽ More

    Submitted 8 April, 2026; v1 submitted 3 April, 2026; originally announced April 2026.

    Comments: Xiaodong He is the corresponding author