Skip to main content

Showing 1–50 of 4,177 results for author: Xu, W

.
  1. arXiv:2604.14113  [pdf, ps, other

    cs.CV cs.AI cs.CL

    UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding

    Authors: Fei Tang, Bofan Chen, Zhengxi Lu, Tongbo Chen, Songqin Nong, Tao Jiang, Wenhao Xu, Weiming Lu, Jun Xiao, Yueting Zhuang, Yongliang Shen

    Abstract: GUI grounding, which localizes interface elements from screenshots given natural language queries, remains challenging for small icons and dense layouts. Test-time zoom-in methods improve localization by cropping and re-running inference at higher resolution, but apply cropping uniformly across all instances with fixed crop sizes, ignoring whether the model is actually uncertain on each case. We p… ▽ More

    Submitted 15 April, 2026; originally announced April 2026.

    Comments: Project Page: https://zju-real.github.io/UI-Zoomer Code: https://github.com/ZJU-REAL/UI-Zoomer

  2. arXiv:2604.13594  [pdf

    physics.flu-dyn cs.LG

    Data-driven Learning of Probabilistic Model of Binary Droplet Collision for Spray Simulation

    Authors: Weiming Xu, Tao Yang, Peng Zhang

    Abstract: Binary droplet collisions are ubiquitous in dense sprays. Traditional deterministic models cannot adequately represent transitional and stochastic behaviors of binary droplet collision. To bridge this gap, we developed a probabilistic model by using a machine learning approach, the Light Gradient-Boosting Machine (LightGBM). The model was trained on a comprehensive dataset of 33,540 experimental c… ▽ More

    Submitted 15 April, 2026; originally announced April 2026.

    Comments: 28 pages, 11 figures, research paper

  3. arXiv:2604.13029  [pdf, ps, other

    cs.CV cs.AI

    Visual Preference Optimization with Rubric Rewards

    Authors: Ya-Qi Yu, Fangyu Hong, Xiangyang Qu, Hao Wang, Gaojie Wu, Qiaoyu Luo, Nuo Xu, Huixin Wang, Wuheng Xu, Yongxin Liao, Zihao Chen, Haonan Li, Ziming Li, Dezhi Peng, Minghui Liao, Jihao Wu, Haoyu Ren, Dandan Tu

    Abstract: The effectiveness of Direct Preference Optimization (DPO) depends on preference data that reflect the quality differences that matter in multimodal tasks. Existing pipelines often rely on off-policy perturbations or coarse outcome-based signals, which are not well suited to fine-grained visual reasoning. We propose rDPO, a preference optimization framework based on instance-specific rubrics. For e… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

  4. arXiv:2604.12834  [pdf, ps, other

    eess.SP cs.CR cs.LG

    Rapid LoRA Aggregation for Wireless Channel Adaptation in Open-Set Radio Frequency Fingerprinting

    Authors: Mingxi Zhang, Renjie Xie, Jincheng Wang, Guyue Li, Wei Xu

    Abstract: Radio frequency fingerprints (RFFs) enable secure wireless authentication but struggle in open-set scenarios with unknown devices and varying channels. Existing methods face challenges in generalization and incur high computational costs. We propose a lightweight, self-adaptive RFF extraction framework using Low-Rank Adaptation (LoRA). By pretraining LoRA modules per environment, our method enable… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

    Comments: 6 pages

  5. arXiv:2604.12313  [pdf

    cond-mat.supr-con cond-mat.mes-hall physics.app-ph

    Nanoscale electrothermal-switch superconducting diode for electrically programmable superconducting circuits

    Authors: Tianyu Li, Jiong Li, Chong Li, Peiyuan Huang, Nuo-Zhou Yang, Wuyue Xu, Wen-Cheng Yue, Yang-Yang Lyu, Yihuang Xiong, Xuecou Tu, Tao Tao, Xiaoqing Jia, Qing-Hu Chen, Huabing Wang, Peiheng Wu, Yong-Lei Wang

    Abstract: Superconducting diodes enable dissipationless directional transport, yet achieving electrical tunability and scalability remains a major challenge for circuit-level integration. Here, we demonstrate an electrothermal-switch superconducting diode in which a gate-controlled nanoscale hotspot dynamically breaks inversion symmetry in a superconducting nanowire. This mechanism gives rise to two coexist… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

    Comments: To appear in Nano Letters

  6. arXiv:2604.12285  [pdf, ps, other

    cs.AI

    GAM: Hierarchical Graph-based Agentic Memory for LLM Agents

    Authors: Zhaofen Wu, Hanrong Zhang, Fulin Lin, Wujiang Xu, Xinran Xu, Yankai Chen, Henry Peng Zou, Shaowen Chen, Weizhi Zhang, Xue Liu, Philip S. Yu, Hongwei Wang

    Abstract: To sustain coherent long-term interactions, Large Language Model (LLM) agents must navigate the tension between acquiring new information and retaining prior knowledge. Current unified stream-based memory systems facilitate context updates but remain vulnerable to interference from transient noise. Conversely, discrete structured memory architectures provide robust knowledge retention but often st… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

    Comments: 18 pages, 6 figures

  7. arXiv:2604.11540  [pdf

    cs.AI

    A collaborative agent with two lightweight synergistic models for autonomous crystal materials research

    Authors: Tongyu Shi, Yutang Li, Zhanyuan Li, Qian Liu, Jie Zhou, Wenhe Xu, Yang Li, Dawei Dai, Rui He, Wenhua Zhou, Jiahong Wang, Xue-Feng Yu

    Abstract: Current large language models require hundreds of billions of parameters yet struggle with domain-specific reasoning and tool coordination in materials science. Here, we present MatBrain, a lightweight collaborative agent system with two synergistic models specialization for crystal materials research. MatBrain employs a dual-model architecture: Mat-R1 (30B parameters) as the analytical model prov… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

  8. arXiv:2604.11344  [pdf, ps, other

    cs.CR cs.CL

    Geometry-Aware Localized Watermarking for Copyright Protection in Embedding-as-a-Service

    Authors: Zhimin Chen, Xiaojie Liang, Wenbo Xu, Yuxuan Liu, Wei Lu

    Abstract: Embedding-as-a-Service (EaaS) has become an important semantic infrastructure for natural language and multimedia applications, but it is highly vulnerable to model stealing and copyright infringement. Existing EaaS watermarking methods face a fundamental robustness--utility--verifiability tension: trigger-based methods are fragile to paraphrasing, transformation-based methods are sensitive to dim… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

  9. arXiv:2604.11103  [pdf, ps, other

    cs.SD cs.AI

    ActorMind: Emulating Human Actor Reasoning for Speech Role-Playing

    Authors: Xi Chen, Wei Xue, Yike Guo

    Abstract: Role-playing has garnered rising attention as it provides a strong foundation for human-machine interaction and facilitates sociological research. However, current work is confined to textual modalities, neglecting speech, which plays a predominant role in daily life, thus limiting genuine role-playing. To bridge this gap, we conceptualize and benchmark speech role-playing through ActorMindBench,… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

  10. arXiv:2604.10708  [pdf, ps, other

    cs.SD cs.AI cs.CV cs.MM

    Audio-Omni: Extending Multi-modal Understanding to Versatile Audio Generation and Editing

    Authors: Zeyue Tian, Binxin Yang, Zhaoyang Liu, Jiexuan Zhang, Ruibin Yuan, Hubery Yin, Qifeng Chen, Chen Li, Jing Lv, Wei Xue, Yike Guo

    Abstract: Recent progress in multimodal models has spurred rapid advances in audio understanding, generation, and editing. However, these capabilities are typically addressed by specialized models, leaving the development of a truly unified framework that can seamlessly integrate all three tasks underexplored. While some pioneering works have explored unifying audio understanding and generation, they often… ▽ More

    Submitted 12 April, 2026; originally announced April 2026.

  11. arXiv:2604.10189  [pdf, ps, other

    cs.CL

    FAITH: Factuality Alignment through Integrating Trustworthiness and Honestness

    Authors: Xiaoning Dong, Chengyan Wu, Yajie Wen, Yu Chen, Yun Xue, Jing Zhang, Wei Xu, Bolei Ma

    Abstract: Large Language Models (LLMs) can generate factually inaccurate content even if they have corresponding knowledge, which critically undermines their reliability. Existing approaches attempt to mitigate this by incorporating uncertainty in QA prompt during training, but these numerical scores lack the semantic richness for LLM to properly understand its internal states of trustworthiness and honestn… ▽ More

    Submitted 11 April, 2026; originally announced April 2026.

    Comments: ACL 2026 Findings

  12. Joint Observation of SGR J1935+2154 with \textit{Insight}-HXMT and KM40m during the active episode of October 2022

    Authors: Wang-Chen Xue, Wen-Jun Tan, Yu-Xiang Huang, Xiao-Bo Li, Long-Fei Hao, Shao-Lin Xiong, Ce Cai, Chen-Wei Wang, Yue Wang, Ke-Jia Lee, Heng Xu, Peng Zhang, Ming-Yu Ge, Hao-Xuan Guo, Yue Huang, Cheng-Kui Li, Jia-Cong Liu, Yang-Zhao Ren, Shuo Xiao, Sheng-Lun Xie, Shu-Xu Yi, Zheng-Hang Yu, Jin-Peng Zhang, Yan-Qiu Zhang, Chao Zheng , et al. (10 additional authors not shown)

    Abstract: SGR J1935+2154 is the unique magnetar so far from which fast radio bursts have been detected. In October 2022, it resumed its burst activity, and we implemented a dedicated target-of-opportunity (ToO) observation on it from Oct. 13th to Nov. 1st, 2022 (about 940 ks in total) with \textit{Insight}-HXMT, while the KM40m radio telescope observed this source for about 1400 hours since Oct. 15th. We se… ▽ More

    Submitted 11 April, 2026; originally announced April 2026.

    Comments: Accepted for publication in ApJL

  13. arXiv:2604.10128  [pdf, ps, other

    quant-ph cond-mat.stat-mech cond-mat.str-el

    A Framework for Predicting Entanglement Spectra of Gapless Symmetry-Protected Topological States in One Dimension

    Authors: Wen-Tao Xu, Frank Pollmann, Michael Knap

    Abstract: The concept of gapped symmetry-protected topological (SPT) states has been generalized to gapless SPT (gSPT) states. Similar to gapped SPT states, gSPT states in one dimension exhibit universal degeneracies in their entanglement spectra. The entanglement spectra of gSPT states are further described by boundary conformal field theories, whose systematic prediction is a key open question. To address… ▽ More

    Submitted 11 April, 2026; originally announced April 2026.

    Comments: Main text contains 12 pages, and 10 figures

  14. arXiv:2604.10073  [pdf, ps, other

    cs.LG cs.AI

    Graph-RHO: Critical-path-aware Heterogeneous Graph Network for Long-Horizon Flexible Job-Shop Scheduling

    Authors: Yujie Li, Jiuniu Wang, Mugen Peng, Guangzuo Li, Wenjia Xu

    Abstract: Long-horizon Flexible Job-Shop Scheduling~(FJSP) presents a formidable combinatorial challenge due to complex, interdependent decisions spanning extended time horizons. While learning-based Rolling Horizon Optimization~(RHO) has emerged as a promising paradigm to accelerate solving by identifying and fixing invariant operations, its effectiveness is hindered by the structural complexity of FJSP. E… ▽ More

    Submitted 11 April, 2026; originally announced April 2026.

    Comments: 8 pages, 3 figures; Accepted by IJCNN 2026

  15. arXiv:2604.08614  [pdf

    physics.flu-dyn astro-ph.GA math.AP nlin.CD

    Inverse Energy Cascade in Turbulent Taylor-Couette Flows

    Authors: Changquan Zhou, Hua-Shu Dou, Lin Niu, Wenqian Xu

    Abstract: The inverse energy cascade in turbulent Taylor-Couette flow is studied in line with the results of the large eddy simulation. The simulation results show that the inverse energy cascade first occurs within the core region of the flow channel of the Taylor-Couette flow at higher Reynolds number. It is uncovered that this phenomenon is induced by the pulsed zero shear stress resulting from the singu… ▽ More

    Submitted 8 April, 2026; originally announced April 2026.

    Comments: 24 pages; 7 figures

    MSC Class: 76D05; 76F06

    Journal ref: Physics of Fluids, 37, 014110 (2025)

  16. arXiv:2604.08523  [pdf, ps, other

    cs.CL cs.AI

    ClawBench: Can AI Agents Complete Everyday Online Tasks?

    Authors: Yuxuan Zhang, Yubo Wang, Yipeng Zhu, Penghui Du, Junwen Miao, Xuan Lu, Wendong Xu, Yunzhuo Hao, Songcheng Cai, Xiaochen Wang, Huaisong Zhang, Xian Wu, Yi Lu, Minyi Lei, Kai Zou, Huifeng Yin, Ping Nie, Liang Chen, Dongfu Jiang, Wenhu Chen, Kelsey R. Allen

    Abstract: AI agents may be able to automate your inbox, but can they automate other routine aspects of your life? Everyday online tasks offer a realistic yet unsolved testbed for evaluating the next generation of AI agents. To this end, we introduce ClawBench, an evaluation framework of 153 simple tasks that people need to accomplish regularly in their lives and work, spanning 144 live platforms across 15 c… ▽ More

    Submitted 9 April, 2026; originally announced April 2026.

    Comments: Project page: https://claw-bench.com

  17. arXiv:2604.07409  [pdf, ps, other

    cs.LG eess.IV

    GAN-based Domain Adaptation for Image-aware Layout Generation in Advertising Poster Design

    Authors: Chenchen Xu, Min Zhou, Tiezheng Ge, Weiwei Xu

    Abstract: Layout plays a crucial role in graphic design and poster generation. Recently, the application of deep learning models for layout generation has gained significant attention. This paper focuses on using a GAN-based model conditioned on images to generate advertising poster graphic layouts, requiring a dataset of paired product images and layouts. To address this task, we introduce the Content-awar… ▽ More

    Submitted 8 April, 2026; originally announced April 2026.

    Comments: arXiv admin note: text overlap with arXiv:2303.14377

  18. arXiv:2604.07150  [pdf, ps, other

    eess.SP

    CRB-Based Waveform Optimization for MIMO ISAC Systems With One-Bit ADCs

    Authors: Qi Lin, Hong Shen, Wei Xu, Chunming Zhao

    Abstract: This paper studies the transmit waveform optimization for a quantized multiple-input multiple-output (MIMO) integrated sensing and communication (ISAC) system, where one-bit analog-to-digital converters (ADCs) are employed to enable a low-cost and power-efficient hardware implementation. Focusing on the parameter estimation task, we propose two novel Cramér-Rao bounds (CRBs) for both point-like ta… ▽ More

    Submitted 8 April, 2026; originally announced April 2026.

    Comments: This work has been submitted to the IEEE for possible publication

  19. arXiv:2604.07026  [pdf, ps, other

    cs.CV

    Not all tokens contribute equally to diffusion learning

    Authors: Guoqing Zhang, Lu Shi, Wanru Xu, Linna Zhang, Sen Wang, Fangfang Wang, Yigang Cen

    Abstract: With the rapid development of conditional diffusion models, significant progress has been made in text-to-video generation. However, we observe that these models often neglect semantically important tokens during inference, leading to biased or incomplete generations under classifier-free guidance. We attribute this issue to two key factors: distributional bias caused by the long-tailed token freq… ▽ More

    Submitted 8 April, 2026; originally announced April 2026.

  20. arXiv:2604.06965  [pdf

    physics.flu-dyn astro-ph.GA math.AP nlin.CD physics.ao-ph

    Solitary wave structure of transitional flow in the wake of a sphere

    Authors: Lin Niu, Hua-Shu Dou, Changquan Zhou, Wenqian Xu

    Abstract: The soliton-like coherent structure (SCS), which has been verified to exist in both transitional and turbulent boundary layers1-4, still poses a challenge in the understanding of its formation and behavior. In our previous study (Niu et al.5), the SCS was also found to exist in the transitional wake flow behind a sphere. In present study, the formation and evolution of the SCS is further investiga… ▽ More

    Submitted 8 April, 2026; originally announced April 2026.

    Comments: 26 pages; 21 figures

    MSC Class: 76D05; 76F06

    Journal ref: Physics of Fluids, 37, 014111(2025)

  21. A Multi-Agent Framework for Automated Exploit Generation with Constraint-Guided Comprehension and Reflection

    Authors: Siyi Chen, Tianhan Luo, Shijian Wu, Xiangyu Liu, Yilin Zhou, Qi Li, Wenyuan Xu

    Abstract: Open-source libraries are widely used in modern software development, introducing significant security vulnerabilities. While static analysis tools can identify potential vulnerabilities at scale, they often generate overwhelming reports with high false positive rates. Automated Exploit Generation (AEG) emerges as a promising solution to confirm vulnerability authenticity by generating an exploit.… ▽ More

    Submitted 6 April, 2026; originally announced April 2026.

    Journal ref: 34th IEEE/ACM International Conference on Program Comprehension (ICPC '26), April 12--13, 2026, Rio de Janeiro, Brazil

  22. arXiv:2604.05051  [pdf, ps, other

    cs.CL cs.AI

    This Treatment Works, Right? Evaluating LLM Sensitivity to Patient Question Framing in Medical QA

    Authors: Hye Sun Yun, Geetika Kapoor, Michael Mackert, Ramez Kouzy, Wei Xu, Junyi Jessy Li, Byron C. Wallace

    Abstract: Patients are increasingly turning to large language models (LLMs) with medical questions that are complex and difficult to articulate clearly. However, LLMs are sensitive to prompt phrasings and can be influenced by the way questions are worded. Ideally, LLMs should respond consistently regardless of phrasing, particularly when grounded in the same underlying evidence. We investigate this through… ▽ More

    Submitted 6 April, 2026; originally announced April 2026.

    Comments: 31 pages, 4 tables, 19 figures

  23. arXiv:2604.04976  [pdf, ps, other

    cs.IR

    Tencent Advertising Algorithm Challenge 2025: All-Modality Generative Recommendation

    Authors: Junwei Pan, Wei Xue, Chao Zhou, Xing Zhou, Lunan Fan, Yanbo Wang, Haoran Xin, Zhiyu Hu, Yaozheng Wang, Fengye Xu, Yurong Yang, Xiaotian Li, Junbang Huo, Wentao Ning, Yuliang Sun, Chengguo Yin, Jun Zhang, Shudong Huang, Lei Xiao, Huan Yu, Irwin King, Haijie Gu, Jie Jiang

    Abstract: Generative recommender systems are rapidly emerging as a new paradigm for recommendation, where collaborative identifiers and/or multi-modal content are mapped into discrete token spaces and user behavior is modelled with autoregressive sequence models. Despite progress on multi-modal recommendation datasets, there is still a lack of public benchmarks that jointly offer large-scale, realistic and… ▽ More

    Submitted 4 April, 2026; originally announced April 2026.

  24. arXiv:2604.02756  [pdf, ps, other

    cs.LG

    STDDN: A Physics-Guided Deep Learning Framework for Crowd Simulation

    Authors: Zijin Liu, Xu Geng, Wenshuai Xu, Xiang Zhao, Yan Xia, You Song

    Abstract: Accurate crowd simulation is crucial for public safety management, emergency evacuation planning, and intelligent transportation systems. However, existing methods, which typically model crowds as a collection of independent individual trajectories, are limited in their ability to capture macroscopic physical laws. This microscopic approach often leads to error accumulation and compromises simulat… ▽ More

    Submitted 3 April, 2026; originally announced April 2026.

    Journal ref: International Conference on Learning Representations (ICLR), 2026

  25. arXiv:2604.02411  [pdf, ps, other

    hep-ph hep-th

    The Holographic QCD Axion in Five Dimensions

    Authors: Csaba Csáki, Eric Kuflik, Wei Xue, Taewook Youn

    Abstract: We present a holographic construction of the QCD axion based on a warped 5D model. A key ingredient of our setup is the introduction of a bulk scalar field $θ$, which is holographically dual to the topological operator of QCD. This makes the relation among the axion, the $η'$, and the anomalies transparent. We identify the bulk modes corresponding to the $η'$ and axion states, and show that an adj… ▽ More

    Submitted 2 April, 2026; originally announced April 2026.

    Comments: 10 pages, 2 figures

  26. arXiv:2604.02392  [pdf, ps, other

    cs.CV

    Beyond Fixed Inference: Quantitative Flow Matching for Adaptive Image Denoising

    Authors: Jigang Duan, Genwei Ma, Xu Jiang, Wenfeng Xu, Ping Yang, Xing Zhao

    Abstract: Diffusion and flow-based generative models have shown strong potential for image restoration. However, image denoising under unknown and varying noise conditions remains challenging, because the learned vector fields may become inconsistent across different noise levels, leading to degraded restoration quality under mismatch between training and inference. To address this issue, we propose a quant… ▽ More

    Submitted 2 April, 2026; originally announced April 2026.

  27. arXiv:2604.02261  [pdf, ps, other

    astro-ph.HE

    GECAM discovery of a peculiar magnetar X-ray burst (MXB 221120) from SGR J1935+2154 associated with a fast radio burst

    Authors: Wen-Jun Tan, Yue Wang, Chen-Wei Wang, Shao-Lin Xiong, Xiao-Bo Li, Shuang-Nan Zhang, Ce Cai, Wang-Chen Xue, Peng Zhang, Bo-Bing Wu, Zheng-Hua An, Ming Gao, Ming-Yu Ge, Ke Gong, Dong-Ya Guo, Hao-Xuan Guo, Long-Fei Hao, Yue Huang, Yu-Xiang Huang, Ke-Jia Lee, Bing Li, Kui-Cheng Li, Xin-Qiao Li, Jia-Cong Liu, Xiao-Jing Liu , et al. (28 additional authors not shown)

    Abstract: Fast radio bursts (FRBs) are enigmatic cosmic transients of millisecond duration observed in the radio band. The identification of FRB-associated magnetar X-ray bursts (MXBs) from galactic magnetar SGR J1935+2154 suggests that at least a fraction of FRBs can be produced from magnetar activity. However, the sample size of FRB-associated MXBs is still very small. Here we report a bright and peculiar… ▽ More

    Submitted 2 April, 2026; originally announced April 2026.

    Comments: 9 pages, 4 figures, published on A&A: https://ui.adsabs.harvard.edu/abs/2026A%26A...707A.289T/abstract

  28. arXiv:2604.01890  [pdf, ps, other

    cs.SI cs.CY

    Behavior and Sublinear Algorithm for Opinion Disagreement on Noisy Social Networks

    Authors: Wanyue Xu, Yubo Sun, Mingzhe Zhu, Zuobai Zhang, Zhongzhi Zhang

    Abstract: The phenomenon of opinion disagreement has been empirically observed and reported in the literature, which is affected by various factors, such as the structure of social networks. An important discovery in network science is that most real-life networks, including social networks, are scale-free and sparse. In this paper, we study noisy opinion dynamics in sparse scale-free social networks to unc… ▽ More

    Submitted 2 April, 2026; originally announced April 2026.

    Comments: This paper has already been accepted by TKDE

  29. arXiv:2604.01226  [pdf, ps, other

    cs.CV cs.SE

    DOne: Decoupling Structure and Rendering for High-Fidelity Design-to-Code Generation

    Authors: Xinhao Huang, Jinke Yu, Wenhao Xu, Zeyi Wen, Ying Zhou, Junzhuo Liu, Junhao Ji, Zulong Chen

    Abstract: While Vision Language Models (VLMs) have shown promise in Design-to-Code generation, they suffer from a "holistic bottleneck-failing to reconcile high-level structural hierarchy with fine-grained visual details, often resulting in layout distortions or generic placeholders. To bridge this gap, we propose DOne, an end-to-end framework that decouples structure understanding from element rendering. D… ▽ More

    Submitted 11 March, 2026; originally announced April 2026.

  30. arXiv:2604.00820  [pdf, ps, other

    cs.CV

    Continual Vision-Language Learning for Remote Sensing: Benchmarking and Analysis

    Authors: Xingxing Weng, Ruifeng Ni, Chao Pang, XiangYu Hao, Yishan Wang, Xiaokang Zhang, Wei Xu, Gui-Song Xia

    Abstract: Current remote sensing vision-language models (RS VLMs) demonstrate impressive performance in image interpretation but rely on static training data, limiting their ability to accommodate continuously emerging sensing modalities and downstream tasks. This exposes a fundamental challenge: enabling RS VLMs to continually adapt without catastrophic forgetting. Despite its practical importance, the con… ▽ More

    Submitted 1 April, 2026; originally announced April 2026.

    Comments: 23 pages, 7 figures, 9 tables

  31. arXiv:2603.29640  [pdf, ps, other

    cs.AI

    ASI-Evolve: AI Accelerates AI

    Authors: Weixian Xu, Tiantian Mi, Yixiu Liu, Yang Nan, Zhimeng Zhou, Lyumanshan Ye, Lin Zhang, Yu Qiao, Pengfei Liu

    Abstract: Can AI accelerate the development of AI itself? While recent agentic systems have shown strong performance on well-scoped tasks with rapid feedback, it remains unclear whether they can tackle the costly, long-horizon, and weakly supervised research loops that drive real AI progress. We present ASI-Evolve, an agentic framework for AI-for-AI research that closes this loop through a learn-design-expe… ▽ More

    Submitted 31 March, 2026; originally announced March 2026.

    Comments: 19 pages, 6 figures, 6 tables. Code available at https://github.com/GAIR-NLP/ASI-Evolve

  32. arXiv:2603.29384  [pdf, ps, other

    cs.LG

    Causality-inspired Federated Learning for Dynamic Spatio-Temporal Graphs

    Authors: Yuxuan Liu, Wenchao Xu, Haozhao Wang, Zhiming He, Zhaofeng Shi, Chongyang Xu, Peichao Wang, Boyuan Zhang

    Abstract: Federated Graph Learning (FGL) has emerged as a powerful paradigm for decentralized training of graph neural networks while preserving data privacy. However, existing FGL methods are predominantly designed for static graphs and rely on parameter averaging or distribution alignment, which implicitly assume that all features are equally transferable across clients, overlooking both the spatial and t… ▽ More

    Submitted 31 March, 2026; originally announced March 2026.

  33. arXiv:2603.28570  [pdf, ps, other

    astro-ph.HE

    Comprehensive Measurement of Spectral Evolution in a GRB Flare: High Time-Resolution Insights into the "Double-Tracking" Phenomenon

    Authors: Zheng-Hang Yu, Wen-Jun Tan, Chen-Wei Wang, Shao-Lin Xiong, Chao Zheng, Peng Zhang, Hao-Xuan Guo, Zheng-Hua An, Ce Cai, Min Gao, Ke Gong, Dong-Ya Guo, Yue Huang, Bing Li, Cheng-Kui Li, Xiao-Bo Li, Xin-Qiao Li, Jia-Cong Liu, Ya-Qing Liu, Xiao-Jing Liu, Xiang Ma, Wen-Xi Peng, Rui Qiao, Yang-Zhao Ren, Li-Ming Song , et al. (19 additional authors not shown)

    Abstract: The spectral evolution characteristics of the prompt emission in gamma-ray bursts (GRBs) have been extensively studied, but detailed investigations of spectral evolution in a GRB flare remain lacking. In this work, we present the first analysis of spectral parameter evolution in a GRB flare through high time-resolved spectral fitting of the Brightest Flare in GRB 221009A. We find that the $α$-Flux… ▽ More

    Submitted 30 March, 2026; originally announced March 2026.

    Comments: 17 pages, 4 figures, accepted for publication in ApJ

  34. arXiv:2603.28220  [pdf, ps, other

    math.OC

    Bundle EXTRA for Decentralized Optimization

    Authors: Haijuan Liu, Zhuoqing Zheng, Cong Li, Wenying Xu, Xuyang Wu

    Abstract: Decentralized primal-dual methods are widely used for solving decentralized optimization problems, but their updates often rely on the potentially crude first-order Taylor approximations of the objective functions, which can limit convergence speed. To overcome this, we replace the first-order Taylor approximation in the primal update of EXTRA, which can be interpreted as a primal-dual method, wit… ▽ More

    Submitted 30 March, 2026; originally announced March 2026.

    MSC Class: 90C25; 68W15

  35. arXiv:2603.27599  [pdf, ps, other

    cs.CV

    You Only Erase Once: Erasing Anything without Bringing Unexpected Content

    Authors: Yixing Zhu, Qing Zhang, Wenju Xu, Wei-Shi Zheng

    Abstract: We present YOEO, an approach for object erasure. Unlike recent diffusion-based methods which struggle to erase target objects without generating unexpected content within the masked regions due to lack of sufficient paired training data and explicit constraint on content generation, our method allows to produce high-quality object erasure results free of unwanted objects or artifacts while faithfu… ▽ More

    Submitted 29 March, 2026; originally announced March 2026.

    Comments: Accepted by CVPR2026

  36. arXiv:2603.27571  [pdf, ps, other

    cs.HC

    RAGent: Physics-Aware Agentic Reasoning for Training-Free mmWave Human Activity Recognition

    Authors: Mingda Han, Huanqi Yang, Zehua Sun, Wenhao Li, Yanni Yang, Guoming Zhang, Yetong Cao, Weitao Xu, Pengfei Hu

    Abstract: Millimeter-wave (mmWave) radar enables privacy-preserving human activity recognition (HAR), yet real-world deployment remains hindered by costly annotation and poor transferability under domain shift. Although prior efforts partially alleviate these challenges, most still require retraining or adaptation for each new deployment setting. This keeps mmWave HAR in a repeated collect-tune-redeploy cyc… ▽ More

    Submitted 29 March, 2026; originally announced March 2026.

  37. arXiv:2603.27562  [pdf, ps, other

    cs.HC

    VoxAnchor: Grounding Speech Authenticity in Throat Vibration via mmWave Radar

    Authors: Mingda Han, Huanqi Yang, Chaoqun Li, Wenhao Li, Guoming Zhang, Yanni Yang, Yetong Cao, Weitao Xu, Pengfei Hu

    Abstract: Rapid advances in speech synthesis and audio editing have made realistic forgeries increasingly accessible, yet existing detection methods remain vulnerable to tampering or depend on visual/wearable sensors. In this paper, we present VoxAnchor, a system that physically grounds audio authentication in vocal dynamics by leveraging the inherent coherence between speech acoustics and radar-sensed thro… ▽ More

    Submitted 29 March, 2026; originally announced March 2026.

  38. arXiv:2603.27516  [pdf, ps, other

    cs.CV

    SGS-Intrinsic: Semantic-Invariant Gaussian Splatting for Sparse-View Indoor Inverse Rendering

    Authors: Jiahao Niu, Rongjia Zheng, Wenju Xu, Wei-Shi Zheng, Qing Zhang

    Abstract: We present SGS-Intrinsic, an indoor inverse rendering framework that works well for sparse-view images. Unlike existing 3D Gaussian Splatting (3DGS) based methods that focus on object-centric reconstruction and fail to work under sparse view settings, our method allows to achieve high-quality geometry reconstruction and accurate disentanglement of material and illumination. The core idea is to con… ▽ More

    Submitted 31 March, 2026; v1 submitted 29 March, 2026; originally announced March 2026.

    Comments: CVPR2026

  39. arXiv:2603.27136  [pdf, ps, other

    cs.SE cs.CY

    The First Issue Matters: Linking Task-Level Characteristics to Long-Term Newcomer Retention in OSS

    Authors: Yichen Hao, Weiwei Xu, Kai Gao, Xiaofang Zhang

    Abstract: Sustaining newcomer participation is critical for the long-term health of open-source communities. Although prior research has explored various task recommendation approaches to help newcomers resolve their first-issue, these methods overlook how characteristics of first-issues may influence newcomers' long-term retention, limiting our understanding of whether initial success leads to sustained pa… ▽ More

    Submitted 28 March, 2026; originally announced March 2026.

  40. arXiv:2603.27035  [pdf, ps, other

    cs.SD

    Diachronic Modeling of Tonal Coherence on the Tonnetz Across Classical and Popular Repertoires

    Authors: Weilun Xu, Edward Hall, Martin Rohrmeier

    Abstract: How do different musical traditions achieve tonal coherence? Most computational measures to date have analysed tonal coherence in terms of a single dimension, whereas a multi-dimensional analyses have not been sufficiently explored. We propose a new model drawing on the concept of the Tonnetz -- we define two partially independent measures: \emph{tonal focus}, the concentration of pitch content ne… ▽ More

    Submitted 27 March, 2026; originally announced March 2026.

  41. arXiv:2603.26757  [pdf, ps, other

    cs.RO

    Beyond Viewpoint Generalization: What Multi-View Demonstrations Offer and How to Synthesize Them for Robot Manipulation?

    Authors: Boyang Cai, Qiwei Liang, Jiawei Li, Shihang Weng, Zhaoxin Zhang, Tao Lin, Xiangyu Chen, Wenjie Zhang, Jiaqi Mao, Weisheng Xu, Bin Yang, Jiaming Liang, Junhao Cai, Renjing Xu

    Abstract: Does multi-view demonstration truly improve robot manipulation, or merely enhance cross-view robustness? We present a systematic study quantifying the performance gains, scaling behavior, and underlying mechanisms of multi-view data for robot manipulation. Controlled experiments show that, under both fixed and randomized backgrounds, multi-view demonstrations consistently improve single-view polic… ▽ More

    Submitted 23 March, 2026; originally announced March 2026.

  42. arXiv:2603.26171  [pdf, ps, other

    astro-ph.HE

    Single-Pulse Study of the Pseudo-nulling Pulsar PSR J1820-0509 Based on FAST Observations

    Authors: Zefeng Tu, Rushuang Zhao, Hui Liu, Biping Gong, D. Li, P. Wang, Chenchen Miao, Q. J. Zhi, S. J. Dang, S. D. Wang, Q. Zhou, Z. J. Zhang, Xu Zhu, R. W. Tian, H. W. Xu, Yi Zhou, D. Y. Yan

    Abstract: Using two observations obtained with the Five-hundred-meter Aperture Spherical radio Telescope (FAST), we present a detailed single-pulse analysis of the high-nulling pulsar PSR J1820-0509. We measure an exceptionally high nulling fraction of approximately 81.78%, significantly exceeding previous estimates from Parkes observations. The single-pulse energy distribution exhibits a clear bimodal stru… ▽ More

    Submitted 27 March, 2026; originally announced March 2026.

  43. Joint Sensing and Covert Communications in RIS-NOMA Systems

    Authors: Jiayi Lei, Xidong Mu, Tiankui Zhang, Wenjun Xu, Ping Zhang

    Abstract: A reconfigurable intelligent surface (RIS)-assisted non-orthogonal multiple access (NOMA) system is investigated, where the transmitter (Alice) is a dual functional radar communication (DFRC) base station (BS) that aims to sense the location of a potential warden (Willie), while simultaneously transmitting public and covert signals to the legitimate users, Carol and Bob, respectively. Both cases o… ▽ More

    Submitted 27 March, 2026; originally announced March 2026.

  44. arXiv:2603.25729  [pdf, ps, other

    hep-ph hep-th

    $θ$ Angle and Axial Anomaly in Holographic QCD

    Authors: Csaba Csáki, Eric Kuflik, Wei Xue, Taewook Youn

    Abstract: We present a bottom-up holographic description of the QCD $θ$-vacuum and the $U(1)_A$ anomaly in five dimensions. The multi-branched $θ$-vacuum structure emerges geometrically from a higher-dimensional gauge field, while the axial anomaly is realized through a Stückelberg coupling that is dual to a Chern-Simons term. In this framework, the $η'$ meson appears as a zero mode of bulk fluctuations, an… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

    Comments: 10 pages, 1 figure

  45. arXiv:2603.25573  [pdf, ps, other

    cs.CV cs.LG

    Hierarchy-Guided Multimodal Representation Learning for Taxonomic Inference

    Authors: Sk Miraj Ahmed, Xi Yu, Yunqi Li, Yuewei Lin, Wei Xu

    Abstract: Accurate biodiversity identification from large-scale field data is a foundational problem with direct impact on ecology, conservation, and environmental monitoring. In practice, the core task is taxonomic prediction - inferring order, family, genus, or species from imperfect inputs such as specimen images, DNA barcodes, or both. Existing multimodal methods often treat taxonomy as a flat label spa… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

    Comments: Accepted at the ICLR 2026 Workshop on Foundation Models for Science (FM4Science)

  46. arXiv:2603.25133  [pdf, ps, other

    cs.AI

    RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following

    Authors: Tianjun Pan, Xuan Lin, Wenyan Yang, Qianyu He, Shisong Chen, Licai Qi, Wanqing Xu, Hongwei Feng, Bo Xu, Yanghua Xiao

    Abstract: Rubric-based evaluation has become a prevailing paradigm for evaluating instruction following in large language models (LLMs). Despite its widespread use, the reliability of these rubric-level evaluations remains unclear, calling for meta-evaluation. However, prior meta-evaluation efforts largely focus on the response level, failing to assess the fine-grained judgment accuracy that rubric-based ev… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

    Comments: 9 pages, 5 figures

  47. arXiv:2603.25085  [pdf, ps, other

    physics.ins-det

    Beam Test Characterization of Silicon Microstrip Detector Flight-Model Ladders for the AMS-02 Upgrade

    Authors: Dexing Miao, Giovanni Ambrosi, Mattia Barbanera, Baasansuren Batsukh, Hengyi Cai, Mengke Cai, Xudong Cai, Yuman Cai, Yuan-Hann Chang, Shanzhen Chen, Hsin-Yi Chou, Xingzhu Cui, Mingyi Dong, Matteo Duranti, Ke Gong, Mingjie Feng, Valerio Formato, Yisheng Fu, Daojin Hong, Maria Ionica, Xiaojie Jiang, Yaozu Jiang, Liangchenglong Jin, Shengjie Jin, Vladimir Koutsenko , et al. (34 additional authors not shown)

    Abstract: The AMS-02 experiment plans to install a new silicon microstrip tracker layer (Layer-0) on top of the existing detector, increasing the cosmic-ray acceptance by a factor of 3. Layer-0 employs a design in which multiple silicon microstrip detectors (SSDs) are connected in series to form long detector ladders. We present a detailed performance study of the flight-model ladders using a 350~GeV mixed… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

  48. arXiv:2603.25080  [pdf, ps, other

    physics.ins-det astro-ph.IM

    A Telescope System for Charge and Position Measurement of High Energy Nuclei

    Authors: Dexing Miao, Zhiyu Xiang, Giovanni Ambrosi, Mattia Barbanera, Baasansuren Batsukh, Mengke Cai, Xudong Cai, Yuan-Hann Chang, Shanzhen Chen, Hsin-Yi Chou, Xingzhu Cui, Mingyi Dong, Matteo Duranti, Ke Gong, Mingjie Feng, Valerio Formato, Daojin Hong, Maria Ionica, Xiaojie Jiang, Yaozu Jiang, Liangchenglong Jin, Shengjie Jin, Vladimir Koutsenko, Tiange Li, Zuhao Li , et al. (21 additional authors not shown)

    Abstract: A high-granularity telescope system with a large sensitive area and low material budget has been developed for high-energy heavy ion beam tests. The telescope consists of nine layers of silicon microstrip detectors (SSDs), whose performance was validated through a heavy ion beam test at the CERN SPS. A hybrid machine learning algorithm is proposed to address the challenges of nuclear charge measur… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

  49. arXiv:2603.25040  [pdf, ps, other

    cs.LG cs.CL cs.CV

    Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

    Authors: Yicheng Zou, Dongsheng Zhu, Lin Zhu, Tong Zhu, Yunhua Zhou, Peiheng Zhou, Xinyu Zhou, Dongzhan Zhou, Zhiwang Zhou, Yuhao Zhou, Bowen Zhou, Zhanping Zhong, Zhijie Zhong, Haiteng Zhao, Penghao Zhao, Xiaomeng Zhao, Zhiyuan Zhao, Yechen Zhang, Jin Zhang, Wenwei Zhang, Hongjie Zhang, Zhuo Zhang, Wenlong Zhang, Bo Zhang, Chao Zhang , et al. (152 additional authors not shown)

    Abstract: We introduce Intern-S1-Pro, the first one-trillion-parameter scientific multimodal foundation model. Scaling to this unprecedented size, the model delivers a comprehensive enhancement across both general and scientific domains. Beyond stronger reasoning and image-text understanding capabilities, its intelligence is augmented with advanced agent capabilities. Simultaneously, its scientific expertis… ▽ More

    Submitted 2 April, 2026; v1 submitted 26 March, 2026; originally announced March 2026.

  50. arXiv:2603.24975  [pdf, ps, other

    cs.IR

    Unbiased Multimodal Reranking for Long-Tail Short-Video Search

    Authors: Wenyi Xu, Feiran Zhu, Songyang Li, Renzhe Zhou, Chao Zhang, Chenglei Dai, Yuren Mao, Yunjun Gao, Yi Zhang

    Abstract: Kuaishou serving hundreds of millions of searches daily, the quality of short-video search is paramount. However, it suffers from a severe Matthew effect on long-tail queries: sparse user behavior data causes models to amplify low-quality content such as clickbait and shallow content. The recent advancements in Large Language Models (LLMs) offer a new paradigm, as their inherent world knowledge pr… ▽ More

    Submitted 30 March, 2026; v1 submitted 25 March, 2026; originally announced March 2026.