-
Randomized Neural Networks for Integro-Differential Equations with Application to Neutron Transport
Authors:
Haoning Dang,
Fei Wang,
Yifan Chen,
Zhouyu Liu,
Dong Liu,
Hongchun Wu
Abstract:
Integro-differential equations arise in a wide range of applications, including transport, kinetic theory, radiative transfer, and multiphysics modeling, where nonlocal integral operators couple the solution across phase space. Such nonlocality often introduces dense coupling blocks in deterministic discretizations, leading to increased computational cost and memory usage, while physics-informed n…
▽ More
Integro-differential equations arise in a wide range of applications, including transport, kinetic theory, radiative transfer, and multiphysics modeling, where nonlocal integral operators couple the solution across phase space. Such nonlocality often introduces dense coupling blocks in deterministic discretizations, leading to increased computational cost and memory usage, while physics-informed neural networks may suffer from expensive nonconvex training and sensitivity to hyperparameter choices. In this work, we present randomized neural networks (RaNNs) as a mesh-free collocation framework for linear integro-differential equations. Because the RaNN approximation is intrinsically dense through globally supported random features, the nonlocal integral operator does not introduce an additional loss of sparsity, while the approximate solution can still be represented with relatively few trainable degrees of freedom. By randomly fixing the hidden-layer parameters and solving only for the linear output weights, the training procedure reduces to a convex least-squares problem in the output coefficients, enabling stable and efficient optimization. As a representative application, we apply the proposed framework to the steady neutron transport equation, a high-dimensional linear integro-differential model featuring scattering integrals and diverse boundary conditions. Extensive numerical experiments demonstrate that, in the reported test settings, the RaNN approach achieves competitive accuracy while incurring substantially lower training cost than the selected neural and deterministic baselines, highlighting RaNNs as a robust and efficient alternative for the numerical simulation of nonlocal linear operators.
△ Less
Submitted 15 April, 2026;
originally announced April 2026.
-
An End-to-end Building Load Forecasting Framework with Patch-based Information Fusion Network and Error-weighted Adaptive Loss
Authors:
Hang Fan,
Ying Lu,
Weican Liu,
Dunnan Liu,
Xiaotao Chen,
Shengwei Mei
Abstract:
Accurate building load forecasting plays a critical role in facilitating demand response aggregation and optimizing energy management. However, the complex temporal dependencies and high volatility of building loads limit the improvement of prediction accuracy. To this end, we propose a novel end-to-end building load forecasting framework. Specifically, the framework can be divided into two main s…
▽ More
Accurate building load forecasting plays a critical role in facilitating demand response aggregation and optimizing energy management. However, the complex temporal dependencies and high volatility of building loads limit the improvement of prediction accuracy. To this end, we propose a novel end-to-end building load forecasting framework. Specifically, the framework can be divided into two main stages. In the two-stage data preprocessing module enhanced by interpretable feature selection, we utilize the Local Outlier Factor (LOF) algorithm to accurately detect and correct anomalies in the original building load series. Furthermore, we employ SVM-SHAP feature analysis to quantify the impact of environmental variables, filtering out critical feature combinations to mitigate redundancy. In the building load forecasting module, we propose the patch-based information fusion network (PIF-Net). This model applies patching technology to process input series into local blocks, extracting temporal features through a shared Gated Recurrent Unit (GRU) network with residual connections. Subsequently, an information fusion module based on a customized gating mechanism integrates the ensemble hidden states to weight the importance of different temporal patches dynamically. Additionally, the framework is trained using a novel Error-weighted Adaptive Loss (EWAL) function. By combining a rational quadratic function and logarithmic loss to dynamically adjust penalty weights based on real-time prediction error distributions, EWAL significantly enhances the model's robustness under extreme load conditions. Finally, extensive experiments demonstrate the effectiveness and superiority of our proposed framework.
△ Less
Submitted 15 April, 2026;
originally announced April 2026.
-
Probing Coronal Activity Using Radio Signals Based on the 2021 superior conjunction of Mars: the Downlink Data from Tianwen-1
Authors:
Yu-Chen Liu,
De-Qing Kong,
Song Tan,
Zi-Han Zhao,
Zan Wang,
Dong-Hao Liu,
Xin-Ying Zhu,
Yan Su,
Hong-Bo Zhang
Abstract:
During the first superior conjunction of the Tianwen-1 Mars probe in October 2021, its downlink signal received by the Wuqing 70-m radio telescope passed within 4.53 solar radii of the Sun. The signal was significantly perturbed by the solar wind, providing a mechanism to probe coronal activity. We analyze the Doppler frequency scintillation spectrum of the solar wind within 10 solar radii to deri…
▽ More
During the first superior conjunction of the Tianwen-1 Mars probe in October 2021, its downlink signal received by the Wuqing 70-m radio telescope passed within 4.53 solar radii of the Sun. The signal was significantly perturbed by the solar wind, providing a mechanism to probe coronal activity. We analyze the Doppler frequency scintillation spectrum of the solar wind within 10 solar radii to derive a characteristic frequency scintillation parameter. Statistical analysis indicates this parameter increases as the signal path approaches the Sun, with notable anomalies observed on October 5, 13, and 15. Comparisons with SOHO and SDO data reveal strong spatio-temporal correlations between these scintillation anomalies and coronal activity. We demonstrate that this parameter effectively identifies solar phenomena, including coronal streamers, high-speed solar wind, and coronal mass ejections (CMEs). Quantitative analysis confirms a distinct temporal correlation and delay between frequency scintillation and solar wind speed changes, validating the feasibility of spatially localizing solar activity.
△ Less
Submitted 15 April, 2026;
originally announced April 2026.
-
Measurement of the $W$-boson production cross-sections in $pp$ collisions at $\sqrt{s}$ = 13 TeV in the forward region
Authors:
LHCb collaboration,
R. Aaij,
M. Abdelfatah,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1112 additional authors not shown)
Abstract:
A precision measurement of the $W$-boson production cross-section is performed using the $W \to μν$ decay channel, based on a sample of proton-proton collision data collected by the LHCb experiment at $\sqrt{s}$ = 13 TeV and corresponding to an integrated luminosity of 5.1 $fb^{-1}$. The cross-section is measured for muons with transverse momentum between 25 and 55 GeV and pseudorapidity between 2…
▽ More
A precision measurement of the $W$-boson production cross-section is performed using the $W \to μν$ decay channel, based on a sample of proton-proton collision data collected by the LHCb experiment at $\sqrt{s}$ = 13 TeV and corresponding to an integrated luminosity of 5.1 $fb^{-1}$. The cross-section is measured for muons with transverse momentum between 25 and 55 GeV and pseudorapidity between 2.0 and 4.5. The integrated production cross-sections of $W$ bosons are measured to be $$ \begin{array}{lcl} σ_{W^+ \to μ^+ν} &=& 1754.2 \pm 1.5 \pm 11.9 \pm 35.1\text{ pb} \\ σ_{W^- \to μ^-\barν} &=& 1178.1 \pm 1.3 \pm 9.7 \pm 23.6\text{ pb} \end{array} $$ where uncertainties are statistical, systematic, and due to the luminosity determination, respectively. Results are in good agreement with theoretical predictions at next-to-next-to-leading order in perturbative quantum chromodynamics. This measurement is significantly more precise than previous results in this kinematic regime.
△ Less
Submitted 14 April, 2026;
originally announced April 2026.
-
Precision measurement of the muon charge asymmetry from $W$-boson decays in $pp$ collisions at $\sqrt{s}$ = 13 TeV in the forward region
Authors:
LHCb collaboration,
R. Aaij,
M. Abdelfatah,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1112 additional authors not shown)
Abstract:
A precision measurement of the muon charge asymmetry from $W$-boson decays in proton-proton collisions at $\sqrt{s}$ = 13 TeV is presented. The analysis utilizes data corresponding to an integrated luminosity of 5.1 $fb^{-1}$, recorded by the LHCb detector during 2016, 2017 and 2018. The asymmetry is measured for muons with transverse momentum between 25 and 55 GeV and pseudorapidity between 2.0 a…
▽ More
A precision measurement of the muon charge asymmetry from $W$-boson decays in proton-proton collisions at $\sqrt{s}$ = 13 TeV is presented. The analysis utilizes data corresponding to an integrated luminosity of 5.1 $fb^{-1}$, recorded by the LHCb detector during 2016, 2017 and 2018. The asymmetry is measured for muons with transverse momentum between 25 and 55 GeV and pseudorapidity between 2.0 and 4.5. This result represents the most precise determination of the muon charge asymmetry in the forward region to date, exhibiting excellent agreement with next-to-next-to-leading-order predictions in perturbative quantum chromodynamics.
△ Less
Submitted 14 April, 2026;
originally announced April 2026.
-
Observation of the Exotic State $π_{1}(1600)$ in $ψ(2S)\rightarrowγχ_{c1},χ_{c1}\rightarrowπ^{+}π^{-}η'$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
X. L. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (728 additional authors not shown)
Abstract:
A partial wave analysis of the process $ψ(2S)\rightarrowγχ_{c1}, χ_{c1}\rightarrowπ^+π^-η^{\prime}$ is performed using $(2712.4\pm14.3)\times10^{6}$ $ψ(2S)$ events collected with the BESIII detector. An isovector state with exotic quantum numbers $J^{PC}=1^{-+}$, denoted as $π_{1}(1600)$, is observed for the first time in the charmonium decay of $χ_{c1}\rightarrowπ_{1}^{\pm}(1600)π^{\mp}$,…
▽ More
A partial wave analysis of the process $ψ(2S)\rightarrowγχ_{c1}, χ_{c1}\rightarrowπ^+π^-η^{\prime}$ is performed using $(2712.4\pm14.3)\times10^{6}$ $ψ(2S)$ events collected with the BESIII detector. An isovector state with exotic quantum numbers $J^{PC}=1^{-+}$, denoted as $π_{1}(1600)$, is observed for the first time in the charmonium decay of $χ_{c1}\rightarrowπ_{1}^{\pm}(1600)π^{\mp}$, $π_{1}^{\pm}(1600)\rightarrowπ^{\pm}η^{\prime}$ with a statistical significance over $21σ$. Its mass and width are determined to be $1828 \pm 8 ({\rm stat})^{+11}_{-33}({\rm syst})~\mathrm{MeV}/c^2$ and $638 \pm 26 ({\rm stat})^{+35}_{-86}({\rm syst})~\mathrm{MeV}$, respectively, using a relativistic Breit-Wigner function with a mass-dependent width. The corresponding product of branching fractions is determined to be $\mathcal{B}\left[χ_{c1}\rightarrowπ_{1}(1600)^{\pm}π^{\mp} \right] \times \mathcal{B}\left[π_{1}(1600)^{\pm}\rightarrowπ^{\pm}η^{\prime}\right] = \left( 4.30 \pm 0.14 ({\rm stat})^{+1.04}_{-1.03}({\rm syst})~ \right) \times 10^{-4}$.
△ Less
Submitted 14 April, 2026;
originally announced April 2026.
-
NTIRE 2026 The 3rd Restore Any Image Model (RAIM) Challenge: Professional Image Quality Assessment (Track 1)
Authors:
Guanyi Qin,
Jie Liang,
Bingbing Zhang,
Lishen Qu,
Ya-nan Guan,
Hui Zeng,
Lei Zhang,
Radu Timofte,
Jianhui Sun,
Xinli Yue,
Tao Shao,
Huan Hou,
Wenjie Liao,
Shuhao Han,
Jieyu Yuan,
Chunle Guo,
Chongyi Li,
Zewen Chen,
Yunze Liu,
Jian Guo,
Juan Wang,
Yun Zeng,
Bing Li,
Weiming Hu,
Hesong Li
, et al. (28 additional authors not shown)
Abstract:
In this paper, we present an overview of the NTIRE 2026 challenge on the 3rd Restore Any Image Model in the Wild, specifically focusing on Track 1: Professional Image Quality Assessment. Conventional Image Quality Assessment (IQA) typically relies on scalar scores. By compressing complex visual characteristics into a single number, these methods fundamentally struggle to distinguish subtle differe…
▽ More
In this paper, we present an overview of the NTIRE 2026 challenge on the 3rd Restore Any Image Model in the Wild, specifically focusing on Track 1: Professional Image Quality Assessment. Conventional Image Quality Assessment (IQA) typically relies on scalar scores. By compressing complex visual characteristics into a single number, these methods fundamentally struggle to distinguish subtle differences among uniformly high-quality images. Furthermore, they fail to articulate why one image is superior, lacking the reasoning capabilities required to provide guidance for vision tasks. To bridge this gap, recent advancements in Multimodal Large Language Models (MLLMs) offer a promising paradigm. Inspired by this potential, our challenge establishes a novel benchmark exploring the ability of MLLMs to mimic human expert cognition in evaluating high-quality image pairs. Participants were tasked with overcoming critical bottlenecks in professional scenarios, centering on two primary objectives: (1) Comparative Quality Selection: reliably identifying the visually superior image within a high-quality pair; and (2) Interpretative Reasoning: generating grounded, expert-level explanations that detail the rationale behind the selection. In total, the challenge attracted nearly 200 registrations and over 2,500 submissions. The top-performing methods significantly advanced the state of the art in professional IQA. The challenge dataset is available at https://github.com/narthchin/RAIM-PIQA, and the official homepage is accessible at https://www.codabench.org/competitions/12789/.
△ Less
Submitted 14 April, 2026;
originally announced April 2026.
-
Three Birds, One Stone: Solving the Communication-Memory-Privacy Trilemma in LLM Fine-tuning Over Wireless Networks with Zeroth-Order Optimization
Authors:
Zhijie Cai,
Yuhao Zheng,
Haolong Chen,
Dongzhu Liu,
Bin Wang,
Guangxu Zhu
Abstract:
Federated Learning (FL) offers a promising pathway for collaboratively fine-tuning Large Language Models (LLMs) at the edge; however, this paradigm faces a critical bottleneck: the prohibitive communication and memory overheads incurred by exchanging high-dimensional gradients. Furthermore, recent studies reveal that user training data can still be recovered from these local gradients, undermining…
▽ More
Federated Learning (FL) offers a promising pathway for collaboratively fine-tuning Large Language Models (LLMs) at the edge; however, this paradigm faces a critical bottleneck: the prohibitive communication and memory overheads incurred by exchanging high-dimensional gradients. Furthermore, recent studies reveal that user training data can still be recovered from these local gradients, undermining the core privacy promise of FL. In this paper, we address this trilemma of communication, memory, and privacy by proposing pAirZero, a novel framework that synergizes Zeroth-Order (ZO) optimization with Over-the-Air (OTA) computation. Uniquely, pAirZero enables resource-constrained devices to submit their local gradient with only bit-level communication loads while participating in federated fine-tuning of LLMs with inference-level memory costs. This approach not only eliminates the high memory requirements needed for LLM fine-tuning but also alleviates the strict synchronization requirements that plague conventional OTA methods. We further formulate a rigorous optimization model to adaptively determine the optimal transmit power and noise levels, ensuring consistent privacy protection regardless of channel conditions. Numerical experiments demonstrate the superiority of pAirZero in enabling secure, efficient LLM fine-tuning over wireless networks, with only 25% peak memory cost on OPT-125M and communication load orders of magnitude lower than conventional methods.
△ Less
Submitted 14 April, 2026;
originally announced April 2026.
-
Self-Adversarial One Step Generation via Condition Shifting
Authors:
Deyuan Liu,
Peng Sun,
Yansen Han,
Zhenglin Cheng,
Chuyan Chen,
Tao Lin
Abstract:
The push for efficient text to image synthesis has moved the field toward one step sampling, yet existing methods still face a three way tradeoff among fidelity, inference speed, and training efficiency. Approaches that rely on external discriminators can sharpen one step performance, but they often introduce training instability, high GPU memory overhead, and slow convergence, which complicates s…
▽ More
The push for efficient text to image synthesis has moved the field toward one step sampling, yet existing methods still face a three way tradeoff among fidelity, inference speed, and training efficiency. Approaches that rely on external discriminators can sharpen one step performance, but they often introduce training instability, high GPU memory overhead, and slow convergence, which complicates scaling and parameter efficient tuning. In contrast, regression based distillation and consistency objectives are easier to optimize, but they typically lose fine details when constrained to a single step. We present APEX, built on a key theoretical insight: adversarial correction signals can be extracted endogenously from a flow model through condition shifting. Using a transformation creates a shifted condition branch whose velocity field serves as an independent estimator of the model's current generation distribution, yielding a gradient that is provably GAN aligned, replacing the sample dependent discriminator terms that cause gradient vanishing. This discriminator free design is architecture preserving, making APEX a plug and play framework compatible with both full parameter and LoRA based tuning. Empirically, our 0.6B model surpasses FLUX-Schnell 12B (20$\times$ more parameters) in one step quality. With LoRA tuning on Qwen-Image 20B, APEX reaches a GenEval score of 0.89 at NFE=1 in 6 hours, surpassing the original 50-step teacher (0.87) and providing a 15.33$\times$ inference speedup. Code is available https://github.com/LINs-lab/APEX.
△ Less
Submitted 14 April, 2026;
originally announced April 2026.
-
The Enforcement and Feasibility of Hate Speech Moderation on Twitter
Authors:
Manuel Tonneau,
Dylan Thurgood,
Diyi Liu,
Niyati Malhotra,
Victor Orozco-Olvera,
Ralph Schroeder,
Scott A. Hale,
Manoel Horta Ribeiro,
Paul Röttger,
Samuel P. Fraiberger
Abstract:
Online hate speech is associated with substantial social harms, yet it remains unclear how consistently platforms enforce hate speech policies or whether enforcement is feasible at scale. We address these questions through a global audit of hate speech moderation on Twitter (now X). Using a complete 24-hour snapshot of public tweets, we construct representative samples comprising 540,000 tweets an…
▽ More
Online hate speech is associated with substantial social harms, yet it remains unclear how consistently platforms enforce hate speech policies or whether enforcement is feasible at scale. We address these questions through a global audit of hate speech moderation on Twitter (now X). Using a complete 24-hour snapshot of public tweets, we construct representative samples comprising 540,000 tweets annotated for hate speech by trained annotators across eight major languages. Five months after posting, 80% of hateful tweets remain online, including explicitly violent hate speech. Such tweets are no more likely to be removed than non-hateful tweets, with neither severity nor visibility increasing the likelihood of removal. We then examine whether these enforcement gaps reflect technical limits of large-scale moderation systems. While fully automated detection systems cannot reliably identify hate speech without generating large numbers of false positives, they effectively prioritize likely violations for human review. Simulations of a human-AI moderation pipeline indicate that substantially reducing user exposure to hate speech is economically feasible at a cost below existing regulatory penalties. These results suggest that the persistence of online hate cannot be explained by technical constraints alone but also reflects institutional choices in the allocation of moderation resources.
△ Less
Submitted 14 April, 2026;
originally announced April 2026.
-
LEADER: Learning Reliable Local-to-Global Correspondences for LiDAR Relocalization
Authors:
Jianshi Wu,
Minghang Zhu,
Dunqiang Liu,
Wen Li,
Sheng Ao,
Siqi Shen,
Chenglu Wen,
Cheng Wang
Abstract:
LiDAR relocalization has attracted increasing attention as it can deliver accurate 6-DoF pose estimation in complex 3D environments. Recent learning-based regression methods offer efficient solutions by directly predicting global poses without the need for explicit map storage. However, these methods often struggle in challenging scenes due to their equal treatment of all predicted points, which i…
▽ More
LiDAR relocalization has attracted increasing attention as it can deliver accurate 6-DoF pose estimation in complex 3D environments. Recent learning-based regression methods offer efficient solutions by directly predicting global poses without the need for explicit map storage. However, these methods often struggle in challenging scenes due to their equal treatment of all predicted points, which is vulnerable to noise and outliers. In this paper, we propose LEADER, a robust LiDAR-based relocalization framework enhanced by a simple, yet effective geometric encoder. Specifically, a Robust Projection-based Geometric Encoder architecture which captures multi-scale geometric features is first presented to enhance descriptiveness in geometric representation. A Truncated Relative Reliability loss is then formulated to model point-wise ambiguity and mitigate the influence of unreliable predictions. Extensive experiments on the Oxford RobotCar and NCLT datasets demonstrate that LEADER outperforms state-of-the-art methods, achieving 24.1% and 73.9% relative reductions in position error over existing techniques, respectively. The source code is released on https://github.com/JiansW/LEADER.
△ Less
Submitted 13 April, 2026;
originally announced April 2026.
-
Measurement of inclusive production of charmonium states in $b$-hadron decays via their decay into $φφ$
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1173 additional authors not shown)
Abstract:
The inclusive production of the $η_c(1S)$, $η_c(2S)$ and $χ_{c}$ charmonium states in $b$-hadron decays is studied with LHCb Run~2 data, corresponding to an integrated luminosity of $5.9~\text{fb}^{-1}$, using charmonia decays to $φφ$ pairs. The production branching fractions of the $χ_{c}(1P)$ states in $b$-hadron decays are measured, using $b \to η_c(1S) (\to φφ) X$ as a normalisation channel, w…
▽ More
The inclusive production of the $η_c(1S)$, $η_c(2S)$ and $χ_{c}$ charmonium states in $b$-hadron decays is studied with LHCb Run~2 data, corresponding to an integrated luminosity of $5.9~\text{fb}^{-1}$, using charmonia decays to $φφ$ pairs. The production branching fractions of the $χ_{c}(1P)$ states in $b$-hadron decays are measured, using $b \to η_c(1S) (\to φφ) X$ as a normalisation channel, with $X$ indicating any additional particles. The results are \begin{align*}
&{\cal{B}} (b \to χ_{c0} X) = (1.34 \pm 0.13 \pm 0.06 \pm 0.37) \times 10^{-3},
&{\cal{B}} (b \to χ_{c1} X) = (1.58 \pm 0.12 \pm 0.09 \pm 0.44) \times 10^{-3},
&{\cal{B}} (b \to χ_{c2} X) = (0.55 \pm 0.08 \pm 0.05 \pm 0.15) \times 10^{-3}, \end{align*} where the first uncertainty is statistical, the second systematic and the last is due to the limited knowledge of externally measured branching fractions. The production branching fraction of $η_c(2S)$ times the branching fraction of its decay into $φφ$ is measured as ${\cal{B}} (b \to η_c(2S) X) \times {\cal{B}} (η_c(2S) \to φφ) = (4.0 \pm 0.6 \pm 0.6 \pm 1.1) \times 10^{-7}$. Furthermore, the mass of the $η_c(1S)$ state is measured to be $M_{η_c(1S)} = 2984.1 \pm 0.5 \pm 0.5$ MeV with the best precision to date.
△ Less
Submitted 13 April, 2026;
originally announced April 2026.
-
Intelligent Approval of Access Control Flow in Office Automation Systems via Relational Modeling
Authors:
Dugang Liu,
Zulong Chen,
Chuanfei Xu,
Jiaxuan He,
Yunlu Ma,
Jia Xu
Abstract:
Office automation (OA) systems play a crucial role in enterprise operations and management, with access control flow approval (ACFA) being a key component that manages the accessibility of various resources. However, traditional ACFA requires approval from the person in charge at each step, which consumes a significant amount of manpower and time. Its intelligence is a crucial issue that needs to…
▽ More
Office automation (OA) systems play a crucial role in enterprise operations and management, with access control flow approval (ACFA) being a key component that manages the accessibility of various resources. However, traditional ACFA requires approval from the person in charge at each step, which consumes a significant amount of manpower and time. Its intelligence is a crucial issue that needs to be addressed urgently by all companies. In this paper, we propose a novel relational modeling-driven intelligent approval (RMIA) framework to automate ACFA. Specifically, our RMIA consists of two core modules: (1) The binary relation modeling module aims to characterize the coupling relation between applicants and approvers and provide reliable basic information for ACFA decision-making from a coarse-grained perspective. (2) The ternary relation modeling module utilizes specific resource information as its core, characterizing the complex relations between applicants, resources, and approvers, and thus provides fine-grained gain information for informed decision-making. Then, our RMIA effectively fuses these two kinds of information to form the final decision. Finally, extensive experiments are conducted on two product datasets and an online A/B test to verify the effectiveness of RMIA.
△ Less
Submitted 13 April, 2026;
originally announced April 2026.
-
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
Authors:
Xiaomeng Hu,
Yinger Zhang,
Fei Huang,
Jianhong Tu,
Yang Su,
Lianghao Deng,
Yuxuan Liu,
Yantao Liu,
Dayiheng Liu,
Tsung-Yi Ho
Abstract:
AI agents are expected to perform professional work across hundreds of occupational domains (from emergency department triage to nuclear reactor safety monitoring to customs import processing), yet existing benchmarks can only evaluate agents in the few domains where public environments exist. We introduce OccuBench, a benchmark covering 100 real-world professional task scenarios across 10 industr…
▽ More
AI agents are expected to perform professional work across hundreds of occupational domains (from emergency department triage to nuclear reactor safety monitoring to customs import processing), yet existing benchmarks can only evaluate agents in the few domains where public environments exist. We introduce OccuBench, a benchmark covering 100 real-world professional task scenarios across 10 industry categories and 65 specialized domains, enabled by Language World Models (LWMs) that simulate domain-specific environments through LLM-driven tool response generation. Our multi-agent synthesis pipeline automatically produces evaluation instances with guaranteed solvability, calibrated difficulty, and document-grounded diversity. OccuBench evaluates agents along two complementary dimensions: task completion across professional domains and environmental robustness under controlled fault injection (explicit errors, implicit data degradation, and mixed faults). We evaluate 15 frontier models across 8 model families and find that: (1) no single model dominates all industries, as each has a distinct occupational capability profile; (2) implicit faults (truncated data, missing fields) are harder than both explicit errors (timeouts, 500s) and mixed faults, because they lack overt error signals and require the agent to independently detect data degradation; (3) larger models, newer generations, and higher reasoning effort consistently improve performance. GPT-5.2 improves by 27.5 points from minimal to maximum reasoning effort; and (4) strong agents are not necessarily strong environment simulators. Simulator quality is critical for LWM-based evaluation reliability. OccuBench provides the first systematic cross-industry evaluation of AI agents on professional occupational tasks.
△ Less
Submitted 12 April, 2026;
originally announced April 2026.
-
Measurement of the branching fractions of $χ_{cJ} \to π^{+}π^{-}π^{0}π^{0}$ via $ψ(3686) \to γχ_{cJ}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
H. R. Bao,
X. L. Bao,
M. Barbagiovanni,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (741 additional authors not shown)
Abstract:
Using $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector operating at BEPCII, the branching fractions of $χ_{cJ}\toπ^+π^-π^0π^0$ ($J=0,~1,~2$) are measured via the radiative transition $ψ(3686)\toγχ_{cJ}$. The results are $\mathcal{B}(χ_{c0} \to π^{+}π^{-}π^{0}π^{0}) = (3.10 \pm 0.01 \pm 0.14) \times 10^{-2}$,…
▽ More
Using $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector operating at BEPCII, the branching fractions of $χ_{cJ}\toπ^+π^-π^0π^0$ ($J=0,~1,~2$) are measured via the radiative transition $ψ(3686)\toγχ_{cJ}$. The results are $\mathcal{B}(χ_{c0} \to π^{+}π^{-}π^{0}π^{0}) = (3.10 \pm 0.01 \pm 0.14) \times 10^{-2}$, $\mathcal{B}(χ_{c1} \to π^{+}π^{-}π^{0}π^{0}) = (1.16 \pm 0.01 \pm 0.05) \times 10^{-2}$, and $\mathcal{B}(χ_{c2} \to π^{+}π^{-}π^{0}π^{0}) = (1.92 \pm 0.01 \pm 0.08) \times 10^{-2}$, where the first uncertainties are statistical and the second systematic. The dominant intermediate states are found to be $χ_{cJ}\toρ^+ρ^-$. These results supersede the previous most precise measurements and provide significantly improved precision.
△ Less
Submitted 12 April, 2026;
originally announced April 2026.
-
Beyond Compliance: A Resistance-Informed Motivation Reasoning Framework for Challenging Psychological Client Simulation
Authors:
Danni Liu,
Bo Liu,
Yuxin Hu,
Hantao Zhao,
Yan Liu,
Ding Ding,
Jiahui Jin,
Jiuxin Cao
Abstract:
Psychological client simulators have emerged as a scalable solution for training and evaluating counselor trainees and psychological LLMs. Yet existing simulators exhibit unrealistic over-compliance, leaving counselors underprepared for the challenging behaviors common in real-world practice. To bridge this gap, we present ResistClient, which systematically models challenging client behaviors grou…
▽ More
Psychological client simulators have emerged as a scalable solution for training and evaluating counselor trainees and psychological LLMs. Yet existing simulators exhibit unrealistic over-compliance, leaving counselors underprepared for the challenging behaviors common in real-world practice. To bridge this gap, we present ResistClient, which systematically models challenging client behaviors grounded in Client Resistance Theory by integrating external behaviors with underlying motivational mechanisms. To this end, we propose Resistance-Informed Motivation Reasoning (RIMR), a two-stage training framework. First, RIMR mitigates compliance bias via supervised fine-tuning on RPC, a large-scale resistance-oriented psychological conversation dataset covering diverse client profiles. Second, beyond surface-level response imitation, RIMR models psychologically coherent motivation reasoning before response generation, jointly optimizing motivation authenticity and response consistency via process-supervised reinforcement learning. Extensive automatic and expert evaluations show that ResistClient substantially outperforms existing simulators in challenge fidelity, behavioral plausibility, and reasoning coherence. Moreover, ResistClient facilities evaluation of psychological LLMs under challenging conditions, offering new optimization directions for mental health dialogue systems.
△ Less
Submitted 12 April, 2026;
originally announced April 2026.
-
First Observation of \boldmath{$D^+ \to a_0(980)ρ$ and $D^+ \to a_0(980)^+ f_0(500)$} in \boldmath{$D^+ \to π^+π^+π^-η$ and $D^+ \to π^+π^0π^0η$} Decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
X. L. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (734 additional authors not shown)
Abstract:
We perform the first amplitude analysis of the singly Cabibbo-suppressed decays $D^+ \to π^+ π^{+(0)} π^{-(0)} η$, using $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773\,GeV, corresponding to an integrated luminosity of 20.3 $\rm{fb}^{-1}$. The absolute branching fractions of the $D^+ \to π^+ π^+ π^- η$ and $D^+ \to π^+ π^0 π^0 η$ decays are measure…
▽ More
We perform the first amplitude analysis of the singly Cabibbo-suppressed decays $D^+ \to π^+ π^{+(0)} π^{-(0)} η$, using $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773\,GeV, corresponding to an integrated luminosity of 20.3 $\rm{fb}^{-1}$. The absolute branching fractions of the $D^+ \to π^+ π^+ π^- η$ and $D^+ \to π^+ π^0 π^0 η$ decays are measured to be $(3.20\pm0.06_{\text{stat.}}\pm0.03_{\text{syst.}})\times 10^{-3}$ and $(2.43 \pm 0.11_{\text{stat.}} \pm 0.04_{\text{syst.}}) \times 10^{-3}$, respectively. % , both achieving three times better precision than the current PDG values. The decay process $D^{+}\to a_0(980)^{+}f_0(500)$ is observed for the first time with an unexpectedly large branching fraction. Moreover, we observe the decays $D^+ \to a_0(980)^{+(0)} ρ(770)^{0(+)}$ and measure the ratio $r_{+/0} \equiv \frac{\mathcal{B}(D^+ \to a_0(980)^+ ρ(770)^0)}{\mathcal{B}(D^+ \to a_0(980)^0 ρ(770)^+)}$ for the first time to be $0.55\pm0.08_{\text{stat.}}\pm0.05_{\text{syst.}}$. These results offer a novel insight into our comprehension of the nature of the $a_0(980)$ and $f_0(500)$ states.
△ Less
Submitted 11 April, 2026;
originally announced April 2026.
-
Gaussian Graphical Models for Functional Connectivity Analysis: A Statistical Review with Applications to Alzheimer's Disease
Authors:
Panpan Zhang,
Shiying Xiao,
W. Hudson Robb,
Dandan Liu,
Angela L. Jefferson,
Jun Yan
Abstract:
Functional connectivity analysis is an important tool for characterizing interactions among brain regions, particularly in studies of neurodegenerative disorders such as Alzheimer's disease (AD). Gaussian graphical models (GGMs) provide a promising statistical framework for estimating functional connectivity by capturing conditional dependence relationships among brain regions. Although a variety…
▽ More
Functional connectivity analysis is an important tool for characterizing interactions among brain regions, particularly in studies of neurodegenerative disorders such as Alzheimer's disease (AD). Gaussian graphical models (GGMs) provide a promising statistical framework for estimating functional connectivity by capturing conditional dependence relationships among brain regions. Although a variety of regularized precision matrix estimators have been proposed to estimate sparse conditional dependency structures for GGMs, their comparative performance and practical implications for neuroimaging studies are not well understood. In this work, we present a comprehensive statistical review and empirical evaluation of widely used GGM estimation methods, including the graphical lasso (glasso), ridge-based glasso, graphical elastic net, adaptive glasso, smoothly clipped absolute deviation (SCAD), minimax concave penalty (MCP), constrained $\ell_1$ minimization for inverse matrix estimation (CLIME), and tuning-insensitive graph estimation and regression (TIGER). Their performance is evaluated through extensive data-driven simulations designed to reflect realistic neuroimaging settings, along with an application to an AD cohort study to illustrate methodological differences and their impact on downstream network analysis. In addition, a user-friendly R package, spice, is provided to facilitate implementation and enhance the reproducibility of empirical studies.
△ Less
Submitted 11 April, 2026;
originally announced April 2026.
-
Projectively Wakamatsu Tilting Modules over One-Point Extensions
Authors:
Dajun Liu,
Jiaxuan Feng,
Hanpeng Gao
Abstract:
Let $Γ= Λ[M]$ be the one-point extension of an algebra $Λ$ by a $Λ$-module $M$. We establish a method to lift projectively Wakamatsu tilting (PWT) modules from $\mathrm{mod}\,Λ$ to $\mathrm{mod}\,Γ$ by adding the new projective module, and prove that this lifting process perfectly preserves mutation relations under certain homological conditions. Furthermore, for source point extensions of represe…
▽ More
Let $Γ= Λ[M]$ be the one-point extension of an algebra $Λ$ by a $Λ$-module $M$. We establish a method to lift projectively Wakamatsu tilting (PWT) modules from $\mathrm{mod}\,Λ$ to $\mathrm{mod}\,Γ$ by adding the new projective module, and prove that this lifting process perfectly preserves mutation relations under certain homological conditions. Furthermore, for source point extensions of representation-finite algebras, we obtain a complete classification of PWT $Γ$-modules in terms of those over $Λ$. In particular, we establish a bijection \[ \mathrm{PWT}(Γ) \longleftrightarrow \mathrm{PWT}(Λ) \coprod \mathrm{RPWT}(Λ, S_i). \] which yields the counting formula about $|\mathrm{PWT}(Γ)|$.
△ Less
Submitted 11 April, 2026;
originally announced April 2026.
-
Instructing LLMs to Negotiate using Reinforcement Learning with Verifiable Rewards
Authors:
Shuze Daniel Liu,
Claire Chen,
Jiabao Sean Xiao,
Lei Lei,
Yuheng Zhang,
Yisong Yue,
David Simchi-Levi
Abstract:
The recent advancement of Large Language Models (LLMs) has established their potential as autonomous interactive agents. However, they often struggle in strategic games of incomplete information, such as bilateral price negotiation. In this paper, we investigate if Reinforcement Learning from Verifiable Rewards (RLVR) can effectively teach LLMs to negotiate. Specifically, we explore the strategic…
▽ More
The recent advancement of Large Language Models (LLMs) has established their potential as autonomous interactive agents. However, they often struggle in strategic games of incomplete information, such as bilateral price negotiation. In this paper, we investigate if Reinforcement Learning from Verifiable Rewards (RLVR) can effectively teach LLMs to negotiate. Specifically, we explore the strategic behaviors that emerge during the learning process. We introduce a framework that trains a mid-sized buyer agent against a regulated LLM seller across a wide distribution of real-world products. By grounding reward signals directly in the maximization of economic surplus and strict adherence to private budget constraints, we reveal a novel four-phase strategic evolution. The agent progresses from naive bargaining to using aggressive starting prices, moves through a phase of deadlock, and ultimately develops sophisticated persuasive skills. Our results demonstrate that this verifiable training allows a 30B agent to significantly outperform frontier models over ten times its size in extracting surplus. Furthermore, the trained agent generalizes robustly to stronger counterparties unseen during training and remains effective even when facing hostile, adversarial seller personas.
△ Less
Submitted 10 April, 2026;
originally announced April 2026.
-
A first [CII] view of high-z quiescent galaxies
Authors:
C. D'Eugenio,
E. Daddi,
R. Gobat,
S. Jin,
D. Liu,
H. Sun,
F. Gentile,
F. Bruckmann,
Z. Liu,
I. Delvecchio,
L. Vallini,
B. Magnelli,
A. Zanella
Abstract:
We present ALMA detections (or stringent upper limits) of the [CII] 158 $μm$ emission line and underlying dust continuum from five massive quenched galaxies (QGs) at 2<z<4.7. We find extreme variations in the molecular gas fractions ($\rm{f_g=M_{mol}/M_{\star}}$), spanning 0.1%-25%, if a standard $\rm{α_{[CII]}}$ applies. We attempt a first empirical calibration of $\rm{α_{[CII]}}$ with respect to…
▽ More
We present ALMA detections (or stringent upper limits) of the [CII] 158 $μm$ emission line and underlying dust continuum from five massive quenched galaxies (QGs) at 2<z<4.7. We find extreme variations in the molecular gas fractions ($\rm{f_g=M_{mol}/M_{\star}}$), spanning 0.1%-25%, if a standard $\rm{α_{[CII]}}$ applies. We attempt a first empirical calibration of $\rm{α_{[CII]}}$ with respect to dust continuum in a $z=2$ lensed QG and with respect to CO(3-2) in a $z=3.1$ QG, finding no evidence of strong deviations from the standard value. Dust continuum measurements, coupled with JWST/MIRI fluxes, suggest higher dust temperatures compared to expectations from $z<2$ QGs, reaching $T_{d}\sim40-50 \,K$ in two galaxies. Coupled with remarkably high total infrared luminosities (LIR) not explained by observed JWST colors not by energy balance based on literature dust extinction measurements, and with [CII] deficits down to $\rm{[CII]/LIR\sim 2\times10^{-4}}$ typical of (Ultra)Luminous Infrared Galaxies, our findings point to additional dust-heating mechanisms other than dust-absorbed stellar radiation. Surprisingly, JWST/NIRCam and ALMA imaging reveal widespread disturbed stellar morphologies and offsets/tails in dust and gas, indicative of ongoing interactions. While larger samples are needed to assess how common these features are in high-z QGs, these findings support a merger-driven origin for the phenomenology observed in these systems, with key similarities with respect to local post-starburst galaxies where low-velocity shocks and turbulence also inject energy into the residual ISM.
△ Less
Submitted 10 April, 2026;
originally announced April 2026.
-
Tuning Plasmonic Metasurfaces via Phase Change Material Substrates for Modulating Reactivity in Light-Driven Reactions
Authors:
Ning Lyu,
Anjalie Edirisooriya,
Dawei Liu,
Zelio Fusco,
Shenyou Zhao,
Lan Fu,
Fiona J. Beck,
Christin David
Abstract:
Phase change materials provide a powerful platform for dynamically modulating optical responses in nanophotonic systems. While plasmonic metasurfaces have been widely employed to enhance photocatalytic efficiency and promote particular light-driven reactions, active and dynamical control over reaction pathways within a single device remains challenging. Here, we report a phase-induced tunable meta…
▽ More
Phase change materials provide a powerful platform for dynamically modulating optical responses in nanophotonic systems. While plasmonic metasurfaces have been widely employed to enhance photocatalytic efficiency and promote particular light-driven reactions, active and dynamical control over reaction pathways within a single device remains challenging. Here, we report a phase-induced tunable metasurface that tailors photoexcited electron populations through mode hybridization, enabling selective control over the reactivity of light-driven chemical processes. By exploiting thermally induced refractive-index switching in a Sb2S3 cavity, the plasmonic resonance strength of Au nanodisks is actively tuned via cavity-plasmon hybridization. This reconfiguration modulates the product yield of methylene blue degradation by a factor of 2.4, suppressing to 0.45 in the crystalline phase and enhancing to 1.09 in the amorphous phase. Importantly, this reconfigurable platform enables dynamic control of the reaction yield using a single metasurface architecture under identical illumination conditions. Our approach establishes a dynamically programmable light-driven reaction platform capable of precisely manipulating reaction reactivity, offering new opportunities for selective photocatalysis in complex multibranch reaction systems.
△ Less
Submitted 10 April, 2026;
originally announced April 2026.
-
Test of lepton flavour universality with $B^0\to K^{*0}\ell^+\ell^-$ decays at large dilepton invariant mass
Authors:
LHCb collaboration,
R. Aaij,
M. Abdelfatah,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1113 additional authors not shown)
Abstract:
Muon-electron universality is tested in $B^0 \to K^{*0} \ \ell^+ \ell^-$ decays, in the dilepton-invariant-mass region above the $ψ(2S)$ resonance. The analysis uses beauty mesons produced in proton-proton collisions recorded by the LHCb detector at center-of-mass energies of 7, 8, and 13 $\text{TeV}$, corresponding to an integrated luminosity of 9 $\text{fb}^{-1}$. The ratio of branching fraction…
▽ More
Muon-electron universality is tested in $B^0 \to K^{*0} \ \ell^+ \ell^-$ decays, in the dilepton-invariant-mass region above the $ψ(2S)$ resonance. The analysis uses beauty mesons produced in proton-proton collisions recorded by the LHCb detector at center-of-mass energies of 7, 8, and 13 $\text{TeV}$, corresponding to an integrated luminosity of 9 $\text{fb}^{-1}$. The ratio of branching fractions between the muon and electron channels, $R_{K^{*0}}$, is measured to be $1.08\,^{+0.14}_{-0.12}\text{(stat)} \ \pm 0.07\text{(syst)}$ for a dilepton-invariant-mass squared above 14.0 $\text{GeV}^{2}/\text{c}^{4}$, consistent with the Standard Model prediction. This result represents the most precise measurement of $R_{K^{*0}}$ in this region and the first such measurement performed at a hadron collider.
△ Less
Submitted 9 April, 2026;
originally announced April 2026.
-
Mapping generative AI use in the human brain: divergent neural, academic, and mental health profiles of functional versus socio emotional AI use
Authors:
Junjie Wang,
Xianyang Gan,
Dan Liu,
Jingxian He,
Stefania Ferraro,
Keith M. Kendrick,
Weihua Zhao,
Shuxia Yao,
Christian Montag,
Benjamin Becker
Abstract:
The widespread adoption of generative artificial intelligence conversational agents (AICAs) among university students constitutes a novel cognitive social environment whose impact on the maturing brain remains elusive. Combining surveys with high resolution structural MRI, we examined patterns of general, functional, and socio emotional AICA use, academic performance, mental health, and brain stru…
▽ More
The widespread adoption of generative artificial intelligence conversational agents (AICAs) among university students constitutes a novel cognitive social environment whose impact on the maturing brain remains elusive. Combining surveys with high resolution structural MRI, we examined patterns of general, functional, and socio emotional AICA use, academic performance, mental health, and brain structural signatures in a comparatively large sample of 222 young individuals. Across computational anatomy, meta analytic network level, and behavioral decoding analyses, we observed use specific associations. Higher general and functional AICA use frequencies were linked to better academic outcomes (GPA), larger dorsolateral prefrontal and calcarine gray matter volume, and enhanced hippocampal network clustering and local efficiency. In contrast, more frequent socio emotional AICA use was associated with poorer mental health (depression, social anxiety) and lower volume of superior temporal and amygdalar regions central to social and affective processing. These findings indicate that the same class of AI tools exerts distinct effects depending on usage patterns and motivations, engaging prefrontal hippocampal systems that support cognition versus socio emotional systems that may track distress linked usage. These heterogeneities are crucial for designing environments that harness the educational benefits of AI while mitigating mental health risks.
△ Less
Submitted 2 April, 2026;
originally announced April 2026.
-
Search for the lepton-flavour violating decays $B^+ \to π^+ μ^\pm e^\mp$
Authors:
LHCb collaboration,
R. Aaij,
M. Abdelfatah,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An,
L. Anderlini
, et al. (1105 additional authors not shown)
Abstract:
The first search for the lepton-flavour violating decays $B^+ \to π^+ μ^{\pm} e^{\mp}$ in proton-proton collisions is presented, using data collected by the LHCb experiment between 2011 and 2018, corresponding to an integrated luminosity of 9 fb$^{-1}$. No significant signal is observed and an upper limit on the branching fraction is set at…
▽ More
The first search for the lepton-flavour violating decays $B^+ \to π^+ μ^{\pm} e^{\mp}$ in proton-proton collisions is presented, using data collected by the LHCb experiment between 2011 and 2018, corresponding to an integrated luminosity of 9 fb$^{-1}$. No significant signal is observed and an upper limit on the branching fraction is set at $\mathcal{B}(B^+ \to π^+ μ^{\pm} e^{\mp}) < 1.8 \times 10^{-9}$ at the $90\%$ confidence level, two orders of magnitude more restrictive than the current world average. This is the first constraint on lepton-flavour violating $b \to d$ quark transitions at the LHC and also sets the most stringent upper limits to date on $b \to d μ^{\pm} e^{\mp}$ transitions. Limits on left-handed and scalar scenarios beyond the Standard Model are also reported.
△ Less
Submitted 9 April, 2026;
originally announced April 2026.
-
MonoUNet: A Robust Tiny Neural Network for Automated Knee Cartilage Segmentation on Point-of-Care Ultrasound Devices
Authors:
Alvin Kimbowa,
Arjun Parmar,
Ibrahim Mujtaba,
Will Wei,
Maziar Badii,
Matthew Harkey,
David Liu,
Ilker Hacihaliloglu
Abstract:
Objective: To develop a robust and compact deep learning model for automated knee cartilage segmentation on point-of-care ultrasound (POCUS) devices.
Methods: We propose MonoUNet, an ultra-compact U-Net consisting of (i) an aggressively reduced backbone with an asymmetric decoder, (ii) a trainable monogenic block that extracts multi-scale local phase features, and (iii) a gated feature injection…
▽ More
Objective: To develop a robust and compact deep learning model for automated knee cartilage segmentation on point-of-care ultrasound (POCUS) devices.
Methods: We propose MonoUNet, an ultra-compact U-Net consisting of (i) an aggressively reduced backbone with an asymmetric decoder, (ii) a trainable monogenic block that extracts multi-scale local phase features, and (iii) a gated feature injection mechanism that integrates these features into the encoder stages to reduce sensitivity to variations in ultrasound image appearance and improve robustness across devices. MonoUNet was evaluated on a multi-site, multi-device knee cartilage ultrasound dataset acquired using cart-based, portable, and handheld POCUS devices.
Results: Overall, MonoUNet outperformed existing lightweight segmentation models, with average Dice scores ranging from 92.62% to 94.82% and mean average surface distance (MASD) values between 0.133 mm and 0.254 mm. MonoUNet reduces the number of parameters by 10x--700x and computational cost by 14x--2000x relative to existing lightweight models. MonoUNet cartilage outcomes showed excellent reliability and agreement with the manual outcomes: intraclass correlation coefficients (ICC$_{2,k})$=0.96 and bias=2.00% (0.047 mm) for average thickness, and ICC$_{2,k}$=0.99 and bias=0.80% (0.328 a.u.) for echo intensity.
Conclusion: Incorporating trainable local phase features improves the robustness of highly compact neural networks for knee cartilage segmentation across varying acquisition settings and could support scalable ultrasound-based assessment and monitoring of knee osteoarthritis using POCUS devices. The code is publicly available at https://github.com/alvinkimbowa/monounet.
△ Less
Submitted 9 April, 2026;
originally announced April 2026.
-
Mode-Resolved Multiband Ballistic Transport and Conductance Thresholds in Bilayer Graphene Junctions
Authors:
Dan-Na Liu,
Jun Zheng,
Pierre A. Pantaleon
Abstract:
We study ballistic transport in bilayer graphene junctions and show how electrostatic gating, interlayer bias, and homogeneous strain provide complementary control over electron transmission. In the absence of strain, transport is governed by symmetry constraints that suppress transmission at specific incidence angles despite the availability of states. An interlayer bias lifts this suppression th…
▽ More
We study ballistic transport in bilayer graphene junctions and show how electrostatic gating, interlayer bias, and homogeneous strain provide complementary control over electron transmission. In the absence of strain, transport is governed by symmetry constraints that suppress transmission at specific incidence angles despite the availability of states. An interlayer bias lifts this suppression through mode mixing and opens a tunable transport gap. Within a full four-band description, we identify a distinct conductance threshold that marks the onset of propagation of the upper band inside the barrier. This produces a clear change in the slope of the conductance and serves as an experimentally accessible transport fingerprint of the multiband structure and interlayer coupling. Homogeneous in-plane strain acts as a geometric control mechanism. By reshaping the band structure in momentum space, it redistributes the angular transmission window and suppresses conductance without introducing disorder. Importantly, strain preserves the underlying symmetry-based decoupling responsible for transmission suppression while shifting its condition away from normal incidence. These results provide a unified framework for interpreting angle-resolved transport in bilayer graphene and establish multiband ballistic transport as a practical probe of band-structure geometry.
△ Less
Submitted 8 April, 2026;
originally announced April 2026.
-
Self-Discovered Intention-aware Transformer for Multi-modal Vehicle Trajectory Prediction
Authors:
Diyi Liu,
Zihan Niu,
Tu Xu,
Lishan Sun
Abstract:
Predicting vehicle trajectories plays an important role in autonomous driving and ITS applications. Although multiple deep learning algorithms are devised to predict vehicle trajectories, their reliant on specific graph structure (e.g., Graph Neural Network) or explicit intention labeling limit their flexibilities. In this study, we propose a pure Transformer-based network with multiple modals con…
▽ More
Predicting vehicle trajectories plays an important role in autonomous driving and ITS applications. Although multiple deep learning algorithms are devised to predict vehicle trajectories, their reliant on specific graph structure (e.g., Graph Neural Network) or explicit intention labeling limit their flexibilities. In this study, we propose a pure Transformer-based network with multiple modals considering their neighboring vehicles. Two separate tracks are employed. One track focuses on predicting the trajectories while the other focuses on predicting the likelihood of each intention considering neighboring vehicles. Study finds that the two track design can increase the performance by separating spatial module from the trajectory generating module. Also, we find the the model can learn an ordered group of trajectories by predicting residual offsets among K trajectories.
△ Less
Submitted 8 April, 2026;
originally announced April 2026.
-
Bi-level Heterogeneous Learning for Time Series Foundation Models: A Federated Learning Approach
Authors:
Shengchao Chen,
Guodong Long,
Dikai Liu,
Jing Jiang
Abstract:
Heterogeneity in time series data is more pronounced than in vision or language, as temporal dynamics vary substantially across domains and tasks. Existing efforts on training time series foundation models (TSFMs) from scratch are often trained with mixed-batch strategies that merge large-scale datasets, which can cause gradient conflicts and degrade representation quality. To address this, we pro…
▽ More
Heterogeneity in time series data is more pronounced than in vision or language, as temporal dynamics vary substantially across domains and tasks. Existing efforts on training time series foundation models (TSFMs) from scratch are often trained with mixed-batch strategies that merge large-scale datasets, which can cause gradient conflicts and degrade representation quality. To address this, we propose a fine-grained learning method that distills invariant knowledge from heterogeneous series while reducing cross-domain interference. We characterize heterogeneity at two levels: inter-domain and intra-domain. To tackle this bi-level heterogeneity, we design a federated learning method that mitigates intra-domain conflicts by enforcing domain-invariant and semantically consistent representations through local regularization, and addresses inter-domain discrepancies by enhancing cross-domain collaboration via domain-aware aggregation. Experiments across diverse benchmarks show that TSFMs trained with our method consistently outperform both centralized and federated TSFM baselines in point and probabilistic forecasting, while also achieving competitive zero-shot performance at scale, offering a flexible pathway for training TSFMs from scratch in heterogeneous environments.
△ Less
Submitted 8 April, 2026;
originally announced April 2026.
-
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability
Authors:
Qihan Ren,
Peng Wang,
Ruikun Cai,
Shuai Shao,
Dadi Guo,
Yuejin Xie,
Yafu Li,
Quanshi Zhang,
Xia Hu,
Jing Shao,
Dongrui Liu
Abstract:
A prevailing narrative in LLM post-training holds that supervised finetuning (SFT) memorizes while reinforcement learning (RL) generalizes. We revisit this claim for reasoning SFT with long chain-of-thought (CoT) supervision and find that cross-domain generalization is not absent but conditional, jointly shaped by optimization dynamics, training data, and base-model capability. Some reported failu…
▽ More
A prevailing narrative in LLM post-training holds that supervised finetuning (SFT) memorizes while reinforcement learning (RL) generalizes. We revisit this claim for reasoning SFT with long chain-of-thought (CoT) supervision and find that cross-domain generalization is not absent but conditional, jointly shaped by optimization dynamics, training data, and base-model capability. Some reported failures are under-optimization artifacts: cross-domain performance first degrades before recovering and improving with extended training (a dip-and-recovery pattern), so shorttraining checkpoints can underestimate generalization. Data quality and structure both matter: low-quality solutions broadly hurt generalization,while verified long-CoT traces yield consistent cross-domain gains. Model capability is essential: stronger models internalize transferable procedural patterns (e.g., backtracking) even from a toy arithmetic game, while weaker ones imitate surface verbosity. This generalization is asymmetric, however: reasoning improves while safety degrades, reframing the question from whether reasoning SFT generalizes to under what conditions and at what cost.
△ Less
Submitted 7 April, 2026;
originally announced April 2026.
-
Generating Synthetic Doctor-Patient Conversations for Long-form Audio Summarization
Authors:
Yanis Labrak,
David Grünert,
Séverin Baroudi,
Jiyun Chun,
Pawel Cyrta,
Sergio Burdisso,
Ahmed Hassoon,
David Liu,
Adam Rothschild,
Reed Van Deusen,
Petr Motlicek,
Andrew Perrault,
Ricard Marxer,
Thomas Schaaf
Abstract:
Long-context audio reasoning is underserved in both training data and evaluation. Existing benchmarks target short-context tasks, and the open-ended generation tasks most relevant to long-context reasoning pose well-known challenges for automatic evaluation. We propose a synthetic data generation pipeline designed to serve both as a training resource and as a controlled evaluation environment, and…
▽ More
Long-context audio reasoning is underserved in both training data and evaluation. Existing benchmarks target short-context tasks, and the open-ended generation tasks most relevant to long-context reasoning pose well-known challenges for automatic evaluation. We propose a synthetic data generation pipeline designed to serve both as a training resource and as a controlled evaluation environment, and instantiate it for first-visit doctor-patient conversations with SOAP note generation as the task. The pipeline has three stages, persona-driven dialogue generation, multi-speaker audio synthesis with overlap/pause modeling, room acoustics, and sound events, and LLM-based reference SOAP note production, built entirely on open-weight models. We release 8,800 synthetic conversations with 1.3k hours of corresponding audio and reference notes. Evaluating current open-weight systems, we find that cascaded approaches still substantially outperform end-to-end models.
△ Less
Submitted 7 April, 2026;
originally announced April 2026.
-
Precise measurement of the CKM angle $γ$ with a novel approach
Authors:
The BESIII,
LHCb Collaborations,
:,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
H. R. Bao,
X. L. Bao,
M. Barbagiovanni,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco
, et al. (1936 additional authors not shown)
Abstract:
A measurement of the CKM angle $γ$ is performed by applying a novel, unbinned, model-independent approach to datasets of electron-positron collisions collected by the BESIII experiment and proton-proton collisions by the LHCb experiment, corresponding to integrated luminosities of 8 fb$^{-1}$ and 9 fb$^{-1}$, respectively. The $C\!P$-violating phase $γ$ is determined from…
▽ More
A measurement of the CKM angle $γ$ is performed by applying a novel, unbinned, model-independent approach to datasets of electron-positron collisions collected by the BESIII experiment and proton-proton collisions by the LHCb experiment, corresponding to integrated luminosities of 8 fb$^{-1}$ and 9 fb$^{-1}$, respectively. The $C\!P$-violating phase $γ$ is determined from ${B^{\pm}\rightarrow D(\rightarrow K_{\rm S}^{0} h^{\prime+}h^{\prime-}) h^{\pm}}$ decays in LHCb data, where $h^{(\prime)}$ is either a pion or kaon, while the corresponding strong-phase parameters are measured using doubly tagged ${D\rightarrow K_{\rm S/L}^0 h^{\prime+} h^{\prime-}}$ decays in the quantum-correlated $D\overline{D}$ system present in BESIII data. A joint fit to both datasets, which allows for a simultaneous determination of the associated $C\!P$-violating observables and strong-phase parameters, yields ${γ= (71.3\pm 5.0)^{\circ}}$. The result is the most precise to date and consistent with previous measurements and world averages.
△ Less
Submitted 7 April, 2026;
originally announced April 2026.
-
Measurement of the CKM angle $γ$ in $B^{\pm} \rightarrow D(\rightarrow K^{0}_{\rm S} h^{\prime+}h^{\prime-})h^{\pm}$ decays with a novel approach
Authors:
The BESIII,
LHCb Collaborations,
:,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
H. R. Bao,
X. L. Bao,
M. Barbagiovanni,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco
, et al. (1936 additional authors not shown)
Abstract:
A measurement of the CKM angle $γ$ and related strong-phase parameters is performed using a novel, model-independent approach in ${B^{\pm}\rightarrow D(\rightarrow K^{0}_{\rm S} h^{\prime+}h^{\prime-}) h^{\pm}}$ decays, where $h^{(\prime)} \equiv π, K$. The analysis uses a joint data sample of electron-positron collisions collected by the BESIII experiment at the Beijing Electron-Positron Collider…
▽ More
A measurement of the CKM angle $γ$ and related strong-phase parameters is performed using a novel, model-independent approach in ${B^{\pm}\rightarrow D(\rightarrow K^{0}_{\rm S} h^{\prime+}h^{\prime-}) h^{\pm}}$ decays, where $h^{(\prime)} \equiv π, K$. The analysis uses a joint data sample of electron-positron collisions collected by the BESIII experiment at the Beijing Electron-Positron Collider II during 2010--2011 and 2021--2022, corresponding to an integrated luminosity of 8 fb$^{-1}$, and proton-proton collisions collected by the LHCb experiment at the Large Hadron Collider during 2011--2018, corresponding to an integrated luminosity of 9 fb$^{-1}$. The two datasets are analyzed simultaneously by applying per-event weights based on the amplitude variation over the $D$-decay phase space to enhance the sensitivity to $C\!P$-violating observables. The CKM angle $γ$ is determined to be $γ= (71.3\pm 5.0)^{\circ}$, which constitutes the most precise single measurement to date.
△ Less
Submitted 7 April, 2026;
originally announced April 2026.
-
Data-Driven Function Calling Improvements in Large Language Model for Online Financial QA
Authors:
Xing Tang,
Hao Chen,
Shiwei Li,
Fuyuan Lyu,
Weijie Shi,
Lingjie Li,
Dugang Liu,
Weihong Luo,
Xiku Du,
Xiuqiang He
Abstract:
Large language models (LLMs) have been incorporated into numerous industrial applications. Meanwhile, a vast array of API assets is scattered across various functions in the financial domain. An online financial question-answering system can leverage both LLMs and private APIs to provide timely financial analysis and information. The key is equipping the LLM model with function calling capability…
▽ More
Large language models (LLMs) have been incorporated into numerous industrial applications. Meanwhile, a vast array of API assets is scattered across various functions in the financial domain. An online financial question-answering system can leverage both LLMs and private APIs to provide timely financial analysis and information. The key is equipping the LLM model with function calling capability tailored to a financial scenario. However, a generic LLM requires customized financial APIs to call and struggles to adapt to the financial domain. Additionally, online user queries are diverse and contain out-of-distribution parameters compared with the required function input parameters, which makes it more difficult for a generic LLM to serve online users. In this paper, we propose a data-driven pipeline to enhance function calling in LLM for our online, deployed financial QA, comprising dataset construction, data augmentation, and model training. Specifically, we construct a dataset based on a previous study and update it periodically, incorporating queries and an augmentation method named AugFC. The addition of user query-related samples will \textit{exploit} our financial toolset in a data-driven manner, and AugFC explores the possible parameter values to enhance the diversity of our updated dataset. Then, we train an LLM with a two-step method, which enables the use of our financial functions. Extensive experiments on existing offline datasets, as well as the deployment of an online scenario, illustrate the superiority of our pipeline. The related pipeline has been adopted in the financial QA of YuanBao\footnote{https://yuanbao.tencent.com/chat/}, one of the largest chat platforms in China.
△ Less
Submitted 6 April, 2026;
originally announced April 2026.
-
Retrieve-then-Adapt: Retrieval-Augmented Test-Time Adaptation for Sequential Recommendation
Authors:
Xing Tang,
Jingyang Bin,
Ziqiang Cui,
Xiaokun Zhang,
Fuyuan Lyu,
Jingyan Jiang,
Dugang Liu,
Chen Ma,
Xiuqiang He
Abstract:
The sequential recommendation (SR) task aims to predict the next item based on users' historical interaction sequences. Typically trained on historical data, SR models often struggle to adapt to real-time preference shifts during inference due to challenges posed by distributional divergence and parameterized constraints. Existing approaches to address this issue include test-time training, test-t…
▽ More
The sequential recommendation (SR) task aims to predict the next item based on users' historical interaction sequences. Typically trained on historical data, SR models often struggle to adapt to real-time preference shifts during inference due to challenges posed by distributional divergence and parameterized constraints. Existing approaches to address this issue include test-time training, test-time augmentation, and retrieval-augmented fine-tuning. However, these methods either introduce significant computational overhead, rely on random augmentation strategies, or require a carefully designed two-stage training paradigm. In this paper, we argue that the key to effective test-time adaptation lies in achieving both effective augmentation and efficient adaptation. To this end, we propose Retrieve-then-Adapt (ReAd), a novel framework that dynamically adapts a deployed SR model to the test distribution through retrieved user preference signals. Specifically, given a trained SR model, ReAd first retrieves collaboratively similar items for a test user from a constructed collaborative memory database. A lightweight retrieval learning module then integrates these items into an informative augmentation embedding that captures both collaborative signals and prediction-refinement cues. Finally, the initial SR prediction is refined via a fusion mechanism that incorporates this embedding. Extensive experiments across five benchmark datasets demonstrate that ReAd consistently outperforms existing SR methods.
△ Less
Submitted 6 April, 2026;
originally announced April 2026.
-
Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills
Authors:
Dawei Liu,
Zongxia Li,
Hongyang Du,
Xiyang Wu,
Shihang Gui,
Yongbei Kuang,
Lichao Sun
Abstract:
Skill usage has become a core component of modern agent systems and can substantially improve agents' ability to complete complex tasks. In real-world settings, where agents must monitor and interact with numerous personal applications, web browsers, and other environment interfaces, skill libraries can scale to thousands of reusable skills. Scaling to larger skill sets introduces two key challeng…
▽ More
Skill usage has become a core component of modern agent systems and can substantially improve agents' ability to complete complex tasks. In real-world settings, where agents must monitor and interact with numerous personal applications, web browsers, and other environment interfaces, skill libraries can scale to thousands of reusable skills. Scaling to larger skill sets introduces two key challenges. First, loading the full skill set saturates the context window, driving up token costs, hallucination, and latency. In this paper, we present Graph of Skills (GoS), an inference-time structural retrieval layer for large skill libraries. GoS constructs an executable skill graph offline from skill packages, then at inference time retrieves a bounded, dependency-aware skill bundle through hybrid semantic-lexical seeding, reverse-weighted Personalized PageRank, and context-budgeted hydration. On SkillsBench and ALFWorld, GoS improves average reward by 43.6% over the vanilla full skill-loading baseline while reducing input tokens by 37.8%, and generalizes across three model families: Claude Sonnet, GPT-5.2 Codex, and MiniMax. Additional ablation studies across skill libraries ranging from 200 to 2,000 skills further demonstrate that GoS consistently outperforms both vanilla skills loading and simple vector retrieval in balancing reward, token efficiency, and runtime.
△ Less
Submitted 8 April, 2026; v1 submitted 6 April, 2026;
originally announced April 2026.
-
Optical Appearance and Ringdown of Black Holes in a Kalb Ramond Field Coupled to Perfect Fluid Dark Matter
Authors:
Qi-Qi Liang,
Zi-Qiang Cai,
Dong Liu,
Zheng-Wen Long
Abstract:
This paper investigates the optical and dynamical properties of a static spherically symmetric black hole in the presence of a Kalb--Ramond (KR) field coupled to perfect fluid dark matter (PFDM). We analyze the effects of the Lorentz-violating parameter $α$ and the dark matter parameter $λ$ on photon trajectories and their observational signatures in the strong-gravity regime. Furthermore, we stud…
▽ More
This paper investigates the optical and dynamical properties of a static spherically symmetric black hole in the presence of a Kalb--Ramond (KR) field coupled to perfect fluid dark matter (PFDM). We analyze the effects of the Lorentz-violating parameter $α$ and the dark matter parameter $λ$ on photon trajectories and their observational signatures in the strong-gravity regime. Furthermore, we study the quasinormal mode spectrum under scalar, electromagnetic, and gravitational perturbations, examining how the model parameters influence the characteristic oscillation frequencies and damping rates. In particular, the interplay between the effective potential structure and perturbative dynamics is clarified, and it is found that, within the validity of the eikonal approximation, the quasinormal modes of the black hole considered here exhibit good agreement with the properties of null geodesics. Our results show that the model parameters significantly affect both the optical appearance of the black hole and the dynamical features of the ringdown phase, providing potential observational constraints on Lorentz-violating effects and dark matter environments in strong-field regimes.
△ Less
Submitted 6 April, 2026;
originally announced April 2026.
-
NAIMA: Semantics Aware RGB Guided Depth Super-Resolution
Authors:
Tayyab Nasir,
Daochang Liu,
Ajmal Mian
Abstract:
Guided depth super-resolution (GDSR) is a multi-modal approach for depth map super-resolution that relies on a low-resolution depth map and a high-resolution RGB image to restore finer structural details. However, the misleading color and texture cues indicating depth discontinuities in RGB images often lead to artifacts and blurred depth boundaries in the generated depth map. We propose a solutio…
▽ More
Guided depth super-resolution (GDSR) is a multi-modal approach for depth map super-resolution that relies on a low-resolution depth map and a high-resolution RGB image to restore finer structural details. However, the misleading color and texture cues indicating depth discontinuities in RGB images often lead to artifacts and blurred depth boundaries in the generated depth map. We propose a solution that introduces global contextual semantic priors, generated from pretrained vision transformer token embeddings. Our approach to distilling semantic knowledge from pretrained token embeddings is motivated by their demonstrated effectiveness in related monocular depth estimation tasks. We introduce a Guided Token Attention (GTA) module, which iteratively aligns encoded RGB spatial features with depth encodings, using cross-attention for selectively injecting global semantic context extracted from different layers of a pretrained vision transformer. Additionally, we present an architecture called Neural Attention for Implicit Multi-token Alignment (NAIMA), which integrates DINOv2 with GTA blocks for a semantics-aware GDSR. Our proposed architecture, with its ability to distill semantic knowledge, achieves significant improvements over existing methods across multiple scaling factors and datasets.
△ Less
Submitted 6 April, 2026;
originally announced April 2026.
-
JoyAI-LLM Flash: Advancing Mid-Scale LLMs with Token Efficiency
Authors:
Aichen Cai,
Anmeng Zhang,
Anyu Li,
Bo Zhang,
Bohua Cai,
Chang Li,
Changjian Jiang,
Changkai Lu,
Chao Xue,
Chaocai Liang,
Cheng Zhang,
Dongkai Liu,
Fei Wang,
Guoqiang Huang,
Haijian Ke,
Han Lin,
Hao Wang,
Ji Miao,
Jiacheng Zhang,
Jialong Shi,
Jifeng Zhu,
Jingjing Qian,
Junhui Luo,
Junwu Xiong,
Lam So
, et al. (44 additional authors not shown)
Abstract:
We introduce JoyAI-LLM Flash, an efficient Mixture-of-Experts (MoE) language model designed to redefine the trade-off between strong performance and token efficiency in the sub-50B parameter regime. JoyAI-LLM Flash is pretrained on a massive corpus of 20 trillion tokens and further optimized through a rigorous post-training pipeline, including supervised fine-tuning (SFT), Direct Preference Optimi…
▽ More
We introduce JoyAI-LLM Flash, an efficient Mixture-of-Experts (MoE) language model designed to redefine the trade-off between strong performance and token efficiency in the sub-50B parameter regime. JoyAI-LLM Flash is pretrained on a massive corpus of 20 trillion tokens and further optimized through a rigorous post-training pipeline, including supervised fine-tuning (SFT), Direct Preference Optimization (DPO), and large-scale reinforcement learning (RL) across diverse environments. To improve token efficiency, JoyAI-LLM Flash strategically balances \emph{thinking} and \emph{non-thinking} cognitive modes and introduces FiberPO, a novel RL algorithm inspired by fibration theory that decomposes trust-region maintenance into global and local components, providing unified multi-scale stability control for LLM policy optimization. To enhance architectural sparsity, the model comprises 48B total parameters while activating only 2.7B parameters per forward pass, achieving a substantially higher sparsity ratio than contemporary industry leading models of comparable scale. To further improve inference throughput, we adopt a joint training-inference co-design that incorporates dense Multi-Token Prediction (MTP) and Quantization-Aware Training (QAT). We release the checkpoints for both JoyAI-LLM-48B-A3B Base and its post-trained variants on Hugging Face to support the open-source community.
△ Less
Submitted 8 April, 2026; v1 submitted 3 April, 2026;
originally announced April 2026.
-
Search for the decays $B_{(s)}^0\to J/ψγ$ at LHCb
Authors:
LHCb collaboration,
R. Aaij,
M. Abdelfatah,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1114 additional authors not shown)
Abstract:
A search for the rare decays $B_{(s)}^0\to J/ψγ$ is performed with proton-proton collision data collected by the LHCb experiment, corresponding to integrated luminosities of $3~\rm{fb}^{-1}$ at centre-of-mass energies of 7 and 8 TeV, and $6~\rm{fb}^{-1}$ at 13 TeV. Assuming no contribution from $B^0\to J/ψγ$ decay, an upper limit is set on the branching fraction…
▽ More
A search for the rare decays $B_{(s)}^0\to J/ψγ$ is performed with proton-proton collision data collected by the LHCb experiment, corresponding to integrated luminosities of $3~\rm{fb}^{-1}$ at centre-of-mass energies of 7 and 8 TeV, and $6~\rm{fb}^{-1}$ at 13 TeV. Assuming no contribution from $B^0\to J/ψγ$ decay, an upper limit is set on the branching fraction $\mathcal{B}(B_{s}^0\to J/ψγ)<2.9\times10^{-6}$ at the 90% confidence level. If instead no contribution from $B_{s}^0\to J/ψγ$ decay is assumed, the limit is $\mathcal{B}(B^0\to J/ψγ)<2.5\times10^{-6}$ at the 90% confidence level. These results supersede the previous LHCb results, with the limit for $B_{s}^0\to J/ψγ$ improved by a factor of 2.5.
△ Less
Submitted 3 April, 2026;
originally announced April 2026.
-
Goal-Conditioned Neural ODEs with Guaranteed Safety and Stability for Learning-Based All-Pairs Motion Planning
Authors:
Dechuan Liu,
Ruigang Wang,
Ian R. Manchester
Abstract:
This paper presents a learning-based approach for all-pairs motion planning, where the initial and goal states are allowed to be arbitrary points in a safe set. We construct smooth goal-conditioned neural ordinary differential equations (neural ODEs) via bi-Lipschitz diffeomorphisms. Theoretical results show that the proposed model can provide guarantees of global exponential stability and safety…
▽ More
This paper presents a learning-based approach for all-pairs motion planning, where the initial and goal states are allowed to be arbitrary points in a safe set. We construct smooth goal-conditioned neural ordinary differential equations (neural ODEs) via bi-Lipschitz diffeomorphisms. Theoretical results show that the proposed model can provide guarantees of global exponential stability and safety (safe set forward invariance) regardless of goal location. Moreover, explicit bounds on convergence rate, tracking error, and vector field magnitude are established. Our approach admits a tractable learning implementation using bi-Lipschitz neural networks and can incorporate demonstration data. We illustrate the effectiveness of the proposed method on a 2D corridor navigation task.
△ Less
Submitted 13 April, 2026; v1 submitted 3 April, 2026;
originally announced April 2026.
-
Axial gravitational perturbations and echo-like signals of a hairy black hole from gravitational decoupling
Authors:
Yi Yang,
Ali Ovgun,
Gaetano Lambiase,
Dong Liu,
Zheng-Wen Long
Abstract:
We study axial gravitational perturbations of a hairy black hole constructed in the framework of gravitational decoupling and investigate the geometric origin of echo-like late-time signals in this spacetime. We derive the odd-parity master equation and the corresponding effective potential, and we compute the quasinormal-mode spectrum by using frequency-domain and time-domain methods. We show tha…
▽ More
We study axial gravitational perturbations of a hairy black hole constructed in the framework of gravitational decoupling and investigate the geometric origin of echo-like late-time signals in this spacetime. We derive the odd-parity master equation and the corresponding effective potential, and we compute the quasinormal-mode spectrum by using frequency-domain and time-domain methods. We show that, in a suitable region of parameter space, the axial potential develops a double-peak structure that supports a trapping cavity and gives rise to echo-like late-time waveforms. Rather than imposing near-horizon reflectivity by hand, the delayed pulses therefore arise dynamically from the geometry of the effective potential. We also clarify that the parameter region exhibiting echoes need not coincide with the region in which the weak energy condition is satisfied everywhere outside the event horizon, and this distinction must be taken into account when interpreting the physical status of the solution. Our results provide a useful framework for probing black-hole hair through gravitational-wave ringdown and for exploring possible observational departures from the standard no-hair paradigm.
△ Less
Submitted 8 April, 2026; v1 submitted 2 April, 2026;
originally announced April 2026.
-
ATBench: A Diverse and Realistic Agent Trajectory Benchmark for Safety Evaluation and Diagnosis
Authors:
Yu Li,
Haoyu Luo,
Yuejin Xie,
Yuqian Fu,
Zhonghao Yang,
Shuai Shao,
Qihan Ren,
Wanying Qu,
Yanwei Fu,
Yujiu Yang,
Jing Shao,
Xia Hu,
Dongrui Liu
Abstract:
Evaluating the safety of LLM-based agents is increasingly important because risks in realistic deployments often emerge over multi-step interactions rather than isolated prompts or final responses. Existing trajectory-level benchmarks remain limited by insufficient interaction diversity, coarse observability of safety failures, and weak long-horizon realism. We introduce ATBench, a trajectory-leve…
▽ More
Evaluating the safety of LLM-based agents is increasingly important because risks in realistic deployments often emerge over multi-step interactions rather than isolated prompts or final responses. Existing trajectory-level benchmarks remain limited by insufficient interaction diversity, coarse observability of safety failures, and weak long-horizon realism. We introduce ATBench, a trajectory-level benchmark for structured, diverse, and realistic evaluation of agent safety. ATBench organizes agentic risk along three dimensions: risk source, failure mode, and real-world harm. Based on this taxonomy, we construct trajectories with heterogeneous tool pools and a long-context delayed-trigger protocol that captures realistic risk emergence across multiple stages. The benchmark contains 1,000 trajectories (503 safe and 497 unsafe), averaging 9.01 turns and 3.95k tokens, with 1,954 invoked tools drawn from pools spanning 2,084 available tools. Data quality is supported by rule-based and LLM-based filtering plus full human audit. Experiments on frontier LLMs, open-source models, and specialized guard systems show that ATBench is challenging even for strong evaluators, while enabling taxonomy-stratified analysis, cross-benchmark comparison, and diagnosis of long-horizon failure patterns.
△ Less
Submitted 8 April, 2026; v1 submitted 2 April, 2026;
originally announced April 2026.
-
Are VLMs Lost Between Sky and Space? LinkS$^2$Bench for UAV-Satellite Dynamic Cross-View Spatial Intelligence
Authors:
Dian Liu,
Jie Feng,
Di Li,
Yuhui Zheng,
Guanbin Li,
Weisheng Dong,
Guangming Shi
Abstract:
Synergistic spatial intelligence between UAVs and satellites is indispensable for emergency response and security operations, as it uniquely integrates macro-scale global coverage with dynamic, real-time local perception. However, the capacity of Vision-Language Models (VLMs) to master this complex interplay remains largely unexplored. This gap persists primarily because existing benchmarks are co…
▽ More
Synergistic spatial intelligence between UAVs and satellites is indispensable for emergency response and security operations, as it uniquely integrates macro-scale global coverage with dynamic, real-time local perception. However, the capacity of Vision-Language Models (VLMs) to master this complex interplay remains largely unexplored. This gap persists primarily because existing benchmarks are confined to isolated Unmanned Aerial Vehicle (UAV) videos or static satellite imagery, failing to evaluate the dynamic local-to-global spatial mapping essential for comprehensive cross-view reasoning. To bridge this gap, we introduce LinkS$^2$Bench, the first comprehensive benchmark designed to evaluate VLMs' wide-area, dynamic cross-view spatial intelligence. LinkS$^2$Bench links 1,022 minutes of dynamic UAV footage with high-resolution satellite imagery covering over 200 km$^2$. Through an LMM-assisted pipeline and rigorous human annotation, we constructed 17.9k high-quality question-answer pairs comprising 12 fine-grained tasks across four dimensions: perception, localization, relation, and reasoning. Evaluations of 18 representative VLMs reveal a substantial gap compared to human baselines, identifying accurate cross-view dynamic alignment as the critical bottleneck. To alleviate this, we design a Cross-View Alignment Adapter, demonstrating that explicit alignment significantly improves model performance. Furthermore, fine-tuning experiments underscore the potential of LinkS$^2$Bench in advancing VLM adaptation for complex spatial reasoning.
△ Less
Submitted 2 April, 2026;
originally announced April 2026.
-
From Understanding to Erasing: Towards Complete and Stable Video Object Removal
Authors:
Dingming Liu,
Wenjing Wang,
Chen Li,
Jing Lyu
Abstract:
Video object removal aims to eliminate target objects from videos while plausibly completing missing regions and preserving spatio-temporal consistency. Although diffusion models have recently advanced this task, it remains challenging to remove object-induced side effects (e.g., shadows, reflections, and illumination changes) without compromising overall coherence. This limitation stems from the…
▽ More
Video object removal aims to eliminate target objects from videos while plausibly completing missing regions and preserving spatio-temporal consistency. Although diffusion models have recently advanced this task, it remains challenging to remove object-induced side effects (e.g., shadows, reflections, and illumination changes) without compromising overall coherence. This limitation stems from the insufficient physical and semantic understanding of the target object and its interactions with the scene. In this paper, we propose to introduce understanding into erasing from two complementary perspectives. Externally, we introduce a distillation scheme that transfers the relationships between objects and their induced effects from vision foundation models to video diffusion models. Internally, we propose a framewise context cross-attention mechanism that grounds each denoising block in informative, unmasked context surrounding the target region. External and internal guidance jointly enable our model to understand the target object, its induced effects, and the global background context, resulting in clear and coherent object removal. Extensive experiments demonstrate our state-of-the-art performance, and we establish the first real-world benchmark for video object removal to facilitate future research and community progress. Our code, data, and models are available at: https://github.com/WeChatCV/UnderEraser.
△ Less
Submitted 2 April, 2026;
originally announced April 2026.
-
Informed Machine Learning with Knowledge Landmarks
Authors:
Chuyi Dai,
Witold Pedrycz,
Suping Xu,
Ding Liu,
Xianmin Wang
Abstract:
Informed Machine Learning has emerged as a viable generalization of Machine Learning (ML) by building a unified conceptual and algorithmic setting for constructing models on a unified basis of knowledge and data. Physics-informed ML involving physics equations is one of the developments within Informed Machine Learning. This study proposes a novel direction of Knowledge-Data ML, referred to as KD-…
▽ More
Informed Machine Learning has emerged as a viable generalization of Machine Learning (ML) by building a unified conceptual and algorithmic setting for constructing models on a unified basis of knowledge and data. Physics-informed ML involving physics equations is one of the developments within Informed Machine Learning. This study proposes a novel direction of Knowledge-Data ML, referred to as KD-ML, where numeric data are integrated with knowledge tidbits expressed in the form of granular knowledge landmarks. We advocate that data and knowledge are complementary in several fundamental ways: data are precise (numeric) and local, usually confined to some region of the input space, while knowledge is global and formulated at a higher level of abstraction. The knowledge can be represented as information granules and organized as a collection of input-output information granules called knowledge landmarks. In virtue of this evident complementarity, we develop a comprehensive design process of the KD-ML model and formulate an original augmented loss function L, which additively embraces the component responsible for optimizing the model based on available numeric data, while the second component, playing the role of a granular regularizer, so that it adheres to the granular constraints (knowledge landmarks). We show the role of the hyperparameter positioned in the loss function, which balances the contribution and guiding role of data and knowledge, and point to some essential tendencies associated with the quality of data (noise level) and the level of granularity of the knowledge landmarks. Experiments on two physics-governed benchmarks demonstrate that the proposed KD model consistently outperforms data-driven ML models.
△ Less
Submitted 31 March, 2026;
originally announced April 2026.
-
First energy scan measurement of $e^{+}e^{-}\to K^{+}K^{-}$ around the $ψ(2S)$ resonance
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
X. L. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (683 additional authors not shown)
Abstract:
We report the first measurement of the $e^{+}e^{-}\to K^{+}K^{-}$ cross sections around the $ψ(2S)$ resonance using the energy scan method. The analysis is based on $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of 495~pb$^{-1}$ collected with the BESIII detector at BEPCII. By analyzing the cross section line-shape, we extract the relative phase $Φ$ between the strong and el…
▽ More
We report the first measurement of the $e^{+}e^{-}\to K^{+}K^{-}$ cross sections around the $ψ(2S)$ resonance using the energy scan method. The analysis is based on $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of 495~pb$^{-1}$ collected with the BESIII detector at BEPCII. By analyzing the cross section line-shape, we extract the relative phase $Φ$ between the strong and electromagnetic amplitudes of the $ψ(2S)$ resonance, a fundamental parameter in charmonium physics, based on the assumption that the relative phase between the electromagnetic amplitude of the $ψ(2S)$ resonance and the continuum is zero. Two distinct solutions for the branching fraction $\mathcal{B}$ of $ψ(2S)\to K^{+}K^{-}$ are observed: a constructive interference solution with $\mathcal{B}=(7.49\pm0.41)\times10^{-5}$ and $Φ=(110.1 \pm6.7)^\circ$, and a destructive interference solution with $\mathcal{B}=(10.94\pm0.48)\times10^{-5}$ and $Φ=(-106.8\pm5.7)^\circ$. A significant correlation between $Φ$ and $\mathcal{B}$ is established, demonstrating that interference effects must be taken into account in the $ψ(2S)$ branching fraction measurements. Additionally, the first results for both the $ψ(2S)$ strong form factor, which characterizes the strong coupling between $ψ(2S)$ and $K^{+}K^{-}$, and the energy-dependent electromagnetic form factor of the charged kaon in this energy region are here reported.
△ Less
Submitted 31 March, 2026;
originally announced March 2026.
-
Constraints on the host galaxy and AGN properties of three z > 6 JWST AGN from NOEMA observations
Authors:
Giovanni Mazzolari,
Hannah Übler,
Rodrigo Herrera Camus,
Ric Davies,
Linda Tacconi,
Dieter Lutz,
Natascha Förster Schreiber,
Francesco D'Eugenio,
Minju Lee,
Capucine Barfety,
Elena Bertola,
Andrew Bunker,
Andreas Burkert,
Jianhang Chen,
Giovanni Cresci,
Frank Eisenhauer,
Juan Manuel Espejo Salcedo,
Simon Flesch,
Reinhard Genzel,
Xihan Ji,
Lilian Lee,
Daizhong Liu,
Cosimo Marconcini,
Roberto Maiolino,
Thorsten Naab
, et al. (10 additional authors not shown)
Abstract:
We targeted with deep NOEMA observations the [CII]158$μ$m emission of three JWST-discovered AGN at z>6. Two of them have the typical features of Little Red Dots (LRDs), while the third one is a blue, extended, Type I AGN. We do not significantly detect [CII] emission or dust continuum in any of the targets, even after stacking. The resulting [CII] luminosity upper limits,…
▽ More
We targeted with deep NOEMA observations the [CII]158$μ$m emission of three JWST-discovered AGN at z>6. Two of them have the typical features of Little Red Dots (LRDs), while the third one is a blue, extended, Type I AGN. We do not significantly detect [CII] emission or dust continuum in any of the targets, even after stacking. The resulting [CII] luminosity upper limits, $\log (L_{[CII]}/L_{\odot})<7.77-8.1$, lie $\sim2σ$ below the values expected from the [CII]-SFR relation, and we explore different scenarios to explain the lack of [CII]. We obtained upper limits on the gas masses of $\log (M_{gas}/M_{\odot})<9.26-9.59$ corresponding to $\log( M_{dust}/M_{\odot})<5.68-6.55$ assuming a metallicity dependent dust to gas ratio. Using the continuum non-detections (rms $\sim 16-25 ~μJy$) together with JWST/MIRI constraints, we performed a revised SED-fitting decomposition, resulting in stellar masses up to $\sim 2$ dex lower than previously reported, and implying $0.03\lesssim M_{BH}/M_{*}\lesssim0.7$. For the two LRDs, the SED is well reproduced by stellar emission in the rest-frame UV, while the rising rest-frame optical slope, flattening toward the near-infrared, is consistent with emission from a Type I AGN partially obscured along the polar direction with $E(B-V)_{\rm polar}\simeq 1$, in agreement with attenuation derived from the broad lines Balmer decrement. This decomposition demonstrates that a relatively standard AGN configuration can reproduce the SEDs of the two LRDs, without invoking more exotic scenarios. Finally, we investigate the positions of the three sources in the $IRX-β_{UV}$ plane, finding that they lie in a parameter space where galaxies are typically characterized by patchy dust distributions. Our analysis highlights the importance of millimeter constraints to characterize the different physical properties of high-z AGN.
△ Less
Submitted 31 March, 2026;
originally announced March 2026.
-
C2RustXW: Program-Structure-Aware C-to-Rust Translation via Program Analysis and LLM
Authors:
Yanyan Yan,
Yang Feng,
Jiangshan Liu,
Di Liu,
Zixi Liu,
Hao Teng,
Baowen Xu
Abstract:
The growing adoption of Rust for its memory safety and performance has increased the demand for effective migration of legacy C codebases. However, existing rule-based translators (e.g., \ctorust) often generate verbose, non-idiomatic code that preserves unsafe C semantics, limiting readability, maintainability, and practical adoption. Moreover, manual post-processing of such outputs is labor-inte…
▽ More
The growing adoption of Rust for its memory safety and performance has increased the demand for effective migration of legacy C codebases. However, existing rule-based translators (e.g., \ctorust) often generate verbose, non-idiomatic code that preserves unsafe C semantics, limiting readability, maintainability, and practical adoption. Moreover, manual post-processing of such outputs is labor-intensive and rarely yields high-quality Rust code, posing a significant barrier to large-scale migration. To address these limitations, we present \tool, a program-structure-aware C-to-Rust translation approach that integrates program analysis with Large Language Models (LLMs). \tool extracts the multi-level program structure, including global symbols, function dependencies, and control- and data-flow information, and encodes these as structured textual representations injected into LLM prompts to guide translation and repair. Based on this design, \tool performs dependency-aware translation and adopts a multi-stage repair pipeline that combines rule-based and structure-guided LLM-based techniques to ensure syntactic correctness. For semantic correctness, \tool further integrates execution-based validation with structure-guided reasoning to localize and repair behavioral inconsistencies. Experimental results show that \tool achieves 100\% syntactic correctness on CodeNet and 97.78\% on GitHub, while significantly reducing code size (up to 43.70\%) and unsafe usage (to 5.75\%). At the project level, \tool achieves perfect syntactic correctness and an average semantic correctness of 78.87\%, demonstrating its effectiveness for practical and scalable C-to-Rust migration.
△ Less
Submitted 30 March, 2026;
originally announced March 2026.
-
Observation of the doubly charmed baryon $\itΞ_{cc}^+$ with the LHCb Run 3 detector
Authors:
LHCb collaboration,
R. Aaij,
M. Abdelfatah,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An,
L. Anderlini
, et al. (1107 additional authors not shown)
Abstract:
The first observation of the doubly charmed baryon $\itΞ_{cc}^+$ is reported through its decay to the $\itΛ_c^+ K^-π^+$ final state, with a statistical significance exceeding seven standard deviations. The observation is made using proton-proton collision data collected in 2024 with the LHCb Run 3 detector at a center-of-mass energy of 13.6 TeV, corresponding to a total integrated luminosity of…
▽ More
The first observation of the doubly charmed baryon $\itΞ_{cc}^+$ is reported through its decay to the $\itΛ_c^+ K^-π^+$ final state, with a statistical significance exceeding seven standard deviations. The observation is made using proton-proton collision data collected in 2024 with the LHCb Run 3 detector at a center-of-mass energy of 13.6 TeV, corresponding to a total integrated luminosity of $6.9\,\mathrm{fb}^{-1}$. The $\itΞ_{cc}^+$ mass is measured to be $3619.97 \pm 0.83 \pm 0.26 \,^{+1.90}_{-1.30}\,\mathrm{MeV}/c^2$, where the first uncertainty is statistical, the second is systematic, and the third is due to the unknown $\itΞ_{cc}^+$ lifetime, which is assumed to lie in the range 15-160 fs with a baseline value of 45 fs. The difference between the masses of the $\itΞ_{cc}^+$ and $\itΞ_{cc}^{++}$ baryons is determined to be $-1.77 \pm 0.84 \pm 0.15 \,^{+1.90}_{-1.30}\,\mathrm{MeV}/c^2$. This is the first observation of a new particle made with the LHCb Run 3 detector.
△ Less
Submitted 30 March, 2026;
originally announced March 2026.