-
DySkew: Dynamic Data Redistribution for Skew-Resilient Snowpark UDF Execution
Authors:
Chenwei Xie,
Urjeet Shrestha,
Corbin McElhanney,
Lukas Lorimer,
Gopal V,
Zihao Ye,
Yi Pan,
Nic Crouch,
Elliott Brossard,
Florian Funke,
Yuxiong He
Abstract:
Snowflake revolutionized data warehousing with an elastic architecture that decouples compute and storage, enabling scalable solutions for diverse data analytics needs. Building on this foundation, Snowflake has advanced its AI Data Cloud vision by introducing Snowpark, a managed turnkey solution that supports data engineering and AI/ML workloads using Python and other programming languages. While…
▽ More
Snowflake revolutionized data warehousing with an elastic architecture that decouples compute and storage, enabling scalable solutions for diverse data analytics needs. Building on this foundation, Snowflake has advanced its AI Data Cloud vision by introducing Snowpark, a managed turnkey solution that supports data engineering and AI/ML workloads using Python and other programming languages. While Snowpark's User-Defined Function (UDF) execution model offers high throughput, it is highly vulnerable to performance degradation from data skew, where uneven data partitioning causes straggler tasks and unpredictable latency. The non-uniform computational cost of arbitrary user code further exacerbates this classic challenge. This paper presents DySkew, a novel, data-skew-aware execution strategy for Snowpark UDFs. Built upon Snowflake's new generalized skew handling solution, an adaptive data distribution mechanism utilizing per-link state machines. DySkew addresses the unique challenges of user-defined logic with goals of fine-grained per-row mitigation, dynamic runtime adaptation, and low-overhead, cost-aware redistribution. Specifically, for Snowpark, we introduce crucial optimizations, including an eager redistribution strategy and a Row Size Model to dynamically manage overhead for extremely large rows. This dynamic approach replaces the limitations of the previous static round-robin method. We detail the architecture of this framework and showcase its effectiveness through performance evaluations and real-world case studies, demonstrating significant improvements in the execution time and resource utilization for large-scale Snowpark UDF workloads.
△ Less
Submitted 14 April, 2026;
originally announced April 2026.
-
Observation of the Exotic State $π_{1}(1600)$ in $ψ(2S)\rightarrowγχ_{c1},χ_{c1}\rightarrowπ^{+}π^{-}η'$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
X. L. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (728 additional authors not shown)
Abstract:
A partial wave analysis of the process $ψ(2S)\rightarrowγχ_{c1}, χ_{c1}\rightarrowπ^+π^-η^{\prime}$ is performed using $(2712.4\pm14.3)\times10^{6}$ $ψ(2S)$ events collected with the BESIII detector. An isovector state with exotic quantum numbers $J^{PC}=1^{-+}$, denoted as $π_{1}(1600)$, is observed for the first time in the charmonium decay of $χ_{c1}\rightarrowπ_{1}^{\pm}(1600)π^{\mp}$,…
▽ More
A partial wave analysis of the process $ψ(2S)\rightarrowγχ_{c1}, χ_{c1}\rightarrowπ^+π^-η^{\prime}$ is performed using $(2712.4\pm14.3)\times10^{6}$ $ψ(2S)$ events collected with the BESIII detector. An isovector state with exotic quantum numbers $J^{PC}=1^{-+}$, denoted as $π_{1}(1600)$, is observed for the first time in the charmonium decay of $χ_{c1}\rightarrowπ_{1}^{\pm}(1600)π^{\mp}$, $π_{1}^{\pm}(1600)\rightarrowπ^{\pm}η^{\prime}$ with a statistical significance over $21σ$. Its mass and width are determined to be $1828 \pm 8 ({\rm stat})^{+11}_{-33}({\rm syst})~\mathrm{MeV}/c^2$ and $638 \pm 26 ({\rm stat})^{+35}_{-86}({\rm syst})~\mathrm{MeV}$, respectively, using a relativistic Breit-Wigner function with a mass-dependent width. The corresponding product of branching fractions is determined to be $\mathcal{B}\left[χ_{c1}\rightarrowπ_{1}(1600)^{\pm}π^{\mp} \right] \times \mathcal{B}\left[π_{1}(1600)^{\pm}\rightarrowπ^{\pm}η^{\prime}\right] = \left( 4.30 \pm 0.14 ({\rm stat})^{+1.04}_{-1.03}({\rm syst})~ \right) \times 10^{-4}$.
△ Less
Submitted 14 April, 2026; v1 submitted 14 April, 2026;
originally announced April 2026.
-
Measurement of the branching fractions of $χ_{cJ} \to π^{+}π^{-}π^{0}π^{0}$ via $ψ(3686) \to γχ_{cJ}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
H. R. Bao,
X. L. Bao,
M. Barbagiovanni,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (741 additional authors not shown)
Abstract:
Using $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector operating at BEPCII, the branching fractions of $χ_{cJ}\toπ^+π^-π^0π^0$ ($J=0,~1,~2$) are measured via the radiative transition $ψ(3686)\toγχ_{cJ}$. The results are $\mathcal{B}(χ_{c0} \to π^{+}π^{-}π^{0}π^{0}) = (3.10 \pm 0.01 \pm 0.14) \times 10^{-2}$,…
▽ More
Using $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector operating at BEPCII, the branching fractions of $χ_{cJ}\toπ^+π^-π^0π^0$ ($J=0,~1,~2$) are measured via the radiative transition $ψ(3686)\toγχ_{cJ}$. The results are $\mathcal{B}(χ_{c0} \to π^{+}π^{-}π^{0}π^{0}) = (3.10 \pm 0.01 \pm 0.14) \times 10^{-2}$, $\mathcal{B}(χ_{c1} \to π^{+}π^{-}π^{0}π^{0}) = (1.16 \pm 0.01 \pm 0.05) \times 10^{-2}$, and $\mathcal{B}(χ_{c2} \to π^{+}π^{-}π^{0}π^{0}) = (1.92 \pm 0.01 \pm 0.08) \times 10^{-2}$, where the first uncertainties are statistical and the second systematic. The dominant intermediate states are found to be $χ_{cJ}\toρ^+ρ^-$. These results supersede the previous most precise measurements and provide significantly improved precision.
△ Less
Submitted 12 April, 2026;
originally announced April 2026.
-
First Observation of \boldmath{$D^+ \to a_0(980)ρ$ and $D^+ \to a_0(980)^+ f_0(500)$} in \boldmath{$D^+ \to π^+π^+π^-η$ and $D^+ \to π^+π^0π^0η$} Decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
X. L. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (734 additional authors not shown)
Abstract:
We perform the first amplitude analysis of the singly Cabibbo-suppressed decays $D^+ \to π^+ π^{+(0)} π^{-(0)} η$, using $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773\,GeV, corresponding to an integrated luminosity of 20.3 $\rm{fb}^{-1}$. The absolute branching fractions of the $D^+ \to π^+ π^+ π^- η$ and $D^+ \to π^+ π^0 π^0 η$ decays are measure…
▽ More
We perform the first amplitude analysis of the singly Cabibbo-suppressed decays $D^+ \to π^+ π^{+(0)} π^{-(0)} η$, using $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773\,GeV, corresponding to an integrated luminosity of 20.3 $\rm{fb}^{-1}$. The absolute branching fractions of the $D^+ \to π^+ π^+ π^- η$ and $D^+ \to π^+ π^0 π^0 η$ decays are measured to be $(3.20\pm0.06_{\text{stat.}}\pm0.03_{\text{syst.}})\times 10^{-3}$ and $(2.43 \pm 0.11_{\text{stat.}} \pm 0.04_{\text{syst.}}) \times 10^{-3}$, respectively. % , both achieving three times better precision than the current PDG values. The decay process $D^{+}\to a_0(980)^{+}f_0(500)$ is observed for the first time with an unexpectedly large branching fraction. Moreover, we observe the decays $D^+ \to a_0(980)^{+(0)} ρ(770)^{0(+)}$ and measure the ratio $r_{+/0} \equiv \frac{\mathcal{B}(D^+ \to a_0(980)^+ ρ(770)^0)}{\mathcal{B}(D^+ \to a_0(980)^0 ρ(770)^+)}$ for the first time to be $0.55\pm0.08_{\text{stat.}}\pm0.05_{\text{syst.}}$. These results offer a novel insight into our comprehension of the nature of the $a_0(980)$ and $f_0(500)$ states.
△ Less
Submitted 15 April, 2026; v1 submitted 11 April, 2026;
originally announced April 2026.
-
Does the spectral break in the IceCube diffuse neutrino spectrum originate from AGN evolution?
Authors:
Caijin Xie,
Zijian Qiu,
Yudong Cui,
Sujie Lin,
Lili Yang
Abstract:
The enigmatic origin of the diffuse neutrino background detected by IceCube in the energy range from TeV to PeV remains one of the central open problems in high-energy astrophysics, and this puzzle is further deepened by the recent evidence for a spectral break. Could this convex-spectrum background arise predominantly from the evolution of active galactic nuclei (AGNs)? In this work, we claim tha…
▽ More
The enigmatic origin of the diffuse neutrino background detected by IceCube in the energy range from TeV to PeV remains one of the central open problems in high-energy astrophysics, and this puzzle is further deepened by the recent evidence for a spectral break. Could this convex-spectrum background arise predominantly from the evolution of active galactic nuclei (AGNs)? In this work, we claim that the spectral break is naturally predicted when AGN evolution is taken into account, and the diffuse flux can be interpreted as the superposition of contributions from AGNs at different evolutionary phases. We develop a unified framework that incorporates AGN evolution, where cosmic rays (CRs) accelerated during the active phase subsequently diffuse and interact in the host galaxy after the central engine switches off, producing a long-lived hadronic afterglow. Adopting physically plausible parameters, our model successfully reproduces both the spectral features of the diffuse background and the observed neutrino emission from representative sources such as TXS 0506$+$056 and NGC 1068. Our results suggest that AGN host galaxies are more efficient CR reservoirs than previously expected. Moreover, the model favors a lepton-dominated scenario for most AGNs. This conclusion accounts for the relatively low detection rate of point-like sources by IceCube and underscores the need for next-generation neutrino telescopes with larger effective areas and higher sensitivity.
△ Less
Submitted 9 April, 2026;
originally announced April 2026.
-
Precise measurement of the CKM angle $γ$ with a novel approach
Authors:
The BESIII,
LHCb Collaborations,
:,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
H. R. Bao,
X. L. Bao,
M. Barbagiovanni,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco
, et al. (1936 additional authors not shown)
Abstract:
A measurement of the CKM angle $γ$ is performed by applying a novel, unbinned, model-independent approach to datasets of electron-positron collisions collected by the BESIII experiment and proton-proton collisions by the LHCb experiment, corresponding to integrated luminosities of 8 fb$^{-1}$ and 9 fb$^{-1}$, respectively. The $C\!P$-violating phase $γ$ is determined from…
▽ More
A measurement of the CKM angle $γ$ is performed by applying a novel, unbinned, model-independent approach to datasets of electron-positron collisions collected by the BESIII experiment and proton-proton collisions by the LHCb experiment, corresponding to integrated luminosities of 8 fb$^{-1}$ and 9 fb$^{-1}$, respectively. The $C\!P$-violating phase $γ$ is determined from ${B^{\pm}\rightarrow D(\rightarrow K_{\rm S}^{0} h^{\prime+}h^{\prime-}) h^{\pm}}$ decays in LHCb data, where $h^{(\prime)}$ is either a pion or kaon, while the corresponding strong-phase parameters are measured using doubly tagged ${D\rightarrow K_{\rm S/L}^0 h^{\prime+} h^{\prime-}}$ decays in the quantum-correlated $D\overline{D}$ system present in BESIII data. A joint fit to both datasets, which allows for a simultaneous determination of the associated $C\!P$-violating observables and strong-phase parameters, yields ${γ= (71.3\pm 5.0)^{\circ}}$. The result is the most precise to date and consistent with previous measurements and world averages.
△ Less
Submitted 7 April, 2026;
originally announced April 2026.
-
Measurement of the CKM angle $γ$ in $B^{\pm} \rightarrow D(\rightarrow K^{0}_{\rm S} h^{\prime+}h^{\prime-})h^{\pm}$ decays with a novel approach
Authors:
The BESIII,
LHCb Collaborations,
:,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
H. R. Bao,
X. L. Bao,
M. Barbagiovanni,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco
, et al. (1936 additional authors not shown)
Abstract:
A measurement of the CKM angle $γ$ and related strong-phase parameters is performed using a novel, model-independent approach in ${B^{\pm}\rightarrow D(\rightarrow K^{0}_{\rm S} h^{\prime+}h^{\prime-}) h^{\pm}}$ decays, where $h^{(\prime)} \equiv π, K$. The analysis uses a joint data sample of electron-positron collisions collected by the BESIII experiment at the Beijing Electron-Positron Collider…
▽ More
A measurement of the CKM angle $γ$ and related strong-phase parameters is performed using a novel, model-independent approach in ${B^{\pm}\rightarrow D(\rightarrow K^{0}_{\rm S} h^{\prime+}h^{\prime-}) h^{\pm}}$ decays, where $h^{(\prime)} \equiv π, K$. The analysis uses a joint data sample of electron-positron collisions collected by the BESIII experiment at the Beijing Electron-Positron Collider II during 2010--2011 and 2021--2022, corresponding to an integrated luminosity of 8 fb$^{-1}$, and proton-proton collisions collected by the LHCb experiment at the Large Hadron Collider during 2011--2018, corresponding to an integrated luminosity of 9 fb$^{-1}$. The two datasets are analyzed simultaneously by applying per-event weights based on the amplitude variation over the $D$-decay phase space to enhance the sensitivity to $C\!P$-violating observables. The CKM angle $γ$ is determined to be $γ= (71.3\pm 5.0)^{\circ}$, which constitutes the most precise single measurement to date.
△ Less
Submitted 7 April, 2026;
originally announced April 2026.
-
Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw
Authors:
Zijun Wang,
Haoqin Tu,
Letian Zhang,
Hardy Chen,
Juncheng Wu,
Xiangyan Liu,
Zhenlong Yuan,
Tianyu Pang,
Michael Qizhe Shieh,
Fengze Liu,
Zeyu Zheng,
Huaxiu Yao,
Yuyin Zhou,
Cihang Xie
Abstract:
OpenClaw, the most widely deployed personal AI agent in early 2026, operates with full local system access and integrates with sensitive services such as Gmail, Stripe, and the filesystem. While these broad privileges enable high levels of automation and powerful personalization, they also expose a substantial attack surface that existing sandboxed evaluations fail to capture. To address this gap,…
▽ More
OpenClaw, the most widely deployed personal AI agent in early 2026, operates with full local system access and integrates with sensitive services such as Gmail, Stripe, and the filesystem. While these broad privileges enable high levels of automation and powerful personalization, they also expose a substantial attack surface that existing sandboxed evaluations fail to capture. To address this gap, we present the first real-world safety evaluation of OpenClaw and introduce the CIK taxonomy, which unifies an agent's persistent state into three dimensions, i.e., Capability, Identity, and Knowledge, for safety analysis. Our evaluations cover 12 attack scenarios on a live OpenClaw instance across four backbone models (Claude Sonnet 4.5, Opus 4.6, Gemini 3.1 Pro, and GPT-5.4). The results show that poisoning any single CIK dimension increases the average attack success rate from 24.6% to 64-74%, with even the most robust model exhibiting more than a threefold increase over its baseline vulnerability. We further assess three CIK-aligned defense strategies alongside a file-protection mechanism; however, the strongest defense still yields a 63.8% success rate under Capability-targeted attacks, while file protection blocks 97% of malicious injections but also prevents legitimate updates. Taken together, these findings show that the vulnerabilities are inherent to the agent architecture, necessitating more systematic safeguards to secure personal AI agents. Our project page is https://ucsc-vlaa.github.io/CIK-Bench.
△ Less
Submitted 6 April, 2026;
originally announced April 2026.
-
ClawArena: Benchmarking AI Agents in Evolving Information Environments
Authors:
Haonian Ji,
Kaiwen Xiong,
Siwei Han,
Peng Xia,
Shi Qiu,
Yiyang Zhou,
Jiaqi Liu,
Jinlong Li,
Bingzhou Li,
Zeyu Zheng,
Cihang Xie,
Huaxiu Yao
Abstract:
AI agents deployed as persistent assistants must maintain correct beliefs as their information environment evolves. In practice, evidence is scattered across heterogeneous sources that often contradict one another, new information can invalidate earlier conclusions, and user preferences surface through corrections rather than explicit instructions. Existing benchmarks largely assume static, single…
▽ More
AI agents deployed as persistent assistants must maintain correct beliefs as their information environment evolves. In practice, evidence is scattered across heterogeneous sources that often contradict one another, new information can invalidate earlier conclusions, and user preferences surface through corrections rather than explicit instructions. Existing benchmarks largely assume static, single-authority settings and do not evaluate whether agents can keep up with this complexity. We introduce ClawArena, a benchmark for evaluating AI agents in evolving information environments. Each scenario maintains a complete hidden ground truth while exposing the agent only to noisy, partial, and sometimes contradictory traces across multi-channel sessions, workspace files, and staged updates. Evaluation is organized around three coupled challenges: multi-source conflict reasoning, dynamic belief revision, and implicit personalization, whose interactions yield a 14-category question taxonomy. Two question formats, multi-choice (set-selection) and shell-based executable checks, test both reasoning and workspace grounding. The current release contains 64 scenarios across 8 professional domains, totaling 1{,}879 evaluation rounds and 365 dynamic updates. Experiments on five agent frameworks and five language models show that both model capability (15.4% range) and framework design (9.2%) substantially affect performance, that self-evolving skill frameworks can partially close model-capability gaps, and that belief revision difficulty is determined by update design strategy rather than the mere presence of updates. Code is available at https://github.com/aiming-lab/ClawArena.
△ Less
Submitted 5 April, 2026;
originally announced April 2026.
-
Speaker-Reasoner: Scaling Interaction Turns and Reasoning Patterns for Timestamped Speaker-Attributed ASR
Authors:
Zhennan Lin,
Shuai Wang,
Zhaokai Sun,
Pengyuan Xie,
Chuan Xie,
Jie Liu,
Qiang Zhang,
Lei Xie
Abstract:
Transcribing and understanding multi-speaker conversations requires speech recognition, speaker attribution, and timestamp localization. While speech LLMs excel at single-speaker tasks, multi-speaker scenarios remain challenging due to overlapping speech, backchannels, rapid turn-taking, and context window constraints. We propose Speaker-Reasoner, an end-to-end Speech LLM with agentic multi-turn t…
▽ More
Transcribing and understanding multi-speaker conversations requires speech recognition, speaker attribution, and timestamp localization. While speech LLMs excel at single-speaker tasks, multi-speaker scenarios remain challenging due to overlapping speech, backchannels, rapid turn-taking, and context window constraints. We propose Speaker-Reasoner, an end-to-end Speech LLM with agentic multi-turn temporal reasoning. Instead of single-pass inference, the model iteratively analyzes global audio structure, autonomously predicts temporal boundaries, and performs fine-grained segment analysis, jointly modeling speaker identity, gender, timestamps, and transcription. A speaker-aware cache further extends processing to audio exceeding the training context window. Trained with a three-stage progressive strategy, Speaker-Reasoner achieves consistent improvements over strong baselines on AliMeeting and AISHELL-4 datasets, particularly in handling overlapping speech and complex turn-taking.
△ Less
Submitted 3 April, 2026;
originally announced April 2026.
-
A new framework for atom-resolved decomposition of second-harmonic generation in nonlinear-optical crystals
Authors:
YingXing Cheng,
Congwei Xie,
Zhihua Yang,
Shili Pan
Abstract:
In this work, we develop a new framework for computing atom-resolved contributions to optical properties based on atoms-in-molecules (AIM) schemes. The formalism is independent of the specific AIM method and is made rigorous by partitioning momentum matrix elements into atomic contributions while exactly satisfying the relevant sum rules. We apply it to second-harmonic generation (SHG) in six repr…
▽ More
In this work, we develop a new framework for computing atom-resolved contributions to optical properties based on atoms-in-molecules (AIM) schemes. The formalism is independent of the specific AIM method and is made rigorous by partitioning momentum matrix elements into atomic contributions while exactly satisfying the relevant sum rules. We apply it to second-harmonic generation (SHG) in six representative UV and deep-UV nonlinear-optical crystals, namely $β$-\ce{BaB2O4} (BBO), \ce{LiB3O5} (LBO), \ce{CsB3O5} (CBO), \ce{CsLiB6O10} (CLBO), \ce{KBe2BO3F2} (KBBF), and \ce{LiCs2PO4} (LCPO). The atom-triplet decomposition reveals a clear hierarchy for the largest SHG component of each crystal. In general, two-center terms provide the leading contribution, one-center terms remain comparatively small, and fully three-center terms supply an important secondary contribution. A motif-triplet decomposition further indicates behavior dominated by the anionic framework in KBBF and LBO. In BBO, CBO, and CLBO, contributions from the anionic framework and the cation sublattice act cooperatively, although the cation contribution is crystal dependent. Moreover, cooperative contributions from the phosphate framework and the Cs sublattice are also observed in LCPO, where the O-Cs contribution is particularly significant. These results may provide a new perspective for understanding the microscopic origin of SHG in nonlinear-optical materials.
△ Less
Submitted 2 April, 2026;
originally announced April 2026.
-
Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory
Authors:
Jiaqi Liu,
Zipeng Ling,
Shi Qiu,
Yanqing Liu,
Siwei Han,
Peng Xia,
Haoqin Tu,
Zeyu Zheng,
Cihang Xie,
Charles Fleming,
Mingyu Ding,
Huaxiu Yao
Abstract:
AI agents increasingly operate over extended time horizons, yet their ability to retain, organize, and recall multimodal experiences remains a critical bottleneck. Building effective lifelong memory requires navigating a vast design space spanning architecture, retrieval strategies, prompt engineering, and data pipelines; this space is too large and interconnected for manual exploration or traditi…
▽ More
AI agents increasingly operate over extended time horizons, yet their ability to retain, organize, and recall multimodal experiences remains a critical bottleneck. Building effective lifelong memory requires navigating a vast design space spanning architecture, retrieval strategies, prompt engineering, and data pipelines; this space is too large and interconnected for manual exploration or traditional AutoML to explore effectively. We deploy an autonomous research pipeline to discover Omni-SimpleMem, a unified multimodal memory framework for lifelong AI agents. Starting from a naïve baseline (F1=0.117 on LoCoMo), the pipeline autonomously executes ${\sim}50$ experiments across two benchmarks, diagnosing failure modes, proposing architectural modifications, and repairing data pipeline bugs, all without human intervention in the inner loop. The resulting system achieves state-of-the-art on both benchmarks, improving F1 by +411% on LoCoMo (0.117$\to$0.598) and +214% on Mem-Gallery (0.254$\to$0.797) relative to the initial configurations. Critically, the most impactful discoveries are not hyperparameter adjustments: bug fixes (+175%), architectural changes (+44%), and prompt engineering (+188% on specific categories) each individually exceed the cumulative contribution of all hyperparameter tuning, demonstrating capabilities fundamentally beyond the reach of traditional AutoML. We provide a taxonomy of six discovery types and identify four properties that make multimodal memory particularly suited for autoresearch, offering guidance for applying autonomous research pipelines to other AI system domains. Code is available at this https://github.com/aiming-lab/SimpleMem.
△ Less
Submitted 2 April, 2026; v1 submitted 1 April, 2026;
originally announced April 2026.
-
Omni-MMSI: Toward Identity-attributed Social Interaction Understanding
Authors:
Xinpeng Li,
Bolin Lai,
Hardy Chen,
Shijian Deng,
Cihang Xie,
Yuyin Zhou,
James Matthew Rehg,
Yapeng Tian
Abstract:
We introduce Omni-MMSI, a new task that requires comprehensive social interaction understanding from raw audio, vision, and speech input. The task involves perceiving identity-attributed social cues (e.g., who is speaking what) and reasoning about the social interaction (e.g., whom the speaker refers to). This task is essential for developing AI assistants that can perceive and respond to human in…
▽ More
We introduce Omni-MMSI, a new task that requires comprehensive social interaction understanding from raw audio, vision, and speech input. The task involves perceiving identity-attributed social cues (e.g., who is speaking what) and reasoning about the social interaction (e.g., whom the speaker refers to). This task is essential for developing AI assistants that can perceive and respond to human interactions. Unlike prior studies that operate on oracle-preprocessed social cues, Omni-MMSI reflects realistic scenarios where AI assistants must perceive and reason from raw data. However, existing pipelines and multi-modal LLMs perform poorly on Omni-MMSI because they lack reliable identity attribution capabilities, which leads to inaccurate social interaction understanding. To address this challenge, we propose Omni-MMSI-R, a reference-guided pipeline that produces identity-attributed social cues with tools and conducts chain-of-thought social reasoning. To facilitate this pipeline, we construct participant-level reference pairs and curate reasoning annotations on top of the existing datasets. Experiments demonstrate that Omni-MMSI-R outperforms advanced LLMs and counterparts on Omni-MMSI. Project page: https://sampson-lee.github.io/omni-mmsi-project-page.
△ Less
Submitted 31 March, 2026;
originally announced April 2026.
-
First energy scan measurement of $e^{+}e^{-}\to K^{+}K^{-}$ around the $ψ(2S)$ resonance
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
X. L. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (683 additional authors not shown)
Abstract:
We report the first measurement of the $e^{+}e^{-}\to K^{+}K^{-}$ cross sections around the $ψ(2S)$ resonance using the energy scan method. The analysis is based on $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of 495~pb$^{-1}$ collected with the BESIII detector at BEPCII. By analyzing the cross section line-shape, we extract the relative phase $Φ$ between the strong and el…
▽ More
We report the first measurement of the $e^{+}e^{-}\to K^{+}K^{-}$ cross sections around the $ψ(2S)$ resonance using the energy scan method. The analysis is based on $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of 495~pb$^{-1}$ collected with the BESIII detector at BEPCII. By analyzing the cross section line-shape, we extract the relative phase $Φ$ between the strong and electromagnetic amplitudes of the $ψ(2S)$ resonance, a fundamental parameter in charmonium physics, based on the assumption that the relative phase between the electromagnetic amplitude of the $ψ(2S)$ resonance and the continuum is zero. Two distinct solutions for the branching fraction $\mathcal{B}$ of $ψ(2S)\to K^{+}K^{-}$ are observed: a constructive interference solution with $\mathcal{B}=(7.49\pm0.41)\times10^{-5}$ and $Φ=(110.1 \pm6.7)^\circ$, and a destructive interference solution with $\mathcal{B}=(10.94\pm0.48)\times10^{-5}$ and $Φ=(-106.8\pm5.7)^\circ$. A significant correlation between $Φ$ and $\mathcal{B}$ is established, demonstrating that interference effects must be taken into account in the $ψ(2S)$ branching fraction measurements. Additionally, the first results for both the $ψ(2S)$ strong form factor, which characterizes the strong coupling between $ψ(2S)$ and $K^{+}K^{-}$, and the energy-dependent electromagnetic form factor of the charged kaon in this energy region are here reported.
△ Less
Submitted 31 March, 2026;
originally announced March 2026.
-
IMAGAgent: Orchestrating Multi-Turn Image Editing via Constraint-Aware Planning and Reflection
Authors:
Fei Shen,
Chengyu Xie,
Lihong Wang,
Zhanyi Zhang,
Xin Jiang,
Xiaoyu Du,
Jinhui Tang
Abstract:
Existing multi-turn image editing paradigms are often confined to isolated single-step execution. Due to a lack of context-awareness and closed-loop feedback mechanisms, they are prone to error accumulation and semantic drift during multi-turn interactions, ultimately resulting in severe structural distortion of the generated images. For that, we propose \textbf{IMAGAgent}, a multi-turn image edit…
▽ More
Existing multi-turn image editing paradigms are often confined to isolated single-step execution. Due to a lack of context-awareness and closed-loop feedback mechanisms, they are prone to error accumulation and semantic drift during multi-turn interactions, ultimately resulting in severe structural distortion of the generated images. For that, we propose \textbf{IMAGAgent}, a multi-turn image editing agent framework based on a "plan-execute-reflect" closed-loop mechanism that achieves deep synergy among instruction parsing, tool scheduling, and adaptive correction within a unified pipeline. Specifically, we first present a constraint-aware planning module that leverages a vision-language model (VLM) to precisely decompose complex natural language instructions into a series of executable sub-tasks, governed by target singularity, semantic atomicity, and visual perceptibility. Then, the tool-chain orchestration module dynamically constructs execution paths based on the current image, the current sub-task, and the historical context, enabling adaptive scheduling and collaborative operation among heterogeneous operation models covering image retrieval, segmentation, detection, and editing. Finally, we devise a multi-expert collaborative reflection mechanism where a central large language model (LLM) receives the image to be edited and synthesizes VLM critiques into holistic feedback, simultaneously triggering fine-grained self-correction and recording feedback outcomes to optimize future decisions. Extensive experiments on our constructed \textbf{MTEditBench} and the MagicBrush dataset demonstrate that IMAGAgent achieves performance significantly superior to existing methods in terms of instruction consistency, editing precision, and overall quality. The code is available at https://github.com/hackermmzz/IMAGAgent.git.
△ Less
Submitted 11 February, 2026;
originally announced March 2026.
-
Pandora: Articulated 3D Scene Graphs from Egocentric Vision
Authors:
Alan Yu,
Yun Chang,
Christopher Xie,
Luca Carlone
Abstract:
Robotic mapping systems typically approach building metric-semantic scene representations from the robot's own sensors and cameras. However, these "first person" maps inherit the robot's own limitations due to its embodiment or skillset, which may leave many aspects of the environment unexplored. For example, the robot might not be able to open drawers or access wall cabinets. In this sense, the m…
▽ More
Robotic mapping systems typically approach building metric-semantic scene representations from the robot's own sensors and cameras. However, these "first person" maps inherit the robot's own limitations due to its embodiment or skillset, which may leave many aspects of the environment unexplored. For example, the robot might not be able to open drawers or access wall cabinets. In this sense, the map representation is not as complete, and requires a more capable robot to fill in the gaps. We narrow these blind spots in current methods by leveraging egocentric data captured as a human naturally explores a scene wearing Project Aria glasses, giving a way to directly transfer knowledge about articulation from the human to any deployable robot. We demonstrate that, by using simple heuristics, we can leverage egocentric data to recover models of articulate object parts, with quality comparable to those of state-of-the-art methods based on other input modalities. We also show how to integrate these models into 3D scene graph representations, leading to a better understanding of object dynamics and object-container relationships. We finally demonstrate that these articulated 3D scene graphs enhance a robot's ability to perform mobile manipulation tasks, showcasing an application where a Boston Dynamics Spot is tasked with retrieving concealed target items, given only the 3D scene graph as input.
△ Less
Submitted 30 March, 2026;
originally announced March 2026.
-
Observation of $Λ^+_c\to nπ^+η$ and search for $Λ^+_c\to na_0(980)^+$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
X. L. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (722 additional authors not shown)
Abstract:
By analysing 6.1 ${\rm fb}^{-1}$ of data collected at center-of-mass energies between $\sqrt{s}=4.600$ and 4.843 $\rm GeV$ with the BESIII detector at the BEPCII collider, we observe the decay $Λ_c^+\to nπ^+η$ for the first time with a statistical significance of $9.5σ$. The ratio of branching fractions $\mathcal{B}(Λ_c^+\to nπ^+η)/\mathcal{B}(Λ_c^+\to Λπ^+η)$ is measured to be…
▽ More
By analysing 6.1 ${\rm fb}^{-1}$ of data collected at center-of-mass energies between $\sqrt{s}=4.600$ and 4.843 $\rm GeV$ with the BESIII detector at the BEPCII collider, we observe the decay $Λ_c^+\to nπ^+η$ for the first time with a statistical significance of $9.5σ$. The ratio of branching fractions $\mathcal{B}(Λ_c^+\to nπ^+η)/\mathcal{B}(Λ_c^+\to Λπ^+η)$ is measured to be $0.155\pm0.031_{\rm stat.}\pm0.012_{\rm syst.}$ Taking the world average of $\mathcal{B}(Λ_c^+\to Λπ^+η)$ as reference, the absolute branching fraction is calculated to be $\mathcal{B}(Λ_c^+\to nπ^+η)=(2.94\pm0.59_{\rm stat.}\pm0.23_{\rm syst.}\pm0.13_{\rm ref.})\times10^{-3}$. The intermediate process $Λ_c^+\to na_0(980)^+$ is also searched for in the $π^+η$ invariant mass spectrum. Since no significant signal is found, the upper limit on $\mathcal{B}(Λ_c^+\to na_0(980)^+)\times\mathcal{B}(a_0(980)^+\toπ^+η)$ is set to $8.4\times10^{-4}$ at 90\% confidence level. A sophisticated deep learning approach using a Transformer-based architecture is employed to distinguish signals from prevalent hadronic backgrounds, complemented by thorough validation and systematic uncertainty quantification.
△ Less
Submitted 30 March, 2026;
originally announced March 2026.
-
AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation
Authors:
Zhaohe Liao,
Kaixun Jiang,
Zhihang Liu,
Yujie Wei,
Junqiu Yu,
Quanhao Li,
Hong-Tao Yu,
Pandeng Li,
Yuzheng Wang,
Zhen Xing,
Shiwei Zhang,
Chen-Wei Xie,
Yun Zheng,
Xihui Liu
Abstract:
Although image generation has boosted various applications via its rapid evolution, whether the state-of-the-art models are able to produce ready-to-use academic illustrations for papers is still largely unexplored. Directly comparing or evaluating the illustration with VLM is native but requires oracle multi-modal understanding ability, which is unreliable for long and complex texts and illustrat…
▽ More
Although image generation has boosted various applications via its rapid evolution, whether the state-of-the-art models are able to produce ready-to-use academic illustrations for papers is still largely unexplored. Directly comparing or evaluating the illustration with VLM is native but requires oracle multi-modal understanding ability, which is unreliable for long and complex texts and illustrations. To address this, we propose AIBench, the first benchmark using VQA for evaluating logic correctness of the academic illustrations and VLMs for assessing aesthetics. In detail, we designed four levels of questions proposed from a logic diagram summarized from the method part of the paper, which query whether the generated illustration aligns with the paper on different scales. Our VQA-based approach raises more accurate and detailed evaluations on visual-logical consistency while relying less on the ability of the judger VLM. With our high-quality AIBench, we conduct extensive experiments and conclude that the performance gap between models on this task is significantly larger than general ones, reflecting their various complex reasoning and high-density generation ability. Further, the logic and aesthetics are hard to optimize simultaneously as in handcrafted illustrations. Additional experiments further state that test-time scaling on both abilities significantly boosts the performance on this task.
△ Less
Submitted 30 March, 2026; v1 submitted 30 March, 2026;
originally announced March 2026.
-
Wan-Weaver: Interleaved Multi-modal Generation via Decoupled Training
Authors:
Jinbo Xing,
Zeyinzi Jiang,
Yuxiang Tuo,
Chaojie Mao,
Xiaotang Gai,
Xi Chen,
Jingfeng Zhang,
Yulin Pan,
Zhen Han,
Jie Xiao,
Keyu Yan,
Chenwei Xie,
Chongyang Zhong,
Kai Zhu,
Tong Shen,
Lianghua Huang,
Yu Liu,
Yujiu Yang
Abstract:
Recent unified models have made unprecedented progress in both understanding and generation. However, while most of them accept multi-modal inputs, they typically produce only single-modality outputs. This challenge of producing interleaved content is mainly due to training data scarcity and the difficulty of modeling long-range cross-modal context. To address this issue, we decompose interleaved…
▽ More
Recent unified models have made unprecedented progress in both understanding and generation. However, while most of them accept multi-modal inputs, they typically produce only single-modality outputs. This challenge of producing interleaved content is mainly due to training data scarcity and the difficulty of modeling long-range cross-modal context. To address this issue, we decompose interleaved generation into textual planning and visual consistency modeling, and introduce a framework consisting of a planner and a visualizer. The planner produces dense textual descriptions for visual content, while the visualizer synthesizes images accordingly. Under this guidance, we construct large-scale textual-proxy interleaved data (where visual content is represented in text) to train the planner, and curate reference-guided image data to train the visualizer. These designs give rise to Wan-Weaver, which exhibits emergent interleaved generation ability with long-range textual coherence and visual consistency. Meanwhile, the integration of diverse understanding and generation data into planner training enables Wan-Weaver to achieve robust task reasoning and generation proficiency. To assess the model's capability in interleaved generation, we further construct a benchmark that spans a wide range of use cases across multiple dimensions. Extensive experiments demonstrate that, even without access to any real interleaved data, Wan-Weaver achieves superior performance over existing methods.
△ Less
Submitted 29 March, 2026; v1 submitted 26 March, 2026;
originally announced March 2026.
-
Amplitude analysis and branching fraction measurement of the decay $D^0 \to K^+K^-π^0π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
M. S. Anderson,
Y. Bai,
O. Bakina,
H. R. Bao,
X. L. Bao,
M. Barbagiovanni,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone
, et al. (749 additional authors not shown)
Abstract:
An amplitude analysis of the singly Cabibbo-suppressed decay $D^0 \to K^+ K^- π^0 π^0$ is performed, for the first time, to determine the relative magnitudes and phases of different intermediate processes. The analysis uses $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy 3.773~GeV corresponding to an integrated luminosity of 20.3 $\rm fb^{-1}$. The absolute…
▽ More
An amplitude analysis of the singly Cabibbo-suppressed decay $D^0 \to K^+ K^- π^0 π^0$ is performed, for the first time, to determine the relative magnitudes and phases of different intermediate processes. The analysis uses $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy 3.773~GeV corresponding to an integrated luminosity of 20.3 $\rm fb^{-1}$. The absolute branching fraction of $D^0 \to K^+ K^- π^0 π^0$ is measured to be \BF. The dominant intermediate process is $D^0 \to K^{*}(892)^+K^{*}(892)^-$, with a branching fraction of $(2.79 \pm 0.13_{\rm{stat.}} \pm 0.11_{\rm{syst.}}) \times 10^{-3}$. Amplitude analysis reveals that the $D^0 \to K^{*}(892)^+K^{*}(892)^-$ decay is S-wave dominant. The longitudinal polarization fraction of $D^0 \to K^{*}(892)^+ K^{*}(892)^-$ is measured to be $0.468\pm0.046_{\rm{stat.}}\pm0.011_{\rm{syst.}}$.
△ Less
Submitted 30 March, 2026; v1 submitted 26 March, 2026;
originally announced March 2026.
-
CSI-tuples-based 3D Channel Fingerprints Construction Assisted by MultiModal Learning
Authors:
Chenjie Xie,
Li You,
Ruirong Chen,
Gaoning He,
Xiqi Gao
Abstract:
Low-altitude communications can promote the integration of aerial and terrestrial wireless resources, expand network coverage, and enhance transmission quality, thereby empowering the development of sixth-generation (6G) mobile communications. As an enabler for low-altitude transmission, 3D channel fingerprints (3D-CF), also referred to as the 3D radio map or 3D channel knowledge map, are expected…
▽ More
Low-altitude communications can promote the integration of aerial and terrestrial wireless resources, expand network coverage, and enhance transmission quality, thereby empowering the development of sixth-generation (6G) mobile communications. As an enabler for low-altitude transmission, 3D channel fingerprints (3D-CF), also referred to as the 3D radio map or 3D channel knowledge map, are expected to enhance the understanding of communication environments and assist in the acquisition of channel state information (CSI), thereby avoiding repeated estimations and reducing computational complexity. In this paper, we propose a modularized multimodal framework to construct 3D-CF. Specifically, we first establish the 3D-CF model as a collection of CSI-tuples based on Rician fading channels, with each tuple comprising the low-altitude vehicle's (LAV) positions and its corresponding statistical CSI. In consideration of the heterogeneous structures of different prior data, we formulate the 3D-CF construction problem as a multimodal regression task, where the target channel information in the CSI-tuple can be estimated directly by its corresponding LAV positions, together with communication measurements and geographic environment maps. Then, a high-efficiency multimodal framework is proposed accordingly, which includes a correlation-based multimodal fusion (Corr-MMF) module, a multimodal representation (MMR) module, and a CSI regression (CSI-R) module. Numerical results show that our proposed framework can efficiently construct 3D-CF and achieve at least 27.5% higher accuracy than the state-of-the-art algorithms under different communication scenarios, demonstrating its competitive performance and excellent generalization ability. We also analyze the computational complexity and illustrate its superiority in terms of the inference time.
△ Less
Submitted 26 March, 2026;
originally announced March 2026.
-
Cross Section Measurements of $\bar{n}p \rightarrow K^{+}K^{-}π^{+}(π^{0})$ via Antineutrons Produced by $J/ψ\to p π^{-} \bar{n}$ Decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
X. L. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (737 additional authors not shown)
Abstract:
Based on a novel method for producing antineutrons via $J/ψ$ decays, we report a study of $\bar{n}p$ inelastic scattering into final states containing kaons. The analysis uses $(10087\pm44)\times 10^6$ $J/ψ$ events collected at the BESIII detector operating at the BEPCII storage ring. Antineutrons are produced via $J/ψ\to p π^{-} \bar{n}$ decays and tagged by the detected protons and pions, result…
▽ More
Based on a novel method for producing antineutrons via $J/ψ$ decays, we report a study of $\bar{n}p$ inelastic scattering into final states containing kaons. The analysis uses $(10087\pm44)\times 10^6$ $J/ψ$ events collected at the BESIII detector operating at the BEPCII storage ring. Antineutrons are produced via $J/ψ\to p π^{-} \bar{n}$ decays and tagged by the detected protons and pions, resulting in antineutron momenta ranging from 0 to 1174~MeV/$c$, while target protons are provided by the hydrogen in the beam-pipe material. The cross sections of the reactions $\bar{n}p \rightarrow K^{+}K^{-}π^{+}$ and $\bar{n}p \rightarrow K^{+}K^{-}π^{+}π^{0}$ are measured to be $0.53^{+0.15}_{-0.12} \pm 0.08$~mb and $1.09^{+0.36}_{-0.30} \pm 0.31$~mb respectively, where the first uncertainties are statistical and the second systematic. Due to limited statistics, the intermediate states in these processes are not investigated. The observation of clean antineutron-proton scattering events indicates the potential of this approach for future investigations of antineutron-proton interactions.
△ Less
Submitted 25 March, 2026;
originally announced March 2026.
-
Electron Dynamics Reconstruction and Nontrivial Transport by Acoustic Waves
Authors:
Zi-Qian Zhou,
Zhi-Fan Zhang,
Cong Xiao,
Hua Jiang,
X. C. Xie
Abstract:
Surface acoustic waves (SAWs) become a popular driving source in modern condensed matter physics, but most existing theories simplify them as electric fields and ignore the non-uniform Brillouin zone folding effect. We develop a semiclassical framework and reconstruct the electron dynamics by treating SAW as a quasi-periodic potential modulating electronic momentum distribution. This framework nat…
▽ More
Surface acoustic waves (SAWs) become a popular driving source in modern condensed matter physics, but most existing theories simplify them as electric fields and ignore the non-uniform Brillouin zone folding effect. We develop a semiclassical framework and reconstruct the electron dynamics by treating SAW as a quasi-periodic potential modulating electronic momentum distribution. This framework naturally explains the experimentally observed DC drag current and predicts acousto-electric Hall effect. The theory further reveals various SAW-driven transport phenomena, emerging anomalous Hall, thermal Hall, and Nernst effects within time-reversal symmetric systems. Illustrated in bilayer graphene and $\mathrm{MX_2}$ (M = Mo, W; X = S, Se, Te), the angular-dependent acousto-electric Hall effect provides an experimental probe for Berry curvature distribution.
△ Less
Submitted 25 March, 2026;
originally announced March 2026.
-
Kirchhoff-Inspired Neural Networks for Evolving High-Order Perception
Authors:
Tongfei Chen,
Jingying Yang,
Linlin Yang,
Jinhu Lü,
David Doermann,
Chunyu Xie,
Long He,
Tian Wang,
Juan Zhang,
Guodong Guo,
Baochang Zhang
Abstract:
Deep learning architectures are fundamentally inspired by neuroscience, particularly the structure of the brain's sensory pathways, and have achieved remarkable success in learning informative data representations. Although these architectures mimic the communication mechanisms of biological neurons, their strategies for information encoding and transmission are fundamentally distinct. Biological…
▽ More
Deep learning architectures are fundamentally inspired by neuroscience, particularly the structure of the brain's sensory pathways, and have achieved remarkable success in learning informative data representations. Although these architectures mimic the communication mechanisms of biological neurons, their strategies for information encoding and transmission are fundamentally distinct. Biological systems depend on dynamic fluctuations in membrane potential; by contrast, conventional deep networks optimize weights and biases by adjusting the strengths of inter-neural connections, lacking a systematic mechanism to jointly characterize the interplay among signal intensity, coupling structure, and state evolution. To tackle this limitation, we propose the Kirchhoff-Inspired Neural Network (KINN), a state-variable-based network architecture constructed based on Kirchhoff's current law. KINN derives numerically stable state updates from fundamental ordinary differential equations, enabling the explicit decoupling and encoding of higher-order evolutionary components within a single layer while preserving physical consistency, interpretability, and end-to-end trainability. Extensive experiments on partial differential equation (PDE) solving and ImageNet image classification validate that KINN outperforms state-of-the-art existing methods.
△ Less
Submitted 25 March, 2026;
originally announced March 2026.
-
Amplitude Analysis of the Isospin-Violating Decay $J/ψ\rightarrowγηπ^{0}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
H. -R. Bao,
X. L. Bao,
M. Barbagiovanni,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (736 additional authors not shown)
Abstract:
Using $(10087 \pm 44)\times 10^{6}$ $\jpsi$ events collected with the BESIII detector, we perform the first amplitude analysis of the process $\jpsi\toγη\piz$. The decay is dominated by the intermediate processes $\jpsi\to\piz \bo \left( \toγη\right)$, $\jpsi\to\pizρ(1450)^0 \left( \toγη\right)$ and $\jpsi\toηh_1(1170) \left( \toγ\piz\right)$. Contributions from $\jpsi\toγa_0(980)^0(\toη\piz)$,…
▽ More
Using $(10087 \pm 44)\times 10^{6}$ $\jpsi$ events collected with the BESIII detector, we perform the first amplitude analysis of the process $\jpsi\toγη\piz$. The decay is dominated by the intermediate processes $\jpsi\to\piz \bo \left( \toγη\right)$, $\jpsi\to\pizρ(1450)^0 \left( \toγη\right)$ and $\jpsi\toηh_1(1170) \left( \toγ\piz\right)$. Contributions from $\jpsi\toγa_0(980)^0(\toη\piz)$, $\jpsi\toγa_2(1320)^0(\toη\piz)$ and $\jpsi\toγa_2(1700)^0(\toη\piz)$ are observed with a statistical significance exceeding $5σ$, constituting the first observation of radiative transitions of $\jpsi$ to isospin-triplet scalar mesons. The total branching fraction of $\jpsi\toγη\piz$ is measured to be \num{25.7\pm0.3\pm1.5e-6}, where the first uncertainty is statistical and the second systematic. This result is consistent with the previous measurement, with the precision improved by more than a factor of two.
△ Less
Submitted 24 March, 2026;
originally announced March 2026.
-
Search for the radiative decays $D^0\to γ\bar K_1(1270)^0$ and $D^+\to γK_1(1270)^+$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (678 additional authors not shown)
Abstract:
A search for the radiative decays $D^0\to γ\bar K_1(1270)^0$ and $D^+\to γK_1(1270)^+$ is conducted using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and upper limits on the branching fractions of $D^0\to γ\bar K_1(1270)^0$ and…
▽ More
A search for the radiative decays $D^0\to γ\bar K_1(1270)^0$ and $D^+\to γK_1(1270)^+$ is conducted using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and upper limits on the branching fractions of $D^0\to γ\bar K_1(1270)^0$ and $D^+\to γK_1(1270)^+$ at 90\% confidence level are determined to be $7.7\times10^{-4}$ and $3.9\times10^{-5}$, respectively. This represents the first test of the Vector Meson Dominance mechanism in the radiative decays of charmed mesons to axial-vector mesons.
△ Less
Submitted 24 March, 2026;
originally announced March 2026.
-
OmniCodec: Low Frame Rate Universal Audio Codec with Semantic-Acoustic Disentanglement
Authors:
Jingbin Hu,
Haoyu Zhang,
Dake Guo,
Qirui Zhan,
Wenhao Li,
Huakang Chen,
Guobin Ma,
Hanke Xie,
Chengyou Wang,
Pengyuan Xie,
Chuan Xie,
Qiang Zhang,
Lei Xie
Abstract:
Large Language Models (LLMs) have advanced audio generation through discrete representation learning. However, most existing neural codecs focus on speech and emphasize reconstruction fidelity, overlooking unified low frame rate modeling across diverse audio domains, including speech, music, and general sound. Moreover, high reconstruction quality does not necessarily yield semantically informativ…
▽ More
Large Language Models (LLMs) have advanced audio generation through discrete representation learning. However, most existing neural codecs focus on speech and emphasize reconstruction fidelity, overlooking unified low frame rate modeling across diverse audio domains, including speech, music, and general sound. Moreover, high reconstruction quality does not necessarily yield semantically informative representations, limiting effectiveness in downstream generation tasks. We propose OmniCodec, a universal neural audio codec tailored for low frame rate. It adopts a hierarchical multi-codebook design with semantic-acoustic decoupling by leveraging the audio encoder of the pre-trained understanding model, along with a self-guidance strategy to improve codebook utilization and reconstruction. Compared with the Mimi codec, experiments show that OmniCodec achieves outstanding performance at the same bitrate, delivering superior reconstruction quality while also providing more semantically informative representations that benefit downstream generation tasks. Our model and code will be open-sourced. Our demo page is available.
△ Less
Submitted 21 March, 2026;
originally announced March 2026.
-
Lithium in Wide Binaries: Effective Temperature Governs Depletion while Rotation Plays a Minor Role
Authors:
Cheng-Cheng Xie,
Hai-Jun Tian,
Jian-Rong Shi,
Ze-Ming Zhou
Abstract:
Using a sample of 116 wide binary systems as coeval and chemically homogeneous stellar pairs, we investigate the factors governing lithium depletion in main-sequence stars. We recover the well-established morphology of the lithium--effective temperature ($T_{\mathrm{eff}}$) relation, including the Li dip (6200--6600\,K), the Li plateau (6000--6200\,K), and a linear trend for cooler stars (…
▽ More
Using a sample of 116 wide binary systems as coeval and chemically homogeneous stellar pairs, we investigate the factors governing lithium depletion in main-sequence stars. We recover the well-established morphology of the lithium--effective temperature ($T_{\mathrm{eff}}$) relation, including the Li dip (6200--6600\,K), the Li plateau (6000--6200\,K), and a linear trend for cooler stars ($T_{\mathrm{eff}}$ $<$ 6000\,K), where lithium abundance increases by $\sim$0.15\,dex per 100\,K. We demonstrate that the apparent correlation between projected rotational velocity ($v\sin i$) and lithium abundance is secondary to the underlying $T_{\mathrm{eff}}$ dependence; $v\sin i$ is not an independent driver of lithium depletion in our sample. Notably, we identify an anomalous system within the Li dip where the primary star exhibits a $\sim$1.4\,dex lithium excess compared to its secondary companion at nearly identical $T_{\mathrm{eff}}$. We discuss two plausible origins for this anomaly: external enrichment via planetesimal accretion or planetary engulfment, and binary interactions with an unresolved tertiary companion. Our results confirm $T_{\mathrm{eff}}$ as the dominant parameter controlling lithium depletion, while highlighting that additional, non-standard processes can occasionally produce significant lithium enrichment.
△ Less
Submitted 19 March, 2026;
originally announced March 2026.
-
Observation of $D_s^+ \to a_0(980)^+f_0(500)$ in the Amplitude Analysis of $D_s^+ \to π^+ π^0 π^0 η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
X. L. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (719 additional authors not shown)
Abstract:
We report the first observation of the decay $D_s^+ \to π^+π^0π^0η$ in a data set corresponding to an integrated luminosity of 7.33 fb$^{-1}$, collected in $e^+e^-$ collisions by the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV. An unexpectedly large branching fraction…
▽ More
We report the first observation of the decay $D_s^+ \to π^+π^0π^0η$ in a data set corresponding to an integrated luminosity of 7.33 fb$^{-1}$, collected in $e^+e^-$ collisions by the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV. An unexpectedly large branching fraction $\mathcal{B}( D_s^+ \to a_0(980)^+ f_0(500), a_0(980)^+ \to π^+η, f_0(500)\to π^0π^0) = (0.98 \pm 0.16_{\rm{stat.}} \pm 0.22_{\rm{syst.}})\%$ is measured with a significance exceeding $10σ$, offering new constraints on the internal structure of light scalar mesons. The dominant intermediate process is $D_s^+ \to a_1(1260)^+η, a_1(1260)^+\to ρ(770)^+π^0$ with a branching fraction of $(1.77 \pm 0.21_{\rm stat.} \pm 0.12_{\rm syst.})\%$. The isospin symmetry has been validated to the decays of $a_1(1260)^+\to ρ(770)^0π^+$ and $a_1(1260)^+\to ρ(770)^+π^0$. Moreover, the measured $\mathcal{B}(D_s^+\to π^+π^0π^0η|_{\rm{non}-η^\prime})=(2.97 \pm 0.23_{\rm stat.} \pm 0.14_{\rm sys.})$ reduces the undetected $D_s^+ \to ηX$ decay branching fractions to (0.1 $\pm$ 3.1)\%.
△ Less
Submitted 19 March, 2026;
originally announced March 2026.
-
Global Uniqueness of Subsonic Flows for the Steady Euler-Poisson System
Authors:
Myoungjean Bae,
Ben Duan,
Chunjing Xie
Abstract:
We prove the global uniqueness of subsonic solutions to the steady Euler-Poisson system in a bounded domain. Previous works established the existence and local uniqueness of multidimensional subsonic flows by constructing solutions as small perturbations of one-dimensional background states, where a contraction mapping argument applies in a small perturbation regime. In contrast, the present paper…
▽ More
We prove the global uniqueness of subsonic solutions to the steady Euler-Poisson system in a bounded domain. Previous works established the existence and local uniqueness of multidimensional subsonic flows by constructing solutions as small perturbations of one-dimensional background states, where a contraction mapping argument applies in a small perturbation regime. In contrast, the present paper removes the smallness assumption and proves global uniqueness within a class of subsonic solutions satisfying the same boundary data.
△ Less
Submitted 18 March, 2026;
originally announced March 2026.
-
A New Fractional Step Structure Preserving Method for The Landau-Lifshitz-Gilbert Equation
Authors:
Changjian Xie
Abstract:
In this paper, we propose a structure preserving method using a Crank-Nicolson's type method with an implicit Gauss-Seidel fractional iteration. Such a method is of first-order accuracy in time and second-order accuracy in space, stable and length preserving. Such a proposed method brings great benefits for the theoretical analysis. The numerical accuracy, norm preserving and stability are verifie…
▽ More
In this paper, we propose a structure preserving method using a Crank-Nicolson's type method with an implicit Gauss-Seidel fractional iteration. Such a method is of first-order accuracy in time and second-order accuracy in space, stable and length preserving. Such a proposed method brings great benefits for the theoretical analysis. The numerical accuracy, norm preserving and stability are verified for 1D and 3D tests.
△ Less
Submitted 18 March, 2026;
originally announced March 2026.
-
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild
Authors:
Peng Xia,
Jianwen Chen,
Xinyu Yang,
Haoqin Tu,
Jiaqi Liu,
Kaiwen Xiong,
Siwei Han,
Shi Qiu,
Haonian Ji,
Yuyin Zhou,
Zeyu Zheng,
Cihang Xie,
Huaxiu Yao
Abstract:
Large language model (LLM) agents are increasingly used for complex tasks, yet deployed agents often remain static, failing to adapt as user needs evolve. This creates a tension between the need for continuous service and the necessity of updating capabilities to match shifting task distributions. On platforms like OpenClaw, which handle diverse workloads across 20+ channels, existing methods eith…
▽ More
Large language model (LLM) agents are increasingly used for complex tasks, yet deployed agents often remain static, failing to adapt as user needs evolve. This creates a tension between the need for continuous service and the necessity of updating capabilities to match shifting task distributions. On platforms like OpenClaw, which handle diverse workloads across 20+ channels, existing methods either store raw trajectories without distilling knowledge, maintain static skill libraries, or require disruptive downtime for retraining. We present MetaClaw, a continual meta-learning framework that jointly evolves a base LLM policy and a library of reusable behavioral skills. MetaClaw employs two complementary mechanisms. Skill-driven fast adaptation analyzes failure trajectories via an LLM evolver to synthesize new skills, enabling immediate improvement with zero downtime. Opportunistic policy optimization performs gradient-based updates via cloud LoRA fine-tuning and Reinforcement Learning with a Process Reward Model (RL-PRM). This is triggered during user-inactive windows by the Opportunistic Meta-Learning Scheduler (OMLS), which monitors system inactivity and calendar data. These mechanisms are mutually reinforcing: a refined policy generates better trajectories for skill synthesis, while richer skills provide higher-quality data for policy optimization. To prevent data contamination, a versioning mechanism separates support and query data. Built on a proxy-based architecture, MetaClaw scales to production-size LLMs without local GPUs. Experiments on MetaClaw-Bench and AutoResearchClaw show that skill-driven adaptation improves accuracy by up to 32% relative. The full pipeline advances Kimi-K2.5 accuracy from 21.4% to 40.6% and increases composite robustness by 18.3%. Code is available at https://github.com/aiming-lab/MetaClaw.
△ Less
Submitted 17 March, 2026;
originally announced March 2026.
-
Kestrel: Grounding Self-Refinement for LVLM Hallucination Mitigation
Authors:
Jiawei Mao,
Hardy Chen,
Haoqin Tu,
Yuhan Wang,
Letian Zhang,
Zeyu Zheng,
Huaxiu Yao,
Zirui Wang,
Cihang Xie,
Yuyin Zhou
Abstract:
Large vision-language models (LVLMs) have become increasingly strong but remain prone to hallucinations in multimodal tasks, which significantly narrows their deployment. As training these LVLMs to avoid hallucinations becomes prohibitively expensive for larger models, training-free methods offer a cheap and flexible solution to this problem, yet existing approaches based on decoding or tool use o…
▽ More
Large vision-language models (LVLMs) have become increasingly strong but remain prone to hallucinations in multimodal tasks, which significantly narrows their deployment. As training these LVLMs to avoid hallucinations becomes prohibitively expensive for larger models, training-free methods offer a cheap and flexible solution to this problem, yet existing approaches based on decoding or tool use often bring limited gains and/or weak interpretability. We propose Kestrel, a training-free framework for LVLM hallucination mitigation that combines an explicit visual-grounding agent with evidence-verified self-refinement mechanism. In detail, Kestrel first collects explicit visual evidence and converts tool outputs into reusable and structured textual evidence. Second, to take full advantage of these evidence, Kestrel verifies them via an LVLM judge for evidence checking, then iteratively self-refine answers based on verified evidence to reduce the risk of over-correction. Extensive experiments show that Kestrel improves performance over strong baselines across hallucination benchmarks (e.g., average +3.31% on POPE and +28.34 on MME-Hallucination with Qwen3-VL), while providing transparent verification traces for hallucination diagnosis and analysis -- e.g., both the integrated self-refinement module and grounding agent contributing an average +2.0% gain on POPE.
△ Less
Submitted 17 March, 2026;
originally announced March 2026.
-
FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection
Authors:
Jie Li,
Yingying Feng,
Chi Xie,
Jie Hu,
Lei Tan,
Jiayi Ji
Abstract:
The remarkable realism of images generated by diffusion models poses critical detection challenges. Current methods utilize reconstruction error as a discriminative feature, exploiting the observation that real images exhibit higher reconstruction errors when processed through diffusion models. However, these approaches require costly reconstruction computations and depend on specific diffusion mo…
▽ More
The remarkable realism of images generated by diffusion models poses critical detection challenges. Current methods utilize reconstruction error as a discriminative feature, exploiting the observation that real images exhibit higher reconstruction errors when processed through diffusion models. However, these approaches require costly reconstruction computations and depend on specific diffusion models, making their performance highly model-dependent. We identify a fundamental difference: real images are more difficult to fit with Gaussian distributions compared to synthetic ones. In this paper, we propose Forgery Identification via Noise Disturbance (FIND), a novel method that requires only a simple binary classifier. It eliminates reconstruction by directly targeting the core distributional difference between real and synthetic images. Our key operation is to add Gaussian noise to real images during training and label these noisy versions as synthetic. This step allows the classifier to focus on the statistical patterns that distinguish real from synthetic images. We theoretically prove that the noise-augmented real images resemble diffusion-generated images in their ease of Gaussian fitting. Furthermore, simply by adding noise, they still retain visual similarity to the original images, highlighting the most discriminative distribution-related features. The proposed FIND improves performance by 11.7% on the GenImage benchmark while running 126x faster than existing methods. By removing the need for auxiliary diffusion models and reconstruction, it offers a practical, efficient, and generalizable way to detect diffusion-generated content.
△ Less
Submitted 15 March, 2026;
originally announced March 2026.
-
TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection
Authors:
Xiong Zhang,
Hong Peng,
Changlong Fu,
Xin Jin,
Yun Yang,
Cheng Xie
Abstract:
A significant number of anomalous nodes in the real world, such as fake news, noncompliant users, malicious transactions, and malicious posts, severely compromises the health of the graph data ecosystem and urgently requires effective identification and processing. With anomalies that span multiple data domains yet exhibit vast differences in features, cross-domain detection models face severe dom…
▽ More
A significant number of anomalous nodes in the real world, such as fake news, noncompliant users, malicious transactions, and malicious posts, severely compromises the health of the graph data ecosystem and urgently requires effective identification and processing. With anomalies that span multiple data domains yet exhibit vast differences in features, cross-domain detection models face severe domain shift issues, which limit their generalizability across all domains. This study identifies and quantitatively analyzes a specific feature mismatch pattern exhibited by domain shift in graph anomaly detection, which we define as the \emph{Anomaly Disassortativity} issue ($\mathcal{AD}$). Based on the modeling of the issue $\mathcal{AD}$, we introduce a novel graph foundation model for anomaly detection. It achieves cross-domain generalization in different graphs, requiring only a single training phase to perform effectively across diverse domains. The experimental findings, based on fourteen diverse real-world graphs, confirm a breakthrough in the model's cross-domain adaptation, achieving a pioneering state-of-the-art (SOTA) level in terms of detection accuracy. In summary, the proposed theory of $\mathcal{AD}$ provides a novel theoretical perspective and a practical route for future research in generalist graph anomaly detection (GGAD). The code is available at https://anonymous.4open.science/r/Anonymization-TA-GGAD/.
△ Less
Submitted 10 March, 2026;
originally announced March 2026.
-
CAST: Modeling Visual State Transitions for Consistent Video Retrieval
Authors:
Yanqing Liu,
Yingcheng Liu,
Fanghong Dong,
Budianto Budianto,
Cihang Xie,
Yan Jiao
Abstract:
As video content creation shifts toward long-form narratives, composing short clips into coherent storylines becomes increasingly important. However, prevailing retrieval formulations remain context-agnostic at inference time, prioritizing local semantic alignment while neglecting state and identity consistency. To address this structural limitation, we formalize the task of Consistent Video Retri…
▽ More
As video content creation shifts toward long-form narratives, composing short clips into coherent storylines becomes increasingly important. However, prevailing retrieval formulations remain context-agnostic at inference time, prioritizing local semantic alignment while neglecting state and identity consistency. To address this structural limitation, we formalize the task of Consistent Video Retrieval (CVR) and introduce a diagnostic benchmark spanning YouCook2, COIN, and CrossTask. We propose CAST (Context-Aware State Transition), a lightweight, plug-and-play adapter compatible with diverse frozen vision-language embedding spaces. By predicting a state-conditioned residual update ($Δ$) from visual history, CAST introduces an explicit inductive bias for latent state evolution. Extensive experiments show that CAST improves performance on YouCook2 and CrossTask, remains competitive on COIN, and consistently outperforms zero-shot baselines across diverse foundation backbones. Furthermore, CAST provides a useful reranking signal for black-box video generation candidates (e.g., from Veo), promoting more temporally coherent continuations.
△ Less
Submitted 9 March, 2026;
originally announced March 2026.
-
Amplitude Analysis of Singly Cabibbo-Suppressed Decay $Λ^{+}_{c}\to p K^{+} K^{-}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (681 additional authors not shown)
Abstract:
Using a sample of $e^{+}e^{-}$ annihilation data corresponding to an integrated luminosity of 4.4 $\rm{fb}^{-1}$ collected with the BESIII detector at the BEPCII collider and produced at center-of-mass energies from $4600$ to $4698~\rm{MeV}$, an amplitude analysis is performed of the singly Cabibbo-suppressed decay $Λ^{+}_{c}\to pK^{+}K^{-}$. The branching fractions of $Λ^{+}_{c}\to pφ(1020)$,…
▽ More
Using a sample of $e^{+}e^{-}$ annihilation data corresponding to an integrated luminosity of 4.4 $\rm{fb}^{-1}$ collected with the BESIII detector at the BEPCII collider and produced at center-of-mass energies from $4600$ to $4698~\rm{MeV}$, an amplitude analysis is performed of the singly Cabibbo-suppressed decay $Λ^{+}_{c}\to pK^{+}K^{-}$. The branching fractions of $Λ^{+}_{c}\to pφ(1020)$, $pf_{0}(980)$, $Λ(1405)K^{+}$, and $Λ(1670)K^{+}$ are measured, where the latter two modes are decays that are observed for the first time. At the same time, with the detection efficiency based on the results of the amplitude analysis, the branching fraction of $Λ^{+}_{c}\to pK^{+}K^{-}$ is updated to be $(9.94\pm0.65_{\text{stat.}}\pm0.50_{\text{syst.}})\times10^{-4}$, which is consistent with the current world average value within one standard deviation. The result supersedes the previous BESIII measurement with precision improved by approximately a factor of 1.5.
△ Less
Submitted 9 March, 2026;
originally announced March 2026.
-
An improved measurement of $η^\prime\rightarrow e^{+}e^{-}ω$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
H. R. Bao,
X. L. Bao,
M. Barbagiovanni,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (751 additional authors not shown)
Abstract:
Using a sample of $(10087 \pm 44) \times 10^{6}$ $J/ψ$ events collected with the BESIII detector, an improved measurement of the decay $η^{\prime}\rightarrow e^{+}e^{-}ω$, with $ω\rightarrowπ^{+}π^{-}π^{0}$ and $π^{0}\rightarrowγγ$ is performed. The branching fraction is determined to be $\mathcal{B}(η^{\prime}\rightarrow e^{+}e^{-}ω) = (1.79 \pm 0.09 \pm 0.12) \times 10^{-4}$, where the first unc…
▽ More
Using a sample of $(10087 \pm 44) \times 10^{6}$ $J/ψ$ events collected with the BESIII detector, an improved measurement of the decay $η^{\prime}\rightarrow e^{+}e^{-}ω$, with $ω\rightarrowπ^{+}π^{-}π^{0}$ and $π^{0}\rightarrowγγ$ is performed. The branching fraction is determined to be $\mathcal{B}(η^{\prime}\rightarrow e^{+}e^{-}ω) = (1.79 \pm 0.09 \pm 0.12) \times 10^{-4}$, where the first uncertainty is statistical and the second is systematic. This result is consistent with the previous measurement and is obtained with significantly improved precision. Furthermore, the first measurement of the transition form factor cutoff parameter for this decay is reported, with $Λ^{-1} = (2.92 \pm 0.83 \pm 0.15)~\text{GeV}^{-1}$. These measurements provide valuable input for understanding the internal structure of the $η^{\prime}$ meson and testing theoretical models.
△ Less
Submitted 9 March, 2026;
originally announced March 2026.
-
In-Context Reinforcement Learning for Tool Use in Large Language Models
Authors:
Yaoqi Ye,
Yiran Zhao,
Keyu Duan,
Zeyu Zheng,
Kenji Kawaguchi,
Cihang Xie,
Michael Qizhe Shieh
Abstract:
While large language models (LLMs) exhibit strong reasoning abilities, their performance on complex tasks is often constrained by the limitations of their internal knowledge. A compelling approach to overcome this challenge is to augment these models with external tools -- such as Python interpreters for mathematical computations or search engines for retrieving factual information. However, enabl…
▽ More
While large language models (LLMs) exhibit strong reasoning abilities, their performance on complex tasks is often constrained by the limitations of their internal knowledge. A compelling approach to overcome this challenge is to augment these models with external tools -- such as Python interpreters for mathematical computations or search engines for retrieving factual information. However, enabling models to use these tools effectively remains a significant challenge. Existing methods typically rely on cold-start pipelines that begin with supervised fine-tuning (SFT), followed by reinforcement learning (RL). These approaches often require substantial amounts of labeled data for SFT, which is expensive to annotate or synthesize. In this work, we propose In-Context Reinforcement Learning (ICRL), an RL-only framework that eliminates the need for SFT by leveraging few-shot prompting during the rollout stage of RL. Specifically, ICRL introduces in-context examples within the rollout prompts to teach the model how to invoke external tools. Furthermore, as training progresses, the number of in-context examples is gradually reduced, eventually reaching a zero-shot setting where the model learns to call tools independently. We conduct extensive experiments across a range of reasoning and tool-use benchmarks. Results show that ICRL achieves state-of-the-art performance, demonstrating its effectiveness as a scalable, data-efficient alternative to traditional SFT-based pipelines.
△ Less
Submitted 9 March, 2026;
originally announced March 2026.
-
Multi-channel joint analysis of the exotic charmonium-like state $T_{c\bar{c}}(4020)$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (700 additional authors not shown)
Abstract:
This paper reports the first multi-channel joint analysis to identify the properties of the exotic charmonium-like state $T_{c\bar{c}}(4020)$ via the electron-positron annihilation process $e^{+}e^{-}\toπ^{+}T_{c\bar{c}}(4020)^{-}+c.c$. A partial wave analysis is performed simultaneously in three decay channels $T_{c\bar{c}}(4020)^{-}\to {D}^{*0}D^{*-}$, $π^{-}J/ψ$, and $π^{-}h_{c}$, based on data…
▽ More
This paper reports the first multi-channel joint analysis to identify the properties of the exotic charmonium-like state $T_{c\bar{c}}(4020)$ via the electron-positron annihilation process $e^{+}e^{-}\toπ^{+}T_{c\bar{c}}(4020)^{-}+c.c$. A partial wave analysis is performed simultaneously in three decay channels $T_{c\bar{c}}(4020)^{-}\to {D}^{*0}D^{*-}$, $π^{-}J/ψ$, and $π^{-}h_{c}$, based on data samples taken at $\sqrt{s}=4.395$ and $4.416\,\mathrm{GeV}$ with an integrated luminosity of $1598.9\,\mathrm{pb}^{-1}$ collected with the BESIII detector operating on the BEPCII collider. For the first time, the spin-parity of the $T_{c\bar{c}}(4020)^{-}$ is determined to be $J^{P}=1^{+}$ with a significance $11.7σ$. Pole positions are extracted on the Riemann sheets with three branch points in the complex energy plane. Furthermore, the relative branching fractions are obtained as $\mathcal{B}[T_{c\bar{c}}(4020)^{-}\toπ^{-}J/ψ]/\mathcal{B}[T_{c\bar{c}}(4020)^{-}\to{D}^{*0}D^{*-}]=(3.6\pm0.6\pm1.6)\times10^{-3}$ and $\mathcal{B}[T_{c\bar{c}}(4020)^{-}\toπ^{-}h_{c}]/\mathcal{B}[T_{c\bar{c}}(4020)^{-}\to{D}^{*0}D^{*-}]=(8.9\pm1.3\pm2.3)\times10^{-2}$, where the first uncertainties are statistical, and the second are systematic.
△ Less
Submitted 5 March, 2026;
originally announced March 2026.
-
Impact of scissors-correction schemes on first-principles calculations of second-harmonic generation in ultraviolet nonlinear-optical crystals
Authors:
YingXing Cheng,
Congwei Xie,
Zhihua Yang,
Shilie Pan
Abstract:
In this work, we assess two widely used scissors-correction schemes for first-principles calculations of second-harmonic generation in representative borate and phosphate ultraviolet nonlinear-optical (UV-NLO) crystals, namely scheme-L [Phys.\ Rev.\ Lett.\ \textbf{63}, 1719 (1989)] and scheme-N [Phys.\ Rev.\ B \textbf{72}, 045223 (2005)]. To enable controlled and numerically robust comparisons, we…
▽ More
In this work, we assess two widely used scissors-correction schemes for first-principles calculations of second-harmonic generation in representative borate and phosphate ultraviolet nonlinear-optical (UV-NLO) crystals, namely scheme-L [Phys.\ Rev.\ Lett.\ \textbf{63}, 1719 (1989)] and scheme-N [Phys.\ Rev.\ B \textbf{72}, 045223 (2005)]. To enable controlled and numerically robust comparisons, we derive a unified static-limit formulation that avoids spurious divergences and is applicable to both schemes, thereby extending earlier static-limit treatments that were effectively restricted to scheme-L. Benchmark calculations show that both schemes largely preserve the spectral line shape while mainly rescaling the overall response. Scheme-N systematically yields 15\%--25\% larger SHG magnitudes than scheme-L, although for some tensor components and experimental datasets scheme-L shows closer agreement with experiment. We further show that Kleinman symmetry is satisfied in the static limit at the level of the formal theory, whereas apparent violations in practical calculations arise mainly from the numerical approximation used to evaluate generalized derivatives.
△ Less
Submitted 23 March, 2026; v1 submitted 5 March, 2026;
originally announced March 2026.
-
RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform
Authors:
Kenan Li,
Rongzhi Li,
Linghao Zhang,
Qirui Jin,
Liao Zhu,
Xiaosong Huang,
Geng Zhang,
Yikai Zhang,
Shilin He,
Chengxing Xie,
Xin Zhang,
Zijian Jin,
Bowen Li,
Chaoyun Zhang,
Yu Kang,
Yufan Huang,
Elsie Nallipogu,
Saravan Rajmohan,
Qingwei Lin,
Dongmei Zhang
Abstract:
Building software repositories typically requires significant manual effort. Recent advances in large language model (LLM) agents have accelerated automation in software engineering (SWE). We introduce RepoLaunch, the first agent capable of automatically resolving dependencies, compiling source code, and extracting test results for repositories across arbitrary programming languages and operating…
▽ More
Building software repositories typically requires significant manual effort. Recent advances in large language model (LLM) agents have accelerated automation in software engineering (SWE). We introduce RepoLaunch, the first agent capable of automatically resolving dependencies, compiling source code, and extracting test results for repositories across arbitrary programming languages and operating systems. To demonstrate its utility, we further propose a fully automated pipeline for SWE dataset creation, where task design is the only human intervention. RepoLaunch automates the remaining steps, enabling scalable benchmarking and training of coding agents and LLMs. Notably, several works on agentic benchmarking and training have recently adopted RepoLaunch for automated task generation.
△ Less
Submitted 5 March, 2026;
originally announced March 2026.
-
Precise measurement of the form factors in $D^0\rightarrow K^*(892)^-\ell^+ν_{\ell}$ and observation of $D^0\rightarrow K_2^*(1430)^-\ell^+ν_{\ell}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (678 additional authors not shown)
Abstract:
We report a study of the semileptonic decays $D^0 \rightarrow \bar{K}^0π^-\ell^+ν_{\ell}$ (where $\ell=e,~μ$) based on a sample of $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider. Based on an investigation of the decay dynamics in $D^0 \rightarrow \bar{K}^0π^-\ell^+ν_{\ell}$ decays, a…
▽ More
We report a study of the semileptonic decays $D^0 \rightarrow \bar{K}^0π^-\ell^+ν_{\ell}$ (where $\ell=e,~μ$) based on a sample of $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider. Based on an investigation of the decay dynamics in $D^0 \rightarrow \bar{K}^0π^-\ell^+ν_{\ell}$ decays, a $\mathcal{D}-$wave component of $D^0\rightarrow K_2^*(1430)^-\ell^+ν_{\ell}$ is observed for the first time with a statistical significance of $8.0σ$, in addition to the dominant $K^*(892)^-$ and $\mathcal{S}-$wave components. The $\mathcal{D}-$wave component is determined to account for $(0.092 \pm 0.028_{\rm stat} \pm 0.018_{\rm syst})\%$ of the total decay rate. The branching fractions of the dominant $K^*(892)^-$ components are measured as $\mathcal{B}(D^0\rightarrow K^{*}(892)^-e^+ν_{e}) = (2.043 \pm 0.018_{\rm stat} \pm 0.012_{\rm syst})\%$ and $\mathcal{B}(D^0\rightarrow K^{*}(892)^-μ^+ν_μ) = (1.964 \pm 0.018_{\rm stat} \pm 0.012_{\rm syst})\%$, which are the most precise measurements to date and represent significant improvements over the previous world averages. The hadronic form-factor parameters are measured to be $r_{V} = V(0)/A_1(0) = 1.444 \pm 0.026_{\rm stat} \pm 0.010_{\rm syst}$, $r_{2} = A_2(0)/A_1(0) = 0.752 \pm 0.020_{\rm stat} \pm 0.004_{\rm syst}$, and $A_1(0)=0.618\pm0.002_{\rm stat} \pm0.004_{\rm syst}$, where $V(0)$ is the vector form factor and $A_{1,2}(0)$ are the axial-vector form factors evaluated at $q^2=0$. This is the most precise determination of the form-factor parameters to date measured in a $D\rightarrow K^*(892)$ transition. In addition, we report the first model-independent measurement of the $\mathcal{S}-$wave phase shift in the hadronic $\bar{K}^0π^-$ system.
△ Less
Submitted 4 March, 2026;
originally announced March 2026.
-
Search for a massless particle beyond the Standard Model in the $Ξ^0\toΛ+ \text{invisible}$ decay
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
H. R. Bao,
X. L. Bao,
M. Barbagiovanni,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (745 additional authors not shown)
Abstract:
A search for a massless beyond-standard-model particle is performed in the decay $Ξ^{0}\toΛ+\text{invisible}$ using $(1.0087 \pm 0.0044)\times 10^{10}$ $J/ψ$ events collected with the BESIII detector at the BEPCII collider. No significant signal is observed and the upper limit on the branching fraction $\mathcal{B}(Ξ^{0}\toΛ+\text{invisible})$ is set to be $2.3 \times 10^{-4}$ at the $90\%$ confid…
▽ More
A search for a massless beyond-standard-model particle is performed in the decay $Ξ^{0}\toΛ+\text{invisible}$ using $(1.0087 \pm 0.0044)\times 10^{10}$ $J/ψ$ events collected with the BESIII detector at the BEPCII collider. No significant signal is observed and the upper limit on the branching fraction $\mathcal{B}(Ξ^{0}\toΛ+\text{invisible})$ is set to be $2.3 \times 10^{-4}$ at the $90\%$ confidence level. This is the first search for a flavor-changing neutral current process with missing energy in $Ξ^0$ decays. Throughout this paper, charge-conjugate processes are always implied.
△ Less
Submitted 12 March, 2026; v1 submitted 3 March, 2026;
originally announced March 2026.
-
GCTAM: Global and Contextual Truncated Affinity Combined Maximization Model For Unsupervised Graph Anomaly Detection
Authors:
Xiong Zhang,
Hong Peng,
Zhenli He,
Cheng Xie,
Xin Jin,
Hua Jiang
Abstract:
Anomalies often occur in real-world information networks/graphs, such as malevolent users, malicious comments, banned users, and fake news in social graphs. The latest graph anomaly detection methods use a novel mechanism called truncated affinity maximization (TAM) to detect anomaly nodes without using any label information and achieve impressive results. TAM maximizes the affinities among the no…
▽ More
Anomalies often occur in real-world information networks/graphs, such as malevolent users, malicious comments, banned users, and fake news in social graphs. The latest graph anomaly detection methods use a novel mechanism called truncated affinity maximization (TAM) to detect anomaly nodes without using any label information and achieve impressive results. TAM maximizes the affinities among the normal nodes while truncating the affinities of the anomalous nodes to identify the anomalies. However, existing TAM-based methods truncate suspicious nodes according to a rigid threshold that ignores the specificity and high-order affinities of different nodes. This inevitably causes inefficient truncations from both normal and anomalous nodes, limiting the effectiveness of anomaly detection. To this end, this paper proposes a novel truncation model combining contextual and global affinity to truncate the anomalous nodes. The core idea of the work is to use contextual truncation to decrease the affinity of anomalous nodes, while global truncation increases the affinity of normal nodes. Extensive experiments on massive real-world datasets show that our method surpasses peer methods in most graph anomaly detection tasks. In highlights, compared with previous state-of-the-art methods, the proposed method has +15\% $\sim$ +20\% improvements in two famous real-world datasets, Amazon and YelpChi. Notably, our method works well in large datasets, Amazin-all and YelpChi-all, and achieves the best results, while most previous models cannot complete the tasks.
△ Less
Submitted 2 March, 2026;
originally announced March 2026.
-
Search for the charmonium weak decay $ψ(2S)\to D_s^-π^+ + c.c.$ and $ψ(2S)\to D_s^-ρ^+ + c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
X. L. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (722 additional authors not shown)
Abstract:
We search for the weak decays $ψ(2S)\to D_s^-π^+ + c.c.$ and $ψ(2S)\to D_s^-ρ^+ + c.c.$ for the first time. The search is based on $(2712.4\pm14.3)\times 10^6$ events containing the charmonium state $ψ(2S)$ collected at the center-of-mass energy $\sqrt{s}=3.686\ \rm{GeV}$ with the BESIII detector. This search offers a unique opportunity to test the Standard Model and search for new physics. Since…
▽ More
We search for the weak decays $ψ(2S)\to D_s^-π^+ + c.c.$ and $ψ(2S)\to D_s^-ρ^+ + c.c.$ for the first time. The search is based on $(2712.4\pm14.3)\times 10^6$ events containing the charmonium state $ψ(2S)$ collected at the center-of-mass energy $\sqrt{s}=3.686\ \rm{GeV}$ with the BESIII detector. This search offers a unique opportunity to test the Standard Model and search for new physics. Since no signal excess above the background is observed, the upper limits on the branching fractions at the 90\% confidence level are set to be $1.4\times 10^{-6}$ and $7.0\times 10^{-6}$ for $ψ(2S)\to D_s^-π^+ + c.c.$ and $ψ(2S)\to D_s^-ρ^+ + c.c.$, respectively.
△ Less
Submitted 2 March, 2026;
originally announced March 2026.
-
First Amplitude Analysis of $D^0\rightarrow K^-π^0e^+ν_e$ and Observation of $D^0\rightarrow K^*_2(1430)^-e^+ν_e$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (686 additional authors not shown)
Abstract:
We present the first amplitude analysis of the semileptonic decay $D^0\to K^-π^0 e^{+}ν_{e}$ by analyzing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 20.3 fb$^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. A tiny $\mathcal{D}$-wave component of the $K^*_2(1430)^-$ accounting for $(0.16 \pm 0.05_{\rm stat} \pm 0.02_{\rm syst})\%$ of t…
▽ More
We present the first amplitude analysis of the semileptonic decay $D^0\to K^-π^0 e^{+}ν_{e}$ by analyzing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 20.3 fb$^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. A tiny $\mathcal{D}$-wave component of the $K^*_2(1430)^-$ accounting for $(0.16 \pm 0.05_{\rm stat} \pm 0.02_{\rm syst})\%$ of the $K^-π^0$ is observed for the first time with a significance of $7.9σ$ in addition to the dominant $\mathcal{P}$-wave component of $K^*(892)^-$ and the sub-dominant $K^-π^0$ $\mathcal{S}$-wave. The hadronic form factors of the $D^0 \to K^*(892)^-$ transition are measured precisely as $r_V=V(0)/A_1(0)=1.41 \pm 0.05_{\rm stat} \pm 0.01_{\rm syst}$ and $r_2=A_2(0)/A_1(0)=0.77 \pm 0.04_{\rm stat} \pm 0.02_{\rm syst}$. The branching fraction of $D^0\to K^*(892)^-e^+ν_e$ with $K^*(892)^-\to K^-π^0$ is measured to be $(7.403\pm0.061_{\rm stat.} \pm 0.048_{\rm syst.})\times10^{-3}$. Combining the measurements of the $D^0\to K^*(892)^-(K^*(892)^-\to K^-π^0)\ell^+ ν_\ell$, lepton flavor universality is tested by the ratio $\mathcal{R}_{\rm LFU}=\mathcal{B}(D^0\to K^*(892)^-μ^+ ν_μ)/\mathcal{B}(D^0\to K^*(892)^-e^+ν_e)=0.928\pm0.020_{\rm stat}\pm0.012_{\rm syst}$ with unprecedented precision; no violation is found. Furthermore, isospin symmetry in the decay $K^*(892) \to Kπ$ is tested by $\mathcal R_{K^{*-}} =\mathcal{B}(K^*(892)^-\to K^- π^0)/\mathcal{B}(K^*(892)^-\to K_S^0 π^-)= 1.09\pm0.02_{\rm stat}\pm0.02_{\rm syst}$ for the first time using the previous measurement of $D^0\to K^*(892)^-e^+ν_e$ with $K^*(892)^-\to K^0_Sπ^-$. Finally, the phase shift of the $Kπ$ \wv{S} is extracted in a model-independent way, which sheds light on the nature of the lightest strange scalar meson, the $K^*_0(700)$.
△ Less
Submitted 28 February, 2026;
originally announced March 2026.
-
Cross-Scale Pansharpening via ScaleFormer and the PanScale Benchmark
Authors:
Ke Cao,
Xuanhua He,
Xueheng Li,
Lingting Zhu,
Yingying Wang,
Ao Ma,
Zhanjie Zhang,
Man Zhou,
Chengjun Xie,
Jie Zhang
Abstract:
Pansharpening aims to generate high-resolution multi-spectral images by fusing the spatial detail of panchromatic images with the spectral richness of low-resolution MS data. However, most existing methods are evaluated under limited, low-resolution settings, limiting their generalization to real-world, high-resolution scenarios. To bridge this gap, we systematically investigate the data, algorith…
▽ More
Pansharpening aims to generate high-resolution multi-spectral images by fusing the spatial detail of panchromatic images with the spectral richness of low-resolution MS data. However, most existing methods are evaluated under limited, low-resolution settings, limiting their generalization to real-world, high-resolution scenarios. To bridge this gap, we systematically investigate the data, algorithmic, and computational challenges of cross-scale pansharpening. We first introduce PanScale, the first large-scale, cross-scale pansharpening dataset, accompanied by PanScale-Bench, a comprehensive benchmark for evaluating generalization across varying resolutions and scales. To realize scale generalization, we propose ScaleFormer, a novel architecture designed for multi-scale pansharpening. ScaleFormer reframes generalization across image resolutions as generalization across sequence lengths: it tokenizes images into patch sequences of the same resolution but variable length proportional to image scale. A Scale-Aware Patchify module enables training for such variations from fixed-size crops. ScaleFormer then decouples intra-patch spatial feature learning from inter-patch sequential dependency modeling, incorporating Rotary Positional Encoding to enhance extrapolation to unseen scales. Extensive experiments show that our approach outperforms SOTA methods in fusion quality and cross-scale generalization. The datasets and source code are available at https://github.com/caoke-963/ScaleFormer.
△ Less
Submitted 5 March, 2026; v1 submitted 28 February, 2026;
originally announced March 2026.
-
SvfEye: A Semantic-Visual Fusion Framework with Multi-Scale Visual Context for Multimodal Reasoning
Authors:
Yuxiang Shen,
Hailong Huang,
Zhenkun Gao,
Xueheng Li,
Man Zhou,
Chengjun Xie,
Haoxuan Che,
Xuanhua He,
Jie Zhang
Abstract:
Multimodal Large Language Models (MLLMs) often struggle to accurately perceive fine-grained visual details, especially when targets are tiny or visually subtle. This challenge can be addressed through semantic-visual information fusion, which integrates global image context with fine-grained local evidence for multi-scale visual understanding. Recently, a paradigm termed "Thinking with Images" has…
▽ More
Multimodal Large Language Models (MLLMs) often struggle to accurately perceive fine-grained visual details, especially when targets are tiny or visually subtle. This challenge can be addressed through semantic-visual information fusion, which integrates global image context with fine-grained local evidence for multi-scale visual understanding. Recently, a paradigm termed "Thinking with Images" has emerged, enabling models to acquire high-resolution visual evidence by zooming or cropping image regions and fusing these local details with global context during reasoning. Although training-based approaches demonstrate the effectiveness of this capability, they require extensive computational resources and large-scale task-specific data. Consequently, lightweight training-free methods have been proposed as a practical alternative to incorporate local visual evidence during inference. However, existing training-free approaches still suffer from two key limitations. First, they indiscriminately extract and fuse local visual regions for all inputs regardless of necessity, introducing computational redundancy and perceptual noise. Second, they exhibit drift between semantic intent and visual attention, preventing accurate localization of user-focused regions. To address these challenges, we propose SvfEye, a training-free framework for adaptive visual-semantic fusion. SvfEye follows a two-stage pipeline with a confidence-based decision module to determine whether additional local visual information is needed, and a semantic-attention fusion module to identify informative local regions. Experiments show that SvfEye achieves substantial performance gains while obtaining an approximately 4.0x inference speedup over the state-of-the-art method ZoomEye.
△ Less
Submitted 13 March, 2026; v1 submitted 26 February, 2026;
originally announced March 2026.
-
Precision Studies and Searches for CP Asymmetries in the Inclusive Decay $Λ_{c}^{+}\to ΛX$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
C. S. Akondi,
R. Aliberti,
A. Amoroso,
Q. An,
Y. H. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
X. L. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. B. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (719 additional authors not shown)
Abstract:
Based on $e^+e^-$ annihilation data collected with the BESIII detector at center-of-mass energies from 4.600 to 4.699 GeV, corresponding to an integrated luminosity of 4.5 fb$^{-1}$, we present the first measurement of the longitudinal polarization of $Λ$ hyperons produced in the inclusive decay $Λ_c^+ \to ΛX$, where $X$ denotes any allowed final state. The polarizations are determined to be…
▽ More
Based on $e^+e^-$ annihilation data collected with the BESIII detector at center-of-mass energies from 4.600 to 4.699 GeV, corresponding to an integrated luminosity of 4.5 fb$^{-1}$, we present the first measurement of the longitudinal polarization of $Λ$ hyperons produced in the inclusive decay $Λ_c^+ \to ΛX$, where $X$ denotes any allowed final state. The polarizations are determined to be $\mathcal{P}_Λ = -0.393 \pm 0.055_{\mathrm{sta.}} \pm 0.020_{\mathrm{sys.}}$ and $\mathcal{P}_{\barΛ} = 0.288 \pm 0.056_{\mathrm{sta.}} \pm 0.017_{\mathrm{sys.}}$. We then search for CP violation using an asymmetry constructed from the $Λ$ polarization and the $Λ\to p π^-$ decay asymmetry parameters, and obtain $\mathcal{A}_{\mathrm{CP}}^{\mathrm{pol}} = 0.15 \pm 0.12_{\mathrm{sta.}} \pm 0.04_{\mathrm{sys.}}$. We also perform an updated measurement of the absolute branching fraction, resulted as $\mathcal{B}(Λ_c^+ \to ΛX) = (38.07 \pm 0.38_{\mathrm{sta.}} \pm 0.49_{\mathrm{sys.}})\%$, with precision improved by a factor of four relative to the current world average. A search for direct CP violation yields $\mathcal{A}_{\mathrm{CP}}^{\mathrm{dir}} = (1.5 \pm 1.0_{\mathrm{sta.}} \pm 1.0_{\mathrm{sys.}})\%$. No evidence for CP violation in inclusive charm baryon decays is observed.
△ Less
Submitted 27 February, 2026;
originally announced February 2026.