-
FAIR Universe Weak Lensing ML Uncertainty Challenge: Handling Uncertainties and Distribution Shifts for Precision Cosmology
Authors:
Biwei Dai,
Po-Wen Chang,
Wahid Bhimji,
Paolo Calafiura,
Ragansu Chakkappai,
Yuan-Tang Chou,
Sascha Diefenbacher,
Jordan Dudley,
Ibrahim Elsharkawy,
Steven Farrell,
Isabelle Guyon,
Chris Harris,
Elham E Khoda,
Benjamin Nachman,
David Rousseau,
Uroš Seljak,
Ihsan Ullah,
Yulei Zhang
Abstract:
Weak gravitational lensing, the correlated distortion of background galaxy shapes by foreground structures, is a powerful probe of the matter distribution in our universe and allows accurate constraints on the cosmological model. In recent years, high-order statistics and machine learning (ML) techniques have been applied to weak lensing data to extract the nonlinear information beyond traditional…
▽ More
Weak gravitational lensing, the correlated distortion of background galaxy shapes by foreground structures, is a powerful probe of the matter distribution in our universe and allows accurate constraints on the cosmological model. In recent years, high-order statistics and machine learning (ML) techniques have been applied to weak lensing data to extract the nonlinear information beyond traditional two-point analysis. However, these methods typically rely on cosmological simulations, which poses several challenges: simulations are computationally expensive, limiting most realistic setups to a low training data regime; inaccurate modeling of systematics in the simulations create distribution shifts that can bias cosmological parameter constraints; and varying simulation setups across studies make method comparison difficult. To address these difficulties, we present the first weak lensing benchmark dataset with several realistic systematics and launch the FAIR Universe Weak Lensing Machine Learning Uncertainty Challenge. The challenge focuses on measuring the fundamental properties of the universe from weak lensing data with limited training set and potential distribution shifts, while providing a standardized benchmark for rigorous comparison across methods. Organized in two phases, the challenge will bring together the physics and ML communities to advance the methodologies for handling systematic uncertainties, data efficiency, and distribution shifts in weak lensing analysis with ML, ultimately facilitating the deployment of ML approaches into upcoming weak lensing survey analysis.
△ Less
Submitted 15 April, 2026;
originally announced April 2026.
-
Competing with AI Scientists: Agent-Driven Approach to Astrophysics Research
Authors:
Thomas Borrett,
Licong Xu,
Andy Nilipour,
Boris Bolliet,
Sebastien Pierre,
Erwan Allys,
Celia Lecat,
Biwei Dai,
Po-Wen Chang,
Wahid Bhimji
Abstract:
We present an agent-driven approach to the construction of parameter inference pipelines for scientific data analysis. Our method leverages a multi-agent system, Cmbagent (the analysis system of the AI scientist Denario), in which specialized agents collaborate to generate research ideas, write and execute code, evaluate results, and iteratively refine the overall pipeline. As a case study, we app…
▽ More
We present an agent-driven approach to the construction of parameter inference pipelines for scientific data analysis. Our method leverages a multi-agent system, Cmbagent (the analysis system of the AI scientist Denario), in which specialized agents collaborate to generate research ideas, write and execute code, evaluate results, and iteratively refine the overall pipeline. As a case study, we apply this approach to the FAIR Universe Weak Lensing Uncertainty Challenge, a competition under time constraints focused on robust cosmological parameter inference with realistic observational uncertainties. While the fully autonomous exploration initially did not reach expert-level performance, the integration of human intervention enabled our agent-driven workflow to achieve a first-place result in the challenge. This demonstrates that semi-autonomous agentic systems can compete with, and in some cases surpass, expert solutions. We describe our workflow in detail, including both the autonomous and semi-autonomous exploration by Cmbagent. Our final inference pipeline utilizes parameter-efficient convolutional neural networks, likelihood calibration over a known parameter grid, and multiple regularization techniques. Our results suggest that agent-driven research workflows can provide a scalable framework to rapidly explore and construct pipelines for inference problems.
△ Less
Submitted 18 March, 2026;
originally announced April 2026.
-
Audio Language Model for Deepfake Detection Grounded in Acoustic Chain-of-Thought
Authors:
Runkun Chen,
Yixiong Fang,
Pengyu Chang,
Yuante Li,
Massa Baali,
Bhiksha Raj
Abstract:
Deepfake speech detection systems are often limited to binary classification tasks and struggle to generate interpretable reasoning or provide context-rich explanations for their decisions. These models primarily extract latent embeddings for authenticity detection but fail to leverage structured acoustic evidence such as prosodic, spectral, and physiological attributes in a meaningful manner. Thi…
▽ More
Deepfake speech detection systems are often limited to binary classification tasks and struggle to generate interpretable reasoning or provide context-rich explanations for their decisions. These models primarily extract latent embeddings for authenticity detection but fail to leverage structured acoustic evidence such as prosodic, spectral, and physiological attributes in a meaningful manner. This paper introduces CoLMbo-DF, a Feature-Guided Audio Language Model that addresses these limitations by integrating robust deepfake detection with explicit acoustic chain-of-thought reasoning. By injecting structured textual representations of low-level acoustic features directly into the model prompt, our approach grounds the model's reasoning in interpretable evidence and improves detection accuracy. To support this framework, we introduce a novel dataset of audio pairs paired with chain-of-thought annotations. Experiments show that our method, trained on a lightweight open-source language model, significantly outperforms existing audio language model baselines despite its smaller scale, marking a significant advancement in explainable deepfake speech detection.
△ Less
Submitted 31 March, 2026; v1 submitted 30 March, 2026;
originally announced March 2026.
-
Search for the decay $B^+ \rightarrow K^+τ^+τ^-$ using data from the Belle and Belle II experiments
Authors:
Belle,
Belle II Collaborations,
:,
M. Abumusabh,
I. Adachi,
K. Adamczyk,
A. Aggarwal,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
M. Angelsmark,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
V. Babu
, et al. (414 additional authors not shown)
Abstract:
We report a search for the rare decay $B^{+} \rightarrow K^{+} τ^{+} τ^{-}$ using $1.2 \times 10^9$ $Υ(4S)$ mesons produced near threshold in electron-positron collisions and collected by the Belle and Belle~II experiments. We fully reconstruct the hadronic decay of one $B$ meson produced in the $Υ(4S)\rightarrow B^{+} B^{-}$ decay, and search for $B^{\pm}\rightarrow K^{\pm} τ^{+}τ^{-}$ candidates…
▽ More
We report a search for the rare decay $B^{+} \rightarrow K^{+} τ^{+} τ^{-}$ using $1.2 \times 10^9$ $Υ(4S)$ mesons produced near threshold in electron-positron collisions and collected by the Belle and Belle~II experiments. We fully reconstruct the hadronic decay of one $B$ meson produced in the $Υ(4S)\rightarrow B^{+} B^{-}$ decay, and search for $B^{\pm}\rightarrow K^{\pm} τ^{+}τ^{-}$ candidates among the remaining collision products, reconstructing a charged kaon and leptonic decays of the $τ$ leptons. We optimize the selection for best sensitivity and look for an excess over background at low values of the residual energy detected in the calorimeter after full event reconstruction. We observe no significant excess and set the limit $\mathcal{B}(B^{+}\rightarrow K^{+}τ^{+}τ^{-})< 0.56\times 10^{-3}$ at the 90% confidence level, improving on the only previous result by a factor of four.
△ Less
Submitted 25 March, 2026;
originally announced March 2026.
-
Stars with Plumbing Issues: The Formation of Collimated Outflows on Common-Envelope Simulations and Comparison to Water Fountains Observations
Authors:
Sarah V. Borges,
Philip Chang
Abstract:
Common-envelope evolution (CEE) is one of the biggest open questions in binary stellar evolution, despite being the main channel for the formation of close binaries. One of the main reasons CEE is difficult to model is the lack of direct observations that could constrain numerical simulations. One exception is luminous red novae, which are thought to represent CEEs that end in mergers. Unfortunate…
▽ More
Common-envelope evolution (CEE) is one of the biggest open questions in binary stellar evolution, despite being the main channel for the formation of close binaries. One of the main reasons CEE is difficult to model is the lack of direct observations that could constrain numerical simulations. One exception is luminous red novae, which are thought to represent CEEs that end in mergers. Unfortunately, there are no confirmed direct detections of ongoing events that result in the survival of a close binary, and we must rely on observations of post-CEE systems. Among these, planetary nebulae (PNe) are particularly important because their morphologies can probe how the envelope is ejected. However, post-CEE PNe do not reflect the ejected envelope in its pristine form, as winds from the central core also affect their morphology. In this context, Water Fountains (WFs), a class of objects proposed to form during CEE, provide an ideal comparison. They are identified by their collimated water masers, and most are still in the post-AGB phase. As such, WFs provide some of the best observational constraints for simulations, since they likely capture a snapshot of the envelope ejection while it is still happening. In this paper, we show that the formation of a circumbinary disk with collimated outflows surrounding the central binary arises naturally from hydrodynamical simulations of CEE, and that their morphology and kinematics are consistent with observations of WFs. We also present insights into how the properties of WFs may provide clues to understanding how CEE proceeds and help guide future simulations.
△ Less
Submitted 17 March, 2026;
originally announced March 2026.
-
From Embeddings to Dyson Series: Transformer Mechanics as Non-Hermitian Operator Theory
Authors:
Po-Hao Chang
Abstract:
Transformer architectures are typically described in algorithmic and statistical terms, leaving their internal mechanics without a familiar structural language for researchers trained in physical theories. To bridge this gap, we develop a complementary operator-theoretic framework that recasts their mechanics in a language familiar to many-body physics. Beginning from the token as a discrete index…
▽ More
Transformer architectures are typically described in algorithmic and statistical terms, leaving their internal mechanics without a familiar structural language for researchers trained in physical theories. To bridge this gap, we develop a complementary operator-theoretic framework that recasts their mechanics in a language familiar to many-body physics. Beginning from the token as a discrete index without intrinsic geometry, we show that embedding corresponds to a basis transformation into a continuous representation space. Once such a reference basis is established, self-attention naturally assumes the role of a non-Hermitian interaction operator, and network depth implements an ordered composition of these interactions. Within this formulation, several empirical properties of deep Transformers -- including stability at large depth, representational saturation, and the effectiveness of multi-head decomposition -- find natural structural interpretations as consequences of regulated operator composition. Together, channel factorization and normalization emerge as organizing structural logic rather than isolated architectural choices. This perspective does not rely on post-hoc analogy, but follows a constructive path where each parallel arises from the preceding structural step. By recasting Transformer mechanics in operator language, the framework lowers the conceptual barrier between deep learning and many-body physics through shared mathematical structure, making tools and intuitions from each domain more readily legible to the other.
△ Less
Submitted 16 March, 2026; v1 submitted 11 March, 2026;
originally announced March 2026.
-
Searches for charged-lepton-flavor violation in $χ_{bJ}(1P)$ decays
Authors:
Belle,
Belle II Collaborations,
:,
M. Abumusabh,
I. Adachi,
A. Aggarwal,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
M. Angelsmark,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
V. Babu,
H. Bae
, et al. (394 additional authors not shown)
Abstract:
We report the first searches for charged-lepton-flavor violation in decays of $χ_{bJ}(1P)$ ($J=0, 1,$ and $2$) to a pair of charged leptons using 158 million $Υ(2S)$ decays collected with the Belle detector in $e^+e^-$ collisions at the KEKB collider. No significant signal is observed, and we set upper limits on the branching fractions for $χ_{bJ}(1P)$ decays to $e^\pmμ^\mp$ at the level of…
▽ More
We report the first searches for charged-lepton-flavor violation in decays of $χ_{bJ}(1P)$ ($J=0, 1,$ and $2$) to a pair of charged leptons using 158 million $Υ(2S)$ decays collected with the Belle detector in $e^+e^-$ collisions at the KEKB collider. No significant signal is observed, and we set upper limits on the branching fractions for $χ_{bJ}(1P)$ decays to $e^\pmμ^\mp$ at the level of $10^{-6}$ and to $e^\pmτ^\mp$ or $μ^\pmτ^\mp$ at the level of $10^{-5}$. Limits on $χ_{b0}(1P)$ decays are translated into bounds on the corresponding Wilson coefficients of scalar operators that mediate charged-lepton-flavor violation.
△ Less
Submitted 12 March, 2026; v1 submitted 11 March, 2026;
originally announced March 2026.
-
Is altermagnetism in vanadium oxychalcogenides a lost cause?
Authors:
Bishal Thapa,
Po-Hao Chang,
Kirill Belashchenko,
Igor I. Mazin
Abstract:
Vanadium-based oxychalcogenide compounds with the inverse Lieb-lattice (ILL) structural pattern have recently been proposed as candidate altermagnets (AM). However, early studies postulated ferromagnetic interlayer coupling, a critical requirement for preserving the bulk AM state. Here we present a systematic survey of the complete AV2Q2O family (A = K, Rb, Cs; Q = S, Se, Te) in terms of their mag…
▽ More
Vanadium-based oxychalcogenide compounds with the inverse Lieb-lattice (ILL) structural pattern have recently been proposed as candidate altermagnets (AM). However, early studies postulated ferromagnetic interlayer coupling, a critical requirement for preserving the bulk AM state. Here we present a systematic survey of the complete AV2Q2O family (A = K, Rb, Cs; Q = S, Se, Te) in terms of their magnetic ordering and interlayer coupling. While intralayer exchange interaction favors AM ordering in a single ILL layer across the entire family, the relatively weak interlayer coupling in most cases favors Kramers-degenerate antiferromagnetic order with a doubled magnetic unit cell. This means that most stoichiometric bulk materials, including the previously proposed candidate KV2Se2O, are not altermagnetic, with CsV2Te2O being the only exception. Using hole doping to simulate alkali vacancies, we show that realistic deviations from stoichiometry do not change the magnetic ground state in these compounds.
△ Less
Submitted 20 February, 2026;
originally announced February 2026.
-
Building an AI-native Research Ecosystem for Experimental Particle Physics: A Community Vision
Authors:
Thea Klaeboe Aarrestad,
Alaa Abdelhamid,
Haider Abidi,
Jahred Adelman,
Jennifer Adelman-McCarthy,
Shuchin Aeron,
Garvita Agarwal,
Usman Ali,
Cristiano Alpigiani,
Omar Alterkait,
Mohamed Aly,
Oz Amram,
Saeed Ansari Fard,
Aram Apyan,
John Arrington,
Marvin Ascencio-Sosa,
Mohammad Atif,
Aneesha Avasthi,
Muhammad Bilal Azam,
Bhim Bam,
Joshua Barrow,
Rainer Bartoldus,
Amit Bashyal,
Aashwin Basnet,
Ayse Bat
, et al. (435 additional authors not shown)
Abstract:
Experimental particle physics seeks to understand the universe by probing its fundamental particles and forces and exploring how they govern the large-scale processes that shape cosmic evolution. This whitepaper presents a vision for how Artificial Intelligence (AI) can accelerate discovery in this field. We outline grand challenges that must be addressed to enable transformative breakthroughs and…
▽ More
Experimental particle physics seeks to understand the universe by probing its fundamental particles and forces and exploring how they govern the large-scale processes that shape cosmic evolution. This whitepaper presents a vision for how Artificial Intelligence (AI) can accelerate discovery in this field. We outline grand challenges that must be addressed to enable transformative breakthroughs and describe how current and planned experimental facilities can implement this vision to advance our understanding of the vast and complex physical world from the smallest to the largest scales. We show how facilities currently under construction, such as the HL-LHC, DUNE and soon EIC, can both benefit from and serve as proving grounds for this vision, while also enabling a longer-term goal for how future experiments -- like FCC-ee at CERN, IceCube-Gen2, a Muon Collider in the U.S., and smaller to mid-scale projects -- can be fully AI-native. We describe how a truly national-scale collaboration, jointly managed across large funding partners, and involving both DOE laboratories and universities, can make this happen.
△ Less
Submitted 19 February, 2026;
originally announced February 2026.
-
Study of $e^{+}e^{-}\to h^{+}h^{-}J/ψ~(h=π,~K,~p)$ via initial-state radiation at Belle~II
Authors:
Belle II Collaboration,
M. Abumusabh,
I. Adachi,
A. Aggarwal,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
V. Babu,
H. Bae,
N. K. Baghel,
S. Bahinipati,
P. Bambade
, et al. (396 additional authors not shown)
Abstract:
Using a data sample of 427.9 fb$^{-1}$ collected by the Belle~II detector at or near the $Υ(4S)$ and $Υ(10753)$ resonances, the cross sections for $e^+e^-\to h^+h^-J/ψ$ $(h=π/K/p)$ at center-of-mass energies ranging from 3.8 GeV or the production threshold to 5.5/6.0/7.0 GeV have been measured via initial-state radiation. The cross sections for the processes $e^+e^-\to π^+π^-J/ψ$ and…
▽ More
Using a data sample of 427.9 fb$^{-1}$ collected by the Belle~II detector at or near the $Υ(4S)$ and $Υ(10753)$ resonances, the cross sections for $e^+e^-\to h^+h^-J/ψ$ $(h=π/K/p)$ at center-of-mass energies ranging from 3.8 GeV or the production threshold to 5.5/6.0/7.0 GeV have been measured via initial-state radiation. The cross sections for the processes $e^+e^-\to π^+π^-J/ψ$ and $e^+e^-\to K^+K^-J/ψ$ are consistent with previously published results. The cross sections for these channels obtained by combining with previous Belle results are also given. The process $e^+e^-\to p\bar p J/ψ$ is investigated for the first time. The yields are small and no significant structure is observed in the cross section versus energy. Searches for vector charmonium-like states in the $h^+h^-J/ψ$ systems, and for associated intermediate states in the $h^{\pm} J/ψ$ systems, are also presented.
△ Less
Submitted 13 February, 2026;
originally announced February 2026.
-
Study of $B^+ \to μ^+ ν_μ$ decays at Belle and Belle II
Authors:
Belle,
Belle II Collaborations,
:,
M. Abumusabh,
I. Adachi,
K. Adamczyk,
A. Aggarwal,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
V. Babu,
H. Bae
, et al. (391 additional authors not shown)
Abstract:
We report a measurement of the branching fraction for the leptonic decay $B^+\toμ^+ν_μ$. This work presents the first $B^+\toμ^+ν_μ$ result using Belle~II data, an updated Belle measurement that supersedes the previous result, and their combination, which yields the most precise search to date. The analysis is based on $1076\,\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-ma…
▽ More
We report a measurement of the branching fraction for the leptonic decay $B^+\toμ^+ν_μ$. This work presents the first $B^+\toμ^+ν_μ$ result using Belle~II data, an updated Belle measurement that supersedes the previous result, and their combination, which yields the most precise search to date. The analysis is based on $1076\,\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of $10.58\,\mathrm{GeV}$ with the Belle and Belle~II detectors at the KEKB and SuperKEKB colliders, respectively. We measure $\mathcal{B}(B^+\toμ^+ν_μ)=(4.4\pm1.9\pm 1.0)\times10^{-7}$, where the first uncertainty is statistical and the second systematic. The observed significance relative to the background-only hypothesis is 2.4 standard deviations. We set a 90\% confidence level upper limit of $\mathcal{B}(B^+\toμ^+ν_μ)<6.7\times10^{-7}$ using a frequentist approach and a 90\% credibility level upper limit of $\mathcal{B}(B^+\toμ^+ν_μ)<7.2\times 10^{-7}$ using a Bayesian approach. These are the most stringent limits to date. The result is interpreted as an exclusion region in the parameter space of type~II and type~III two-Higgs-doublet models. We search for stable sterile neutrinos with masses $m_N\in[0,1.5]\,\mathrm{GeV}$. No signal is observed, and the resulting exclusion on the squared mixing parameter $|U_{μN}|^2$ provides improvement over previous limits. We report a measurement of the partial branching fraction of semileptonic $B\to X_u\ellν_\ell$ decays with $p_μ^B>2.2\,\mathrm{GeV}$, obtaining $Δ\mathcal{B}(B\to X_u\ellν_\ell)=(2.72\pm0.05\pm0.29)\times10^{-4}$. We present a model-dependent study of weak annihilation decays using the muon momentum spectrum. We observe a signal of 2.4 standard deviations above the background-only hypothesis in regions where the distribution resembles that of $B\to X_u\ellν_\ell$ decays.
△ Less
Submitted 10 February, 2026;
originally announced February 2026.
-
Anisotropy, frustration and saddle point in the twisted Kagome antiferromagnet ErPdPb
Authors:
Resham Babu Regmi,
Sk Jamaluddin,
Y. Lee,
Hari Bhandari,
Po-Hao Chang,
Peter E. Siegfried,
Abhijeet Nayak,
Mohamed El Gazzah,
Bence G. Márkus,
Anna Nyáry,
Zachary T. Messegee,
Miya P. Zhao,
Xiaoyan Tan,
László Forró,
Liqin Ke,
Igor I. Mazin,
Nirmal J. Ghimire
Abstract:
The kagome lattice, with its inherent geometric frustration, provides a rich platform for exploring intriguing magnetic phenomena and topological electronic structures. In reduced-symmetry structures, such as twisted kagome systems involving rare earth elements, additional anisotropy can arise, enabling intriguing properties including spin-ice states, magnetocaloric effects, noncollinear magnetic…
▽ More
The kagome lattice, with its inherent geometric frustration, provides a rich platform for exploring intriguing magnetic phenomena and topological electronic structures. In reduced-symmetry structures, such as twisted kagome systems involving rare earth elements, additional anisotropy can arise, enabling intriguing properties including spin-ice states, magnetocaloric effects, noncollinear magnetic ordering, and anomalous Hall effect. Here, we report the synthesis of single crystals of ErPdPb, which features a twisted kagome lattice net of Er atoms within the hexagonal ZrNiAl-type structure, and we investigate its magnetic, electronic, and thermal properties. The material exhibits antiferromagnetic ordering below 2.2 K, consistently observed in magnetic, transport, and heat capacity measurements. Magnetization measurements reveal 1/3 metamagnetic steps along the c-axis below the Néel temperature, suggesting an Ising-spin-like state on the twisted kagome lattice. A pronounced anisotropy between in-plane and out-of-plane resistivity is observed throughout the temperature range of 1.8-300 K, and the compound exhibits a significant frustration index of 13.6 (12.7) along the c-axis (ab-plane). Heat capacity measurements show a broad hump at 2.2 K, with an additional increase below 0.5 K. The anisotropic magnetic properties are further explored through density functional theory (DFT) calculations, which suggest strong easy-axis anisotropy, consistent with experimental magnetic measurements and crystal-field model expectations, and quasi-one-dimensional bands and a spin-split saddle point at the zone center.
△ Less
Submitted 9 February, 2026;
originally announced February 2026.
-
Test vs Mutant: Adversarial LLM Agents for Robust Unit Test Generation
Authors:
Pengyu Chang,
Yixiong Fang,
Silin Chen,
Yuling Shi,
Beijun Shen,
Xiaodong Gu
Abstract:
Software testing is a critical, yet resource-intensive phase of the software development lifecycle. Over the years, various automated tools have been developed to aid in this process. Search-based approaches typically achieve high coverage but produce tests with low readability, whereas large language model (LLM)-based methods generate more human-readable tests but often suffer from low coverage a…
▽ More
Software testing is a critical, yet resource-intensive phase of the software development lifecycle. Over the years, various automated tools have been developed to aid in this process. Search-based approaches typically achieve high coverage but produce tests with low readability, whereas large language model (LLM)-based methods generate more human-readable tests but often suffer from low coverage and compilability. While the majority of research efforts have focused on improving test coverage and readability, little attention has been paid to enhancing the robustness of bug detection, particularly in exposing corner cases and vulnerable execution paths. To address this gap, we propose AdverTest, a novel adversarial framework for LLM-powered test case generation. AdverTest comprises two interacting agents: a test case generation agent (T) and a mutant generation agent (M). These agents engage in an adversarial loop, where M persistently creates new mutants "hacking" the blind spots of T's current test suite, while T iteratively refines its test cases to "kill" the challenging mutants produced by M. This interaction loop is guided by both coverage and mutation scores, enabling the system to co-evolve toward both high test coverage and bug detection capability. Experimental results in the Defects4J dataset show that our approach improves fault detection rates by 8.56% over the best existing LLM-based methods and by 63.30% over EvoSuite, while also improving line and branch coverage.
△ Less
Submitted 9 February, 2026; v1 submitted 8 February, 2026;
originally announced February 2026.
-
ShapeGaussian: High-Fidelity 4D Human Reconstruction in Monocular Videos via Vision Priors
Authors:
Zhenxiao Liang,
Ning Zhang,
Youbao Tang,
Ruei-Sung Lin,
Qixing Huang,
Peng Chang,
Jing Xiao
Abstract:
We introduce ShapeGaussian, a high-fidelity, template-free method for 4D human reconstruction from casual monocular videos. Generic reconstruction methods lacking robust vision priors, such as 4DGS, struggle to capture high-deformation human motion without multi-view cues. While template-based approaches, primarily relying on SMPL, such as HUGS, can produce photorealistic results, they are highly…
▽ More
We introduce ShapeGaussian, a high-fidelity, template-free method for 4D human reconstruction from casual monocular videos. Generic reconstruction methods lacking robust vision priors, such as 4DGS, struggle to capture high-deformation human motion without multi-view cues. While template-based approaches, primarily relying on SMPL, such as HUGS, can produce photorealistic results, they are highly susceptible to errors in human pose estimation, often leading to unrealistic artifacts. In contrast, ShapeGaussian effectively integrates template-free vision priors to achieve both high-fidelity and robust scene reconstructions. Our method follows a two-step pipeline: first, we learn a coarse, deformable geometry using pretrained models that estimate data-driven priors, providing a foundation for reconstruction. Then, we refine this geometry using a neural deformation model to capture fine-grained dynamic details. By leveraging 2D vision priors, we mitigate artifacts from erroneous pose estimation in template-based methods and employ multiple reference frames to resolve the invisibility issue of 2D keypoints in a template-free manner. Extensive experiments demonstrate that ShapeGaussian surpasses template-based methods in reconstruction accuracy, achieving superior visual quality and robustness across diverse human motions in casual monocular videos.
△ Less
Submitted 5 February, 2026;
originally announced February 2026.
-
SPICE: Submodular Penalized Information-Conflict Selection for Efficient Large Language Model Training
Authors:
Powei Chang,
Jinpeng Zhang,
Bowen Chen,
Chenyu Wang,
Chenlu Guo,
Yixing Zhang,
Yukang Gao,
JianXiang Xiang,
Yue Gao,
Chaoqun Sun,
Yiyi Chen,
Dongying Kong
Abstract:
Information-based data selection for instruction tuning is compelling: maximizing the log-determinant of the Fisher information yields a monotone submodular objective, enabling greedy algorithms to achieve a $(1-1/e)$ approximation under a cardinality budget. In practice, however, we identify alleviating gradient conflicts, misalignment between per-sample gradients, is a key factor that slows down…
▽ More
Information-based data selection for instruction tuning is compelling: maximizing the log-determinant of the Fisher information yields a monotone submodular objective, enabling greedy algorithms to achieve a $(1-1/e)$ approximation under a cardinality budget. In practice, however, we identify alleviating gradient conflicts, misalignment between per-sample gradients, is a key factor that slows down the decay of marginal log-determinant information gains, thereby preventing significant loss of information. We formalize this via an $\varepsilon$-decomposition that quantifies the deviation from ideal submodularity as a function of conflict statistics, yielding data-dependent approximation factors that tighten as conflicts diminish. Guided by this analysis, we propose SPICE, a conflict-aware selector that maximizes information while penalizing misalignment, and that supports early stopping and proxy models for efficiency. Empirically, SPICE selects subsets with higher log-determinant information than original criteria, and these informational gains translate into performance improvements: across 8 benchmarks with LLaMA2-7B and Qwen2-7B, SPICE uses only 10% of the data, yet matches or exceeds 6 methods including full-data tuning. This achieves performance improvements with substantially lower training cost.
△ Less
Submitted 8 April, 2026; v1 submitted 30 January, 2026;
originally announced January 2026.
-
Search for Cosmic Ray Electron Boosted Dark Matter with the CDEX-10 Experiment
Authors:
R. Xu,
L. T. Yang,
Q. Yue,
K. J. Kang,
Y. J. Li,
H. P. An,
Greeshma C.,
J. P. Chang,
H. Chen,
Y. H. Chen,
J. P. Cheng,
J. Y. Cui,
W. H. Dai,
Z. Deng,
Y. X. Dong,
C. H. Fang,
H. Gong,
Q. J. Guo,
T. Guo,
X. Y. Guo,
L. He,
J. R. He,
H. X. Huang,
T. C. Huang,
S. Karmakar
, et al. (63 additional authors not shown)
Abstract:
We present new constraints on the cosmic ray electron boosted light dark matter (CReDM) using the 205.4 kg$\cdot$day data of the CDEX-10 experiment located at the China Jinping Underground Laboratory. The cosmic ray electron spectrum and distribution in the Galaxy are generated by the $\tt GALPROP$ code package. In the calculation process of DM-electron scattering process in the Galaxy, we conside…
▽ More
We present new constraints on the cosmic ray electron boosted light dark matter (CReDM) using the 205.4 kg$\cdot$day data of the CDEX-10 experiment located at the China Jinping Underground Laboratory. The cosmic ray electron spectrum and distribution in the Galaxy are generated by the $\tt GALPROP$ code package. In the calculation process of DM-electron scattering process in the Galaxy, we consider the energy-dependency of the DM-electron scattering cross section. The constraints on CReDM are set for both heavy and light mediator scenarios using the CDEX-10 dataset. The result exceeds previous Standard Halo Model (SHM) limits for DM mass lower than 0.6 MeV in heavy mediator case and corresponds to the best sensitivity among all direct detection experiments from 1 keV to 0.5 MeV in the light mediator scenario.
△ Less
Submitted 13 January, 2026;
originally announced January 2026.
-
A search for feebly-interacting particles in $B$ decays with missing energy at Belle
Authors:
Belle,
Belle II Collaborations,
:,
M. Abumusabh,
I. Adachi,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
V. Babu,
H. Bae,
N. K. Baghel,
S. Bahinipati
, et al. (385 additional authors not shown)
Abstract:
We present a search for an invisible hidden-sector particle $ X_{\rm{inv}}$, produced in $B^0 \to \bar{D}^0 X_{\rm{inv}}$ and $B^\pm\to h X_{\rm{inv}}$ decays, where $h = π^\pm$, $K^\pm$, $D_{s}^\pm$, $p^\pm$. The search is performed using $e^+ e^-$ collision data recorded with the Belle detector, corresponding to 711~fb$^{-1}$. No significant signal is observed. We set 90\% confidence level upper…
▽ More
We present a search for an invisible hidden-sector particle $ X_{\rm{inv}}$, produced in $B^0 \to \bar{D}^0 X_{\rm{inv}}$ and $B^\pm\to h X_{\rm{inv}}$ decays, where $h = π^\pm$, $K^\pm$, $D_{s}^\pm$, $p^\pm$. The search is performed using $e^+ e^-$ collision data recorded with the Belle detector, corresponding to 711~fb$^{-1}$. No significant signal is observed. We set 90\% confidence level upper limits ranging between $10^{-4}$ and $10^{-6}$ on the branching fraction $\mathcal{B}(B \to h X_{\rm{inv}})$ as a function of $m_{ X_{\rm{inv}}}$. Corresponding limits are set on $\mathcal{B}(B \to hX)\times\mathcal{B}(X\toγγ)$ for lifetimes $cτ_X$ between $10~μ$m and 10 m. Many of these limits are the first direct constraints on their respective decays. Our results provide the most stringent exclusion limits to date on the branching fractions for all search channels, and exclude previously unexplored regions of parameter space relevant to several new physics models.
△ Less
Submitted 11 January, 2026;
originally announced January 2026.
-
EIR: Enhanced Image Representations for Medical Report Generation
Authors:
Qiang Sun,
Zongcheng Ji,
Yinlong Xiao,
Peng Chang,
Jun Yu
Abstract:
Generating medical reports from chest X-ray images is a critical and time-consuming task for radiologists, especially in emergencies. To alleviate the stress on radiologists and reduce the risk of misdiagnosis, numerous research efforts have been dedicated to automatic medical report generation in recent years. Most recent studies have developed methods that represent images by utilizing various m…
▽ More
Generating medical reports from chest X-ray images is a critical and time-consuming task for radiologists, especially in emergencies. To alleviate the stress on radiologists and reduce the risk of misdiagnosis, numerous research efforts have been dedicated to automatic medical report generation in recent years. Most recent studies have developed methods that represent images by utilizing various medical metadata, such as the clinical document history of the current patient and the medical graphs constructed from retrieved reports of other similar patients. However, all existing methods integrate additional metadata representations with visual representations through a simple "Add and LayerNorm" operation, which suffers from the information asymmetry problem due to the distinct distributions between them. In addition, chest X-ray images are usually represented using pre-trained models based on natural domain images, which exhibit an obvious domain gap between general and medical domain images. To this end, we propose a novel approach called Enhanced Image Representations (EIR) for generating accurate chest X-ray reports. We utilize cross-modal transformers to fuse metadata representations with image representations, thereby effectively addressing the information asymmetry problem between them, and we leverage medical domain pre-trained models to encode medical images, effectively bridging the domain gap for image representation. Experimental results on the widely used MIMIC and Open-I datasets demonstrate the effectiveness of our proposed method.
△ Less
Submitted 28 December, 2025;
originally announced December 2025.
-
A search for successful and choked jets in nearby broad-lined Type Ic supernovae
Authors:
Tanner O'Dwyer,
Alessandra Corsi,
Sheng Yang,
Shreya Anand,
S. Bradley Cenko,
Gokul P. Srinivasaragavan,
Anna Y. Q. Ho,
Jesper Sollerman,
Bei Zhou,
Arvind Balasubramanian,
Po-Wen Chang,
Marc Kamionkowski,
Daniel Perley,
Russ R. Laher,
Kohta Murase,
Frank J. Masci,
Mansi M. Kasliwal,
Josiah N. Purdum,
Matthew J. Graham
Abstract:
The observational link between long gamma-ray bursts (GRBs) and broad-lined stripped-envelope core-collapse supernovae (SNe Ic-BL) is well established. Significant progress has been made in constraining what fraction of SNe Ic-BL may power high- or low-luminosity GRBs when viewed at small off-axis angles. However, the GRB-SN connection still lacks a complete understanding in the broader context of…
▽ More
The observational link between long gamma-ray bursts (GRBs) and broad-lined stripped-envelope core-collapse supernovae (SNe Ic-BL) is well established. Significant progress has been made in constraining what fraction of SNe Ic-BL may power high- or low-luminosity GRBs when viewed at small off-axis angles. However, the GRB-SN connection still lacks a complete understanding in the broader context of massive-star evolution and explosion physics. Models predict a continuum of outcomes for the fastest ejecta, from choked to ultra-relativistic jets, and observations from radio to X-rays are key to probing these scenarios across a range of viewing angles and velocities. Here, we present results from a coordinated radio-to-X-ray campaign targeting nearby (z<=0.1) SNe Ic-BL designed to explore this diversity. With eight new radio-monitored events and updated data for one previously observed SN, we further tighten constraints on the fraction of SNe Ic-BL as relativistic as SN 1998bw/GRB 980425. We identify SN 2024rjw as a new radio-loud event likely powered by strong interaction with circumstellar material (CSM), and add evidence supporting a similar interpretation for SN 2020jqm. We also establish new limits on the properties of radio-emitting ejecta with velocities consistent with cocoons from choked jets, highlighting SN 2022xxf as a promising cocoon-dominated candidate. These results refine our understanding of the continuum linking ordinary SNe Ic-BL, engine-driven explosions, and GRBs, and contribute to building a sample that will inform future multi-messenger searches for electromagnetic counterparts to high-energy neutrinos.
△ Less
Submitted 14 April, 2026; v1 submitted 9 December, 2025;
originally announced December 2025.
-
Measurement of inclusive $B \to X_u \ell ν$ partial branching fractions and $|V_{ub}|$ at Belle II
Authors:
Belle II Collaboration,
M. Abumusabh,
I. Adachi,
K. Adamczyk,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
V. Babu,
H. Bae,
N. K. Baghel,
S. Bahinipati,
P. Bambade
, et al. (398 additional authors not shown)
Abstract:
A sample of 365 fb$^{-1}$ of $e^+e^- \to Υ(4S) \to B\bar{B}$ data collected by the Belle II experiment is used to measure the partial branching fractions of charmless semileptonic $B$ meson decays and determine the magnitude of the Cabibbo-Kobayashi-Maskawa matrix element $V_{ub}$. Events containing a signal electron or muon $\ell$ and a fully reconstructed hadronic $B$ decay that constrains the s…
▽ More
A sample of 365 fb$^{-1}$ of $e^+e^- \to Υ(4S) \to B\bar{B}$ data collected by the Belle II experiment is used to measure the partial branching fractions of charmless semileptonic $B$ meson decays and determine the magnitude of the Cabibbo-Kobayashi-Maskawa matrix element $V_{ub}$. Events containing a signal electron or muon $\ell$ and a fully reconstructed hadronic $B$ decay that constrains the signal kinematics are selected, while the rest of the event defines the hadronic system $X_u$ associated with the signal. To discriminate the signal from the 50-times larger background originating from CKM-favored semileptonic $B$ decays, a template fit is performed in both signal and control regions after applying an optimized selection. The partial branching fraction measured for lepton energies greater than 1 GeV in the signal $B$ meson rest frame is $Δ\mathcal{B}(B \to X_u \ell ν) = (1.54 \pm 0.08 \, {\rm (stat.)} \pm 0.12 \, {\rm (syst.)}) \times 10^{-3}$. From this measurement, using the Gambino, Giordano, Ossola, Uraltsev theoretical framework, $|V_{ub}| = (4.01 \pm 0.19 ^{+0.07} _{-0.08}) \times 10^{-3}$ is determined, where the uncertainties are experimental and theoretical, respectively. This value is consistent with the world average obtained from previous inclusive measurements. Different theoretical predictions and partial branching fractions measured in other phase-space regions, defined by additional selections on the $X_u$ and leptonic system masses, are also used to determine $|V_{ub}|$.
△ Less
Submitted 28 March, 2026; v1 submitted 8 December, 2025;
originally announced December 2025.
-
Measurements of the mass difference $m(B^0)-m(B^+)$ and the energy dependence of the cross-section ratio $σ(e^+e^-\to B^0\bar{B}^0) / σ(e^+e^-\to B^+B^-)$ at Belle and Belle II
Authors:
Belle,
Belle II Collaborations,
:,
M. Abumusabh,
I. Adachi,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
V. Babu,
N. K. Baghel,
S. Bahinipati,
P. Bambade
, et al. (375 additional authors not shown)
Abstract:
Using data samples collected by the Belle and Belle II experiments at the $Υ(4S)$ resonance with integrated luminosities of 571 fb$^{-1}$ and 365 fb$^{-1}$, respectively, we measure the pseudoscalar $B$-meson mass difference to be $m(B^0)-m(B^+) = (0.495\pm0.024\pm0.005)$ MeV/c$^2$. The results are based on a simultaneous fit to the variable $\tilde{M}_{bc}$, which is related to the $B$ momentum,…
▽ More
Using data samples collected by the Belle and Belle II experiments at the $Υ(4S)$ resonance with integrated luminosities of 571 fb$^{-1}$ and 365 fb$^{-1}$, respectively, we measure the pseudoscalar $B$-meson mass difference to be $m(B^0)-m(B^+) = (0.495\pm0.024\pm0.005)$ MeV/c$^2$. The results are based on a simultaneous fit to the variable $\tilde{M}_{bc}$, which is related to the $B$ momentum, for $B^0$ and $B^+$ candidates; and to the energy dependence of ${\cal R}=σ(e^+e^-\to B^0\bar{B}^0) / σ(e^+e^-\to B^+B^-)$, which is measured using changes in the average center-of-mass energy over the data taking periods. The phase-space hypothesis ${\cal R}=(p_{B^0}/p_{B^+})^3$, upon which previous measurements rely, is strongly disfavored by our fit; the measured mass-difference value for the phase-space hypothesis also differs significantly from our measurement. We constrain ${\cal R}$ in a broader energy range than covered by the direct measurement and extract the energy dependence of ${\cal R}$ in the range from the $B\bar{B}$ threshold up to 10.59 GeV. We interpret the results using a phenomenological model and constrain the parameters of the $B\bar{B}$ potential in the isovector channel.
△ Less
Submitted 19 November, 2025;
originally announced November 2025.
-
First search for $B \rightarrow X_{s} ν\barν$ decays
Authors:
Belle II Collaboration,
M. Abumusabh,
I. Adachi,
K. Adamczyk,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
N. K. Baghel,
S. Bahinipati
, et al. (418 additional authors not shown)
Abstract:
We report the first search for the flavor-changing neutral-current decays $B \rightarrow X_{s} ν\barν$, where $X_{s}$ is a hadronic system with strangeness equal to 1, in data collected with the Belle~II detector at the SuperKEKB asymmetric-energy $e^+e^-$ collider. The data sample corresponds to an integrated luminosity of $365~\textrm{fb}^{-1}$ collected at the $Υ(4S)$ resonance and…
▽ More
We report the first search for the flavor-changing neutral-current decays $B \rightarrow X_{s} ν\barν$, where $X_{s}$ is a hadronic system with strangeness equal to 1, in data collected with the Belle~II detector at the SuperKEKB asymmetric-energy $e^+e^-$ collider. The data sample corresponds to an integrated luminosity of $365~\textrm{fb}^{-1}$ collected at the $Υ(4S)$ resonance and $43~\textrm{fb}^{-1}$ collected at a center-of-mass energy $60~\textrm{MeV}$ below resonance for estimation of $e^+e^-\to q\bar{q}$ continuum background. One of the $B$ mesons from the $Υ(4S) \to B\bar{B}$ decay is fully reconstructed in a hadronic decay mode. The $B \to X_s ν\barν$ decay is reconstructed with a sum-of-exclusives approach that uses 30 $X_s$ decay modes. This approach provides high sensitivity to the inclusive decay, despite the presence of two undetected neutrinos. The search is performed in three regions of the $X_{s}$ mass, chosen to separate contributions from prominent resonances. We do not observe a significant signal and set upper limits at 90\% confidence level on the partial branching fractions for the regions $0.0 < M_{X_{s}} < 0.6~\textrm{GeV}/c^{2}$, $0.6 < M_{X_{s}} < 1.0~\textrm{GeV}/c^{2}$, and $1.0~\textrm{GeV}/c^{2} < M_{X_{s}}$ of $2.2 \times 10^{-5}$, $9.5 \times 10^{-5}$, and $31.2 \times 10^{-5}$, respectively. Combining the three mass regions, we obtain the upper limit on the branching fraction, $B(B \to X_s ν\barν) < 3.2 \times 10^{-4}$.
△ Less
Submitted 14 November, 2025;
originally announced November 2025.
-
COGNOS: Universal Enhancement for Time Series Anomaly Detection via Constrained Gaussian-Noise Optimization and Smoothing
Authors:
Wenlong Shang,
Shihao Tian,
Xutong Wan,
Peng Chang
Abstract:
Reconstruction-based methods are a dominant paradigm in time series anomaly detection (TSAD), however, their near-universal reliance on Mean Squared Error (MSE) loss results in statistically flawed reconstruction residuals. This fundamental weakness leads to noisy, unstable anomaly scores, hindering reliable detection. To address this, we propose Constrained Gaussian-Noise Optimization and Smoothi…
▽ More
Reconstruction-based methods are a dominant paradigm in time series anomaly detection (TSAD), however, their near-universal reliance on Mean Squared Error (MSE) loss results in statistically flawed reconstruction residuals. This fundamental weakness leads to noisy, unstable anomaly scores, hindering reliable detection. To address this, we propose Constrained Gaussian-Noise Optimization and Smoothing (COGNOS), a universal, model-agnostic enhancement framework that tackles this issue at its source. COGNOS introduces a novel Gaussian-White Noise Regularization strategy during training, which directly constrains the model's output residuals to conform to a Gaussian white noise distribution. This engineered statistical property creates the ideal precondition for our second contribution: Adaptive Residual Kalman Smoother that operates as a statistically robust estimator to denoise the raw anomaly scores. Extensive experiments on multiple benchmarks demonstrate that COGNOS consistently enhances the performance of state-of-the-art backbones significantly, validating the efficacy of coupling statistical regularization with adaptive filtering.
△ Less
Submitted 19 January, 2026; v1 submitted 10 November, 2025;
originally announced November 2025.
-
Risk Aware Safe Control with Multi-Modal Sensing for Dynamic Obstacle Avoidance
Authors:
Pei Yu Chang,
Qizhe Xu,
Vishnu Renganathan,
Qadeer Ahmed
Abstract:
Safe control in dynamic traffic environments remains a major challenge for autonomous vehicles (AVs), as ego vehicle and obstacle states are inherently affected by sensing noise and estimation uncertainty. However, existing studies have not sufficiently addressed how uncertain multi-modal sensing information can be systematically incorporated into tail-risk-aware safety-critical control. To addres…
▽ More
Safe control in dynamic traffic environments remains a major challenge for autonomous vehicles (AVs), as ego vehicle and obstacle states are inherently affected by sensing noise and estimation uncertainty. However, existing studies have not sufficiently addressed how uncertain multi-modal sensing information can be systematically incorporated into tail-risk-aware safety-critical control. To address this gap, this paper proposes a risk-aware safe control framework that integrates probabilistic state estimation with a conditional value-at-risk (CVaR) control barrier function (CBF) safety filter. Obstacle detections from cameras, LiDAR, and vehicle-to-everything (V2X) communication are combined using a Wasserstein barycenter (WB) to obtain a probabilistic state estimate. A model predictive controller generates the nominal control, which is then filtered through a CVaR-CBF quadratic program to enforce risk-aware safety constraints. The approach is evaluated through numerical studies and further validated on a full-scale AV. Results demonstrate improved safety and robustness over a baseline MPC-CBF design, with an average improvement of 12.7\% in success rate across the evaluated scenarios.
△ Less
Submitted 13 March, 2026; v1 submitted 3 November, 2025;
originally announced November 2025.
-
Observation of the radiative decay $D_s (2317)^+ \to D_s^* γ$
Authors:
Belle II Collaboration,
M. Abumusabh,
I. Adachi,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
V. Babu,
N. K. Baghel,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
M. Barrett
, et al. (345 additional authors not shown)
Abstract:
We observe the radiative decay $D^{*}_{s0}(2317)^{+} \to D_{s}^{*+} γ$ for the first time, with a significance exceeding $10$ standard deviations. The signal is found in the continuum $e^+ e^- \to c\bar{c}$ process with the combined data samples of 980.4~$\rm fb^{-1}$ and 427.9~$\rm fb^{-1}$ collected by the Belle and Belle~II detectors operating at the KEKB and SuperKEKB asymmetric-energy…
▽ More
We observe the radiative decay $D^{*}_{s0}(2317)^{+} \to D_{s}^{*+} γ$ for the first time, with a significance exceeding $10$ standard deviations. The signal is found in the continuum $e^+ e^- \to c\bar{c}$ process with the combined data samples of 980.4~$\rm fb^{-1}$ and 427.9~$\rm fb^{-1}$ collected by the Belle and Belle~II detectors operating at the KEKB and SuperKEKB asymmetric-energy $e^+e^-$ colliders, respectively. The branching fraction ratio ${\cal B}(D^{*}_{s0}(2317)^{+} \to D_{s}^{*+} γ)/{\cal B}(D^{*}_{s0}(2317)^{+} \to D_{s}^{+} π^{0})$ is measured to be $[7.14 \pm 0.70({\rm stat.}) \pm 0.23({\rm syst.})]\%$. This result provides significant new experimental input for the determination of the quark structure of the $D^{*}_{s0}(2317)^{+}$, which remains unknown.
△ Less
Submitted 31 October, 2025;
originally announced October 2025.
-
Improved measurement of Born cross sections for $χ_{bJ}\,ω$ and $χ_{bJ}\,(π^+π^-π^0)_{\rm non-ω}$ ($J$ = 0, 1, 2) at Belle and Belle II
Authors:
Belle,
Belle II Collaborations,
:,
I. Adachi,
L. Aggarwal,
H. Ahmed,
H. Aihara,
N. Akopov,
M. Alhakami,
A. Aloisio,
N. Althubiti,
M. Angelsmark,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
N. K. Baghel,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
M. Barrett
, et al. (402 additional authors not shown)
Abstract:
We study the processes $χ_{bJ}\,ω$ and $χ_{bJ}\,(π^+π^-π^0)_{\rm non-ω}$ ($J$ = 0, 1, 2) at center-of-mass energies $\sqrt{s}$ from 10.73--11.02 GeV using a $142.5\,\mathrm{fb}^{-1}$ data sample collected with the Belle detector at the KEKB asymmetric-energy $e^+ e^-$ collider; and at $\sqrt{s}\sim10.75$ GeV using a $19.8\,\mathrm{fb}^{-1}$ sample collected with Belle II at SuperKEKB. We find that…
▽ More
We study the processes $χ_{bJ}\,ω$ and $χ_{bJ}\,(π^+π^-π^0)_{\rm non-ω}$ ($J$ = 0, 1, 2) at center-of-mass energies $\sqrt{s}$ from 10.73--11.02 GeV using a $142.5\,\mathrm{fb}^{-1}$ data sample collected with the Belle detector at the KEKB asymmetric-energy $e^+ e^-$ collider; and at $\sqrt{s}\sim10.75$ GeV using a $19.8\,\mathrm{fb}^{-1}$ sample collected with Belle II at SuperKEKB. We find that the $Υ(10753)$ state decays into $χ_{bJ}\,ω$ but not into $χ_{bJ}\,(π^+π^-π^0)_{\rm non-ω}$, while the $Υ(10860)$ state, in contrast, decays into $χ_{bJ}\,(π^+π^-π^0)_{\rm non-ω}$ but not into $χ_{bJ}\,ω$. The mass and width of the $Υ(10753)$ state are measured to be $(10756.1\pm3.4({\rm stat.})\pm2.7({\rm syst.}))$ MeV/$c^2$ and $(32.2\pm11.3({\rm stat.})\pm14.9({\rm syst.}))$ MeV. The products of the partial width to $e^+e^-$ and branching fractions for $Υ(10753)\toχ_{b1}\,ω$ and $Υ(10753)\toχ_{b2}\,ω$ are ($1.46\pm0.25({\rm stat.})\pm 0.20({\rm syst.})$) eV and ($1.29\pm0.38({\rm stat.})\pm 0.31({\rm syst.})$) eV.
△ Less
Submitted 29 October, 2025;
originally announced October 2025.
-
Waveguide-Plasmon Polariton Quasiparticles with Exceptional Point Characteristics
Authors:
P. Chang,
S. Ramezanpour,
A. Helmy
Abstract:
The growing complexity of integrated photonics necessitates compact, low-power devices that transcend traditional, material-centric design approaches. In this study, we harness non-Hermitian physics to uncover novel properties of coupled plasmonic waveguide modes exhibiting exceptional point (EP) degeneracy. Our hybrid plasmonic waveguide architecture, capable of supporting both strong and weak co…
▽ More
The growing complexity of integrated photonics necessitates compact, low-power devices that transcend traditional, material-centric design approaches. In this study, we harness non-Hermitian physics to uncover novel properties of coupled plasmonic waveguide modes exhibiting exceptional point (EP) degeneracy. Our hybrid plasmonic waveguide architecture, capable of supporting both strong and weak coupling regimes between plasmonic and dielectric waveguide modes, is precisely engineered to reach an EP where eigenmodes coalesce. This strategic tuning not only enhances the modal contrast between minimized-loss and highly dissipative states but also enables unprecedented control over device characteristics. Our findings introduce a new paradigm in integrated photonics, paving the way for ultracompact modulators and highly tunable on-chip communication systems with reduced power consumption.
△ Less
Submitted 28 October, 2025;
originally announced October 2025.
-
Constraints on ultraheavy dark matter from the CDEX-10 experiment at the China Jinping Underground Laboratory
Authors:
Y. F. Wang,
L. T. Yang,
Q. Yue,
K. J. Kang,
Y. J. Li,
H. P. An,
Greeshma C.,
J. P. Chang,
H. Chen,
Y. H. Chen,
J. P. Cheng,
J. Y. Cui,
W. H. Dai,
Z. Deng,
Y. X. Dong,
C. H. Fang,
H. Gong,
Q. J. Guo,
T. Guo,
X. Y. Guo,
L. He,
J. R. He,
H. X. Huang,
T. C. Huang,
S. Karmakar
, et al. (63 additional authors not shown)
Abstract:
We report a search for ultraheavy dark matter (UHDM) with the CDEX-10 experiment at the China Jinping Underground Laboratory. Using a Monte Carlo framework that incorporates Earth shielding effects, we simulated UHDM propagation and energy deposition in p-type point-contact germanium detectors. Analysis of 205.4 kg$\cdot$day exposure in the 0.16--4.16 keVee range showed no excess above background.…
▽ More
We report a search for ultraheavy dark matter (UHDM) with the CDEX-10 experiment at the China Jinping Underground Laboratory. Using a Monte Carlo framework that incorporates Earth shielding effects, we simulated UHDM propagation and energy deposition in p-type point-contact germanium detectors. Analysis of 205.4 kg$\cdot$day exposure in the 0.16--4.16 keVee range showed no excess above background. Our results exclude the spin-independent UHDM-nucleon scattering with two cross section scales, with the UHDM mass from $10^6$ to $10^{11}$ GeV, and provide the most stringent constraints with solid-state detectors below $10^8$ GeV.
△ Less
Submitted 28 March, 2026; v1 submitted 24 October, 2025;
originally announced October 2025.
-
Measurement of the $CP$ asymmetry in $D^0\toπ^+π^-π^0$ decays at Belle II
Authors:
Belle II Collaboration,
M. Abumusabh,
I. Adachi,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
V. Babu,
H. Bae,
N. K. Baghel,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
M. Barrett
, et al. (378 additional authors not shown)
Abstract:
We measure the time- and phase-space-integrated $CP$ asymmetry $A_{CP}$ in $D^0\toπ^+π^-π^0$ decays reconstructed in $e^+e^-\to c\bar c$ events collected by the Belle II experiment from 2019 to 2022. This sample corresponds to an integrated luminosity of 428 fb$^{-1}$. We require $D^0$ mesons to be produced in $D^{*+}\to D^0π^+$ decays to determine their flavor at production. Control samples of…
▽ More
We measure the time- and phase-space-integrated $CP$ asymmetry $A_{CP}$ in $D^0\toπ^+π^-π^0$ decays reconstructed in $e^+e^-\to c\bar c$ events collected by the Belle II experiment from 2019 to 2022. This sample corresponds to an integrated luminosity of 428 fb$^{-1}$. We require $D^0$ mesons to be produced in $D^{*+}\to D^0π^+$ decays to determine their flavor at production. Control samples of $D^0\to K^-π^+$ decays are used to correct for reconstruction-induced asymmetries. The result, $A_{CP}(D^0\toπ^+π^-π^0)=(0.29\pm0.27\pm0.13)\%$, where the first uncertainty is statistical and the second systematic, is the most precise result to date and is consistent with $CP$ conservation.
△ Less
Submitted 17 March, 2026; v1 submitted 24 October, 2025;
originally announced October 2025.
-
Neutron Star-Main Sequence Collisions Robustly Form Dynamically Stable Thorne-Żytkow Objects
Authors:
Lauryn E. Williams,
Philip Chang,
Emily M. Levesque,
Thomas R. Quinn
Abstract:
Thorne-Żytkow Objects (TŻOs) are hypothetical hybrid stars with a neutron star at the core of a large, diffuse envelope. (TŻOs) may be formed when a newly formed neutron star that is kicked by its supernova collides with its main-sequence companion. Using a moving-mesh hydrodynamics solver integrated into the parallel-code Charm N-body GrAvity solver, we demonstrate that these ``impact scenario''…
▽ More
Thorne-Żytkow Objects (TŻOs) are hypothetical hybrid stars with a neutron star at the core of a large, diffuse envelope. (TŻOs) may be formed when a newly formed neutron star that is kicked by its supernova collides with its main-sequence companion. Using a moving-mesh hydrodynamics solver integrated into the parallel-code Charm N-body GrAvity solver, we demonstrate that these ``impact scenario'' formation processes robustly form (TŻOs) for periastron distances less than one stellar radius. These (TŻOs) are dynamically stable and they can serve as initial models for further evolutionary studies.
△ Less
Submitted 17 October, 2025;
originally announced October 2025.
-
Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following
Authors:
Qingyu Ren,
Qianyu He,
Powei Chang,
Jie Zeng,
Zeye Sun,
Fei Yu,
Jiaqing Liang,
Yanghua Xiao
Abstract:
Language models often struggle to follow multi-constraint instructions that are crucial for real-world applications. Existing reinforcement learning (RL) approaches suffer from dependency on external supervision and sparse reward signals from multi-constraint tasks. We propose a label-free self-supervised RL framework that eliminates dependency on external supervision by deriving reward signals di…
▽ More
Language models often struggle to follow multi-constraint instructions that are crucial for real-world applications. Existing reinforcement learning (RL) approaches suffer from dependency on external supervision and sparse reward signals from multi-constraint tasks. We propose a label-free self-supervised RL framework that eliminates dependency on external supervision by deriving reward signals directly from instructions and generating pseudo-labels for reward model training. Our approach introduces constraint decomposition strategies and efficient constraint-wise binary classification to address sparse reward challenges while maintaining computational efficiency. Experiments show that our approach generalizes well, achieving strong improvements across 3 in-domain and 5 out-of-domain datasets, including challenging agentic and multi-turn instruction following. The data and code are publicly available at https://github.com/Rainier-rq/verl-if
△ Less
Submitted 14 April, 2026; v1 submitted 16 October, 2025;
originally announced October 2025.
-
PriorGuide: Test-Time Prior Adaptation for Simulation-Based Inference
Authors:
Yang Yang,
Severi Rissanen,
Paul E. Chang,
Nasrulloh Loka,
Daolang Huang,
Arno Solin,
Markus Heinonen,
Luigi Acerbi
Abstract:
Amortized simulator-based inference offers a powerful framework for tackling Bayesian inference in computational fields such as engineering or neuroscience, increasingly leveraging modern generative methods like diffusion models to map observed data to model parameters or future predictions. These approaches yield posterior or posterior-predictive samples for new datasets without requiring further…
▽ More
Amortized simulator-based inference offers a powerful framework for tackling Bayesian inference in computational fields such as engineering or neuroscience, increasingly leveraging modern generative methods like diffusion models to map observed data to model parameters or future predictions. These approaches yield posterior or posterior-predictive samples for new datasets without requiring further simulator calls after training on simulated parameter-data pairs. However, their applicability is often limited by the prior distribution(s) used to generate model parameters during this training phase. To overcome this constraint, we introduce PriorGuide, a technique specifically designed for diffusion-based amortized inference methods. PriorGuide leverages a novel guidance approximation that enables flexible adaptation of the trained diffusion model to new priors at test time, crucially without costly retraining. This allows users to readily incorporate updated information or expert knowledge post-training, enhancing the versatility of pre-trained inference models.
△ Less
Submitted 15 October, 2025;
originally announced October 2025.
-
Risk-Budgeted Control Framework for Balanced Performance and Safety in Autonomous Vehicles
Authors:
Pei Yu Chang,
Vishnu Renganathan,
Qadeer Ahmed
Abstract:
This paper presents a hybrid control framework with a risk-budgeted monitor for safety-certified autonomous driving. A sliding-window monitor tracks insufficient barrier residuals and triggers switching from a relaxed control barrier function (R-CBF) to a more conservative conditional value-at-risk CBF (CVaR-CBF) when the safety margin deteriorates. Two real-time triggers are considered: feasibili…
▽ More
This paper presents a hybrid control framework with a risk-budgeted monitor for safety-certified autonomous driving. A sliding-window monitor tracks insufficient barrier residuals and triggers switching from a relaxed control barrier function (R-CBF) to a more conservative conditional value-at-risk CBF (CVaR-CBF) when the safety margin deteriorates. Two real-time triggers are considered: feasibility-triggered (FT), which activates CVaR-CBF when the R-CBF problem is reported infeasible, and quality-triggered (QT), which switches when the residual falls below a prescribed safety margin.
The framework is evaluated with model predictive control (MPC) under vehicle localization noise and obstacle position uncertainty across multiple AV-pedestrian interaction scenarios with 1,500 Monte Carlo runs. In the most challenging case with 5 m pedestrian detection uncertainty, the proposed method achieves a 94--96\% collision-free success rate over 300 trials while maintaining the lowest mean cross-track error (CTE = 3.2--3.6 m), indicating faster trajectory recovery after obstacle avoidance and a favorable balance between safety and performance.
△ Less
Submitted 13 March, 2026; v1 submitted 12 October, 2025;
originally announced October 2025.
-
Efficient Autoregressive Inference for Transformer Probabilistic Models
Authors:
Conor Hassan,
Nasrulloh Loka,
Cen-You Li,
Daolang Huang,
Paul E. Chang,
Yang Yang,
Francesco Silvestrin,
Samuel Kaski,
Luigi Acerbi
Abstract:
Transformer-based models for amortized probabilistic inference, such as neural processes, prior-fitted networks, and tabular foundation models, excel at single-pass marginal prediction. However, many real-world applications, from signal interpolation to multi-column tabular predictions, require coherent joint distributions that capture dependencies between predictions. While purely autoregressive…
▽ More
Transformer-based models for amortized probabilistic inference, such as neural processes, prior-fitted networks, and tabular foundation models, excel at single-pass marginal prediction. However, many real-world applications, from signal interpolation to multi-column tabular predictions, require coherent joint distributions that capture dependencies between predictions. While purely autoregressive architectures efficiently generate such distributions, they sacrifice the flexible set-conditioning that makes these models powerful for meta-learning. Conversely, the standard approach to obtain joint distributions from set-based models requires expensive re-encoding of the entire augmented conditioning set at each autoregressive step. We introduce a causal autoregressive buffer that preserves the advantages of both paradigms. Our approach decouples context encoding from updating the conditioning set. The model processes the context once and caches it. A dynamic buffer then captures target dependencies: as targets are incorporated, they enter the buffer and attend to both the cached context and previously buffered targets. This enables efficient batched autoregressive generation and one-pass joint log-likelihood evaluation. A unified training strategy allows seamless integration of set-based and autoregressive modes at minimal additional cost. Across synthetic functions, EEG signals, cognitive models, and tabular data, our method matches predictive accuracy of strong baselines while delivering up to 20 times faster joint sampling. Our approach combines the efficiency of autoregressive generative models with the representational power of set-based conditioning, making joint prediction practical for transformer-based probabilistic models.
△ Less
Submitted 10 October, 2025;
originally announced October 2025.
-
Constraints on inelastic dark matter from the CDEX-1B experiment
Authors:
Y. F. Liang,
L. T. Yang,
Q. Yue,
K. J. Kang,
Y. J. Li,
H. P. An,
Greeshma C.,
J. P. Chang,
H. Chen,
Y. H. Chen,
J. P. Cheng,
J. Y. Cui,
W. H. Dai,
Z. Deng,
Y. X. Dong,
C. H. Fang,
H. Gong,
Q. J. Guo,
T. Guo,
X. Y. Guo,
L. He,
J. R. He,
H. X. Huang,
T. C. Huang,
S. Karmakar
, et al. (63 additional authors not shown)
Abstract:
We present limits on spin-independent inelastic weakly interacting massive particles (WIMP)-nucleus scattering using the 737.1 kg$\cdot$day dataset from the CDEX-1B experiment. Expected nuclear recoil spectra for various inelastic WIMP masses $m_χ$ and mass splittings $δ$ are calculated under the standard halo model. An accurate background model of CDEX-1B is constructed by simulating all major ba…
▽ More
We present limits on spin-independent inelastic weakly interacting massive particles (WIMP)-nucleus scattering using the 737.1 kg$\cdot$day dataset from the CDEX-1B experiment. Expected nuclear recoil spectra for various inelastic WIMP masses $m_χ$ and mass splittings $δ$ are calculated under the standard halo model. An accurate background model of CDEX-1B is constructed by simulating all major background sources. The model parameters are then determined through maximum likelihood estimation and Markov chain Monte Carlo fitting. The resulting 90\% confidence level upper limits on the WIMP-nucleon cross section $σ_{\mathrm{n}}$ exclude certain DAMA/LIBRA allowed regions: the $χ^2 < 4$ regions for $δ< 30$ keV at $m_χ= 250$ GeV and the $χ^2 < 9$ region for $δ< 50$ keV at $m_χ= 500$ GeV. The method is applicable to other inelastic dark matter scenarios, and the upcoming CDEX-50 experiment is expected to improve sensitivity by four orders of magnitude.
△ Less
Submitted 31 December, 2025; v1 submitted 9 October, 2025;
originally announced October 2025.
-
Generative World Modelling for Humanoids: 1X World Model Challenge Technical Report
Authors:
Riccardo Mereu,
Aidan Scannell,
Yuxin Hou,
Yi Zhao,
Aditya Jitta,
Antonio Dominguez,
Luigi Acerbi,
Amos Storkey,
Paul Chang
Abstract:
World models are a powerful paradigm in AI and robotics, enabling agents to reason about the future by predicting visual observations or compact latent states. The 1X World Model Challenge introduces an open-source benchmark of real-world humanoid interaction, with two complementary tracks: sampling, focused on forecasting future image frames, and compression, focused on predicting future discrete…
▽ More
World models are a powerful paradigm in AI and robotics, enabling agents to reason about the future by predicting visual observations or compact latent states. The 1X World Model Challenge introduces an open-source benchmark of real-world humanoid interaction, with two complementary tracks: sampling, focused on forecasting future image frames, and compression, focused on predicting future discrete latent codes. For the sampling track, we adapt the video generation foundation model Wan-2.2 TI2V-5B to video-state-conditioned future frame prediction. We condition the video generation on robot states using AdaLN-Zero, and further post-train the model using LoRA. For the compression track, we train a Spatio-Temporal Transformer model from scratch. Our models achieve 23.0 dB PSNR in the sampling task and a Top-500 CE of 6.6386 in the compression task, securing 1st place in both challenges.
△ Less
Submitted 8 October, 2025;
originally announced October 2025.
-
Search for $CP$ violation in $Ξ_c^+\toΣ^+h^+h^-$ and $Λ_c^+\to ph^+h^-$ at Belle II
Authors:
Belle II Collaboration,
M. Abumusabh,
I. Adachi,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
N. Althubiti,
K. Amos,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
R. Ayad,
V. Babu,
N. K. Baghel,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
M. Bartl,
J. Baudot,
A. Beaubien,
J. Becker,
J. V. Bennett
, et al. (322 additional authors not shown)
Abstract:
We report decay-rate $CP$ asymmetries of the singly-Cabibbo-suppressed decays $Ξ_c^+\toΣ^+h^+h^-$ and $Λ_c^+\to ph^+h^-$, with $h=K,π$, measured using 428 fb$^{-1}$ of $e^+e^-$ collisions collected by the Belle II experiment at the SuperKEKB collider. The results, \begin{equation}
A_{CP}(Ξ_c^+\toΣ^+K^+K^-) = (3.7\pm6.6\pm0.6)\%, \end{equation} \begin{equation}
A_{CP}(Ξ_c^+\toΣ^+π^+π^-) = (9.5\…
▽ More
We report decay-rate $CP$ asymmetries of the singly-Cabibbo-suppressed decays $Ξ_c^+\toΣ^+h^+h^-$ and $Λ_c^+\to ph^+h^-$, with $h=K,π$, measured using 428 fb$^{-1}$ of $e^+e^-$ collisions collected by the Belle II experiment at the SuperKEKB collider. The results, \begin{equation}
A_{CP}(Ξ_c^+\toΣ^+K^+K^-) = (3.7\pm6.6\pm0.6)\%, \end{equation} \begin{equation}
A_{CP}(Ξ_c^+\toΣ^+π^+π^-) = (9.5\pm6.8\pm0.5)\%, \end{equation} \begin{equation}
A_{CP}(Λ_c^+\to pK^+K^-) = (3.9\pm1.7\pm0.7)\%, \end{equation} \begin{equation}
A_{CP}(Λ_c^+\to pπ^+π^-) = (0.3\pm1.0\pm0.2)\%, \end{equation} where the first uncertainties are statistical and the second systematic, agree with $CP$ symmetry. From these results we derive the sums \begin{equation}
A_{CP}(Ξ_c^+\toΣ^+π^+π^-) \, + \, A_{CP}(Λ_c^+\to pK^+K^-) = (13.4 \pm 7.0\pm 0.9)\%, \end{equation} \begin{equation}
A_{CP}(Ξ_c^+\toΣ^+K^+K^-) \, + \, A_{CP}(Λ_c^+\to pπ^+π^-) = (\phantom{0}4.0 \pm 6.6\pm 0.7)\%, \end{equation} which are consistent with the $U$-spin symmetry prediction of zero. These are the first measurements of $CP$ asymmetries for individual hadronic three-body charmed-baryon decays.
△ Less
Submitted 30 September, 2025;
originally announced September 2025.
-
Fair Universe Higgs Uncertainty Challenge
Authors:
Ragansu Chakkappai,
Wahid Bhimji,
Paolo Calafiura,
Po-Wen Chang,
Yuan-Tang Chou,
Sascha Diefenbacher,
Jordan Dudley,
Steven Farrell,
Aishik Ghosh,
Isabelle Guyon,
Chris Harris,
Shih-Chieh Hsu,
Elham E. Khoda,
Benjamin Nachman,
Peter Nugent,
David Rousseau,
Benjamin Thorne,
Ihsan Ullah,
Yulei Zhang
Abstract:
This competition in high-energy physics (HEP) and machine learning was the first to strongly emphasise uncertainties in $(H \rightarrow τ^+ τ^-)$ cross-section measurement. Participants were tasked with developing advanced analysis techniques capable of dealing with uncertainties in the input training data and providing credible confidence intervals. The accuracy of these intervals was evaluated u…
▽ More
This competition in high-energy physics (HEP) and machine learning was the first to strongly emphasise uncertainties in $(H \rightarrow τ^+ τ^-)$ cross-section measurement. Participants were tasked with developing advanced analysis techniques capable of dealing with uncertainties in the input training data and providing credible confidence intervals. The accuracy of these intervals was evaluated using pseudo-experiments to assess correct coverage. The dataset is now published in Zenodo, and the winning submissions are fully documented.
△ Less
Submitted 4 March, 2026; v1 submitted 26 September, 2025;
originally announced September 2025.
-
Dimer-driven multiple reentrant localization with composite potential
Authors:
Pei-Jie Chang,
Dong Ruan,
Gui-Lu Long
Abstract:
Recent studies have revealed reentrant localization transitions in quasi-periodic one-dimensional lattices, where the competition between dimerized hopping and staggered disorder plays a central role. Yet the extent to which such reentrant localization persists under more general conditions, such as additional periodic potentials, modified quasi-periodic modulations remains unclear. Here we invest…
▽ More
Recent studies have revealed reentrant localization transitions in quasi-periodic one-dimensional lattices, where the competition between dimerized hopping and staggered disorder plays a central role. Yet the extent to which such reentrant localization persists under more general conditions, such as additional periodic potentials, modified quasi-periodic modulations remains unclear. Here we investigate localization phenomena in a one-dimensional lattice subject to a periodic potential and an additional quasi-periodic modulation. Using both eigenstate-based indicators and experimentally accessible dynamical observables, we identify robust reentrant, or multiple, localization transitions. We show that these transitions are uniquely stabilized by the dimer structure of the unit cell, where the competition between the onsite periodic potential and the quasi-periodic modulation becomes most pronounced. By systematically varying the periodicity parameter $α$ and the quasi-periodic frequency $β$, we find that the robust multiple reentrant localization behavior disappears for any deviation from the dimer configuration, confirming its essential role. Our results suggest that the interplay between these competing factors drives the multiple reentrant localization transitions.
△ Less
Submitted 28 September, 2025; v1 submitted 12 September, 2025;
originally announced September 2025.
-
Embodied Intelligence in Disassembly: Multimodal Perception Cross-validation and Continual Learning in Neuro-Symbolic TAMP
Authors:
Ziwen He,
Zhigang Wang,
Yanlong Peng,
Pengxu Chang,
Hong Yang,
Ming Chen
Abstract:
With the rapid development of the new energy vehicle industry, the efficient disassembly and recycling of power batteries have become a critical challenge for the circular economy. In current unstructured disassembly scenarios, the dynamic nature of the environment severely limits the robustness of robotic perception, posing a significant barrier to autonomous disassembly in industrial application…
▽ More
With the rapid development of the new energy vehicle industry, the efficient disassembly and recycling of power batteries have become a critical challenge for the circular economy. In current unstructured disassembly scenarios, the dynamic nature of the environment severely limits the robustness of robotic perception, posing a significant barrier to autonomous disassembly in industrial applications. This paper proposes a continual learning framework based on Neuro-Symbolic task and motion planning (TAMP) to enhance the adaptability of embodied intelligence systems in dynamic environments. Our approach integrates a multimodal perception cross-validation mechanism into a bidirectional reasoning flow: the forward working flow dynamically refines and optimizes action strategies, while the backward learning flow autonomously collects effective data from historical task executions to facilitate continual system learning, enabling self-optimization. Experimental results show that the proposed framework improves the task success rate in dynamic disassembly scenarios from 81.68% to 100%, while reducing the average number of perception misjudgments from 3.389 to 1.128. This research provides a new paradigm for enhancing the robustness and adaptability of embodied intelligence in complex industrial environments.
△ Less
Submitted 14 September, 2025;
originally announced September 2025.
-
PT symmetry-enriched non-unitary criticality
Authors:
Kuang-Hung Chou,
Xue-Jia Yu,
Po-Yao Chang
Abstract:
The interplay between topology and quantum criticality gives rise to the notion of symmetry-enriched criticality, which has attracted considerable attention in recent years. However, its non-Hermitian counterpart remains largely unexplored. In this Letter, we show how parity-time (PT) symmetry enriches non-Hermitian critical points, giving rise to a topologically distinct non-unitary universality…
▽ More
The interplay between topology and quantum criticality gives rise to the notion of symmetry-enriched criticality, which has attracted considerable attention in recent years. However, its non-Hermitian counterpart remains largely unexplored. In this Letter, we show how parity-time (PT) symmetry enriches non-Hermitian critical points, giving rise to a topologically distinct non-unitary universality class. By analytically investigating non-Hermitian free fermion models with $PT$ symmetry, we uncover a new class of conformally invariant non-unitary critical points that host robust topological edge modes. Remarkably, the associated topological degeneracy is surprisingly encoded in the purely imaginary part of the entanglement entropy scaling-a feature absent in Hermitian systems. The underlying mechanism for the emergence of edge states at non-Hermitian criticality is traced to a generalized mass inversion that is absent in Hermitian systems.
△ Less
Submitted 8 October, 2025; v1 submitted 11 September, 2025;
originally announced September 2025.
-
Decoding RobKiNet: Insights into Efficient Training of Robotic Kinematics Informed Neural Network
Authors:
Yanlong Peng,
Zhigang Wang,
Ziwen He,
Pengxu Chang,
Chuangchuang Zhou,
Yu Yan,
Ming Chen
Abstract:
In robots task and motion planning (TAMP), it is crucial to sample within the robot's configuration space to meet task-level global constraints and enhance the efficiency of subsequent motion planning. Due to the complexity of joint configuration sampling under multi-level constraints, traditional methods often lack efficiency. This paper introduces the principle of RobKiNet, a kinematics-informed…
▽ More
In robots task and motion planning (TAMP), it is crucial to sample within the robot's configuration space to meet task-level global constraints and enhance the efficiency of subsequent motion planning. Due to the complexity of joint configuration sampling under multi-level constraints, traditional methods often lack efficiency. This paper introduces the principle of RobKiNet, a kinematics-informed neural network, for end-to-end sampling within the Continuous Feasible Set (CFS) under multiple constraints in configuration space, establishing its Optimization Expectation Model. Comparisons with traditional sampling and learning-based approaches reveal that RobKiNet's kinematic knowledge infusion enhances training efficiency by ensuring stable and accurate gradient optimization.Visualizations and quantitative analyses in a 2-DOF space validate its theoretical efficiency, while its application on a 9-DOF autonomous mobile manipulator robot(AMMR) demonstrates superior whole-body and decoupled control, excelling in battery disassembly tasks. RobKiNet outperforms deep reinforcement learning with a training speed 74.29 times faster and a sampling accuracy of up to 99.25%, achieving a 97.33% task completion rate in real-world scenarios.
△ Less
Submitted 9 September, 2025;
originally announced September 2025.
-
Nonparametric Bayesian Multi-Treatment Mixture Cure Survival Model with Application in Pediatric Oncology
Authors:
Peter Chang,
John Kairalla,
Arkaprava Roy
Abstract:
Heterogeneous treatment effect estimation is critical in oncology, particularly in multi-arm trials with overlapping therapeutic components and long-term survivors. These shared mechanisms pose a central challenge to identifying causal effects in precision medicine. We propose a novel covariate-dependent nonparametric Bayesian multi-treatment cure survival model that jointly accounts for common st…
▽ More
Heterogeneous treatment effect estimation is critical in oncology, particularly in multi-arm trials with overlapping therapeutic components and long-term survivors. These shared mechanisms pose a central challenge to identifying causal effects in precision medicine. We propose a novel covariate-dependent nonparametric Bayesian multi-treatment cure survival model that jointly accounts for common structures among treatments and cure fractions. Through latent link functions, our model leverages sharing among treatments through a flexible modeling approach, enabling individualized survival inference. We adopt a Bayesian route for inference and implement an efficient MCMC algorithm for approximating the posterior. Simulation studies demonstrate the method's robustness and superiority in various specification scenarios. Finally, application to the AALL0434 trial reveals clinically meaningful differences in survival across methotrexate-based regimens and their associations with different covariates, underscoring its practical utility for learning treatment effects in real-world pediatric oncology data.
△ Less
Submitted 27 October, 2025; v1 submitted 12 August, 2025;
originally announced August 2025.
-
Magic Entropy in Hybrid Spin-Boson Systems
Authors:
Samuel Crew,
Ying-Lin Li,
Heng-Hsi Li,
Po-Yao Chang
Abstract:
We introduce entropic measures to quantify non-classical resource in hybrid spin-boson systems. We discuss the stabilizer Rényi entropy in the framework of phase space quantisation and define an analogous hybrid magic entropy and a mutual magic entropy that capture the distribution of quantum magic across spin and bosonic subsystems. We use these entropic measures to demonstrate two key phenomena:…
▽ More
We introduce entropic measures to quantify non-classical resource in hybrid spin-boson systems. We discuss the stabilizer Rényi entropy in the framework of phase space quantisation and define an analogous hybrid magic entropy and a mutual magic entropy that capture the distribution of quantum magic across spin and bosonic subsystems. We use these entropic measures to demonstrate two key phenomena: the detection of the superradiant phase transition in the Dicke model and the dynamics of magic in the Jaynes-Cummings model following a quench. We develop a Monte Carlo numerical scheme to enable practical computation in many-body examples.
△ Less
Submitted 8 August, 2025;
originally announced August 2025.
-
Inverse Lieb Materials: Altermagnetism and More
Authors:
Po-Hao Chang,
Igor I. Mazin,
Kirill D. Belashchenko
Abstract:
The Lieb lattice, originally proposed for cuprate superconductors, has gained new attention in the emerging field of altermagnetism as a minimal analytical model for the latter. While initially the so-called inverse Lieb lattice (ILL) was deemed only a theoretical model, recently several real materials with this crystallographic motif have been found. The unique geometry of ILL can accommodate com…
▽ More
The Lieb lattice, originally proposed for cuprate superconductors, has gained new attention in the emerging field of altermagnetism as a minimal analytical model for the latter. While initially the so-called inverse Lieb lattice (ILL) was deemed only a theoretical model, recently several real materials with this crystallographic motif have been found. The unique geometry of ILL can accommodate complex magnetic orderings arising from competing exchange interactions and geometric frustration, offering great tunability for magnetic properties. In this work, we provide comprehensive insights into magnetic phases in ILL materials and establish guidelines for efficient identification of altermagnetic materials within this family. We begin by constructing phase diagrams using a simple Heisenberg model to elucidate the fundamental mechanisms underlying altermagnetism and other complex magnetic phases observed experimentally. To bridge theory with experiment, we systematically investigate a series of existing ILL compounds using density functional theory (DFT) calculations to determine their magnetic ground states. Our computational results are in good agreement with experimental observations. Importantly, we identify a trend linking magnetic ordering to the $d$-shell filling of transition metal ions, with $d^{2-3}$ and $d^{5}$ configurations showing propensity for altermagnetic behavior. Additionally, we identify a promising metallic compound Sr$_{2}$CrO$_{2}$Cr$_{2}$OAs$_{2}$ as an altermagnet that is highly anisotropic in its $J_2$ exchange couplings with large Néel temperature ($\sim 600$ K). Using exchange coupling parameters extracted from DFT calculations, we compute the magnon spectra for altermagnetic systems. As expected, chiral splittings in the magnon dispersion are directly correlated with anisotropy between crystallographically inequivalent $J_{2}$ exchange interactions.
△ Less
Submitted 8 August, 2025; v1 submitted 6 August, 2025;
originally announced August 2025.
-
Magentic-UI: Towards Human-in-the-loop Agentic Systems
Authors:
Hussein Mozannar,
Gagan Bansal,
Cheng Tan,
Adam Fourney,
Victor Dibia,
Jingya Chen,
Jack Gerrits,
Tyler Payne,
Matheus Kunzler Maldaner,
Madeleine Grunde-McLaughlin,
Eric Zhu,
Griffin Bassman,
Jacob Alber,
Peter Chang,
Ricky Loynd,
Friederike Niedtner,
Ece Kamar,
Maya Murad,
Rafah Hosn,
Saleema Amershi
Abstract:
AI agents powered by large language models are increasingly capable of autonomously completing complex, multi-step tasks using external tools. Yet, they still fall short of human-level performance in most domains including computer use, software development, and research. Their growing autonomy and ability to interact with the outside world, also introduces safety and security risks including pote…
▽ More
AI agents powered by large language models are increasingly capable of autonomously completing complex, multi-step tasks using external tools. Yet, they still fall short of human-level performance in most domains including computer use, software development, and research. Their growing autonomy and ability to interact with the outside world, also introduces safety and security risks including potentially misaligned actions and adversarial manipulation. We argue that human-in-the-loop agentic systems offer a promising path forward, combining human oversight and control with AI efficiency to unlock productivity from imperfect systems. We introduce Magentic-UI, an open-source web interface for developing and studying human-agent interaction. Built on a flexible multi-agent architecture, Magentic-UI supports web browsing, code execution, and file manipulation, and can be extended with diverse tools via Model Context Protocol (MCP). Moreover, Magentic-UI presents six interaction mechanisms for enabling effective, low-cost human involvement: co-planning, co-tasking, multi-tasking, action guards, and long-term memory. We evaluate Magentic-UI across four dimensions: autonomous task completion on agentic benchmarks, simulated user testing of its interaction capabilities, qualitative studies with real users, and targeted safety assessments. Our findings highlight Magentic-UI's potential to advance safe and efficient human-agent collaboration.
△ Less
Submitted 29 July, 2025;
originally announced July 2025.
-
Biorthogonal quench dynamics of entanglement and quantum geometry in PT-symmetric non-Hermitian systems
Authors:
Hsueh-Hao Lu,
Po-Yao Chang
Abstract:
We explore the quench dynamics of PT-symmetric non-Hermitian systems by utilizing the biorthogonal formalism. We analyze quench dynamics of observable quantities, the quantum geometric tensor, and various entanglement quantities, including the entanglement entropy, the SVD entropy, and the Tu-Tzeng-Chang entropy. Our results show that a sudden quench into a PT-broken phase generally leads to expon…
▽ More
We explore the quench dynamics of PT-symmetric non-Hermitian systems by utilizing the biorthogonal formalism. We analyze quench dynamics of observable quantities, the quantum geometric tensor, and various entanglement quantities, including the entanglement entropy, the SVD entropy, and the Tu-Tzeng-Chang entropy. Our results show that a sudden quench into a PT-broken phase generally leads to exponential growth in these quantities, driven by the biorthogonal density matrix's non-positivity. In contrast to generic interacting systems, we observe a surprising linear decay in the TTC entropy for non-interacting fermionic systems. This finding originates from the approximate spectral symmetry of the biorthogonal reduced density matrix, and we confirm our findings using the Yang-Lee and non-Hermitian XXZ models.
△ Less
Submitted 27 July, 2025;
originally announced July 2025.
-
Conversations Gone Awry, But Then? Evaluating Conversational Forecasting Models
Authors:
Son Quoc Tran,
Tushaar Gangavarapu,
Nicholas Chernogor,
Jonathan P. Chang,
Cristian Danescu-Niculescu-Mizil
Abstract:
We often rely on our intuition to anticipate the direction of a conversation. Endowing automated systems with similar foresight can enable them to assist human-human interactions. Recent work on developing models with this predictive capacity has focused on the Conversations Gone Awry (CGA) task: forecasting whether an ongoing conversation will derail. In this work, we revisit this task and introd…
▽ More
We often rely on our intuition to anticipate the direction of a conversation. Endowing automated systems with similar foresight can enable them to assist human-human interactions. Recent work on developing models with this predictive capacity has focused on the Conversations Gone Awry (CGA) task: forecasting whether an ongoing conversation will derail. In this work, we revisit this task and introduce the first uniform evaluation framework, creating a benchmark that enables direct and reliable comparisons between different architectures. This allows us to present an up-to-date overview of the current progress in CGA models, in light of recent advancements in language modeling. Our framework also introduces a novel metric that captures a model's ability to revise its forecast as the conversation progresses.
△ Less
Submitted 25 July, 2025;
originally announced July 2025.
-
Constraints on a dark matter sub-halo near the Sun from pulsar timing
Authors:
Sukanya Chakrabarti,
Philip Chang,
Stefano Profumo,
Peter Craig
Abstract:
Using pulsar accelerations, we identify and constrain the properties of a dark matter sub-halo in the Galaxy for the first time from analyzing the acceleration field of binary and solitary pulsars. Our MCMC calculations show that this sub-halo has a mass of $2.45^{+1.07}_{-0.96} \times 10^{7}~M_{\odot}$ and is located at Galactocentric coordinates $X = 7.43^{+0.2}_{-0.12}~\rm$ kpc,…
▽ More
Using pulsar accelerations, we identify and constrain the properties of a dark matter sub-halo in the Galaxy for the first time from analyzing the acceleration field of binary and solitary pulsars. Our MCMC calculations show that this sub-halo has a mass of $2.45^{+1.07}_{-0.96} \times 10^{7}~M_{\odot}$ and is located at Galactocentric coordinates $X = 7.43^{+0.2}_{-0.12}~\rm$ kpc, $Y = 0.38^{+0.11}_{-0.16} ~\rm kpc$, $Z = 0.21^{+0.06}_{-0.11} ~\rm kpc$, using flat, uninformative priors, where we have modeled the sub-halo as a compact object. The Bayes factors for the models are in the range of $\sim$ 20-40, which indicates tentative evidence (though not yet decisive) for the sub-halo. Modeling the sub-halo with a NFW profile gives a sub-halo mass within the scale radius (0.1 kpc) of $0.48^{0.15}_{-0.16} \times 10^{7} M_{\odot}$, located at $X = 7.47^{+0.21}_{-0.14}$, $Y=0.38^{+0.11}_{-0.16}$, $Z=0.21^{+0.06}_{-0.11}$. We examine \textit{Gaia} data and the atomic and molecular hydrogen data of our Galaxy and show that the measured deviation from a smooth potential cannot arise from the gas or the stars in our Galaxy. By analyzing the full sample of binary pulsars with available acceleration measurements, we find that massive (with mass $>10^{8}~M_{\odot}~$) sub-halos are disfavored for the Milky Way within several kiloparsec of the Sun. Smaller sub-halos are beyond the reach of current direct acceleration measurements. The presence of a $\sim 10^{7}~M_{\odot}$ sub-halo within a few kpc of the Sun is potentially consistent with the expected number counts of sub-halos in the prevailing $Λ$CDM paradigm, for a substantial sub-halo mass fraction. This work now provides a proof of principle for probing nearby, low-mass sub-halos, and has implications across many fields of astrophysics - from understanding the nature of dark matter to galaxy formation.
△ Less
Submitted 2 December, 2025; v1 submitted 22 July, 2025;
originally announced July 2025.
-
What Has a Foundation Model Found? Using Inductive Bias to Probe for World Models
Authors:
Keyon Vafa,
Peter G. Chang,
Ashesh Rambachan,
Sendhil Mullainathan
Abstract:
Foundation models are premised on the idea that sequence prediction can uncover deeper domain understanding, much like how Kepler's predictions of planetary motion later led to the discovery of Newtonian mechanics. However, evaluating whether these models truly capture deeper structure remains a challenge. We develop a technique for evaluating foundation models that examines how they adapt to synt…
▽ More
Foundation models are premised on the idea that sequence prediction can uncover deeper domain understanding, much like how Kepler's predictions of planetary motion later led to the discovery of Newtonian mechanics. However, evaluating whether these models truly capture deeper structure remains a challenge. We develop a technique for evaluating foundation models that examines how they adapt to synthetic datasets generated from some postulated world model. Our technique measures whether the foundation model's inductive bias aligns with the world model, and so we refer to it as an inductive bias probe. Across multiple domains, we find that foundation models can excel at their training tasks yet fail to develop inductive biases towards the underlying world model when adapted to new tasks. We particularly find that foundation models trained on orbital trajectories consistently fail to apply Newtonian mechanics when adapted to new physics tasks. Further analysis reveals that these models behave as if they develop task-specific heuristics that fail to generalize.
△ Less
Submitted 27 December, 2025; v1 submitted 9 July, 2025;
originally announced July 2025.