Skip to main content

Showing 1–50 of 242 results for author: Ning, Y

.
  1. arXiv:2604.13630  [pdf, ps, other

    cs.CR cs.AI

    SafeHarness: Lifecycle-Integrated Security Architecture for LLM-based Agent Deployment

    Authors: Xixun Lin, Yang Liu, Yancheng Chen, Yongxuan Wu, Yucheng Ning, Yilong Liu, Nan Sun, Shun Zhang, Bin Chong, Chuan Zhou, Yanan Cao, Li Guo

    Abstract: The performance of large language model (LLM) agents depends critically on the execution harness, the system layer that orchestrates tool use, context management, and state persistence. Yet this same architectural centrality makes the harness a high-value attack surface: a single compromise at the harness level can cascade through the entire execution pipeline. We observe that existing security ap… ▽ More

    Submitted 15 April, 2026; originally announced April 2026.

    Comments: 26 pages, 6 figures

  2. arXiv:2604.11986  [pdf, ps, other

    cs.LG

    Exploring Concept Subspace for Self-explainable Text-Attributed Graph Learning

    Authors: Xiaoxue Han, Libo Zhang, Zining Zhu, Yue Ning

    Abstract: We introduce Graph Concept Bottleneck (GCB) as a new paradigm for self-explainable text-attributed graph learning. GCB maps graphs into a subspace, concept bottleneck, where each concept is a meaningful phrase, and predictions are made based on the activation of these concepts. Unlike existing interpretable graph learning methods that primarily rely on subgraphs as explanations, the concept bottle… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

  3. arXiv:2603.28676  [pdf, ps, other

    physics.optics

    Kramers-Kronig causality in integrated photonics: The spectral tension between ultraviolet transition and mid-infrared absorption

    Authors: Yue Hu, Zhenyuan Shang, Chenxi Zhang, Yuanjie Ning, Weiqin Zheng, Dengke Chen, Sanli Huang, Baoqi Shi, Zeying Zhong, Hao Tan, Wei Sun, Yi-Han Luo, Xinmao Yin, Zhi-Chuan Niu, Junqiu Liu

    Abstract: Dispersion engineering via geometric confinement is essential to integrated photonics, enabling phenomena such as soliton microcombs, supercontinua, parametric oscillators, and entangled photons. However, prevailing methodologies rely on semi-empirical Sellmeier models that assume idealized material purity, neglecting the pronounced dispersion shifts induced by residual impurities like hydrogen-re… ▽ More

    Submitted 31 March, 2026; v1 submitted 30 March, 2026; originally announced March 2026.

    Comments: 11 pages, 5 figures

  4. arXiv:2603.27869  [pdf, ps, other

    stat.ME

    Dependable Exploitation of High-Dimensional Unlabeled Data in an Assumption-Lean Framework

    Authors: Chao Ying, Siyi Deng, Yang Ning, Jiwei Zhao, Heping Zhang

    Abstract: Semi-supervised learning has attracted significant attention due to the proliferation of applications featuring limited labeled data but abundant unlabeled data. In this paper, we examine the statistical inference problem in an assumption-lean framework which involves a high-dimensional regression parameter, defined by minimizing the least squares, within the context of semi-supervised learning.… ▽ More

    Submitted 29 March, 2026; originally announced March 2026.

  5. arXiv:2603.23780  [pdf, ps, other

    cs.LG

    Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters

    Authors: Nan Cui, Wendy Hui Wang, Yue Ning

    Abstract: Large Language Models (LLMs) have introduced new capabilities to recommender systems, enabling dynamic, context-aware, and conversational recommendations. However, LLM-based recommender systems inherit and may amplify social biases embedded in their pre-training data, especially when demographic cues are present. Existing fairness solutions either require extra parameters fine-tuning, or suffer fr… ▽ More

    Submitted 24 March, 2026; originally announced March 2026.

  6. arXiv:2603.21521  [pdf

    cs.IT physics.optics

    Ultrafast microwave sensing and automatic recognition of dynamic objects in open world using programmable surface plasmonic neural networks

    Authors: Qian Ma, Ze Gu, Zi Rui Feng, Qian Wen Wu, Yu Ming Ning, Zhi Qiao Han, Rui Si Li, Xinxin Gao, Tie Jun Cui

    Abstract: The evolution toward next-generation intelligent sensing requires microwave systems to move beyond static detection and achieve high-speed and adaptive perception of dynamic scenes. However, the existing microwave sensing systems have bottlenecks owing to their sequential digital processing chain, limiting the refresh rates to hundreds of hertz, while the existing integrated microwave processors a… ▽ More

    Submitted 22 March, 2026; originally announced March 2026.

  7. arXiv:2602.20617  [pdf, ps, other

    math.AC

    Grade and Cohen-Macaulayness for DG-modules

    Authors: Yuancheng Ning, Xiaoyan Yang

    Abstract: We establish an inequality relating the projective dimension of a DG-module in $\mathrm{D}^\mathrm{b}_\mathrm{f}(A)$ to its grade and introduce the concept of perfect DG-modules as a natural generalization of perfect modules. It is proved that a DG-module $M$ over a local Cohen-Macaulay DG-ring with constant amplitude is Cohen-Macaulay if and only if $M$ is perfect and… ▽ More

    Submitted 24 February, 2026; originally announced February 2026.

    Comments: comments welcome!

  8. arXiv:2602.12542  [pdf, ps, other

    cs.LG cs.AI

    Exploring Accurate and Transparent Domain Adaptation in Predictive Healthcare via Concept-Grounded Orthogonal Inference

    Authors: Pengfei Hu, Chang Lu, Feifan Liu, Yue Ning

    Abstract: Deep learning models for clinical event prediction on electronic health records (EHR) often suffer performance degradation when deployed under different data distributions. While domain adaptation (DA) methods can mitigate such shifts, its "black-box" nature prevents widespread adoption in clinical practice where transparency is essential for trust and safety. We propose ExtraCare to decompose pat… ▽ More

    Submitted 12 February, 2026; originally announced February 2026.

  9. arXiv:2602.08995  [pdf, ps, other

    cs.CL

    When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents

    Authors: Yuting Ning, Jaylen Jones, Zhehao Zhang, Chentao Ye, Weitong Ruan, Junyi Li, Rahul Gupta, Huan Sun

    Abstract: Computer-use agents (CUAs) have made tremendous progress in the past year, yet they still frequently produce misaligned actions that deviate from the user's original intent. Such misaligned actions may arise from external attacks (e.g., indirect prompt injection) or from internal limitations (e.g., erroneous reasoning). They not only expose CUAs to safety risks, but also degrade task efficiency an… ▽ More

    Submitted 9 February, 2026; originally announced February 2026.

    Comments: Project Homepage: https://osu-nlp-group.github.io/Misaligned-Action-Detection/

  10. arXiv:2602.08235  [pdf, ps, other

    cs.CL cs.AI cs.CR

    When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

    Authors: Jaylen Jones, Zhehao Zhang, Yuting Ning, Eric Fosler-Lussier, Pierre-Luc St-Charles, Yoshua Bengio, Dawn Song, Yu Su, Huan Sun

    Abstract: Although computer-use agents (CUAs) hold significant potential to automate increasingly complex OS workflows, they can demonstrate unsafe unintended behaviors that deviate from expected outcomes even under benign input contexts. However, exploration of this risk remains largely anecdotal, lacking concrete characterization and automated methods to proactively surface long-tail unintended behaviors… ▽ More

    Submitted 8 February, 2026; originally announced February 2026.

    Comments: Project Homepage: https://osu-nlp-group.github.io/AutoElicit/

  11. arXiv:2602.04284  [pdf, ps, other

    cs.AI cs.LG

    Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Agentic Reinforcement Learning

    Authors: Yansong Ning, Jun Fang, Naiqiang Tan, Hao Liu

    Abstract: Managing agent thought and observation during multi-turn agent-environment interactions is an emerging strategy to improve agent efficiency. However, existing studies treat the entire interaction trajectories equally, overlooking the thought necessity and observation utility varies across turns. To this end, we first conduct quantitative investigations into how thought and observation affect agent… ▽ More

    Submitted 4 February, 2026; originally announced February 2026.

    Comments: Under Review

  12. arXiv:2602.04186  [pdf

    cond-mat.mtrl-sci

    Long-range orbital transport and inverse orbital Hall effect in Co/Ru-based terahertz emitters

    Authors: Zhou Chao, Zhang Shaohua, Hao Lei, Jin Yaxuan, Jiang Xianguo, Yang Ning, Zheng Li, Meng Hao, Lu Chao, Huang Wendeng, Wu Yizheng, Zhou Yan, Jia Xu

    Abstract: The utilization of terahertz (THz) emission spectroscopy in femtosecond photoexcited spintronic heterostructures has emerged as a versatile tool for investigating ultrafast spin-transport in a noncontact and non-invasive manner. However, the investigation of ultrafast orbital-transport is still in the primitive stage. Here, we experimentally demonstrate the orbital-to-charge current conversion in… ▽ More

    Submitted 3 February, 2026; originally announced February 2026.

    Comments: 22 pages, 5 figures

  13. arXiv:2601.13587  [pdf, ps, other

    astro-ph.IM

    The R2Pub Telescopes for Surveying: An Overview and Performance Evaluation of the System

    Authors: Xuan Song, Xiaofeng Wang, Jin Zhu, Jian Li, Jincheng Guo, Danfeng Xiang, Xin Li, Cheng Liu, Yuanhang Ning, Zhishuai Ge, Zhenzhen Shao, Xiaochen Zheng, Yi Yang, Lei Zhang, Yaqing Shi, Dongyao Zhao, Xiangyun Zeng, Jun Mo, Tengfei Song, Yufeng Fan, Yu Liu, Jingxing Wang, Shousheng He, Ciren Wangdui, Jujia Zhang , et al. (7 additional authors not shown)

    Abstract: The R2Pub telescope, built by the Beijing Planetarium, is a 60 cm equatorial binocular telescope located at the Daocheng site of Yunnan Observatories in China, at an altitude of about 4700 m. This paper presents an overview of the R2Pub telescope system, including its design, instrumentation, and survey capabilities, and reports an initial evaluation of its system performance. R2Pub is a prime-foc… ▽ More

    Submitted 19 January, 2026; originally announced January 2026.

    Comments: Accepted for publication in Publications of the Astronomical Society of the Pacific

  14. arXiv:2601.11152  [pdf, ps, other

    math.NA math.AP

    An efficient solver based on low-rank approximation and Neumann matrix series for unsteady diffusion-type partial differential equations with random coefficients

    Authors: Yujun Zhu, Min Li, Yulan Ning, Ju Ming

    Abstract: In this paper, we develop an efficient numerical solver for unsteady diffusion-type partial differential equations with random coefficients. A major computational challenge in such problems lies in repeatedly handling large-scale linear systems arising from spatial and temporal discretizations under uncertainty. To address this issue, we propose a novel generalized low-rank matrix approximation to… ▽ More

    Submitted 16 January, 2026; originally announced January 2026.

  15. arXiv:2601.02186  [pdf

    cs.CL

    Toward Global Large Language Models in Medicine

    Authors: Rui Yang, Huitao Li, Weihao Xuan, Heli Qi, Xin Li, Kunyu Yu, Yingjian Chen, Rongrong Wang, Jacques Behmoaras, Tianxi Cai, Bibhas Chakraborty, Qingyu Chen, Lionel Tim-Ee Cheng, Marie-Louise Damwanza, Chido Dzinotyiwei, Aosong Feng, Chuan Hong, Yusuke Iwasawa, Yuhe Ke, Linah Kitala, Taehoon Ko, Jisan Lee, Irene Li, Jonathan Chong Kai Liew, Hongfang Liu , et al. (25 additional authors not shown)

    Abstract: Despite continuous advances in medical technology, the global distribution of health care resources remains uneven. The development of large language models (LLMs) has transformed the landscape of medicine and holds promise for improving health care quality and expanding access to medical information globally. However, existing LLMs are primarily trained on high-resource languages, limiting their… ▽ More

    Submitted 5 January, 2026; originally announced January 2026.

    Comments: 182 pages, 65 figures

  16. arXiv:2601.00627  [pdf, ps, other

    cs.CR cs.SE

    Towards Understanding and Characterizing Vulnerabilities in Intelligent Connected Vehicles through Real-World Exploits

    Authors: Yuelin Wang, Yuqiao Ning, Yanbang Sun, Xiaofei Xie, Zhihua Xie, Yang Chen, Zhen Guo, Shihao Xue, Junjie Wang, Sen Chen

    Abstract: Intelligent Connected Vehicles (ICVs) are a core component of modern transportation systems, and their security is crucial as it directly relates to user safety. Despite prior research, most existing studies focus only on specific sub-components of ICVs due to their inherent complexity. As a result, there is a lack of systematic understanding of ICV vulnerabilities. Moreover, much of the current l… ▽ More

    Submitted 2 January, 2026; originally announced January 2026.

  17. arXiv:2512.21840  [pdf, ps, other

    stat.ME

    Targeted learning via probabilistic subpopulation matching

    Authors: Xiaokang Liu, Jie Hu, Naimin Jing, Yang Ning, Cheng Yong Tang, Runze Li, Yong Chen

    Abstract: In biomedical research, to obtain more accurate prediction results from a target study, leveraging information from multiple similar source studies is proved to be useful. However, in many biomedical applications based on real-world data, populations under consideration in different studies, e.g., clinical sites, can be heterogeneous, leading to challenges in properly borrowing information towards… ▽ More

    Submitted 25 December, 2025; originally announced December 2025.

  18. arXiv:2512.16833  [pdf, ps, other

    stat.ME

    Distributed inference for heterogeneous mixture models using multi-site data

    Authors: Xiaokang Liu, Rui Duan, Raymond J. Carroll, Yang Ning, Yong Chen

    Abstract: Mixture models postulate the overall population as a mixture of finite subpopulations with unobserved membership. Fitting mixture models usually requires large sample sizes and combining data from multiple sites can be beneficial. However, sharing individual participant data across sites is often less feasible due to various types of practical constraints, such as data privacy concerns. Moreover,… ▽ More

    Submitted 18 December, 2025; originally announced December 2025.

    Comments: 70 pages, 5 figures

  19. arXiv:2512.11426  [pdf, ps, other

    cs.AI

    AgentBalance: Backbone-then-Topology Design for Cost-Effective Multi-Agent Systems under Budget Constraints

    Authors: Shuowei Cai, Yansong Ning, Hao Liu

    Abstract: Large Language Model (LLM)-based multi-agent systems (MAS) are becoming indispensable building blocks for web-scale applications such as web search, social network analytics, and online customer support, where cost-effectiveness is increasingly the primary constraint for large-scale deployment. While recent work improves MAS cost-effectiveness by shaping inter-agent communication topologies and se… ▽ More

    Submitted 12 December, 2025; originally announced December 2025.

  20. arXiv:2512.09547  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci physics.app-ph

    Checkerboard-type Zhang-Rice States in Overdoped Cuprate Superconductors

    Authors: Xiongfang Liu, Kun Han, Yan Peng, Yuanjie Ning, Jing Wu, Zhaoyang Luo, Difan Zhou, Zhigang Zeng, Qian He, Chuanbing Cai, Mark. B. H. Breese, Ariando Ariando, Chi Sin Tang, George A. Sawatzky, Mi Jiang, Xinmao Yin

    Abstract: Cuprate superconductors remain central to condensed matter physics due to their technological relevance and unconventional, incompletely understood electronic behavior. While the canonical phase diagram and low-energy models have been shaped largely by studies of underdoped and moderately doped cuprates, the overdoped regime has received comparatively limited attention.Here, we track the evolution… ▽ More

    Submitted 10 December, 2025; originally announced December 2025.

  21. arXiv:2512.03300  [pdf, ps, other

    cs.LG cs.AI

    HydroDCM: Hydrological Domain-Conditioned Modulation for Cross-Reservoir Inflow Prediction

    Authors: Pengfei Hu, Fan Ming, Xiaoxue Han, Chang Lu, Yue Ning, Dan Lu

    Abstract: Deep learning models have shown promise in reservoir inflow prediction, yet their performance often deteriorates when applied to different reservoirs due to distributional differences, referred to as the domain shift problem. Domain generalization (DG) solutions aim to address this issue by extracting domain-invariant representations that mitigate errors in unseen domains. However, in hydrological… ▽ More

    Submitted 2 December, 2025; originally announced December 2025.

    Comments: Accepted by AAAI 2026 workshop (oral) on AI for Environmental Science

  22. arXiv:2512.01195  [pdf, ps, other

    math.CO quant-ph

    Quantum Chromatic Number of Subgraphs of Orthogonality Graphs and the Distance-2 Hamming Graph

    Authors: Tao Luo, Yu Ning, Xiande Zhang

    Abstract: The determination of the quantum chromatic number of graphs has attracted considerable attention recently. However, there are few families of graphs whose quantum chromatic numbers are determined. A notable exception is the family of orthogonality graphs, whose quantum chromatic numbers are fully determined. In this paper, we extend these results by determining the exact quantum chromatic number o… ▽ More

    Submitted 30 November, 2025; originally announced December 2025.

    MSC Class: 05C15; 05B30

  23. arXiv:2511.21775  [pdf

    eess.IV

    Attention-Guided Fair AI Modeling for Skin Cancer Diagnosis

    Authors: Mingcheng Zhu, Mingxuan Liu, Han Yuan, Yilin Ning, Zhiyao Luo, Tingting Zhu, Nan Liu

    Abstract: Artificial intelligence (AI) has shown remarkable promise in dermatology, offering accurate and non-invasive diagnosis of skin cancer. While extensive research has addressed skin tone-related bias, gender bias in dermatologic AI remains underexplored, leading to unequal care and reinforcing existing gender disparities. In this study, we developed LesionAttn, a fairness-aware algorithm that integra… ▽ More

    Submitted 26 November, 2025; originally announced November 2025.

  24. arXiv:2511.07649  [pdf, ps, other

    cs.LG cs.AI

    Adaptive Graph Learning with Transformer for Multi-Reservoir Inflow Prediction

    Authors: Pengfei Hu, Ming Fan, Xiaoxue Han, Chang Lu, Wei Zhang, Hyun Kang, Yue Ning, Dan Lu

    Abstract: Reservoir inflow prediction is crucial for water resource management, yet existing approaches mainly focus on single-reservoir models that ignore spatial dependencies among interconnected reservoirs. We introduce AdaTrip as an adaptive, time-varying graph learning framework for multi-reservoir inflow forecasting. AdaTrip constructs dynamic graphs where reservoirs are nodes with directed edges refl… ▽ More

    Submitted 10 November, 2025; originally announced November 2025.

    Comments: ICDM 2025 DMESS Workshop

  25. arXiv:2510.24714  [pdf, ps, other

    stat.ME econ.EM stat.ML

    Machine-Learning-Assisted Comparison of Regression Functions

    Authors: Jian Yan, Zhuoxi Li, Yang Ning, Yong Chen

    Abstract: We revisit the classical problem of comparing regression functions, a fundamental question in statistical inference with broad relevance to modern applications such as data integration, transfer learning, and causal inference. Existing approaches typically rely on smoothing techniques and are thus hindered by the curse of dimensionality. We propose a generalized notion of kernel-based conditional… ▽ More

    Submitted 28 October, 2025; originally announced October 2025.

  26. arXiv:2510.22967  [pdf, ps, other

    cs.CL cs.AI

    MAD-Fact: A Multi-Agent Debate Framework for Long-Form Factuality Evaluation in LLMs

    Authors: Yucheng Ning, Xixun Lin, Fang Fang, Yanan Cao

    Abstract: The widespread adoption of Large Language Models (LLMs) raises critical concerns about the factual accuracy of their outputs, especially in high-risk domains such as biomedicine, law, and education. Existing evaluation methods for short texts often fail on long-form content due to complex reasoning chains, intertwined perspectives, and cumulative information. To address this, we propose a systemat… ▽ More

    Submitted 29 October, 2025; v1 submitted 26 October, 2025; originally announced October 2025.

    Comments: The article has been accepted by Frontiers of Computer Science (FCS), with the DOI: {10.1007/s11704-025-51369-x}

  27. arXiv:2510.20629  [pdf

    cs.LG cs.AI

    Equitable Survival Prediction: A Fairness-Aware Survival Modeling (FASM) Approach

    Authors: Mingxuan Liu, Yilin Ning, Haoyuan Wang, Chuan Hong, Matthew Engelhard, Danielle S. Bitterman, William G. La Cava, Nan Liu

    Abstract: As machine learning models become increasingly integrated into healthcare, structural inequities and social biases embedded in clinical data can be perpetuated or even amplified by data-driven models. In survival analysis, censoring and time dynamics can further add complexity to fair model development. Additionally, algorithmic fairness approaches often overlook disparities in cross-group ranking… ▽ More

    Submitted 23 October, 2025; originally announced October 2025.

  28. arXiv:2510.17414  [pdf

    cs.LG

    A Conditional Diffusion Model for Probabilistic Prediction of Battery Capacity Degradation

    Authors: Hequn Li, Zhongwei Deng, Chunlin Jiang, Yvxin He andZhansheng Ning

    Abstract: Accurate prediction of lithium-ion battery capacity and its associated uncertainty is essential for reliable battery management but remains challenging due to the stochastic nature of aging. This paper presents a novel method, termed the Condition Diffusion U-Net with Attention (CDUA), which integrates feature engineering and deep learning to address this challenge. The proposed approach employs a… ▽ More

    Submitted 20 October, 2025; originally announced October 2025.

  29. arXiv:2510.15533  [pdf, ps, other

    cs.RO

    Improved Extended Kalman Filter-Based Disturbance Observers for Exoskeletons

    Authors: Shilei Li, Dawei Shi, Makoto Iwasaki, Yan Ning, Hongpeng Zhou, Ling Shi

    Abstract: The nominal performance of mechanical systems is often degraded by unknown disturbances. A two-degree-of-freedom control structure can decouple nominal performance from disturbance rejection. However, perfect disturbance rejection is unattainable when the disturbance dynamic is unknown. In this work, we reveal an inherent trade-off in disturbance estimation subject to tracking speed and tracking u… ▽ More

    Submitted 17 October, 2025; originally announced October 2025.

  30. arXiv:2510.14400  [pdf, ps, other

    cs.CL cs.AI cs.IR

    MedTrust-RAG: Evidence Verification and Trust Alignment for Biomedical Question Answering

    Authors: Yingpeng Ning, Yuanyuan Sun, Ling Luo, Yanhua Wang, Yuchen Pan, Hongfei Lin

    Abstract: Biomedical question answering (QA) requires accurate interpretation of complex medical knowledge. Large language models (LLMs) have shown promising capabilities in this domain, with retrieval-augmented generation (RAG) systems enhancing performance by incorporating external medical literature. However, RAG-based approaches in biomedical QA suffer from hallucinations due to post-retrieval noise and… ▽ More

    Submitted 18 October, 2025; v1 submitted 16 October, 2025; originally announced October 2025.

    Comments: Accepted as a short paper at BlBM2025

  31. arXiv:2510.08614  [pdf

    cs.CL

    Gender Bias in Large Language Models for Healthcare: Assignment Consistency and Clinical Implications

    Authors: Mingxuan Liu, Yuhe Ke, Wentao Zhu, Mayli Mertens, Yilin Ning, Jingchi Liao, Chuan Hong, Daniel Shu Wei Ting, Yifan Peng, Danielle S. Bitterman, Marcus Eng Hock Ong, Nan Liu

    Abstract: The integration of large language models (LLMs) into healthcare holds promise to enhance clinical decision-making, yet their susceptibility to biases remains a critical concern. Gender has long influenced physician behaviors and patient outcomes, raising concerns that LLMs assuming human-like roles, such as clinicians or medical educators, may replicate or amplify gender-related biases. Using case… ▽ More

    Submitted 7 October, 2025; originally announced October 2025.

  32. arXiv:2510.08558  [pdf, ps, other

    cs.AI cs.CL cs.IR cs.LG

    Agent Learning via Early Experience

    Authors: Kai Zhang, Xiangchao Chen, Bo Liu, Tianci Xue, Zeyi Liao, Zhihan Liu, Xiyao Wang, Yuting Ning, Zhaorun Chen, Xiaohan Fu, Jian Xie, Yuxuan Sun, Boyu Gou, Qi Qi, Zihang Meng, Jianwei Yang, Ning Zhang, Xian Li, Ashish Shah, Dat Huynh, Hengduo Li, Zi Yang, Sara Cao, Lawrence Jang, Shuyan Zhou , et al. (5 additional authors not shown)

    Abstract: A long-term goal of language agents is to learn and improve through their own experience, ultimately outperforming humans in complex, real-world tasks. However, training agents from experience data with reinforcement learning remains difficult in many environments, which either lack verifiable rewards (e.g., websites) or require inefficient long-horizon rollouts (e.g., multi-turn tool use). As a r… ▽ More

    Submitted 13 October, 2025; v1 submitted 9 October, 2025; originally announced October 2025.

    Comments: Work in progress

  33. arXiv:2510.00603  [pdf

    cs.CV

    LVLMs as inspectors: an agentic framework for category-level structural defect annotation

    Authors: Sheng Jiang, Yuanmin Ning, Bingxi Huang, Peiyin Chen, Zhaohui Chen

    Abstract: Automated structural defect annotation is essential for ensuring infrastructure safety while minimizing the high costs and inefficiencies of manual labeling. A novel agentic annotation framework, Agent-based Defect Pattern Tagger (ADPT), is introduced that integrates Large Vision-Language Models (LVLMs) with a semantic pattern matching module and an iterative self-questioning refinement mechanism.… ▽ More

    Submitted 1 October, 2025; originally announced October 2025.

  34. arXiv:2509.21842  [pdf, ps, other

    cs.AI

    DeepTravel: An End-to-End Agentic Reinforcement Learning Framework for Autonomous Travel Planning Agents

    Authors: Yansong Ning, Rui Liu, Jun Wang, Kai Chen, Wei Li, Jun Fang, Kan Zheng, Naiqiang Tan, Hao Liu

    Abstract: Travel planning (TP) agent has recently worked as an emerging building block to interact with external tools and resources for travel itinerary generation, ensuring enjoyable user experience. Despite its benefits, existing studies rely on hand craft prompt and fixed agent workflow, hindering more flexible and autonomous TP agent. This paper proposes DeepTravel, an end to end agentic reinforcement… ▽ More

    Submitted 26 September, 2025; originally announced September 2025.

    Comments: Under review

  35. arXiv:2509.20586  [pdf, ps, other

    stat.ME stat.ML

    Incorporating External Controls for Estimating the Average Treatment Effect on the Treated with High-Dimensional Data: Retaining Double Robustness and Ensuring Double Safety

    Authors: Chi-Shian Dai, Chao Ying, Yang Ning, Jiwei Zhao

    Abstract: Randomized controlled trials (RCTs) are widely regarded as the gold standard for causal inference in biomedical research. For instance, when estimating the average treatment effect on the treated (ATT), a doubly robust estimation procedure can be applied, requiring either the propensity score model or the control outcome model to be correctly specified. In this paper, we address scenarios where ex… ▽ More

    Submitted 24 September, 2025; originally announced September 2025.

  36. arXiv:2509.20383  [pdf, ps, other

    cs.CR cs.AI

    MARS: A Malignity-Aware Backdoor Defense in Federated Learning

    Authors: Wei Wan, Yuxuan Ning, Zhicong Huang, Cheng Hong, Shengshan Hu, Ziqi Zhou, Yechao Zhang, Tianqing Zhu, Wanlei Zhou, Leo Yu Zhang

    Abstract: Federated Learning (FL) is a distributed paradigm aimed at protecting participant data privacy by exchanging model parameters to achieve high-quality model training. However, this distributed nature also makes FL highly vulnerable to backdoor attacks. Notably, the recently proposed state-of-the-art (SOTA) attack, 3DFed (SP2023), uses an indicator mechanism to determine whether the backdoor models… ▽ More

    Submitted 21 September, 2025; originally announced September 2025.

    Comments: NeurIPS 2025

  37. arXiv:2509.18970  [pdf, ps, other

    cs.AI

    LLM-based Agents Suffer from Hallucinations: A Survey of Taxonomy, Methods, and Directions

    Authors: Xixun Lin, Yucheng Ning, Jingwen Zhang, Yan Dong, Yilong Liu, Yongxuan Wu, Xiaohua Qi, Nan Sun, Yanmin Shang, Kun Wang, Pengfei Cao, Qingyue Wang, Lixin Zou, Xu Chen, Chuan Zhou, Jia Wu, Peng Zhang, Qingsong Wen, Shirui Pan, Bin Wang, Yanan Cao, Kai Chen, Songlin Hu, Li Guo

    Abstract: Driven by the rapid advancements of Large Language Models (LLMs), LLM-based agents have emerged as powerful intelligent systems capable of human-like cognition, reasoning, and interaction. These agents are increasingly being deployed across diverse real-world applications, including student education, scientific research, and financial analysis. However, despite their remarkable potential, LLM-bas… ▽ More

    Submitted 18 November, 2025; v1 submitted 23 September, 2025; originally announced September 2025.

  38. arXiv:2509.08278  [pdf, ps, other

    math.RA

    Fundamental theorem of transposed Poisson $(A,H)$-Hopf modules

    Authors: Yan Ning, Daowei Lu, Dingguo Wang

    Abstract: Transposed Poisson algebra was introduced as a dual notion of the Poisson algebra by switching the roles played by the commutative associative operation and Lie operation in the Leibniz rule defining the Poisson algebra. Let $H$ be a Hopf algebra with a bijective antipode and $A$ an $H$-comodule transposed Poisson algebra. Assume that there exists an $H$-colinear map which is also an algebra map f… ▽ More

    Submitted 10 September, 2025; originally announced September 2025.

  39. arXiv:2509.02341  [pdf, ps, other

    cs.LG cs.AI

    RDIT: Residual-based Diffusion Implicit Models for Probabilistic Time Series Forecasting

    Authors: Chih-Yu Lai, Yu-Chien Ning, Duane S. Boning

    Abstract: Probabilistic Time Series Forecasting (PTSF) plays a critical role in domains requiring accurate and uncertainty-aware predictions for decision-making. However, existing methods offer suboptimal distribution modeling and suffer from a mismatch between training and evaluation metrics. Surprisingly, we found that augmenting a strong point estimator with a zero-mean Gaussian, whose standard deviation… ▽ More

    Submitted 2 September, 2025; originally announced September 2025.

  40. arXiv:2509.00196  [pdf, ps, other

    stat.ME math.ST

    G-HIVE: Parameter Estimation and Approximate Inference for Multivariate Response Generalized Linear Models with Hidden Variables

    Authors: Inbeom Lee, Yang Ning

    Abstract: In practice, there often exist unobserved variables, also termed hidden variables, associated with both the response and covariates. Existing works in the literature mostly focus on linear regression with hidden variables. However, when the regression model is non-linear, the presence of hidden variables leads to new challenges in parameter identification, estimation, and statistical inference. Th… ▽ More

    Submitted 29 August, 2025; originally announced September 2025.

  41. arXiv:2508.20942  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Transfer Learning for Classification under Decision Rule Drift with Application to Optimal Individualized Treatment Rule Estimation

    Authors: Xiaohan Wang, Yang Ning

    Abstract: In this paper, we extend the transfer learning classification framework from regression function-based methods to decision rules. We propose a novel methodology for modeling posterior drift through Bayes decision rules. By exploiting the geometric transformation of the Bayes decision boundary, our method reformulates the problem as a low-dimensional empirical risk minimization problem. Under mild… ▽ More

    Submitted 28 August, 2025; originally announced August 2025.

  42. Atomistic understanding of hydrogen bubble-induced embrittlement in tungsten enabled by machine learning molecular dynamics

    Authors: Yu Bao, Keke Song, Jiahui Liu, Yanzhou Wang, Yifei Ning, Penghua Ying, Ping Qian

    Abstract: Hydrogen bubble formation within nanoscale voids is a critical mechanism underlying the embrittlement of metallic materials, yet its atomistic origins remains elusive. Here, we present an accurate and transferable machine-learned potential (MLP) for the tungsten-hydrogen binary system within the neuroevolution potential (NEP) framework, trained through active learning on extensive density function… ▽ More

    Submitted 27 August, 2025; originally announced August 2025.

    Comments: 14pages,7 figures

    Journal ref: npj Computational Materials,12,108 (2026)

  43. arXiv:2508.12727  [pdf, ps, other

    cs.LG

    FedSODA: Federated Fine-tuning of LLMs via Similarity Group Pruning and Orchestrated Distillation Alignment

    Authors: Manning Zhu, Songtao Guo, Pengzhan Zhou, Yansong Ning, Chang Han, Dewen Qiao

    Abstract: Federated fine-tuning (FFT) of large language models (LLMs) has recently emerged as a promising solution to enable domain-specific adaptation while preserving data privacy. Despite its benefits, FFT on resource-constrained clients relies on the high computational and memory demands of full-model fine-tuning, which limits the potential advancement. This paper presents FedSODA, a resource-efficient… ▽ More

    Submitted 18 August, 2025; originally announced August 2025.

  44. arXiv:2508.09749  [pdf, ps, other

    astro-ph.GA

    A dual AGN at z = 5.4 associated with a Lyman-alpha Nebula in the Center of a Cosmic Filament

    Authors: Qiong Li, Christopher J. Conselice, Qiao Duan, Duncan Austin, Tom Harvey, Nathan Adams, George Bendo, Lewi Westcott, Vadim Rusakov, Zheng Cai, Yuanhang Ning, Shiwu Zhang

    Abstract: Predictions from current theories and simulations suggest that dual AGN systems are exceedingly rare at high redshifts. The intense radiation and powerful outflows from AGNs regulate star formation, heat the interstellar medium, and drive massive gas outflows that shape the host galaxy and its surroundings. One manifestation of AGN feedback is the creation of extended Ly$α$ nebulae. However, ident… ▽ More

    Submitted 29 August, 2025; v1 submitted 13 August, 2025; originally announced August 2025.

    Comments: 12 pages, 7 figures, submitted to MNRAS

  45. arXiv:2508.08785  [pdf, ps, other

    cs.CL

    Privacy-protected Retrieval-Augmented Generation for Knowledge Graph Question Answering

    Authors: Yunfeng Ning, Mayi Xu, Jintao Wen, Qiankun Pi, Yuanyuan Zhu, Ming Zhong, Jiawei Jiang, Tieyun Qian

    Abstract: LLMs often suffer from hallucinations and outdated or incomplete knowledge. RAG is proposed to address these issues by integrating external knowledge like that in KGs into LLMs. However, leveraging private KGs in RAG systems poses significant privacy risks due to the black-box nature of LLMs and potential insecure data transmission, especially when using third-party LLM APIs lacking transparency a… ▽ More

    Submitted 3 December, 2025; v1 submitted 12 August, 2025; originally announced August 2025.

    Comments: Accepted by AAAI 2026, camera ready version

  46. arXiv:2508.05328  [pdf, ps, other

    math.NA

    A low-rank solver for the Stokes-Darcy model with random hydraulic conductivity and Beavers-Joseph condition

    Authors: Yujun Zhu, Yulan Ning, Zhipeng Yang, Xiaoming He, Ju Ming

    Abstract: This paper proposes, analyzes, and demonstrates an efficient low-rank solver for the stochastic Stokes-Darcy interface model with a random hydraulic conductivity both in the porous media domain and on the interface. We consider three interface conditions with randomness, including the Beavers-Joseph interface condition with the random hydraulic conductivity, on the interface between the free flow… ▽ More

    Submitted 7 August, 2025; originally announced August 2025.

  47. arXiv:2507.20176  [pdf, ps, other

    math.RA math-ph

    Post-Hopf group algebras, Hopf group braces and Rota-Baxter operators on Hopf group algebras

    Authors: Yan Ning, Xing Wang, Daowei Lu

    Abstract: In this paper, we introduce the notions of Hopf group braces, post-Hopf group algebras and Rota-Baxter Hopf group algebras as important generalizations of Hopf brace, post Hopf algebra and Rota-Baxter Hopf algebras respectively. We also discuss their relationships. Explicitly under the condition of cocomutativity, Hopf group braces, post-Hopf group algebras could be mutually obtained, and Rota-Bax… ▽ More

    Submitted 1 August, 2025; v1 submitted 27 July, 2025; originally announced July 2025.

  48. arXiv:2507.06043  [pdf, ps, other

    cs.CR cs.AI

    CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal Representations

    Authors: Xiaohu Li, Yunfeng Ning, Zepeng Bao, Mayi Xu, Jianhao Chen, Tieyun Qian

    Abstract: Security alignment enables the Large Language Model (LLM) to gain the protection against malicious queries, but various jailbreak attack methods reveal the vulnerability of this security mechanism. Previous studies have isolated LLM jailbreak attacks and defenses. We analyze the security protection mechanism of the LLM, and propose a framework that combines attack and defense. Our method is based… ▽ More

    Submitted 6 August, 2025; v1 submitted 8 July, 2025; originally announced July 2025.

    Comments: Accepted to ACL 2025 (Findings), camera-ready version

  49. arXiv:2507.04689  [pdf, ps, other

    cs.IT

    On subcodes of the generalized Reed-Solomon codes

    Authors: Yu Ning

    Abstract: In this paper, we study a class of subcodes of codimension $1$ in the $[n,k+1]_q$ generalized Reed-Solomon (GRS) codes, whose generator matrix is derived by removing the row of degree $k-r$ from the generator matrix of the $[n,k+1]_q$ GRS codes, where $1 \le r \le k-1$. We show equivalent characterizations for this class of subcodes of the GRS codes being self-dual or near-MDS, which extends the r… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

  50. arXiv:2507.00914  [pdf, ps, other

    cs.MA cs.AI

    Large Language Model Powered Intelligent Urban Agents: Concepts, Capabilities, and Applications

    Authors: Jindong Han, Yansong Ning, Zirui Yuan, Hang Ni, Fan Liu, Tengfei Lyu, Hao Liu

    Abstract: The long-standing vision of intelligent cities is to create efficient, livable, and sustainable urban environments using big data and artificial intelligence technologies. Recently, the advent of Large Language Models (LLMs) has opened new ways toward realizing this vision. With powerful semantic understanding and reasoning capabilities, LLMs can be deployed as intelligent agents capable of autono… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.