Skip to main content

Showing 1–50 of 142 results for author: Ouyang, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2604.07765  [pdf, ps, other

    cs.CV

    RemoteAgent: Bridging Vague Human Intents and Earth Observation with RL-based Agentic MLLMs

    Authors: Liang Yao, Shengxiang Xu, Fan Liu, Chuanyi Zhang, Bishun Yao, Rui Min, Yongjun Li, Chaoqian Ouyang, Shimin Di, Min-Ling Zhang

    Abstract: Earth Observation (EO) systems are essentially designed to support domain experts who often express their requirements through vague natural language rather than precise, machine-friendly instructions. Depending on the specific application scenario, these vague queries can demand vastly different levels of visual precision. Consequently, a practical EO AI system must bridge the gap between ambiguo… ▽ More

    Submitted 8 April, 2026; originally announced April 2026.

  2. arXiv:2604.07735  [pdf, ps, other

    cs.IT

    Modeling and Analysis for Joint Design of Communication and Control

    Authors: Xu Gan, Chongjun Ouyang, Yuanwei Liu

    Abstract: A unified analytical framework for joint design of communication and control (JDCC) is proposed. Within this framework, communication transmission delay and steady-state control variance are derived as the two fundamental JDCC performance metrics. The Pareto boundary is then established to characterize the optimal communication-control trade-off in JDCC systems. To further obtain closed-form expre… ▽ More

    Submitted 8 April, 2026; originally announced April 2026.

  3. arXiv:2604.04074  [pdf, ps, other

    cs.AI cs.LG

    FactReview: Evidence-Grounded Reviews with Literature Positioning and Execution-Based Claim Verification

    Authors: Hang Xu, Ling Yue, Chaoqian Ouyang, Yuchen Liu, Libin Zheng, Shaowu Pan, Shimin Di, Min-Ling Zhang

    Abstract: Peer review in machine learning is under growing pressure from rising submission volume and limited reviewer time. Most LLM-based reviewing systems read only the manuscript and generate comments from the paper's own narrative. This makes their outputs sensitive to presentation quality and leaves them weak when the evidence needed for review lies in related work or released code. We present FactRev… ▽ More

    Submitted 7 April, 2026; v1 submitted 5 April, 2026; originally announced April 2026.

  4. arXiv:2603.15844  [pdf, ps, other

    cs.IT

    Low-complexity tuning of pinching-antenna systems for integrated sensing and communication

    Authors: Saba Asaad, Chongjun Ouyang, Zhiguo Ding, Ali Bereyhi

    Abstract: Pinching antenna systems (PASSs) can dynamically adapt their transmit and receive arrays for sensing and communication in wireless systems. This work explores the potential of PASSs for integrated sensing and communication (ISAC) by proposing a novel PASS-aided ISAC design, in which pinching locations are adaptively adjusted to enable simultaneous sensing and data transmission with minimal interfe… ▽ More

    Submitted 16 March, 2026; originally announced March 2026.

    Comments: 6 pages, 2 figures, ICC workshop

  5. arXiv:2603.09290  [pdf, ps, other

    cs.SE cs.CE cs.MA

    ToolRosetta: Scalable Tool Access for Open-World Scientific Agents

    Authors: Shimin Di, Xujie Yuan, Hanghui Guo, Chaoqian Ouyang, Yongxu Liu, Ling Yue, Zhangze Chen, Libin Zheng, Jia Zhu, Shaowu Pan, Jian Yin, Yong Rui, Min-Ling Zhang

    Abstract: Large Language Model (LLM)-based agent systems are increasingly being used for scientific discovery, yet their practical capability remains constrained by a narrow and manually curated tool layer. Much scientific computational capability already exists in open-source repositories, software packages and APIs, but these resources remain difficult to standardize, operationalize and invoke reliably. H… ▽ More

    Submitted 10 April, 2026; v1 submitted 10 March, 2026; originally announced March 2026.

    Comments: 22 pages

  6. arXiv:2603.07906  [pdf, ps, other

    cs.SE

    IOTEL: A Tool for Generating IoT-enriched Object-Centric Event Logs

    Authors: Jia Wei, Xin Su, Chun Ouyang

    Abstract: Integrating Internet of Things (IoT) data with business process event logs is crucial for analysing IoT-enhanced processes, yet remains challenging due to differences in abstraction levels and the separation of data sources. Simply incorporating raw IoT data increases the size and complexity of the resulting log, often requiring additional processing before process analysis can be performed. While… ▽ More

    Submitted 8 March, 2026; originally announced March 2026.

  7. arXiv:2603.07654  [pdf, ps, other

    math.OC cs.LG

    Compressed Proximal Federated Learning for Non-Convex Composite Optimization on Heterogeneous Data

    Authors: Pu Qiu, Chen Ouyang, Yongyang Xiong, Keyou You, Wanquan Liu, Yang Shi

    Abstract: Federated Composite Optimization (FCO) has emerged as a promising framework for training models with structural constraints (e.g., sparsity) in distributed edge networks. However, simultaneously achieving communication efficiency and convergence robustness remains a significant challenge, particularly when dealing with non-smooth regularizers, statistical heterogeneity, and the restrictions of bia… ▽ More

    Submitted 8 March, 2026; originally announced March 2026.

    Comments: 14 pages, 4 figures

  8. arXiv:2603.06171  [pdf, ps, other

    cs.IT eess.SP

    On the Secrecy Performance of Continuous-Aperture Arrays Over Fading Channels

    Authors: Xuan Yang, Chongjun Ouyang, Dongming Li, Yuanwei Liu

    Abstract: The secrecy performance of continuous-aperture array (CAPA)-based wiretap channels in terms of secrecy rate and secrecy outage probability (SOP) is analyzed. First, the system models of CAPA systems with maximum-ratio transmission under a Rayleigh fading channel are established, and approximate probability density functions for the legitimate user Bob's signal-to-noise ratio (SNR) and the eavesdro… ▽ More

    Submitted 6 March, 2026; originally announced March 2026.

  9. arXiv:2603.02468  [pdf, ps, other

    cs.RO

    A Novel Modular Cable-Driven Soft Robotic Arm with Multi-Segment Reconfigurability

    Authors: Moeen Ul Islam, Cheng Ouyang, Xinda Qi, Azlan Zahid, Xiaobo Tan, Dong Chen

    Abstract: This paper presents a novel, modular, cable-driven soft robotic arm featuring multi-segment reconfigurability. The proposed architecture enables a stackable system with independent segment control, allowing scalable adaptation to diverse structural and application requirements. The system is fabricated from soft silicone material and incorporates embedded tendon-routing channels with a protective… ▽ More

    Submitted 4 March, 2026; v1 submitted 2 March, 2026; originally announced March 2026.

    Comments: 6 pages, 8 figures

  10. arXiv:2602.19605  [pdf, ps, other

    cs.CV cs.AI cs.MM

    CLCR: Cross-Level Semantic Collaborative Representation for Multimodal Learning

    Authors: Chunlei Meng, Guanhong Huang, Rong Fu, Runmin Jian, Zhongxue Gan, Chun Ouyang

    Abstract: Multimodal learning aims to capture both shared and private information from multiple modalities. However, existing methods that project all modalities into a single latent space for fusion often overlook the asynchronous, multi-level semantic structure of multimodal data. This oversight induces semantic misalignment and error propagation, thereby degrading representation quality. To address this… ▽ More

    Submitted 23 February, 2026; originally announced February 2026.

    Comments: This study has been Accepted by CVPR 2026

  11. arXiv:2602.19585  [pdf, ps, other

    cs.MM cs.AI

    Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis

    Authors: Chunlei Meng, Jiabin Luo, Zhenglin Yan, Zhenyu Yu, Rong Fu, Zhongxue Gan, Chun Ouyang

    Abstract: Multimodal Sentiment Analysis (MSA) integrates language, visual, and acoustic modalities to infer human sentiment. Most existing methods either focus on globally shared representations or modality-specific features, while overlooking signals that are shared only by certain modality pairs. This limits the expressiveness and discriminative power of multimodal representations. To address this limitat… ▽ More

    Submitted 23 February, 2026; originally announced February 2026.

    Comments: This study has been Accepted by CVPR 2026

  12. arXiv:2602.01157  [pdf, ps, other

    cs.LG cs.AI

    Deep Time-Series Models Meet Volatility: Multi-Horizon Electricity Price Forecasting in the Australian National Electricity Market

    Authors: Mohammed Osman Gani, Zhipeng He, Chun Ouyang, Sara Khalifa

    Abstract: Accurate electricity price forecasting (EPF) is increasingly difficult in markets characterised by extreme volatility, frequent price spikes, and rapid structural shifts. Deep learning (DL) has been increasingly adopted in EPF due to its ability to achieve high forecasting accuracy. Recently, state-of-the-art (SOTA) deep time-series models have demonstrated promising performance across general for… ▽ More

    Submitted 13 February, 2026; v1 submitted 1 February, 2026; originally announced February 2026.

    Comments: 10 pages, 4 figures, 6 tables

  13. arXiv:2601.13659  [pdf, ps, other

    cs.CL cs.AI cs.MM

    Temporal-Spatial Decouple before Act: Disentangled Representation Learning for Multimodal Sentiment Analysis

    Authors: Chunlei Meng, Ziyang Zhou, Lucas He, Xiaojing Du, Chun Ouyang, Zhongxue Gan

    Abstract: Multimodal Sentiment Analysis integrates Linguistic, Visual, and Acoustic. Mainstream approaches based on modality-invariant and modality-specific factorization or on complex fusion still rely on spatiotemporal mixed modeling. This ignores spatiotemporal heterogeneity, leading to spatiotemporal information asymmetry and thus limited performance. Hence, we propose TSDA, Temporal-Spatial Decouple be… ▽ More

    Submitted 20 January, 2026; originally announced January 2026.

    Comments: This study has been accepted by IEEE ICASSP2026

  14. arXiv:2601.07597  [pdf, ps, other

    cs.NE cs.AI

    Pheromone-Focused Ant Colony Optimization algorithm for path planning

    Authors: Yi Liu, Hongda Zhang, Zhongxue Gan, Yuning Chen, Ziqing Zhou, Chunlei Meng, Chun Ouyang

    Abstract: Ant Colony Optimization (ACO) is a prominent swarm intelligence algorithm extensively applied to path planning. However, traditional ACO methods often exhibit shortcomings, such as blind search behavior and slow convergence within complex environments. To address these challenges, this paper proposes the Pheromone-Focused Ant Colony Optimization (PFACO) algorithm, which introduces three key strate… ▽ More

    Submitted 12 January, 2026; originally announced January 2026.

    Comments: Accepted to 2025 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

  15. arXiv:2601.05686  [pdf, ps, other

    cs.IT eess.SP

    Secure Multiuser Beamforming With Movable Antenna Arrays

    Authors: Zhenqiao Cheng, Chongjun Ouyang, Boqun Zhao, Xingqi Zhang

    Abstract: A movable antennas (MAs)-enabled secure multiuser transmission framework is developed to enhance physical-layer security. Novel expressions are derived to characterize the achievable sum secrecy rate based on the secure channel coding theorem. On this basis, a joint optimization algorithm for digital beamforming and MA placement is proposed to maximize the sum secrecy rate via fractional programmi… ▽ More

    Submitted 9 January, 2026; originally announced January 2026.

    Comments: 6 pages; code available at https://github.com/DragonAim0597/Secure-Multiuser-Beamforming-With-Movable-Antenna-Arrays

  16. arXiv:2512.21652  [pdf

    eess.IV cs.AI physics.med-ph

    Enabling Ultra-Fast Cardiovascular Imaging Across Heterogeneous Clinical Environments with a Generalist Foundation Model and Multimodal Database

    Authors: Zi Wang, Mingkai Huang, Zhang Shi, Hongjie Hu, Lan Lan, Hui Zhang, Yan Li, Xi Hu, Qing Lu, Zongming Zhu, Qiong Yao, Yuxiang Dai, Fanwen Wang, Yinzhe Wu, Jun Lyu, Qianqian Gao, Guangming Xu, Zhenxuan Zhang, Haosen Zhang, Qing Li, Guangming Wang, Tianxing He, Lizhen Lan, Siyue Li, Le Xue , et al. (39 additional authors not shown)

    Abstract: Multimodal cardiovascular magnetic resonance (CMR) imaging provides comprehensive and non-invasive insights into cardiovascular disease (CVD) diagnosis and underlying mechanisms. Despite decades of advancements, its widespread clinical adoption remains constrained by prolonged scan times and heterogeneity across medical environments. This underscores the urgent need for a generalist reconstruction… ▽ More

    Submitted 25 December, 2025; originally announced December 2025.

    Comments: Github: https://github.com/wangziblake/CardioMM_MMCMR-427K

  17. arXiv:2511.08720  [pdf, ps, other

    eess.SP cs.IT

    Dynamic and Static Energy Efficient Design of Pinching Antenna Systems

    Authors: Saba Asaad, Chongjun Ouyang, Ali Bereyhi, Zhiguo Ding

    Abstract: We study the energy efficiency of pinching-antenna systems (PASSs) by developing a consistent formulation for power distribution in these systems. The per-antenna power distribution in PASSs is not controlled explicitly by a power allocation policy, but rather implicitly through tuning of pinching couplings and locations. Both these factors are tunable: (i) pinching locations are tuned using movab… ▽ More

    Submitted 15 February, 2026; v1 submitted 11 November, 2025; originally announced November 2025.

    Comments: To be presented at IEEE International Conference on Communications (ICC 2026): Symposium on Wireless Communications. 6 pages, 4 figures, 3 algorithms

  18. arXiv:2510.08953  [pdf, ps, other

    cs.RO eess.SY

    Direct Data-Driven Predictive Control for a Three-dimensional Cable-Driven Soft Robotic Arm

    Authors: Cheng Ouyang, Moeen Ul Islam, Dong Chen, Kaixiang Zhang, Zhaojian Li, Xiaobo Tan

    Abstract: Soft robots offer significant advantages in safety and adaptability, yet achieving precise and dynamic control remains a major challenge due to their inherently complex and nonlinear dynamics. Recently, Data-enabled Predictive Control (DeePC) has emerged as a promising model-free approach that bypasses explicit system identification by directly leveraging input-output data. While DeePC has shown s… ▽ More

    Submitted 19 March, 2026; v1 submitted 9 October, 2025; originally announced October 2025.

  19. arXiv:2510.00861  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Erase to Improve: Erasable Reinforcement Learning for Search-Augmented LLMs

    Authors: Ziliang Wang, Kang An, Xuhui Zheng, Faqiang Qian, Weikun Zhang, Cijun Ouyang, Jialu Cai, Yuhang Wang, Yichao Wu

    Abstract: While search-augmented large language models (LLMs) exhibit impressive capabilities, their reliability in complex multi-hop reasoning remains limited. This limitation arises from three fundamental challenges: decomposition errors, where tasks are incorrectly broken down; retrieval missing, where key evidence fails to be retrieved; and reasoning errors, where flawed logic propagates through the rea… ▽ More

    Submitted 1 October, 2025; originally announced October 2025.

    Comments: 10 pages, 4 figures

  20. arXiv:2509.23803  [pdf, ps, other

    cs.LG cs.AI cs.CV cs.DC cs.MA

    FedAgentBench: Towards Automating Real-world Federated Medical Image Analysis with Server-Client LLM Agents

    Authors: Pramit Saha, Joshua Strong, Divyanshu Mishra, Cheng Ouyang, J. Alison Noble

    Abstract: Federated learning (FL) allows collaborative model training across healthcare sites without sharing sensitive patient data. However, real-world FL deployment is often hindered by complex operational challenges that demand substantial human efforts. This includes: (a) selecting appropriate clients (hospitals), (b) coordinating between the central server and clients, (c) client-level data pre-proces… ▽ More

    Submitted 28 September, 2025; originally announced September 2025.

  21. arXiv:2509.12815  [pdf, ps, other

    cs.CV

    Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation

    Authors: Biwen Lei, Yang Li, Xinhai Liu, Shuhui Yang, Lixin Xu, Jingwei Huang, Ruining Tang, Haohan Weng, Jian Liu, Jing Xu, Zhen Zhou, Yiling Zhu, Jiankai Xing, Jiachen Xu, Changfeng Ma, Xinhao Yan, Yunhan Yang, Chunshi Wang, Duoteng Xu, Xueqi Ma, Yuguang Chen, Jing Li, Mingxin Yang, Sheng Zhang, Yifei Feng , et al. (75 additional authors not shown)

    Abstract: The creation of high-quality 3D assets, a cornerstone of modern game development, has long been characterized by labor-intensive and specialized workflows. This paper presents Hunyuan3D Studio, an end-to-end AI-powered content creation platform designed to revolutionize the game production pipeline by automating and streamlining the generation of game-ready 3D assets. At its core, Hunyuan3D Studio… ▽ More

    Submitted 16 September, 2025; originally announced September 2025.

    Comments: Technical Report

  22. arXiv:2509.06467  [pdf, ps, other

    cs.CV

    Does DINOv3 Set a New Medical Vision Standard? Benchmarking 2D and 3D Classification, Segmentation, and Registration

    Authors: Che Liu, Yinda Chen, Haoyuan Shi, Jinpeng Lu, Bailiang Jian, Jiazhen Pan, Linghan Cai, Jiayi Wang, Jieming Yu, Ziqi Gao, Xiaoran Zhang, Long Bai, Yundi Zhang, Jun Li, Cosmin I. Bercea, Cheng Ouyang, Chen Chen, Zhiwei Xiong, Benedikt Wiestler, Christian Wachinger, James S. Duncan, Daniel Rueckert, Wenjia Bai, Rossella Arcucci

    Abstract: The advent of large-scale vision foundation models, pre-trained on diverse natural images, has marked a paradigm shift in computer vision. However, how the frontier vision foundation models' efficacies transfer to specialised domains such as medical imaging remains an open question. This report investigates whether DINOv3, a state-of-the-art self-supervised vision transformer (ViT) pre-trained on… ▽ More

    Submitted 17 January, 2026; v1 submitted 8 September, 2025; originally announced September 2025.

    Comments: Technical Report

  23. arXiv:2509.05941  [pdf, ps, other

    cs.SE cs.LG cs.MA

    Code2MCP: Transforming Code Repositories into MCP Services

    Authors: Chaoqian Ouyang, Ling Yue, Shimin Di, Libin Zheng, Linan Yue, Shaowu Pan, Jian Yin, Min-Ling Zhang

    Abstract: The Model Context Protocol (MCP) aims to create a standard for how Large Language Models use tools. However, most current research focuses on selecting tools from an existing pool. A more fundamental, yet largely overlooked, problem is how to populate this pool by converting the vast number of existing software projects into MCP-compatible services. To bridge this gap, we introduce Code2MCP, an ag… ▽ More

    Submitted 10 February, 2026; v1 submitted 7 September, 2025; originally announced September 2025.

  24. arXiv:2509.05612  [pdf, ps, other

    cs.IT

    Multiport Network Modeling and Optimization for Reconfigurable Pinching-Antenna Systems

    Authors: Zhaolin Wang, Jiaqi Xu, Chongjun Ouyang, Xidong Mu, Yuanwei Liu

    Abstract: A reconfigurable pinching-antenna system (PASS) is presented, endowing pinching antennas (PAs) with both amplitude- and phase-controllable radiation beyond conventional implementations. To characterize this feature, a general and physically consistent model is established for PASS via multiport network theory. Within this model, the fundamental constraint of ideal reconfigurability of PAs is ident… ▽ More

    Submitted 6 September, 2025; originally announced September 2025.

    Comments: 13 pages, 9 figures

  25. A Multi-stage Low-latency Enhancement System for Hearing Aids

    Authors: Chengwei Ouyang, Kexin Fei, Haoshuai Zhou, Congxi Lu, Linkai Li

    Abstract: This paper proposes an end-to-end system for the ICASSP 2023 Clarity Challenge. In this work, we introduce four major novelties: (1) a novel multi-stage system in both the magnitude and complex domains to better utilize phase information; (2) an asymmetric window pair to achieve higher frequency resolution with the 5ms latency constraint; (3) the integration of head rotation information and the mi… ▽ More

    Submitted 6 August, 2025; originally announced August 2025.

    Comments: 2 pages, 1 figure, 1 table. accepted to ICASSP 2023

  26. arXiv:2508.00923  [pdf, ps, other

    cs.LG

    Beyond Benchmarks: Dynamic, Automatic And Systematic Red-Teaming Agents For Trustworthy Medical Language Models

    Authors: Jiazhen Pan, Bailiang Jian, Paul Hager, Yundi Zhang, Che Liu, Friedrike Jungmann, Hongwei Bran Li, Chenyu You, Junde Wu, Jiayuan Zhu, Fenglin Liu, Yuyuan Liu, Niklas Bubeck, Christian Wachinger, Chen, Chen, Zhenyu Gong, Cheng Ouyang, Georgios Kaissis, Benedikt Wiestler, Daniel Rueckert

    Abstract: Ensuring the safety and reliability of large language models (LLMs) in clinical practice is critical to prevent patient harm. However, LLMs are advancing so rapidly that static benchmarks quickly become obsolete or prone to overfitting, yielding a misleading picture of model trustworthiness. Here we introduce a Dynamic, Automatic, and Systematic (DAS) red-teaming framework that continuously stress… ▽ More

    Submitted 9 March, 2026; v1 submitted 30 July, 2025; originally announced August 2025.

  27. arXiv:2507.23763  [pdf, ps, other

    eess.IV cs.CV

    Topology Optimization in Medical Image Segmentation with Fast Euler Characteristic

    Authors: Liu Li, Qiang Ma, Cheng Ouyang, Johannes C. Paetzold, Daniel Rueckert, Bernhard Kainz

    Abstract: Deep learning-based medical image segmentation techniques have shown promising results when evaluated based on conventional metrics such as the Dice score or Intersection-over-Union. However, these fully automatic methods often fail to meet clinically acceptable accuracy, especially when topological constraints should be observed, e.g., continuous boundaries or closed surfaces. In medical image se… ▽ More

    Submitted 5 August, 2025; v1 submitted 31 July, 2025; originally announced July 2025.

  28. Extreme Cardiac MRI Analysis under Respiratory Motion: Results of the CMRxMotion Challenge

    Authors: Kang Wang, Chen Qin, Zhang Shi, Haoran Wang, Xiwen Zhang, Chen Chen, Cheng Ouyang, Chengliang Dai, Yuanhan Mo, Chenchen Dai, Xutong Kuang, Ruizhe Li, Xin Chen, Xiuzheng Yue, Song Tian, Alejandro Mora-Rubio, Kumaradevan Punithakumar, Shizhan Gong, Qi Dou, Sina Amirrajab, Yasmina Al Khalil, Cian M. Scannell, Lexiaozi Fan, Huili Yang, Xiaowu Sun , et al. (24 additional authors not shown)

    Abstract: Deep learning models have achieved state-of-the-art performance in automated Cardiac Magnetic Resonance (CMR) analysis. However, the efficacy of these models is highly dependent on the availability of high-quality, artifact-free images. In clinical practice, CMR acquisitions are frequently degraded by respiratory motion, yet the robustness of deep learning models against such artifacts remains an… ▽ More

    Submitted 25 July, 2025; originally announced July 2025.

  29. Crafting Imperceptible On-Manifold Adversarial Attacks for Tabular Data

    Authors: Zhipeng He, Alexander Stevens, Chun Ouyang, Johannes De Smedt, Alistair Barros, Catarina Moreira

    Abstract: Adversarial attacks on tabular data present unique challenges due to the heterogeneous nature of mixed categorical and numerical features. Unlike images where pixel perturbations maintain visual similarity, tabular data lacks intuitive similarity metrics, making it difficult to define imperceptible modifications. Additionally, traditional gradient-based methods prioritise $\ell_p$-norm constraints… ▽ More

    Submitted 21 November, 2025; v1 submitted 15 July, 2025; originally announced July 2025.

    Comments: Final Version

    Journal ref: Applied Soft Computing 186, Part D (2026) 114286

  30. arXiv:2507.08119  [pdf, ps, other

    cs.NI

    Photonic Rails in ML Datacenters

    Authors: Eric Ding, Chuhan Ouyang, Rachee Singh

    Abstract: Rail-optimized network fabrics have become the de facto datacenter scale-out fabric for large-scale ML training. However, the use of high-radix electrical switches to provide all-to-all connectivity in rails imposes massive power, cost, and complexity overheads. We propose a rethinking of the rail abstraction by retaining its communication semantics, but realizing it using optical circuit switches… ▽ More

    Submitted 10 July, 2025; originally announced July 2025.

  31. arXiv:2506.14298  [pdf, ps, other

    cs.IT eess.SP

    Capacity Characterization of Pinching-Antenna Systems

    Authors: Chongjun Ouyang, Zhaolin Wang, Yuanwei Liu, Zhiguo Ding

    Abstract: Unlike conventional systems using a fixed-location antenna, the channel capacity of the pinching-antenna system (PASS) is determined by the activated positions of pinching antennas. This article characterizes the capacity region of multiuser PASS, where a single pinched waveguide is deployed to enable both uplink and downlink communications. The capacity region of the uplink channel is first chara… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: submit to possible IEEE journal

  32. arXiv:2506.09570  [pdf, ps, other

    cs.IT eess.SP

    Spectral Efficiency Maximization for DMA-enabled Multiuser MISO with Statistical CSI

    Authors: Hao Xu, Boyu Ning, Chongjun Ouyang, Hongwen Yang

    Abstract: Dynamic metasurface antennas (DMAs) offer the potential to achieve large-scale antenna arrays with low power consumption and reduced hardware costs, making them a promising technology for future communication systems. This paper investigates the spectral efficiency (SE) of DMA-enabled multiuser multiple-input single-output (MISO) systems in both uplink and downlink transmissions, using only statis… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  33. TabAttackBench: A Benchmark for Adversarial Attacks on Tabular Data

    Authors: Zhipeng He, Chun Ouyang, Lijie Wen, Cong Liu, Catarina Moreira

    Abstract: Adversarial attacks pose a significant threat to machine learning models by inducing incorrect predictions through imperceptible perturbations to input data. While these attacks are well studied in unstructured domains such as images, their behaviour on tabular data remains underexplored due to mixed feature types and complex inter-feature dependencies. This study introduces a comprehensive benchm… ▽ More

    Submitted 12 October, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

    Comments: 71 pages, 21 figures, 11 tables

    Journal ref: Expert Systems with Applications 301 (2026) 130491

  34. arXiv:2505.19389  [pdf, ps, other

    cs.DB

    Curation and Analysis of MIMICEL -- An Event Log for MIMIC-IV Emergency Department

    Authors: Jia Wei, Chun Ouyang, Bemali Wickramanayake, Zhipeng He, Keshara Perera, Catarina Moreira

    Abstract: The global issue of overcrowding in emergency departments (ED) necessitates the analysis of patient flow through ED to enhance efficiency and alleviate overcrowding. However, traditional analytical methods are time-consuming and costly. The healthcare industry is embracing process mining tools to analyse healthcare processes and patient flows. Process mining aims to discover, monitor, and enhance… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  35. arXiv:2505.15107  [pdf, other

    cs.CL cs.AI cs.IR

    StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization

    Authors: Ziliang Wang, Xuhui Zheng, Kang An, Cijun Ouyang, Jialu Cai, Yuhang Wang, Yichao Wu

    Abstract: Efficient multi-hop reasoning requires Large Language Models (LLMs) based agents to acquire high-value external knowledge iteratively. Previous work has explored reinforcement learning (RL) to train LLMs to perform search-based document retrieval, achieving notable improvements in QA performance, but underperform on complex, multi-hop QA resulting from the sparse rewards from global signal only. T… ▽ More

    Submitted 26 May, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: 20 pages, 6 figures

  36. arXiv:2505.06998  [pdf, ps, other

    cs.SI physics.soc-ph

    Assessing the Robustness and Reducibility of Multiplex Networks with Embedding-Aided Interlayer Similarities

    Authors: Haoran Nan, Senquan Wang, Chun Ouyang, Yanchen Zhou, Weiwei Gu

    Abstract: The study of interlayer similarity of multiplex networks helps to understand the intrinsic structure of complex systems, revealing how changes in one layer can propagate and affect others, thus enabling broad implications for transportation, social, and biological systems. Existing algorithms that measure similarity between network layers typically encode only partial information, which limits the… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  37. arXiv:2505.06647  [pdf, other

    cs.CV

    Dataset Distillation with Probabilistic Latent Features

    Authors: Zhe Li, Sarah Cechnicka, Cheng Ouyang, Katharina Breininger, Peter Schüffler, Bernhard Kainz

    Abstract: As deep learning models grow in complexity and the volume of training data increases, reducing storage and computational costs becomes increasingly important. Dataset distillation addresses this challenge by synthesizing a compact set of synthetic data that can effectively replace the original dataset in downstream classification tasks. While existing methods typically rely on mapping data from pi… ▽ More

    Submitted 17 May, 2025; v1 submitted 10 May, 2025; originally announced May 2025.

    Comments: 23 pages

  38. arXiv:2504.16577  [pdf, ps, other

    eess.SP cs.IT

    Uplink Sum Rate Maximization for Pinching Antenna-Assisted Multiuser MISO

    Authors: Jiarui Zhang, Hao Xu, Chongjun Ouyang, Qiuyun Zou, Hongwen Yang

    Abstract: This article investigates the application of pinching-antenna systems (PASS) in multiuser multiple-input single-output (MISO) communications. Two sum-rate maximization problems are formulated under minimum mean square error (MMSE) decoding, with and without successive interference cancellation (SIC). To address the joint optimization of pinching antenna locations and user transmit powers, a fracti… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  39. arXiv:2504.13599  [pdf, other

    eess.IV cs.CV

    ViG3D-UNet: Volumetric Vascular Connectivity-Aware Segmentation via 3D Vision Graph Representation

    Authors: Bowen Liu, Chunlei Meng, Wei Lin, Hongda Zhang, Ziqing Zhou, Zhongxue Gan, Chun Ouyang

    Abstract: Accurate vascular segmentation is essential for coronary visualization and the diagnosis of coronary heart disease. This task involves the extraction of sparse tree-like vascular branches from the volumetric space. However, existing methods have faced significant challenges due to discontinuous vascular segmentation and missing endpoints. To address this issue, a 3D vision graph neural network fra… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  40. arXiv:2504.00181  [pdf, ps, other

    cs.IT

    Beamforming Design for Continuous Aperture Array (CAPA)-Based MIMO Systems

    Authors: Zhaolin Wang, Chongjun Ouyang, Yuanwei Liu

    Abstract: An efficient beamforming design is proposed for continuous aperture array (CAPA)-based point-to-point multiple-input multiple-output (MIMO) systems. In contrast to conventional spatially discrete array (SPDA)-MIMO systems, whose optimal beamforming can be obtained using singular-value decomposition, CAPA-MIMO systems require solving the eigendecomposition of a Hermitian kernel operator, which is c… ▽ More

    Submitted 18 July, 2025; v1 submitted 31 March, 2025; originally announced April 2025.

    Comments: 16 pages, 10 figures

  41. arXiv:2503.15414  [pdf, ps, other

    eess.IV cs.CV

    Federated Continual 3D Segmentation With Single-round Communication

    Authors: Can Peng, Qianhui Men, Pramit Saha, Qianye Yang, Cheng Ouyang, J. Alison Noble

    Abstract: Federated learning seeks to foster collaboration among distributed clients while preserving the privacy of their local data. Traditionally, federated learning methods assume a fixed setting in which client data and learning objectives remain constant. However, in real-world scenarios, new clients may join, and existing clients may expand the segmentation label set as task requirements evolve. In s… ▽ More

    Submitted 16 November, 2025; v1 submitted 19 March, 2025; originally announced March 2025.

  42. arXiv:2503.03117  [pdf, ps, other

    cs.IT eess.SP

    MIMO-PASS: Uplink and Downlink Transmission via MIMO Pinching-Antenna Systems

    Authors: Ali Bereyhi, Chongjun Ouyang, Saba Asaad, Zhiguo Ding, H. Vincent Poor

    Abstract: Pinching-antenna systems (PASSs) are a recent flexible-antenna technology that is realized by attaching simple components, referred to as pinching elements, to dielectric waveguides. This work explores the potential of deploying PASS for uplink and downlink transmission in multiuser MIMO settings. For downlink PASS-aided communication, we formulate the optimal hybrid beamforming, in which the digi… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 13 pages and 10 figures

  43. arXiv:2502.19634  [pdf, other

    cs.CV cs.AI

    MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

    Authors: Jiazhen Pan, Che Liu, Junde Wu, Fenglin Liu, Jiayuan Zhu, Hongwei Bran Li, Chen Chen, Cheng Ouyang, Daniel Rueckert

    Abstract: Reasoning is a critical frontier for advancing medical image analysis, where transparency and trustworthiness play a central role in both clinician trust and regulatory approval. Although Medical Visual Language Models (VLMs) show promise for radiological tasks, most existing VLMs merely produce final answers without revealing the underlying reasoning. To address this gap, we introduce MedVLM-R1,… ▽ More

    Submitted 19 March, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

  44. arXiv:2502.17900  [pdf, other

    cs.LG cs.AI

    Knowledge-enhanced Multimodal ECG Representation Learning with Arbitrary-Lead Inputs

    Authors: Che Liu, Cheng Ouyang, Zhongwei Wan, Haozhe Wang, Wenjia Bai, Rossella Arcucci

    Abstract: Recent advances in multimodal ECG representation learning center on aligning ECG signals with paired free-text reports. However, suboptimal alignment persists due to the complexity of medical language and the reliance on a full 12-lead setup, which is often unavailable in under-resourced settings. To tackle these issues, we propose **K-MERL**, a knowledge-enhanced multimodal ECG representation lea… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  45. arXiv:2502.10533  [pdf, ps, other

    cs.LG cs.HC

    Identity-Free Deferral For Unseen Experts

    Authors: Joshua Strong, Pramit Saha, Yasin Ibrahim, Cheng Ouyang, Alison Noble

    Abstract: Learning to Defer (L2D) improves AI reliability in decision-critical environments by training AI to either make its own prediction or defer the decision to a human expert. A key challenge is adapting to unseen experts at test time, whose competence can differ from the training population. Current methods for this task, however, can falter when unseen experts are out-of-distribution (OOD) relative… ▽ More

    Submitted 2 March, 2026; v1 submitted 14 February, 2025; originally announced February 2025.

    Comments: Fourteenth International Conference on Learning Representations (ICLR) 2026

  46. Downlink and Uplink ISAC in Continuous-Aperture Array (CAPA) Systems

    Authors: Boqun Zhao, Chongjun Ouyang, Xingqi Zhang, Hyundong Shin, Yuanwei Liu

    Abstract: A continuous-aperture array (CAPA)-based integrated sensing and communications (ISAC) framework is proposed for both downlink and uplink scenarios. Within this framework, continuous operator-based signal models are employed to describe the sensing and communication processes. The performance of communication and sensing is analyzed using two information-theoretic metrics: the communication rate (C… ▽ More

    Submitted 20 July, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: 16 pages, 12 figures

    Journal ref: IEEE Trans. Wireless Commun., vol. 25, pp. 3592-3609, 2026

  47. arXiv:2502.05917  [pdf, ps, other

    cs.IT

    Modeling and Beamforming Optimization for Pinching-Antenna Systems

    Authors: Zhaolin Wang, Chongjun Ouyang, Xidong Mu, Yuanwei Liu, Zhiguo Ding

    Abstract: The Pinching-Antenna SyStem (PASS) is a revolutionary flexible antenna technology designed to enhance wireless communication by establishing strong line-of-sight (LoS) links, reducing free-space path loss and enabling antenna array reconfigurability. PASS uses dielectric waveguides with low propagation loss for signal transmission, radiating via a passive pinching antenna, which is a small dielect… ▽ More

    Submitted 12 June, 2025; v1 submitted 9 February, 2025; originally announced February 2025.

    Comments: 15 pages, 12 figures

  48. arXiv:2502.01590  [pdf, other

    cs.IT eess.SP

    Downlink Beamforming with Pinching-Antenna Assisted MIMO Systems

    Authors: Ali Bereyhi, Saba Asaad, Chongjun Ouyang, Zhiguo Ding, H. Vincent Poor

    Abstract: Pinching antennas have been recently proposed as a promising flexible-antenna technology, which can be implemented by attaching low-cost pinching elements to dielectric waveguides. This work explores the potential of employing pinching antenna systems (PASs) for downlink transmission in a multiuser MIMO setting. We consider the problem of hybrid beamforming, where the digital precoder at the acces… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  49. arXiv:2412.13748  [pdf, ps, other

    cs.IT eess.SP

    On the Performance of Physical Layer Security for Continuous-Aperture Array (CAPA) Systems

    Authors: Boqun Zhao, Chongjun Ouyang, Xingqi Zhang, Yuanwei Liu

    Abstract: A continuous-aperture array (CAPA)-based secure transmission framework is proposed to enhance physical layer security. Continuous current distributions, or beamformers, are designed to maximize the secrecy transmission rate under a power constraint and to minimize the required transmission power for achieving a specific target secrecy rate. On this basis, the fundamental secrecy performance limits… ▽ More

    Submitted 2 April, 2026; v1 submitted 18 December, 2024; originally announced December 2024.

  50. arXiv:2411.15513  [pdf, ps, other

    eess.IV cs.CV

    SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation

    Authors: Jiayuan Zhu, Junde Wu, Cheng Ouyang, Konstantinos Kamnitsas, J. Alison Noble

    Abstract: Medical image segmentation data inherently contain uncertainty. This can stem from both imperfect image quality and variability in labeling preferences on ambiguous pixels, which depend on annotator expertise and the clinical context of the annotations. For instance, a boundary pixel might be labeled as tumor in diagnosis to avoid under-estimation of severity, but as normal tissue in radiotherapy… ▽ More

    Submitted 16 July, 2025; v1 submitted 23 November, 2024; originally announced November 2024.