Skip to main content

Showing 1–19 of 19 results for author: Xuan, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2603.04999  [pdf, ps, other

    cs.CV

    Physics-consistent deep learning for blind aberration recovery in mobile optics

    Authors: Kartik Jhawar, Tamo Sancho Miguel Tandoc, Khoo Jun Xuan, Wang Lipo

    Abstract: Mobile photography is often limited by complex, lens-specific optical aberrations. While recent deep learning methods approach this as an end-to-end deblurring task, these "black-box" models lack explicit optical modeling and can hallucinate details. Conversely, classical blind deconvolution remains highly unstable. To bridge this gap, we present Lens2Zernike, a deep learning framework that blindl… ▽ More

    Submitted 5 March, 2026; originally announced March 2026.

    Comments: 4 pages, 3 figures

  2. arXiv:2602.07276  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs

    Authors: Pengrui Han, Xueqiang Xu, Keyang Xuan, Peiyang Song, Siru Ouyang, Runchu Tian, Yuqing Jiang, Cheng Qian, Pengcheng Jiang, Jiashuo Sun, Junxia Cui, Ming Zhong, Ge Liu, Jiawei Han, Jiaxuan You

    Abstract: Activation steering has emerged as a promising approach for efficiently adapting large language models (LLMs) to downstream behaviors. However, most existing steering methods rely on a single static direction per task or concept, making them inflexible under task variation and inadequate for complex tasks that require multiple coordinated capabilities. To address this limitation, we propose STEER2… ▽ More

    Submitted 6 February, 2026; originally announced February 2026.

  3. arXiv:2602.05115  [pdf, ps, other

    cs.AI cs.CL

    SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers

    Authors: Keyang Xuan, Pengda Wang, Chongrui Ye, Haofei Yu, Tal August, Jiaxuan You

    Abstract: Large language models (LLMs) are increasingly evaluated in interactive environments to test their social intelligence. However, existing benchmarks often assume idealized communication between agents, limiting our ability to diagnose whether LLMs can maintain and repair interactions in more realistic, imperfect settings. To close this gap, we present \textsc{SocialVeil}, a social learning environm… ▽ More

    Submitted 4 February, 2026; originally announced February 2026.

    Comments: 10 pages

  4. arXiv:2510.06579  [pdf, ps, other

    cs.CL

    TinyScientist: An Interactive, Extensible, and Controllable Framework for Building Research Agents

    Authors: Haofei Yu, Keyang Xuan, Fenghai Li, Kunlun Zhu, Zijie Lei, Jiaxun Zhang, Ziheng Qi, Kyle Richardson, Jiaxuan You

    Abstract: Automatic research with Large Language Models (LLMs) is rapidly gaining importance, driving the development of increasingly complex workflows involving multi-agent systems, planning, tool usage, code execution, and human-agent interaction to accelerate research processes. However, as more researchers and developers begin to use and build upon these tools and platforms, the complexity and difficult… ▽ More

    Submitted 7 October, 2025; originally announced October 2025.

    Comments: 7 pages, EMNLP 2025 Demo track

  5. arXiv:2508.03905  [pdf, ps, other

    cs.CL

    Sotopia-RL: Reward Design for Social Intelligence

    Authors: Haofei Yu, Zhengyang Qi, Yining Zhao, Kolby Nottingham, Keyang Xuan, Bodhisattwa Prasad Majumder, Hao Zhu, Paul Pu Liang, Jiaxuan You

    Abstract: Social intelligence has become a critical capability for large language models (LLMs), enabling them to engage effectively in real-world social tasks such as collaboration and negotiation. Reinforcement learning (RL) is a natural fit for training socially intelligent agents because it allows models to learn sophisticated strategies directly through social interactions without requiring human annot… ▽ More

    Submitted 7 October, 2025; v1 submitted 5 August, 2025; originally announced August 2025.

    Comments: 10 pages

  6. arXiv:2504.09897  [pdf, other

    cs.CV

    TAMP: Token-Adaptive Layerwise Pruning in Multimodal Large Language Models

    Authors: Jaewoo Lee, Keyang Xuan, Chanakya Ekbote, Sandeep Polisetty, Yi R. Fung, Paul Pu Liang

    Abstract: Multimodal Large Language Models (MLLMs) have shown remarkable versatility in understanding diverse multimodal data and tasks. However, these capabilities come with an increased model scale. While post-training pruning reduces model size in unimodal models, its application to MLLMs often yields limited success. Our analysis discovers that conventional methods fail to account for the unique token a… ▽ More

    Submitted 17 May, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

    Comments: ACL Findings 2025

  7. arXiv:2503.12399  [pdf, other

    cs.CV eess.IV

    Pathology Image Restoration via Mixture of Prompts

    Authors: Jiangdong Cai, Yan Chen, Zhenrong Shen, Haotian Jiang, Honglin Xiong, Kai Xuan, Lichi Zhang, Qian Wang

    Abstract: In digital pathology, acquiring all-in-focus images is essential to high-quality imaging and high-efficient clinical workflow. Traditional scanners achieve this by scanning at multiple focal planes of varying depths and then merging them, which is relatively slow and often struggles with complex tissue defocus. Recent prevailing image restoration technique provides a means to restore high-quality… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  8. arXiv:2412.17767  [pdf, ps, other

    cs.CL cs.LG

    ResearchTown: Simulator of Human Research Community

    Authors: Haofei Yu, Zhaochen Hong, Zirui Cheng, Kunlun Zhu, Keyang Xuan, Jinwei Yao, Tao Feng, Jiaxuan You

    Abstract: Large Language Models (LLMs) have demonstrated remarkable potential in scientific domains, yet a fundamental question remains unanswered: Can we simulate human research communities with LLMs? Addressing this question can deepen our understanding of the processes behind idea brainstorming and inspire the automatic discovery of novel scientific insights. In this work, we propose ResearchTown, a mult… ▽ More

    Submitted 6 June, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

    Comments: 9 pages, ICML 2025

  9. arXiv:2409.16526  [pdf, other

    cs.CR

    APILOT: Navigating Large Language Models to Generate Secure Code by Sidestepping Outdated API Pitfalls

    Authors: Weiheng Bai, Keyang Xuan, Pengxiang Huang, Qiushi Wu, Jianing Wen, Jingjing Wu, Kangjie Lu

    Abstract: With the rapid development of large language models (LLMs), their applications have expanded into diverse fields, such as code assistance. However, the substantial size of LLMs makes their training highly resource- and time-intensive, rendering frequent retraining or updates impractical. Consequently, time-sensitive data can become outdated, potentially misleading LLMs in time-aware tasks. For exa… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  10. arXiv:2402.11943  [pdf, other

    cs.CL

    LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation

    Authors: Keyang Xuan, Li Yi, Fan Yang, Ruochen Wu, Yi R. Fung, Heng Ji

    Abstract: The rise of multimodal misinformation on social platforms poses significant challenges for individuals and societies. Its increased credibility and broader impact compared to textual misinformation make detection complex, requiring robust reasoning across diverse media types and profound knowledge for accurate verification. The emergence of Large Vision Language Model (LVLM) offers a potential sol… ▽ More

    Submitted 20 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  11. arXiv:2304.07756  [pdf, other

    eess.IV cs.CV

    Arbitrary Reduction of MRI Inter-slice Spacing Using Hierarchical Feature Conditional Diffusion

    Authors: Xin Wang, Zhenrong Shen, Zhiyun Song, Sheng Wang, Mengjun Liu, Lichi Zhang, Kai Xuan, Qian Wang

    Abstract: Magnetic resonance (MR) images collected in 2D scanning protocols typically have large inter-slice spacing, resulting in high in-plane resolution but reduced through-plane resolution. Super-resolution techniques can reduce the inter-slice spacing of 2D scanned MR images, facilitating the downstream visual experience and computer-aided diagnosis. However, most existing super-resolution methods are… ▽ More

    Submitted 15 September, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

    Comments: new version

  12. arXiv:2210.07522  [pdf, other

    cs.LG

    Spatiotemporal Classification with limited labels using Constrained Clustering for large datasets

    Authors: Praveen Ravirathinam, Rahul Ghosh, Ke Wang, Keyang Xuan, Ankush Khandelwal, Hilary Dugan, Paul Hanson, Vipin Kumar

    Abstract: Creating separable representations via representation learning and clustering is critical in analyzing large unstructured datasets with only a few labels. Separable representations can lead to supervised models with better classification capabilities and additionally aid in generating new labeled samples. Most unsupervised and semisupervised methods to analyze large datasets do not leverage the ex… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 9 pages

  13. arXiv:2208.06099  [pdf, other

    eess.IV cs.CV

    TBI-GAN: An Adversarial Learning Approach for Data Synthesis on Traumatic Brain Segmentation

    Authors: Xiangyu Zhao, Di Zang, Sheng Wang, Zhenrong Shen, Kai Xuan, Zeyu Wei, Zhe Wang, Ruizhe Zheng, Xuehai Wu, Zheren Li, Qian Wang, Zengxin Qi, Lichi Zhang

    Abstract: Brain network analysis for traumatic brain injury (TBI) patients is critical for its consciousness level assessment and prognosis evaluation, which requires the segmentation of certain consciousness-related brain regions. However, it is difficult to construct a TBI segmentation model as manually annotated MR scans of TBI patients are hard to collect. Data augmentation techniques can be applied to… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

  14. arXiv:2205.11346  [pdf, other

    eess.IV cs.CV

    Spatial Attention-based Implicit Neural Representation for Arbitrary Reduction of MRI Slice Spacing

    Authors: Xin Wang, Sheng Wang, Honglin Xiong, Kai Xuan, Zixu Zhuang, Mengjun Liu, Zhenrong Shen, Xiangyu Zhao, Lichi Zhang, Qian Wang

    Abstract: Magnetic resonance (MR) images collected in 2D clinical protocols typically have large inter-slice spacing, resulting in high in-plane resolution and reduced through-plane resolution. Super-resolution technique can enhance the through-plane resolution of MR images to facilitate downstream visualization and computer-aided diagnosis. However, most existing works train the super-resolution network at… ▽ More

    Submitted 19 March, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

  15. arXiv:2201.04318  [pdf, other

    eess.IV cs.CV

    Knee Cartilage Defect Assessment by Graph Representation and Surface Convolution

    Authors: Zixu Zhuang, Liping Si, Sheng Wang, Kai Xuan, Xi Ouyang, Yiqiang Zhan, Zhong Xue, Lichi Zhang, Dinggang Shen, Weiwu Yao, Qian Wang

    Abstract: Knee osteoarthritis (OA) is the most common osteoarthritis and a leading cause of disability. Cartilage defects are regarded as major manifestations of knee OA, which are visible by magnetic resonance imaging (MRI). Thus early detection and assessment for knee cartilage defects are important for protecting patients from knee OA. In this way, many attempts have been made on knee cartilage defect as… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

    Comments: 10 pages, 4 figures

  16. Multi-Modal MRI Reconstruction Assisted with Spatial Alignment Network

    Authors: Kai Xuan, Lei Xiang, Xiaoqian Huang, Lichi Zhang, Shu Liao, Dinggang Shen, Qian Wang

    Abstract: In clinical practice, multi-modal magnetic resonance imaging (MRI) with different contrasts is usually acquired in a single study to assess different properties of the same region of interest in the human body. The whole acquisition process can be accelerated by having one or more modalities under-sampled in the $k$-space. Recent research has shown that, considering the redundancy between differen… ▽ More

    Submitted 2 April, 2022; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: Final version, IEEE Transactions on Medical Imaging, code available at \url{https://github.com/woxuankai/SpatialAlignmentNetwork}

  17. arXiv:2104.11597  [pdf

    cs.AI

    Generalized-TODIM Method for Multi-criteria Decision Making with Basic Uncertain Information and its Application

    Authors: Zhiyuan Zhou, Kai Xuan, Zhifu Tao, Ligang Zhou

    Abstract: Due to the fact that basic uncertain information provides a simple form for decision information with certainty degree, it has been developed to reflect the quality of observed or subjective assessments. In order to study the algebra structure and preference relation of basic uncertain information, we develop some algebra operations for basic uncertain information. The order relation of such type… ▽ More

    Submitted 27 April, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: 24 pages, 2 figure, 1 table

  18. arXiv:2005.09212  [pdf, other

    eess.IV cs.CV

    A Self-ensembling Framework for Semi-supervised Knee Cartilage Defects Assessment with Dual-Consistency

    Authors: Jiayu Huo, Liping Si, Xi Ouyang, Kai Xuan, Weiwu Yao, Zhong Xue, Qian Wang, Dinggang Shen, Lichi Zhang

    Abstract: Knee osteoarthritis (OA) is one of the most common musculoskeletal disorders and requires early-stage diagnosis. Nowadays, the deep convolutional neural networks have achieved greatly in the computer-aided diagnosis field. However, the construction of the deep learning models usually requires great amounts of annotated data, which is generally high-cost. In this paper, we propose a novel approach… ▽ More

    Submitted 12 October, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: accepted by International Workshop on PRedictive Intelligence In MEdicine, 2020

  19. arXiv:2001.03857  [pdf, other

    eess.IV cs.CV

    Robust Brain Magnetic Resonance Image Segmentation for Hydrocephalus Patients: Hard and Soft Attention

    Authors: Xuhua Ren, Jiayu Huo, Kai Xuan, Dongming Wei, Lichi Zhang, Qian Wang

    Abstract: Brain magnetic resonance (MR) segmentation for hydrocephalus patients is considered as a challenging work. Encoding the variation of the brain anatomical structures from different individuals cannot be easily achieved. The task becomes even more difficult especially when the image data from hydrocephalus patients are considered, which often have large deformations and differ significantly from the… ▽ More

    Submitted 12 January, 2020; originally announced January 2020.

    Comments: ISBI 2020