Skip to main content

Showing 1–50 of 1,991 results for author: Yan, J

.
  1. arXiv:2512.21183  [pdf, ps, other

    cs.CV

    Towards Arbitrary Motion Completing via Hierarchical Continuous Representation

    Authors: Chenghao Xu, Guangtao Lyu, Qi Liu, Jiexi Yan, Muli Yang, Cheng Deng

    Abstract: Physical motions are inherently continuous, and higher camera frame rates typically contribute to improved smoothness and temporal coherence. For the first time, we explore continuous representations of human motion sequences, featuring the ability to interpolate, inbetween, and even extrapolate any input motion sequences at arbitrary frame rates. To achieve this, we propose a novel parametric act… ▽ More

    Submitted 24 December, 2025; originally announced December 2025.

  2. arXiv:2512.21174  [pdf, ps, other

    cs.CV

    A Turn Toward Better Alignment: Few-Shot Generative Adaptation with Equivariant Feature Rotation

    Authors: Chenghao Xu, Qi Liu, Jiexi Yan, Muli Yang, Cheng Deng

    Abstract: Few-shot image generation aims to effectively adapt a source generative model to a target domain using very few training images. Most existing approaches introduce consistency constraints-typically through instance-level or distribution-level loss functions-to directly align the distribution patterns of source and target domains within their respective latent spaces. However, these strategies ofte… ▽ More

    Submitted 24 December, 2025; originally announced December 2025.

  3. arXiv:2512.20917  [pdf, ps, other

    eess.SP

    Semantic Radio Access Networks: Architecture, State-of-the-Art, and Future Directions

    Authors: Rui Meng, Zixuan Huang, Jingshu Yan, Mengying Sun, Yiming Liu, Chenyuan Feng, Xiaodong Xu, Zhidi Zhang, Song Gao, Ping Zhang, Tony Q. S. Quek

    Abstract: Radio Access Network (RAN) is a bridge between user devices and the core network in mobile communication systems, responsible for the transmission and reception of wireless signals and air interface management. In recent years, Semantic Communication (SemCom) has represented a transformative communication paradigm that prioritizes meaning-level transmission over conventional bit-level delivery, th… ▽ More

    Submitted 23 December, 2025; originally announced December 2025.

    Comments: 19 pages, 8 figures

  4. arXiv:2512.18813  [pdf, ps, other

    cs.CV

    Revealing Perception and Generation Dynamics in LVLMs: Mitigating Hallucinations via Validated Dominance Correction

    Authors: Guangtao Lyu, Xinyi Cheng, Chenghao Xu, Qi Liu, Muli Yang, Fen Fang, Huilin Chen, Jiexi Yan, Xu Yang, Cheng Deng

    Abstract: Large Vision-Language Models (LVLMs) have shown remarkable capabilities, yet hallucinations remain a persistent challenge. This work presents a systematic analysis of the internal evolution of visual perception and token generation in LVLMs, revealing two key patterns. First, perception follows a three-stage GATE process: early layers perform a Global scan, intermediate layers Approach and Tighten… ▽ More

    Submitted 21 December, 2025; originally announced December 2025.

  5. arXiv:2512.18804  [pdf, ps, other

    cs.CV cs.MM cs.SD

    Tempo as the Stable Cue: Hierarchical Mixture of Tempo and Beat Experts for Music to 3D Dance Generation

    Authors: Guangtao Lyu, Chenghao Xu, Qi Liu, Jiexi Yan, Muli Yang, Fen Fang, Cheng Deng

    Abstract: Music to 3D dance generation aims to synthesize realistic and rhythmically synchronized human dance from music. While existing methods often rely on additional genre labels to further improve dance generation, such labels are typically noisy, coarse, unavailable, or insufficient to capture the diversity of real-world music, which can result in rhythm misalignment or stylistic drift. In contrast, w… ▽ More

    Submitted 21 December, 2025; originally announced December 2025.

  6. arXiv:2512.18712  [pdf, ps, other

    cs.RO

    DSO-VSA: a Variable Stiffness Actuator with Decoupled Stiffness and Output Characteristics for Rehabilitation Robotics

    Authors: Maozeng Zhang, Ke Shi, Huijun Li, Tongshu Chen, Jiejun Yan, Aiguo Song

    Abstract: Stroke-induced motor impairment often results in substantial loss of upper-limb function, creating a strong demand for rehabilitation robots that enable safe and transparent physical human-robot interaction (pHRI). Variable stiffness actuators are well suited for such applications. However, in most existing designs, stiffness is coupled with the deflection angle, complicating both modeling and con… ▽ More

    Submitted 21 December, 2025; originally announced December 2025.

  7. arXiv:2512.18375  [pdf

    cond-mat.supr-con cond-mat.str-el

    Lattice-decoupled rotatable stripe-like charge order within the strange metal phase of 2M-WS2

    Authors: Kebin Xiao, Yunkai Guo, Daran Fu, Yuqiang Fang, Yating Hu, Jingming Yan, Yucong Peng, Yuyang Wang, Yongkang Ju, Peizhe Tang, Xiangang Wan, Fuqiang Huang, Qi-Kun Xue, Wei Li

    Abstract: In quantum materials, charge orders typically stabilize in specific crystallographic orientations, though their formation mechanisms may vary. Here, using low-temperature scanning tunneling microscopy (STM), we discover a lattice-decoupled rotatable stripe-like charge order coexisting with superconductivity in 2M-WS2. The charge order manifests five distinct orientations across different sample re… ▽ More

    Submitted 20 December, 2025; originally announced December 2025.

    Comments: 12 pages, 5 figures. This article was published on PNAS (https://doi.org/10.1073/pnas.2513493122)

  8. arXiv:2512.18190  [pdf, ps, other

    cs.AI cs.CL cs.LG

    External Hippocampus: Topological Cognitive Maps for Guiding Large Language Model Reasoning

    Authors: Jian Yan

    Abstract: This paper proposes the External Hippocampus framework, which models language model reasoning from a cognitive dynamics perspective as the flow of information energy in semantic space. Unlike traditional weight-space optimization methods, this framework constructs topological cognitive maps through dimensionality reduction projection, enabling precise navigation and intervention of energy flow at… ▽ More

    Submitted 23 December, 2025; v1 submitted 19 December, 2025; originally announced December 2025.

    Comments: 12 pages, 7 figures

    ACM Class: I.2.7

  9. arXiv:2512.17206  [pdf, ps, other

    cs.CV

    Reasoning Palette: Modulating Reasoning via Latent Contextualization for Controllable Exploration for (V)LMs

    Authors: Rujiao Long, Yang Li, Xingyao Zhang, Weixun Wang, Tianqianjin Lin, Xi Zhao, Yuchi Xu, Wenbo Su, Junchi Yan, Bo Zheng

    Abstract: Exploration capacity shapes both inference-time performance and reinforcement learning (RL) training for large (vision-) language models, as stochastic sampling often yields redundant reasoning paths with little high-level diversity. This paper proposes Reasoning Palette, a novel latent-modulation framework that endows the model with a stochastic latent variable for strategic contextualization, gu… ▽ More

    Submitted 18 December, 2025; originally announced December 2025.

  10. arXiv:2512.16969  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

    Authors: Wanghan Xu, Yuhao Zhou, Yifan Zhou, Qinglong Cao, Shuo Li, Jia Bu, Bo Liu, Yixin Chen, Xuming He, Xiangyu Zhao, Xiang Zhuang, Fengxiang Wang, Zhiwang Zhou, Qiantai Feng, Wenxuan Huang, Jiaqi Wei, Hao Wu, Yuejin Yang, Guangshuai Wang, Sheng Xu, Ziyan Huang, Xinyao Liu, Jiyao Liu, Cheng Tang, Wei Li , et al. (82 additional authors not shown)

    Abstract: Despite advances in scientific AI, a coherent framework for Scientific General Intelligence (SGI)-the ability to autonomously conceive, investigate, and reason across scientific domains-remains lacking. We present an operational SGI definition grounded in the Practical Inquiry Model (PIM: Deliberation, Conception, Action, Perception) and operationalize it via four scientist-aligned tasks: deep res… ▽ More

    Submitted 18 December, 2025; originally announced December 2025.

  11. arXiv:2512.16229  [pdf, ps, other

    cs.CL

    LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding

    Authors: Chenkai Xu, Yijie Jin, Jiajun Li, Yi Tu, Guoping Long, Dandan Tu, Mingcong Song, Hongjie Si, Tianqi Hou, Junchi Yan, Zhijie Deng

    Abstract: Diffusion Large Language Models (dLLMs) have demonstrated significant potential for high-speed inference. However, current confidence-driven decoding strategies are constrained by limited parallelism, typically achieving only 1--3 tokens per forward pass (TPF). In this work, we identify that the degree of parallelism during dLLM inference is highly sensitive to the Token Filling Order (TFO). Then,… ▽ More

    Submitted 22 December, 2025; v1 submitted 18 December, 2025; originally announced December 2025.

  12. arXiv:2512.13120  [pdf, ps, other

    cs.IR cs.LG

    Towards Practical Large-scale Dynamical Heterogeneous Graph Embedding: Cold-start Resilient Recommendation

    Authors: Mabiao Long, Jiaxi Liu, Yufeng Li, Hao Xiong, Junchi Yan, Kefan Wang, Yi Cao, Jiandong Ding

    Abstract: Deploying dynamic heterogeneous graph embeddings in production faces key challenges of scalability, data freshness, and cold-start. This paper introduces a practical, two-stage solution that balances deep graph representation with low-latency incremental updates. Our framework combines HetSGFormer, a scalable graph transformer for static learning, with Incremental Locally Linear Embedding (ILLE),… ▽ More

    Submitted 15 December, 2025; originally announced December 2025.

  13. arXiv:2512.12916  [pdf

    cond-mat.mtrl-sci

    Experimental Demonstration and Transformation Mechanism of Quenchable Two-dimensional Diamond

    Authors: Jiayin Li, Guoshuai Du, Lili Zhao, Wuxiao Han, Jiaxin Ming, Shang Chen, Pengcheng Zhao, Lu Bai, Jiaohui Yan, Yubing Du, Jiajia Feng, Hongliang Dong, Ke Jin, Weigao Xu, Bin Chen, Jianguo Zhang, Yabin Chen

    Abstract: Two-dimensional (2D) diamond has aroused tremendous interest in nanoelectronics and optoelectronics, owing to its superior properties and flexible characteristics compared to bulk diamond. Despite significant efforts, great challenges lie in the experimental synthesis and transformation conditions of 2D diamond. Herein, we have demonstrated the experimental preparation of high quality 2D diamond w… ▽ More

    Submitted 14 December, 2025; originally announced December 2025.

    Comments: 26 pages, 5 figures

  14. arXiv:2512.12676  [pdf, ps, other

    stat.ME cs.LG stat.ML

    Robust Variational Bayes by Min-Max Median Aggregation

    Authors: Jiawei Yan, Ju Liu, Weidong Liu, Jiyuan Tu

    Abstract: We propose a robust and scalable variational Bayes (VB) framework designed to effectively handle contamination and outliers in dataset. Our approach partitions the data into $m$ disjoint subsets and formulates a joint optimization problem based on robust aggregation principles. A key insight is that the full posterior distribution is equivalent to the minimizer of the mean Kullback-Leibler (KL) di… ▽ More

    Submitted 14 December, 2025; originally announced December 2025.

    Comments: 34 pages, 11 figures

  15. arXiv:2512.09736  [pdf, ps, other

    cs.AI

    Analyzing Planner Design Trade-offs for MAPF under Realistic Simulation

    Authors: Jingtian Yan, Zhifei Li, William Kang, Stephen F. Smith, Jiaoyang Li

    Abstract: Multi-Agent Path Finding (MAPF) algorithms are increasingly deployed in industrial warehouses and automated manufacturing facilities, where robots must operate reliably under real-world physical constraints. However, existing MAPF evaluation frameworks typically rely on simplified robot models, leaving a substantial gap between algorithmic benchmarks and practical performance. Recent frameworks su… ▽ More

    Submitted 10 December, 2025; originally announced December 2025.

  16. Dual-Branch Center-Surrounding Contrast: Rethinking Contrastive Learning for 3D Point Clouds

    Authors: Shaofeng Zhang, Xuanqi Chen, Xiangdong Zhang, Sitong Wu, Junchi Yan

    Abstract: Most existing self-supervised learning (SSL) approaches for 3D point clouds are dominated by generative methods based on Masked Autoencoders (MAE). However, these generative methods have been proven to struggle to capture high-level discriminative features effectively, leading to poor performance on linear probing and other downstream tasks. In contrast, contrastive methods excel in discriminative… ▽ More

    Submitted 9 December, 2025; originally announced December 2025.

    Comments: 16 pages, 6 figures

  17. arXiv:2512.08648  [pdf, ps, other

    cs.CV

    Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank

    Authors: Shaofeng Zhang, Xuanqi Chen, Ning Liao, Haoxiang Zhao, Xiaoxing Wang, Haoru Tan, Sitong Wu, Xiaosong Jia, Qi Fan, Junchi Yan

    Abstract: The dominance of denoising generative models (e.g., diffusion, flow-matching) in visual synthesis is tempered by their substantial training costs and inefficiencies in representation learning. While injecting discriminative representations via auxiliary alignment has proven effective, this approach still faces key limitations: the reliance on external, pre-trained encoders introduces overhead and… ▽ More

    Submitted 13 December, 2025; v1 submitted 9 December, 2025; originally announced December 2025.

    Comments: 19 pages, 19 figures

  18. arXiv:2512.08478  [pdf, ps, other

    cs.CV cs.AI cs.GR

    Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

    Authors: Yuning Gong, Yifei Liu, Yifan Zhan, Muyao Niu, Xueying Li, Yuanjun Liao, Jiaming Chen, Yuanyuan Gao, Jiaqi Chen, Minming Chen, Li Zhou, Yuning Zhang, Wei Wang, Xiaoqing Hou, Huaxi Huang, Shixiang Tang, Le Ma, Dingwen Zhang, Xue Yang, Junchi Yan, Yanchi Zhang, Yinqiang Zheng, Xiao Sun, Zhihang Zhong

    Abstract: Neural rendering, particularly 3D Gaussian Splatting (3DGS), has evolved rapidly and become a key component for building world models. However, existing viewer solutions remain fragmented, heavy, or constrained by legacy pipelines, resulting in high deployment friction and limited support for dynamic content and generative models. In this work, we present Visionary, an open, web-native platform fo… ▽ More

    Submitted 9 December, 2025; originally announced December 2025.

    Comments: Project page: https://visionary-laboratory.github.io/visionary

  19. arXiv:2512.06865  [pdf, ps, other

    cs.CV

    Spatial Retrieval Augmented Autonomous Driving

    Authors: Xiaosong Jia, Chenhe Zhang, Yule Jiang, Songbur Wong, Zhiyuan Zhang, Chen Chen, Shaofeng Zhang, Xuanhe Zhou, Xue Yang, Junchi Yan, Yu-Gang Jiang

    Abstract: Existing autonomous driving systems rely on onboard sensors (cameras, LiDAR, IMU, etc) for environmental perception. However, this paradigm is limited by the drive-time perception horizon and often fails under limited view scope, occlusion or extreme conditions such as darkness and rain. In contrast, human drivers are able to recall road structure even under poor visibility. To endow models with t… ▽ More

    Submitted 7 December, 2025; originally announced December 2025.

    Comments: Demo Page: https://spatialretrievalad.github.io/ with open sourced code, dataset, and checkpoints

  20. arXiv:2512.06024  [pdf

    cs.CV physics.flu-dyn

    Neural reconstruction of 3D ocean wave hydrodynamics from camera sensing

    Authors: Jiabin Liu, Zihao Zhou, Jialei Yan, Anxin Guo, Alvise Benetazzo, Hui Li

    Abstract: Precise three-dimensional (3D) reconstruction of wave free surfaces and associated velocity fields is essential for developing a comprehensive understanding of ocean physics. To address the high computational cost of dense visual reconstruction in long-term ocean wave observation tasks and the challenges introduced by persistent visual occlusions, we propose an wave free surface visual reconstruct… ▽ More

    Submitted 4 December, 2025; originally announced December 2025.

  21. arXiv:2512.04753  [pdf, ps, other

    cs.CL

    EtCon: Edit-then-Consolidate for Reliable Knowledge Editing

    Authors: Ruilin Li, Yibin Wang, Wenhong Zhu, Chenglin Li, Jinghao Zhang, Chenliang Li, Junchi Yan, Jiaqi Wang

    Abstract: Knowledge editing aims to update specific facts in large language models (LLMs) without full retraining. Prior efforts sought to tune the knowledge layers of LLMs, proving effective for making selective edits. However, a significant gap exists between their performance in controlled, teacher-forcing evaluations and their real-world effectiveness in lifelong learning scenarios, which greatly limits… ▽ More

    Submitted 4 December, 2025; originally announced December 2025.

  22. arXiv:2512.03111  [pdf, ps, other

    q-bio.GN cs.AI cs.CV

    PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-Cancer

    Authors: Xiaoshui Huang, Tianlin Zhu, Yifan Zuo, Xue Xia, Zonghan Wu, Jiebin Yan, Dingli Hua, Zongyi Xu, Yuming Fang, Jian Zhang

    Abstract: Single-cell RNA sequencing (scRNA-seq) is essential for decoding tumor heterogeneity. However, pan-cancer research still faces two key challenges: learning discriminative and efficient single-cell representations, and establishing a comprehensive evaluation benchmark. In this paper, we introduce PanFoMa, a lightweight hybrid neural network that combines the strengths of Transformers and state-spac… ▽ More

    Submitted 2 December, 2025; originally announced December 2025.

    Comments: Accepted by AAAI 2026

  23. arXiv:2512.01366  [pdf, ps, other

    cs.CV cs.HC cs.LG

    BlinkBud: Detecting Hazards from Behind via Sampled Monocular 3D Detection on a Single Earbud

    Authors: Yunzhe Li, Jiajun Yan, Yuzhou Wei, Kechen Liu, Yize Zhao, Chong Zhang, Hongzi Zhu, Li Lu, Shan Chang, Minyi Guo

    Abstract: Failing to be aware of speeding vehicles approaching from behind poses a huge threat to the road safety of pedestrians and cyclists. In this paper, we propose BlinkBud, which utilizes a single earbud and a paired phone to online detect hazardous objects approaching from behind of a user. The core idea is to accurately track visually identified objects utilizing a small number of sampled camera ima… ▽ More

    Submitted 1 December, 2025; originally announced December 2025.

    Comments: This is the author-accepted version of the paper published in Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), Vol. 9, No. 4, Article 191, 2025. Final published version: https://doi.org/10.1145/3770707

  24. arXiv:2511.22264  [pdf, ps, other

    cs.CV

    DriveVGGT: Visual Geometry Transformer for Autonomous Driving

    Authors: Xiaosong Jia, Yanhao Liu, Junqi You, Renqiu Xia, Yu Hong, Junchi Yan

    Abstract: Feed-forward reconstruction has recently gained significant attention, with VGGT being a notable example. However, directly applying VGGT to autonomous driving (AD) systems leads to sub-optimal results due to the different priors between the two tasks. In AD systems, several important new priors need to be considered: (i) The overlap between camera views is minimal, as autonomous driving sensor se… ▽ More

    Submitted 27 November, 2025; originally announced November 2025.

  25. arXiv:2511.21886  [pdf, ps, other

    cs.RO cs.AI

    Bridging Planning and Execution: Multi-Agent Path Finding Under Real-World Deadlines

    Authors: Jingtian Yan, Shuai Zhou, Stephen F. Smith, Jiaoyang Li

    Abstract: The Multi-Agent Path Finding (MAPF) problem aims to find collision-free paths for multiple agents while optimizing objectives such as the sum of costs or makespan. MAPF has wide applications in domains like automated warehouses, manufacturing systems, and airport logistics. However, most MAPF formulations assume a simplified robot model for planning, which overlooks execution-time factors such as… ▽ More

    Submitted 26 November, 2025; originally announced November 2025.

  26. arXiv:2511.21272  [pdf, ps, other

    cs.CV

    Co-Training Vision Language Models for Remote Sensing Multi-task Learning

    Authors: Qingyun Li, Shuran Ma, Junwei Luo, Yi Yu, Yue Zhou, Fengxiang Wang, Xudong Lu, Xiaoxing Wang, Xin He, Yushi Chen, Xue Yang, Junchi Yan

    Abstract: With Transformers achieving outstanding performance on individual remote sensing (RS) tasks, we are now approaching the realization of a unified model that excels across multiple tasks through multi-task learning (MTL). Compared to single-task approaches, MTL methods offer improved generalization, enhanced scalability, and greater practical applicability. Recently, vision language models (VLMs) ha… ▽ More

    Submitted 26 November, 2025; originally announced November 2025.

    Comments: 14 pages, 6 figures

  27. arXiv:2511.21256  [pdf, ps, other

    cs.CV

    LaGen: Towards Autoregressive LiDAR Scene Generation

    Authors: Sizhuo Zhou, Xiaosong Jia, Fanrui Zhang, Junjie Li, Juyong Zhang, Yukang Feng, Jianwen Sun, Songbur Wong, Junqi You, Junchi Yan

    Abstract: Generative world models for autonomous driving (AD) have become a trending topic. Unlike the widely studied image modality, in this work we explore generative world models for LiDAR data. Existing generation methods for LiDAR data only support single frame generation, while existing prediction approaches require multiple frames of historical input and can only deterministically predict multiple fr… ▽ More

    Submitted 26 November, 2025; originally announced November 2025.

  28. arXiv:2511.20223  [pdf, ps, other

    cs.CV

    V-Attack: Targeting Disentangled Value Features for Controllable Adversarial Attacks on LVLMs

    Authors: Sen Nie, Jie Zhang, Jianxin Yan, Shiguang Shan, Xilin Chen

    Abstract: Adversarial attacks have evolved from simply disrupting predictions on conventional task-specific models to the more complex goal of manipulating image semantics on Large Vision-Language Models (LVLMs). However, existing methods struggle with controllability and fail to precisely manipulate the semantics of specific concepts in the image. We attribute this limitation to semantic entanglement in th… ▽ More

    Submitted 25 November, 2025; originally announced November 2025.

    Comments: 21 pages

  29. arXiv:2511.19331  [pdf, ps, other

    cs.CR

    Evolution of Cybersecurity Subdisciplines: A Science of Science Study

    Authors: Yao Chen, Jeff Yan

    Abstract: The science of science is an emerging field that studies the practice of science itself. We present the first study of the cybersecurity discipline from a science of science perspective. We examine the evolution of two comparable interdisciplinary communities in cybersecurity: the Symposium on Usable Privacy and Security (SOUPS) and Financial Cryptography and Data Security (FC).

    Submitted 24 November, 2025; originally announced November 2025.

    Comments: 17 pages, 18 figures

  30. arXiv:2511.17302  [pdf, ps, other

    stat.ME stat.CO

    Covariate Connectivity Combined Clustering for Weighted Networks

    Authors: Zeyu Hu, Wenrui Li, Jun Yan, Panpan Zhang

    Abstract: Community detection is a central task in network analysis, with applications in social, biological, and technological systems. Traditional algorithms rely primarily on network topology, which can fail when community signals are partly encoded in node-specific attributes. Existing covariate-assisted methods often assume the number of clusters is known, involve computationally intensive inference, o… ▽ More

    Submitted 21 November, 2025; originally announced November 2025.

  31. arXiv:2511.17006  [pdf, ps, other

    cs.AI

    Budget-Aware Tool-Use Enables Effective Agent Scaling

    Authors: Tengxiao Liu, Zifeng Wang, Jin Miao, I-Hung Hsu, Jun Yan, Jiefeng Chen, Rujun Han, Fangyuan Xu, Yanfei Chen, Ke Jiang, Samira Daruki, Yi Liang, William Yang Wang, Tomas Pfister, Chen-Yu Lee

    Abstract: Scaling test-time computation improves performance across different tasks on large language models (LLMs), which has also been extended to tool-augmented agents. For these agents, scaling involves not only "thinking" in tokens but also "acting" via tool calls. The number of tool calls directly bounds the agent's interaction with the external environment. However, we find that simply granting agent… ▽ More

    Submitted 21 November, 2025; originally announced November 2025.

  32. arXiv:2511.16136  [pdf, ps, other

    cs.CV

    How Noise Benefits AI-generated Image Detection

    Authors: Jiazhen Yan, Ziqiang Li, Fan Wang, Kai Zeng, Zhangjie Fu

    Abstract: The rapid advancement of generative models has made real and synthetic images increasingly indistinguishable. Although extensive efforts have been devoted to detecting AI-generated images, out-of-distribution generalization remains a persistent challenge. We trace this weakness to spurious shortcuts exploited during training and we also observe that small feature-space perturbations can mitigate s… ▽ More

    Submitted 20 November, 2025; originally announced November 2025.

  33. arXiv:2511.16097  [pdf, ps, other

    astro-ph.CO

    Forecasting the Constraint on the Hu-Sawicki $f(R)$ Modified Gravity in the CSST $3\times2$pt Photometric Survey

    Authors: Jun-Hui Yan, Yan Gong, Qi Xiong, Xuelei Chen, Qi Guo, Ming Li, Yun Liu, Wenxiang Pei

    Abstract: We forecast the constraint on the Hu-Sawicki $f(R)$ model from the photometric survey operated by the Chinese Space Station Survey Telescope (CSST). The simulated $3\times2$pt data of galaxy clustering, weak lensing, and galaxy-galaxy lensing measurements within 100 deg$^{2}$ are used in the analysis. The mock observational maps are constructed from a light cone, redshift sampling and noise. The a… ▽ More

    Submitted 11 December, 2025; v1 submitted 20 November, 2025; originally announced November 2025.

    Comments: 13 pages, 9 figures, 1 table. Accepted for publication in ApJ

  34. arXiv:2511.15673  [pdf, ps, other

    math.CO

    Asymmetric Ramsey numbers of trees

    Authors: Jun Yan

    Abstract: Let $n\geqν$, let $T$ be an $n$-vertex tree with bipartition class sizes $t_1\geq t_2$, and let $S$ be a $ν$-vertex tree with bipartition class sizes $Ï„_1\geqÏ„_2$. Using four natural constructions, we show that the Ramsey number $R(T,S)$ is lower bounded by $\underline{R}(T,S)=\max\{n+Ï„_2,ν+\min\{t_2,ν\},\min\{2t_1,2ν\},2Ï„_1\}-1$. Our main result shows that there exists a constant $c>0$, such th… ▽ More

    Submitted 19 November, 2025; originally announced November 2025.

    Comments: 30 pages, 3 figures

    MSC Class: 05D10; 05C05

  35. arXiv:2511.15456  [pdf, ps, other

    cs.AI q-fin.GN

    Know Your Intent: An Autonomous Multi-Perspective LLM Agent Framework for DeFi User Transaction Intent Mining

    Authors: Qian'ang Mao, Yuxuan Zhang, Jiaman Chen, Wenjun Zhou, Jiaqi Yan

    Abstract: As Decentralized Finance (DeFi) develops, understanding user intent behind DeFi transactions is crucial yet challenging due to complex smart contract interactions, multifaceted on-/off-chain factors, and opaque hex logs. Existing methods lack deep semantic insight. To address this, we propose the Transaction Intent Mining (TIM) framework. TIM leverages a DeFi intent taxonomy built on grounded theo… ▽ More

    Submitted 19 November, 2025; originally announced November 2025.

    Comments: Written in 2025 Q1

  36. arXiv:2511.13108  [pdf, ps, other

    cs.CV

    DGS-Net: Distillation-Guided Gradient Surgery for CLIP Fine-Tuning in AI-Generated Image Detection

    Authors: Jiazhen Yan, Ziqiang Li, Fan Wang, Boyu Wang, Zhangjie Fu

    Abstract: The rapid progress of generative models such as GANs and diffusion models has led to the widespread proliferation of AI-generated images, raising concerns about misinformation, privacy violations, and trust erosion in digital media. Although large-scale multimodal models like CLIP offer strong transferable representations for detecting synthetic content, fine-tuning them often induces catastrophic… ▽ More

    Submitted 17 November, 2025; originally announced November 2025.

  37. arXiv:2511.12525  [pdf, ps, other

    cs.CV

    MdaIF: Robust One-Stop Multi-Degradation-Aware Image Fusion with Language-Driven Semantics

    Authors: Jing Li, Yifan Wang, Jiafeng Yan, Renlong Zhang, Bin Yang

    Abstract: Infrared and visible image fusion aims to integrate complementary multi-modal information into a single fused result. However, existing methods 1) fail to account for the degradation visible images under adverse weather conditions, thereby compromising fusion performance; and 2) rely on fixed network architectures, limiting their adaptability to diverse degradation scenarios. To address these issu… ▽ More

    Submitted 16 November, 2025; originally announced November 2025.

    Comments: 10 pages, 7 figures. Accepted by AAAI 2026

    ACM Class: I.4.3; I.4.4; I.4.9

  38. arXiv:2511.09907  [pdf, ps, other

    cs.AI cs.CV

    Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models

    Authors: Yongxian Wei, Yilin Zhao, Li Shen, Xinrui Chen, Runxi Cheng, Sinan Du, Hao Yu, Gang Liu, Jiahong Yan, Chun Yuan, Dian Li

    Abstract: Data synthesis for training large reasoning models offers a scalable alternative to limited, human-curated datasets, enabling the creation of high-quality data. However, existing approaches face several challenges: (i) indiscriminate generation that ignores the solver's ability and yields low-value problems, or reliance on complex data pipelines to balance problem difficulty; and (ii) a lack of re… ▽ More

    Submitted 15 December, 2025; v1 submitted 12 November, 2025; originally announced November 2025.

  39. arXiv:2511.09823  [pdf, ps, other

    stat.CO stat.ME

    Diagnostics for Semiparametric Accelerated Failure Time Models with R Package afttest

    Authors: Woojung Bae, Dongrak Choi, Jun Yan, Sangwook Kang

    Abstract: The semiparametric accelerated failure time (AFT) model is a useful alternative to the widely used Cox proportional hazard model, which directly links the logarithm of the failure time to the covariates, yielding more interpretable regression coefficients. However, diagnostic procedures for the semiparametric AFT model have received relatively little attention. This paper introduces afttest, an R… ▽ More

    Submitted 12 November, 2025; originally announced November 2025.

  40. arXiv:2511.09512  [pdf, ps, other

    cs.LG

    GenePheno: Interpretable Gene Knockout-Induced Phenotype Abnormality Prediction from Gene Sequences

    Authors: Jingquan Yan, Yuwei Miao, Lei Yu, Yuzhi Guo, Xue Xiao, Lin Xu, Junzhou Huang

    Abstract: Exploring how genetic sequences shape phenotypes is a fundamental challenge in biology and a key step toward scalable, hypothesis-driven experimentation. The task is complicated by the large modality gap between sequences and phenotypes, as well as the pleiotropic nature of gene-phenotype relationships. Existing sequence-based efforts focus on the degree to which variants of specific genes alter a… ▽ More

    Submitted 14 November, 2025; v1 submitted 12 November, 2025; originally announced November 2025.

    Comments: AAAI 2026 Oral

  41. arXiv:2511.08910  [pdf, ps, other

    eess.SP cs.CV

    OG-PCL: Efficient Sparse Point Cloud Processing for Human Activity Recognition

    Authors: Jiuqi Yan, Chendong Xu, Dongyu Liu

    Abstract: Human activity recognition (HAR) with millimeter-wave (mmWave) radar offers a privacy-preserving and robust alternative to camera- and wearable-based approaches. In this work, we propose the Occupancy-Gated Parallel-CNN Bi-LSTM (OG-PCL) network to process sparse 3D radar point clouds produced by mmWave sensing. Designed for lightweight deployment, the parameter size of the proposed OG-PCL is only… ▽ More

    Submitted 11 November, 2025; originally announced November 2025.

  42. arXiv:2511.06460  [pdf, ps, other

    cs.OS

    Guidelines for Building Indexes on Partially Cache-Coherent CXL Shared Memory

    Authors: Fangnuo Wu, Mingkai Dong, Wenjun Cai, Jingsheng Yan, Haibo Chen

    Abstract: The \emph{Partial Cache-Coherence (PCC)} model maintains hardware cache coherence only within subsets of cores, enabling large-scale memory sharing with emerging memory interconnect technologies like Compute Express Link (CXL). However, PCC's relaxation of global cache coherence compromises the correctness of existing single-machine software. This paper focuses on building consistent and efficie… ▽ More

    Submitted 9 November, 2025; originally announced November 2025.

  43. arXiv:2511.06448  [pdf, ps, other

    cs.MA cs.AI cs.CL cs.SI

    When AI Agents Collude Online: Financial Fraud Risks by Collaborative LLM Agents on Social Platforms

    Authors: Qibing Ren, Zhijie Zheng, Jiaxuan Guo, Junchi Yan, Lizhuang Ma, Jing Shao

    Abstract: In this work, we study the risks of collective financial fraud in large-scale multi-agent systems powered by large language model (LLM) agents. We investigate whether agents can collaborate in fraudulent behaviors, how such collaboration amplifies risks, and what factors influence fraud success. To support this research, we present MultiAgentFraudBench, a large-scale benchmark for simulating finan… ▽ More

    Submitted 9 November, 2025; originally announced November 2025.

    Comments: Code is available at https://github.com/zheng977/MutiAgent4Fraud

  44. arXiv:2511.05733  [pdf, ps, other

    stat.ME

    Nonparametric Block Bootstrap Kolmogorov-Smirnov Goodness-of-Fit Test

    Authors: Mathew Chandy, Elizabeth Schifano, Jun Yan, Xianyang Zhang

    Abstract: The Kolmogorov--Smirnov (KS) test is a widely used statistical test that assesses the conformity of a sample to a specified distribution. Its efficacy, however, diminishes with serially dependent data and when parameters within the hypothesized distribution are unknown. For independent data, parametric and nonparametric bootstrap procedures are available to adjust for estimated parameters. For ser… ▽ More

    Submitted 7 November, 2025; originally announced November 2025.

  45. arXiv:2511.05039  [pdf, ps, other

    eess.SP cs.AI

    PECL: A Heterogeneous Parallel Multi-Domain Network for Radar-Based Human Activity Recognition

    Authors: Jiuqi Yan, Chendong Xu, Dongyu Liu

    Abstract: Radar systems are increasingly favored for medical applications because they provide non-intrusive monitoring with high privacy and robustness to lighting conditions. However, existing research typically relies on single-domain radar signals and overlooks the temporal dependencies inherent in human activity, which complicates the classification of similar actions. To address this issue, we designe… ▽ More

    Submitted 7 November, 2025; originally announced November 2025.

  46. arXiv:2511.04966  [pdf, ps, other

    astro-ph.HE astro-ph.IM

    Detecting FRB by DANCE: a method based on DEnsity ANalysis and Cluster Extraction

    Authors: Mao Yuan, Jiarui Niu, Yi Feng, Xu-ning Lv, Chenchen Miao, Lingqi Meng, Bo Peng, Li Deng, Jingye Yan, Weiwei Zhu

    Abstract: Fast radio bursts (FRBs) are transient signals exhibiting diverse strengths and emission bandwidths. Traditional single-pulse search techniques are widely employed for FRB detection; yet weak, narrow-band bursts often remain undetectable due to low signal-to-noise ratios (SNR) in integrated profiles. We developed DANCE, a detection tool based on cluster analysis of the original spectrum. It is spe… ▽ More

    Submitted 6 November, 2025; originally announced November 2025.

    Comments: 12 pages, 12 figures

    Journal ref: MNRAS, 547, staf1910 (2025)

  47. AStF: Motion Style Transfer via Adaptive Statistics Fusor

    Authors: Hanmo Chen, Chenghao Xu, Jiexi Yan, Cheng Deng

    Abstract: Human motion style transfer allows characters to appear less rigidity and more realism with specific style. Traditional arbitrary image style transfer typically process mean and variance which is proved effective. Meanwhile, similar methods have been adapted for motion style transfer. However, due to the fundamental differences between images and motion, relying on mean and variance is insufficien… ▽ More

    Submitted 6 November, 2025; originally announced November 2025.

  48. arXiv:2511.04099  [pdf, ps, other

    astro-ph.CO

    Exploring Cosmological Constraints of the Void-Lensing Cross-Correlation in the CSST Photometric Survey

    Authors: Qi Xiong, Yan Gong, Junhui Yan, Furen Deng, Hengjie Lin, Xingchen Zhou, Xuelei Chen, Qi Guo, Ming Li, Yun Liu, Wenxiang Pei

    Abstract: We investigate the cosmological constraints from the void-lensing cross-correlation assuming the $w$CDM model for the Chinese Space Station Survey Telescope (CSST) photometric survey. Using Jiutian simulations, we construct a mock galaxy catalog to $z=3$ covering 100 deg$^2$, which incorporates the instrumental and observational effects of the CSST. We divide the galaxy sample into seven photometr… ▽ More

    Submitted 6 November, 2025; originally announced November 2025.

    Comments: 13 pages, 8 figures, 2 tables

  49. arXiv:2511.01354  [pdf, ps, other

    cs.CL cs.AI

    Thinking with DistilQwen: A Tale of Four Distilled Reasoning and Reward Model Series

    Authors: Wenrui Cai, Chengyu Wang, Junbing Yan, Jun Huang, Xiangzhong Fang

    Abstract: Recently, the demand for small and efficient reasoning models to support real-world applications has driven the development of knowledge distillation techniques that balance reasoning performance and inference speed. In this paper, we further extend the DistilQwen model family, initialized from the Qwen models, by introducing four model series specifically designed to meet industrial requirements.… ▽ More

    Submitted 3 November, 2025; originally announced November 2025.

    Comments: emnlp 2025 industry track

  50. arXiv:2511.00209  [pdf, ps, other

    cs.LG cs.AI q-bio.BM q-bio.QM

    Diffusion Models at the Drug Discovery Frontier: A Review on Generating Small Molecules versus Therapeutic Peptides

    Authors: Yiquan Wang, Yahui Ma, Yuhan Chang, Jiayao Yan, Jialin Zhang, Minnuo Cai, Kai Wei

    Abstract: Diffusion models have emerged as a leading framework in generative modeling, poised to transform the traditionally slow and costly process of drug discovery. This review provides a systematic comparison of their application in designing two principal therapeutic modalities: small molecules and therapeutic peptides. We dissect how the unified framework of iterative denoising is adapted to the disti… ▽ More

    Submitted 26 November, 2025; v1 submitted 31 October, 2025; originally announced November 2025.

    Comments: Published in Biology

    Journal ref: Biology 2025, 14(12), 1665