Skip to main content

Showing 1–50 of 95 results for author: Cai, F

.
  1. arXiv:2603.28478  [pdf

    cs.CE

    Physics-Enforced Neural Ordinary Differential Equation for Chemical Kinetics Optimization in Reaction-Diffusion Systems

    Authors: Feixue Cai, Hua Zhou, Zhuyin Ren

    Abstract: Calibrating chemical kinetics in a reaction-diffusion system is challenging because of complex dynamics governed by tightly coupled chemistry and transport, while experimental observations are often sparse and noisy. We propose a physics consistent diffusion-chemistry coupled neural ordinary differential equation (Diff-Chem Neural ODE) that embeds Arrhenius-structured reaction neurons into a fully… ▽ More

    Submitted 30 March, 2026; originally announced March 2026.

  2. arXiv:2603.21856  [pdf, ps, other

    cs.CV

    Climate Prompting: Generating the Madden-Julian Oscillation using Video Diffusion and Low-Dimensional Conditioning

    Authors: Sulian Thual, Feiyang Cai, Jingjing Wang, Feng Luo

    Abstract: Generative Deep Learning is a powerful tool for modeling of the Madden-Julian oscillation (MJO) in the tropics, yet its relationship to traditional theoretical frameworks remains poorly understood. Here we propose a video diffusion model, trained on atmospheric reanalysis, to synthetize long MJO sequences conditioned on key low-dimensional metrics. The generated MJOs capture key features including… ▽ More

    Submitted 23 March, 2026; originally announced March 2026.

  3. arXiv:2603.08254  [pdf, ps, other

    cs.CV

    DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving

    Authors: Zhuolin He, Jing Li, Guanghao Li, Xiaolei Chen, Jiacheng Tang, Siyang Zhang, Zhounan Jin, Feipeng Cai, Bin Li, Jian Pu, Jia Cai, Xiangyang Xue

    Abstract: Dynamic scene reconstruction in autonomous driving remains a fundamental challenge due to significant temporal variations, moving objects, and complex scene dynamics. Existing feed-forward 3D models have demonstrated strong performance in static reconstruction but still struggle to capture dynamic motion. To address these limitations, we propose DynamicVGGT, a unified feed-forward framework that e… ▽ More

    Submitted 9 March, 2026; originally announced March 2026.

  4. arXiv:2603.01221  [pdf, ps, other

    cs.MA

    Epistemic Gain, Aleatoric Cost: Uncertainty Decomposition in Multi-Agent Debate for Math Reasoning

    Authors: Dan Qiao, Binbin Chen, Fengyu Cai, Jianlong Chen, Wenhao Li, Fuxin Jiang, Zuzhi Chen, Hongyuan Zha, Tieying Zhang, Baoxiang Wang

    Abstract: Multi-Agent Debate (MAD) has shown promise in leveraging collective intelligence to improve reasoning and reduce hallucinations, yet it remains unclear how information exchange shapes the underlying ability. Empirically, MAD exhibits paradoxical phenomena, such as accuracy improvement accompanied by substantial increase in token entropy, and remarkable divergence between homogeneous and heterogene… ▽ More

    Submitted 1 March, 2026; originally announced March 2026.

  5. arXiv:2602.02320  [pdf, ps, other

    cs.CL cs.AI q-bio.BM

    A Large-Scale Dataset for Molecular Structure-Language Description via a Rule-Regularized Method

    Authors: Feiyang Cai, Guijuan He, Yi Hu, Jingjing Wang, Joshua Luo, Tianyu Zhu, Srikanth Pilla, Gang Li, Ling Liu, Feng Luo

    Abstract: Molecular function is largely determined by structure. Accurately aligning molecular structure with natural language is therefore essential for enabling large language models (LLMs) to reason about downstream chemical tasks. However, the substantial cost of human annotation makes it infeasible to construct large-scale, high-quality datasets of structure-grounded descriptions. In this work, we prop… ▽ More

    Submitted 10 February, 2026; v1 submitted 2 February, 2026; originally announced February 2026.

  6. arXiv:2512.11872  [pdf, ps, other

    cs.RO cs.AI cs.CV

    WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving

    Authors: Mingwang Xu, Jiahao Cui, Feipeng Cai, Hanlin Shang, Zhihao Zhu, Shan Luan, Yifang Xu, Neng Zhang, Yaoyi Li, Jia Cai, Siyu Zhu

    Abstract: End-to-end autonomous driving systems based on vision-language-action (VLA) models integrate multimodal sensor inputs and language instructions to generate planning and control signals. While autoregressive large language models and continuous diffusion policies are prevalent, the potential of discrete masked diffusion for trajectory generation remains largely unexplored. This paper presents WAM-D… ▽ More

    Submitted 6 December, 2025; originally announced December 2025.

  7. arXiv:2512.06112  [pdf, ps, other

    cs.RO cs.AI cs.CV

    WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving

    Authors: Yifang Xu, Jiahao Cui, Feipeng Cai, Zhihao Zhu, Hanlin Shang, Shan Luan, Mingwang Xu, Neng Zhang, Yaoyi Li, Jia Cai, Siyu Zhu

    Abstract: We introduce WAM-Flow, a vision-language-action (VLA) model that casts ego-trajectory planning as discrete flow matching over a structured token space. In contrast to autoregressive decoders, WAM-Flow performs fully parallel, bidirectional denoising, enabling coarse-to-fine refinement with a tunable compute-accuracy trade-off. Specifically, the approach combines a metric-aligned numerical tokenize… ▽ More

    Submitted 11 December, 2025; v1 submitted 5 December, 2025; originally announced December 2025.

    Comments: 18 pages, 11 figures. Code & Model: https://github.com/fudan-generative-vision/WAM-Flow

  8. arXiv:2511.21970  [pdf, ps, other

    cs.LG eess.SP

    MOTIF-RF: Multi-template On-chip Transformer Synthesis Incorporating Frequency-domain Self-transfer Learning for RFIC Design Automation

    Authors: Houbo He, Yizhou Xu, Lei Xia, Yaolong Hu, Fan Cai, Taiyun Chi

    Abstract: This paper presents a systematic study on developing multi-template machine learning (ML) surrogate models and applying them to the inverse design of transformers (XFMRs) in radio-frequency integrated circuits (RFICs). Our study starts with benchmarking four widely used ML architectures, including MLP-, CNN-, UNet-, and GT-based models, using the same datasets across different XFMR topologies. To… ▽ More

    Submitted 26 November, 2025; originally announced November 2025.

    Comments: Accepted at ASP-DAC 2026

  9. arXiv:2508.15304  [pdf, ps, other

    cs.IR

    MLLMRec: A Preference Reasoning Paradigm with Graph Refinement for Multimodal Recommendation

    Authors: Yuzhuo Dang, Xin Zhang, Zhiqiang Pan, Yuxiao Duan, Wanyu Chen, Fei Cai, Honghui Chen

    Abstract: Multimodal recommendation combines the user historical behaviors with the modal features of items to capture the tangible user preferences, presenting superior performance compared to the conventional ID-based recommender systems. However, existing methods still encounter two key problems in the representation learning of users and items, respectively: (1) the initialization of multimodal user rep… ▽ More

    Submitted 24 January, 2026; v1 submitted 21 August, 2025; originally announced August 2025.

  10. arXiv:2507.14580  [pdf

    cond-mat.mtrl-sci

    Investigation on high-order planar Hall effect in trigonal PtBi$_2$

    Authors: Fangqi Cai, Mingxi Chi, Yingjie Hu, Heyao Liu, Yangyang Chen, Chao Jing, Wei Ren, He Wang

    Abstract: The trigonal PtBi$_2$ (t-PtBi$_2$) as a Weyl semimetal possessing triply degenerate points in its electronic bands near the Fermi level endows it with rich electronic properties. Previous studies have already measured the planar Hall effect (PHE) and in-plane anisotropic magnetoresistance (AMR) of t-PtBi$_2$. We noticed that their experimental results exhibited high-order features in both the PHE… ▽ More

    Submitted 19 July, 2025; originally announced July 2025.

    Comments: 18 pages, 4 figures,

    Journal ref: Appl. Phys. Lett. 126, 233101 (2025)

  11. arXiv:2506.16552  [pdf, ps, other

    cs.IR cs.CL

    Revela: Dense Retriever Learning via Language Modeling

    Authors: Fengyu Cai, Tong Chen, Xinran Zhao, Sihao Chen, Hongming Zhang, Sherry Tongshuang Wu, Iryna Gurevych, Heinz Koeppl

    Abstract: Dense retrievers play a vital role in accessing external and specialized knowledge to augment language models (LMs). Training dense retrievers typically requires annotated query-document pairs, which are costly to create and scarce in specialized domains (e.g., code) or in complex settings (e.g., requiring reasoning). These practical challenges have sparked growing interest in self-supervised retr… ▽ More

    Submitted 20 February, 2026; v1 submitted 19 June, 2025; originally announced June 2025.

    Comments: Accepted to ICLR 2026 (Oral). Camera-ready version

  12. arXiv:2506.15862  [pdf, ps, other

    cs.IR cs.AI cs.CL

    MoR: Better Handling Diverse Queries with a Mixture of Sparse, Dense, and Human Retrievers

    Authors: Jushaan Singh Kalra, Xinran Zhao, To Eun Kim, Fengyu Cai, Fernando Diaz, Tongshuang Wu

    Abstract: Retrieval-augmented Generation (RAG) is powerful, but its effectiveness hinges on which retrievers we use and how. Different retrievers offer distinct, often complementary signals: BM25 captures lexical matches; dense retrievers, semantic similarity. Yet in practice, we typically fix a single retriever based on heuristics, which fails to generalize across diverse information needs. Can we dynamica… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 19 pages, 3 figures

  13. arXiv:2506.11066  [pdf, ps, other

    cs.SE cs.AI

    CoQuIR: A Comprehensive Benchmark for Code Quality-Aware Information Retrieval

    Authors: Jiahui Geng, Fengyu Cai, Shaobo Cui, Qing Li, Liangwei Chen, Chenyang Lyu, Haonan Li, Derui Zhu, Walter Pretschner, Heinz Koeppl, Fakhri Karray

    Abstract: Code retrieval is essential in modern software development, as it boosts code reuse and accelerates debugging. However, current benchmarks primarily emphasize functional relevance while neglecting critical dimensions of software quality. Motivated by this gap, we introduce CoQuIR, the first large-scale, multilingual benchmark specifically designed to evaluate quality-aware code retrieval across fo… ▽ More

    Submitted 27 August, 2025; v1 submitted 31 May, 2025; originally announced June 2025.

  14. arXiv:2506.10035  [pdf, ps, other

    cs.GR cs.AI

    FastFLUX: Pruning FLUX with Block-wise Replacement and Sandwich Training

    Authors: Fuhan Cai, Yong Guo, Jie Li, Wenbo Li, Jian Chen, Xiangzhong Fang

    Abstract: Recent advancements in text-to-image (T2I) generation have led to the emergence of highly expressive models such as diffusion transformers (DiTs), exemplified by FLUX. However, their massive parameter sizes lead to slow inference, high memory usage, and poor deployability. Existing acceleration methods (e.g., single-step distillation and attention pruning) often suffer from significant performance… ▽ More

    Submitted 13 January, 2026; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: 14 pages

  15. arXiv:2506.03747  [pdf, other

    physics.optics

    Fast Non-Line-of-Sight Transient Data Simulation and an Open Benchmark Dataset

    Authors: Yingjie Shi, Jinye Miao, Taotao Qin, Fuyao Cai, Yi Wei, Lingfeng Liu, Tongyao Li, Chenyang Wu, Huan Liang, Yuyang Yin, Lianfa Bai, Enlai Guo, Jing Han

    Abstract: Non-Line-of-Sight (NLOS) imaging reconstructs the shape and depth of hidden objects from picosecond-resolved transient signals, offering potential applications in autonomous driving, security, and medical diagnostics. However, current NLOS experiments rely on expensive hardware and complex system alignment, limiting their scalability. This manuscript presents a simplified simulation method that ge… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  16. arXiv:2505.15054  [pdf, ps, other

    cs.CL cs.AI cs.LG q-bio.BM

    MolLangBench: A Comprehensive Benchmark for Language-Prompted Molecular Structure Recognition, Editing, and Generation

    Authors: Feiyang Cai, Jiahui Bai, Tao Tang, Guijuan He, Joshua Luo, Tianyu Zhu, Srikanth Pilla, Gang Li, Ling Liu, Feng Luo

    Abstract: Precise recognition, editing, and generation of molecules are essential prerequisites for both chemists and AI systems tackling various chemical tasks. We present MolLangBench, a comprehensive benchmark designed to evaluate fundamental molecule-language interface tasks: language-prompted molecular structure recognition, editing, and generation. To ensure high-quality, unambiguous, and deterministi… ▽ More

    Submitted 23 March, 2026; v1 submitted 20 May, 2025; originally announced May 2025.

    Comments: ICLR-2026 Camera-Ready version

  17. arXiv:2504.18869  [pdf

    cond-mat.mes-hall

    Micro-tip manipulated origami for robust twisted few-layer graphene

    Authors: Ruo-Jue Zou, Long Deng, Si-Min Xue, Feng-Fei Cai, Ling-Hui Tong, Yang Zhang, Yuan Tian, Li Zhang, Lijie Zhang, Zhihui Qin, Long-Jing Yin

    Abstract: Twisted few-layer graphene (tFLG) has emerged as an ideal model system for investigating novel strongly correlated and topological phenomena. However, the experimental construction of tFLG with high structural stability is still challenging. Here, we introduce a highly accessible method for fabricating robust tFLG by polymer micro-tip manipulated origami. Through using a self-prepared polymer micr… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

    Comments: 17 pages, 5 figures

    Journal ref: Appl. Phys. Lett. 126, 163105 (2025) Featured Article & AIP Scilight

  18. arXiv:2503.14530   

    cs.CV cs.AI

    SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders

    Authors: Qing Li, Jiahui Geng, Derui Zhu, Fengyu Cai, Chenyang Lyu, Fakhri Karray

    Abstract: Unlearning methods for vision-language models (VLMs) have primarily adapted techniques from large language models (LLMs), relying on weight updates that demand extensive annotated forget sets. Moreover, these methods perform unlearning at a coarse granularity, often leading to excessive forgetting and reduced model utility. To address this issue, we introduce SAUCE, a novel method that leverages s… ▽ More

    Submitted 20 March, 2025; v1 submitted 16 March, 2025; originally announced March 2025.

    Comments: More comparative experiments are needed

  19. arXiv:2503.11279  [pdf, ps, other

    hep-ph

    Revisiting $B_{c}^-\to J/ψ(η_c) L^-$ decays within the SM and beyond in QCD factorization

    Authors: Wei-Jun Deng, Fang-Min Cai, Xin-Qiang Li, Yan Shi, Ya-Dong Yang

    Abstract: Motivated by the deviations observed between the data and the SM predictions of $\mathcal{B}(\bar{B}_s^0\to D_s^+ π^-)$ and $\mathcal{B}(\bar{B}_d^0\to D^+ K^-)$, we revisit the $B_{c}^{-}\to J/ψ(η_{c}) L^{-}$ decays, with $L=π, K^{(*)}, ρ$, both within the SM and beyond. Since these processes are also mediated by $b\to c \bar{u} d(s)$ transitions and hence dominated by the colour-allowed tree top… ▽ More

    Submitted 12 August, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: 31 pages, 9 figures and 6 tables; final version published in the journal

  20. arXiv:2503.01854  [pdf, ps, other

    cs.CL cs.AI

    A Comprehensive Survey of Machine Unlearning Techniques for Large Language Models

    Authors: Jiahui Geng, Qing Li, Herbert Woisetschlaeger, Zongxiong Chen, Fengyu Cai, Yuxia Wang, Preslav Nakov, Hans-Arno Jacobsen, Fakhri Karray

    Abstract: This study investigates the machine unlearning techniques within the context of large language models (LLMs), referred to as \textit{LLM unlearning}. LLM unlearning offers a principled approach to removing the influence of undesirable data (e.g., sensitive or illegal information) from LLMs, while preserving their overall utility without requiring full retraining. Despite growing research interest,… ▽ More

    Submitted 31 May, 2025; v1 submitted 22 February, 2025; originally announced March 2025.

  21. arXiv:2410.21422  [pdf, ps, other

    cs.CE

    ChemFM as a Scaling Law Guided Foundation Model Pre-trained on Informative Chemicals

    Authors: Feiyang Cai, Katelin Zacour, Tianyu Zhu, Tzuen-Rong Tzeng, Yongping Duan, Ling Liu, Srikanth Pilla, Gang Li, Feng Luo

    Abstract: Traditional AI methods often rely on task-specific model designs and training, which constrain both the scalability of model size and generalization across different tasks. Here, we introduce ChemFM, a large foundation model specifically developed for chemicals. By conducting a series of scaling experiments, we identify UniChem as the informative molecular database for pre-training the foundation… ▽ More

    Submitted 5 November, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

  22. arXiv:2410.05691  [pdf, ps, other

    hep-ph

    Exotic hybrid pseudopotentials at finite temperature and chemical potential

    Authors: Le Zhang, Fei-Yang Cai, Xun Chen

    Abstract: Using gauge/gravity duality, we study the exotic hybrid pseudopotentials at finite temperature and chemical potential. The $Σ$ hybrid meson can be described by a model including an object called ``defect'' on a string linking the quark and antiquark. It was first proposed by Andreev and perfectly described the $Σ_u^-$ hybrid potential at zero temperature and chemical potential. In this paper, we w… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  23. arXiv:2409.04179  [pdf, other

    hep-ph

    CP asymmetries of $t \to c γ$ and $t \to cg$ decays in the aligned two-Higgs-doublet model

    Authors: Fang-Min Cai, Rui-Lin Fan, Xin-Qiang Li, Ya-Dong Yang

    Abstract: We study the CP asymmetries of the rare top-quark decays $t \to c γ$ and $t \to cg$ in the aligned two-Higgs-doublet model (A2HDM), which is generically characterized by new sources of CP violation beyond the Standard Model (SM). Specifically, the branching ratios and CP asymmetries of these rare top-quark decays are explicitly formulated, with an emphasis on the origins of weak and strong phases… ▽ More

    Submitted 14 March, 2025; v1 submitted 6 September, 2024; originally announced September 2024.

    Comments: 46 pages, 9 figures, and 3 tables; figure presentations changed and more references added, final version published in the journal

  24. arXiv:2409.02390  [pdf, other

    cs.NE cs.AI cs.CV q-bio.NC

    Neural Dynamics Model of Visual Decision-Making: Learning from Human Experts

    Authors: Jie Su, Fang Cai, Shu-Kuo Zhao, Xin-Yi Wang, Tian-Yi Qian, Da-Hui Wang, Bo Hong

    Abstract: Uncovering the fundamental neural correlates of biological intelligence, developing mathematical models, and conducting computational simulations are critical for advancing new paradigms in artificial intelligence (AI). In this study, we implemented a comprehensive visual decision-making model that spans from visual input to behavioral output, using a neural dynamics modeling approach. Drawing ins… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  25. arXiv:2408.13711  [pdf, other

    cs.CV cs.MM

    SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting

    Authors: Wenrui Li, Fucheng Cai, Yapeng Mi, Zhe Yang, Wangmeng Zuo, Xingtao Wang, Xiaopeng Fan

    Abstract: Text-driven 3D scene generation has seen significant advancements recently. However, most existing methods generate single-view images using generative models and then stitch them together in 3D space. This independent generation for each view often results in spatial inconsistency and implausibility in the 3D scenes. To address this challenge, we proposed a novel text-driven 3D-consistent scene g… ▽ More

    Submitted 13 October, 2024; v1 submitted 24 August, 2024; originally announced August 2024.

  26. arXiv:2407.12512  [pdf, other

    cs.CL

    $\textit{GeoHard}$: Towards Measuring Class-wise Hardness through Modelling Class Semantics

    Authors: Fengyu Cai, Xinran Zhao, Hongming Zhang, Iryna Gurevych, Heinz Koeppl

    Abstract: Recent advances in measuring hardness-wise properties of data guide language models in sample selection within low-resource scenarios. However, class-specific properties are overlooked for task setup and learning. How will these properties influence model learning and is it generalizable across datasets? To answer this question, this work formally initiates the concept of… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: Findings of ACL 2024

  27. arXiv:2407.10691  [pdf, other

    cs.IR cs.CL

    $\texttt{MixGR}$: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity

    Authors: Fengyu Cai, Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Iryna Gurevych, Heinz Koeppl

    Abstract: Recent studies show the growing significance of document retrieval in the generation of LLMs, i.e., RAG, within the scientific domain by bridging their knowledge gap. However, dense retrievers often struggle with domain-specific retrieval and complex query-document relationships, particularly when query segments correspond to various parts of a document. To alleviate such prevalent challenges, thi… ▽ More

    Submitted 1 November, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: EMNLP 2024 Main Conference

  28. arXiv:2406.14474  [pdf, ps, other

    eess.SY

    Spatio-temporal Patterns between ENSO and Weather-related Power Outages in the Continental United States

    Authors: Long Huo, Xin Chen, Kaiwen Li, Fengying Cai, Jürgen Kurths

    Abstract: El Niño-Southern Oscillation (ENSO) exhibits significant impacts on the frequency of extreme weather events and its socio-economic implications prevail on a global scale. However, a fundamental gap still exists in understanding the relationship between the ENSO and weather-related power outages in the continental United States. Through 24-year (2000-2023) composite and statistical analysis, our st… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  29. arXiv:2406.14079  [pdf

    cond-mat.mtrl-sci physics.chem-ph

    Nano-Patterned Pt-Based Metallic Glass Electrocatalysts with In-Situ Copper Oxide Foam for Enhanced Hydrogen Evolution

    Authors: Fei-Fan Cai, Baran Sarac, Adnan Akman, Juan J. Londoño, Selin Gümrükcü, Lukas Schweiger, Martin Hantusch, Jan Schroers, Andreas Blatter, Annett Gebert, Florian Spieckermann, Jürgen Eckert

    Abstract: Hydrogen is a promising energy carrier for replacing fossil fuels, and hydrogen production via hydrogen evolution reaction (HER) is an environmentally friendly option if electrocatalysts with low overpotentials and long-term stability are used. In this work, the electrocatalytic performance of $\mathrm{Pt_{57.5}Cu_{14.7}Ni_{5.3}P_{22.5}}$ bulk metallic glass (BMG) with flat, micro-patterned, and n… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 28 pages, 9 figures (including supplementary information)

    Journal ref: Materials & Design Volume 249, January 2025, 113530

  30. arXiv:2405.18554  [pdf, other

    cs.LG cs.RO eess.SY

    Scalable Surrogate Verification of Image-based Neural Network Control Systems using Composition and Unrolling

    Authors: Feiyang Cai, Chuchu Fan, Stanley Bak

    Abstract: Verifying safety of neural network control systems that use images as input is a difficult problem because, from a given system state, there is no known way to mathematically model what images are possible in the real-world. We build on recent work that considers a surrogate verification approach, training a conditional generative adversarial network (cGAN) as an image generator in place of the re… ▽ More

    Submitted 28 April, 2025; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by AAAI-25

  31. arXiv:2405.16127  [pdf, other

    cs.IR

    Finetuning Large Language Model for Personalized Ranking

    Authors: Zhuoxi Bai, Ning Wu, Fengyu Cai, Xinyi Zhu, Yun Xiong

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across various domains, motivating researchers to investigate their potential use in recommendation systems. However, directly applying LLMs to recommendation tasks has proven challenging due to the significant disparity between the data used for pre-training LLMs and the specific requirements of recommendation tasks. In this st… ▽ More

    Submitted 20 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  32. arXiv:2403.09062  [pdf

    eess.IV cs.CV

    TBI Image/Text (TBI-IT): Comprehensive Text and Image Datasets for Traumatic Brain Injury Research

    Authors: Jie Li, Jiaying Wen, Tongxin Yang, Fenglin Cai, Miao Wei, Zhiwei Zhang, Li Jiang

    Abstract: In this paper, we introduce a new dataset in the medical field of Traumatic Brain Injury (TBI), called TBI-IT, which includes both electronic medical records (EMRs) and head CT images. This dataset is designed to enhance the accuracy of artificial intelligence in the diagnosis and treatment of TBI. This dataset, built upon the foundation of standard text and image data, incorporates specific annot… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2401.15934

  33. arXiv:2403.07225  [pdf, other

    cs.RO

    Stereo-NEC: Enhancing Stereo Visual-Inertial SLAM Initialization with Normal Epipolar Constraints

    Authors: Weihan Wang, Chieh Chou, Ganesh Sevagamoorthy, Kevin Chen, Zheng Chen, Ziyue Feng, Youjie Xia, Feiyang Cai, Yi Xu, Philippos Mordohai

    Abstract: We propose an accurate and robust initialization approach for stereo visual-inertial SLAM systems. Unlike the current state-of-the-art method, which heavily relies on the accuracy of a pure visual SLAM system to estimate inertial variables without updating camera poses, potentially compromising accuracy and robustness, our approach offers a different solution. We realize the crucial impact of prec… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  34. arXiv:2401.15934  [pdf, other

    cs.CV

    HICH Image/Text (HICH-IT): Comprehensive Text and Image Datasets for Hypertensive Intracerebral Hemorrhage Research

    Authors: Jie Li, Yulong Xia, Tongxin Yang, Fenglin Cai, Miao Wei, Zhiwei Zhang, Li Jiang

    Abstract: In this paper, we introduce a new dataset in the medical field of hypertensive intracerebral hemorrhage (HICH), called HICH-IT, which includes both electronic medical records (EMRs) and head CT images. This dataset is designed to enhance the accuracy of artificial intelligence in the diagnosis and treatment of HICH. This dataset, built upon the foundation of standard text and image data, incorpora… ▽ More

    Submitted 5 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  35. arXiv:2312.15879  [pdf, ps, other

    math.CV

    Optimal estimates for mappings admitting general Poisson representations in the unit ball

    Authors: Deguang Zhong, Fangming Cai, Dongping Wei

    Abstract: Suppose that $1<p\leq\infty$ and $\varphi\in L^{p}(\mathbb{B}^{n},\mathbb{R}^{n}).$ In this note, we use Hölder inequality and some basic properties of hypergeometric functions to establish the sharp constant $C_{p}$ and function $C_{p}(x)$ in the following inequalities $$|u(x)|\leq \frac{C_{p}}{(1-|x|^{2})^{(n-1)/p}}\cdot||\varphi||_{L^{p}}$$ and… ▽ More

    Submitted 4 October, 2025; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: The extremum function $\varphi_ {0} (\ eta)$ given in arXiv: 2312.15879 is incorrect. In this new version, we have rephrased the main theorems and obtained the correct expression for the extreme value function $\varphi_{0}(η)=\left(0,0,\ldots,\left[\frac{(1-|x|^{2})^{β-\frac{n-1}{q}}}{|x-η|^β}\right]^{q/p}\right)$

    MSC Class: 31B05; 31B10; 42B30

  36. Debiasing Sequential Recommenders through Distributionally Robust Optimization over System Exposure

    Authors: Jiyuan Yang, Yue Ding, Yidan Wang, Pengjie Ren, Zhumin Chen, Fei Cai, Jun Ma, Rui Zhang, Zhaochun Ren, Xin Xin

    Abstract: Sequential recommendation (SR) models are typically trained on user-item interactions which are affected by the system exposure bias, leading to the user preference learned from the biased SR model not being fully consistent with the true user preference. Exposure bias refers to the fact that user interactions are dependent upon the partial items exposed to the user. Existing debiasing methods do… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Accept by WSDM 2024

  37. arXiv:2311.12626  [pdf

    physics.app-ph

    Acoustic Vortex in Waveguide with Chiral Gradient Sawtooth Metasurface

    Authors: Zeliang Song, Shuhuan Xie, Yong Li, Hua Ding, Feiyan Cai, Yugui Peng, Xuefeng Zhu, Degang Zhao

    Abstract: The acoustic vortex states with spiral phase dislocation that can carry orbital angular moment (OAM) have aroused many research interests in recent years. The mainstream methods of generating acoustic vortex are based on Huygens-Fresnel principle to modulate the wavefront to create spatial spiral phase dislocation. In this work, we propose an entirely new scenario to generate acoustic vortex in a… ▽ More

    Submitted 14 January, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  38. arXiv:2311.08298  [pdf, other

    cs.CL cs.AI

    A Survey of Confidence Estimation and Calibration in Large Language Models

    Authors: Jiahui Geng, Fengyu Cai, Yuxia Wang, Heinz Koeppl, Preslav Nakov, Iryna Gurevych

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across a wide range of tasks in various domains. Despite their impressive performance, they can be unreliable due to factual errors in their generations. Assessing their confidence and calibrating them across different tasks can help mitigate risks and enable LLMs to produce better generations. There has been a lot of recent re… ▽ More

    Submitted 25 March, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 16 pages, 1 page, 1 table

  39. arXiv:2309.02254  [pdf

    physics.flu-dyn physics.ins-det

    Seeing the Unheard: dynamics of thin liquid film in holographic ultrasonic field revealed by time-resolved Schlieren imaging

    Authors: Weitao Sun, Diyao Wang, Yuheng Yang, Fangyu Cai, Mingchen Gao, Sirui Guo

    Abstract: In this study, we introduce a unique approach that employs time-resolved Schlieren imaging to capture and visualize the dynamic changes of a thin liquid (mixture of water, soap and glycerin) film in ultrasonic wave field with high spatial and temporal resolution. By placing a soap film spanning a wire frame vertically in the path of light, we harnessed the vibrations induced by the ultrasonic wave… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 10 pages, 8 pages

  40. arXiv:2307.15526  [pdf

    physics.flu-dyn

    Enhanced boiling heat transfer using conducting-insulating microcavity surfaces in an electric field: A lattice Boltzmann study

    Authors: Fanming Cai, Zhaomiao Liu, Nan Zheng, Yan Pang

    Abstract: The field trap effect on the microcavity surface under the action of an electric field is not conducive to boiling heat transfer. This numerical study found that using conducting-insulating microcavity surfaces in an electric field removes the field trap effect, increasing the critical heat flux by more than 200%. Bubble behavior and heat transfer mechanisms on heating surfaces were further explor… ▽ More

    Submitted 17 October, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Journal ref: Physics of Fluids 35, 107126 (2023)

  41. arXiv:2307.08021  [pdf, ps, other

    math.DS

    Local weighted topological pressure

    Authors: Fangzhou Cai

    Abstract: In [D. Feng, W. Huang, Variational principle for weighted topological pressure. J. Math. Pures Appl. (2016)], the authors studied weighted topological pressure and established a variational principle for it. In this paper, we introduce the notion of local weighted topological pressure and generalize Feng and Huang's main results to localized version.

    Submitted 16 July, 2023; originally announced July 2023.

  42. arXiv:2307.05087  [pdf, other

    cs.CV eess.IV

    SAR-NeRF: Neural Radiance Fields for Synthetic Aperture Radar Multi-View Representation

    Authors: Zhengxin Lei, Feng Xu, Jiangtao Wei, Feng Cai, Feng Wang, Ya-Qiu Jin

    Abstract: SAR images are highly sensitive to observation configurations, and they exhibit significant variations across different viewing angles, making it challenging to represent and learn their anisotropic features. As a result, deep learning methods often generalize poorly across different view angles. Inspired by the concept of neural radiance fields (NeRF), this study combines SAR imaging mechanisms w… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  43. arXiv:2306.15961  [pdf, other

    cs.IR

    Disentangled Variational Auto-encoder Enhanced by Counterfactual Data for Debiasing Recommendation

    Authors: Yupu Guo, Fei Cai, Xin Zhanga, Jianming Zhenga, Honghui Chena

    Abstract: Recommender system always suffers from various recommendation biases, seriously hindering its development. In this light, a series of debias methods have been proposed in the recommender system, especially for two most common biases, i.e., popularity bias and amplified subjective bias. However, exsisting debias methods usually concentrate on correcting a single bias. Such single-functionality debi… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  44. arXiv:2306.14397  [pdf, other

    cs.SE cs.CY

    Discriminating Human-authored from ChatGPT-Generated Code Via Discernable Feature Analysis

    Authors: Li Ke, Hong Sheng, Fu Cai, Zhang Yunhe, Liu Ming

    Abstract: The ubiquitous adoption of Large Language Generation Models (LLMs) in programming has underscored the importance of differentiating between human-written code and code generated by intelligent models. This paper specifically aims to distinguish code generated by ChatGPT from that authored by humans. Our investigation reveals disparities in programming style, technical level, and readability betwee… ▽ More

    Submitted 4 July, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

    Comments: 11 pages, 8 figures, 3 tables

  45. arXiv:2305.09335  [pdf, other

    cs.CL

    MsPrompt: Multi-step Prompt Learning for Debiasing Few-shot Event Detection

    Authors: Siyuan Wang, Jianming Zheng, Xuejun Hu, Fei Cai, Chengyu Song, Xueshan Luo

    Abstract: Event detection (ED) is aimed to identify the key trigger words in unstructured text and predict the event types accordingly. Traditional ED models are too data-hungry to accommodate real applications with scarce labeled data. Besides, typical ED models are facing the context-bypassing and disabled generalization issues caused by the trigger bias stemming from ED datasets. Therefore, we focus on t… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  46. arXiv:2303.15790  [pdf, other

    hep-ex hep-ph physics.ins-det

    STCF Conceptual Design Report: Volume 1 -- Physics & Detector

    Authors: M. Achasov, X. C. Ai, R. Aliberti, L. P. An, Q. An, X. Z. Bai, Y. Bai, O. Bakina, A. Barnyakov, V. Blinov, V. Bobrovnikov, D. Bodrov, A. Bogomyagkov, A. Bondar, I. Boyko, Z. H. Bu, F. M. Cai, H. Cai, J. J. Cao, Q. H. Cao, Z. Cao, Q. Chang, K. T. Chao, D. Y. Chen, H. Chen , et al. (413 additional authors not shown)

    Abstract: The Super $τ$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τ$-Charm factory -- the BEPCII,… ▽ More

    Submitted 5 October, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Journal ref: Front. Phys. 19(1), 14701 (2024)

  47. On the properties of the mean orbital pseudo-metric

    Authors: Fangzhou Cai, Dominik Kwietniak, Jian Li, Habibeh Pourmand

    Abstract: Given a topological dynamical system $(X,T)$, we study properties of the mean orbital pseudo-metric $\bar E$ defined by \[ \bar E(x,y)= \limsup_{n\to\infty } \min_{σ\in S_n}\frac{1}{n}\sum_{k=0}^{n-1}d(T^k(x),T^{σ(k)}(y)), \] where $x,y\in X$ and $S_n$ is the permutation group of $\{0,1,\ldots,n-1\}$. Let $\hatω_T(x)$ denote the set of measures quasi-generated by a point $x\in X$. We show that the… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Journal ref: Journal of Differential Equations 318 (2022), 1-19

  48. arXiv:2211.16092  [pdf, other

    cs.CV

    Unsupervised Visual Defect Detection with Score-Based Generative Model

    Authors: Yapeng Teng, Haoyang Li, Fuzhen Cai, Ming Shao, Siyu Xia

    Abstract: Anomaly Detection (AD), as a critical problem, has been widely discussed. In this paper, we specialize in one specific problem, Visual Defect Detection (VDD), in many industrial applications. And in practice, defect image samples are very rare and difficult to collect. Thus, we focus on the unsupervised visual defect detection and localization tasks and propose a novel framework based on the recen… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  49. arXiv:2204.12787  [pdf, other

    math.CV eess.SP

    3-D generalized analytic signal associated with linear canonical transform in Clifford biquaternion domain

    Authors: Zhen Feng Cai, Kit Ian Kou

    Abstract: The analytic signal is a useful mathematical tool. It separates qualitative and quantitative information of a signal in form of the local phase and local amplitude. The Clifford Fourier transform (CFT) plays a vital role in the representation of multidimensional signals. By generalizing the CFT to the Clifford linear canonical transform (CLCT), we present a new type of Clifford biquaternionic anal… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: 19 pages and 5 figures

    MSC Class: 45P05

  50. arXiv:2203.08441  [pdf, other

    cs.CV cs.AI

    Open Set Recognition using Vision Transformer with an Additional Detection Head

    Authors: Feiyang Cai, Zhenkai Zhang, Jie Liu, Xenofon Koutsoukos

    Abstract: Deep neural networks have demonstrated prominent capacities for image classification tasks in a closed set setting, where the test data come from the same distribution as the training data. However, in a more realistic open set scenario, traditional classifiers with incomplete knowledge cannot tackle test data that are not from the training classes. Open set recognition (OSR) aims to address this… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: under review