Skip to main content

Showing 1–50 of 113 results for author: Yin, T

.
  1. arXiv:2604.09089  [pdf, ps, other

    cs.SE cs.AI cs.CR

    DeepGuard: Secure Code Generation via Multi-Layer Semantic Aggregation

    Authors: Li Huang, Zhongxin Liu, Yifan Wu, Tao Yin, Dong Li, Jichao Bi, Nankun Mu, Hongyu Zhang, Meng Yan

    Abstract: Large Language Models (LLMs) for code generation can replicate insecure patterns from their training data. To mitigate this, a common strategy for security hardening is to fine-tune models using supervision derived from the final transformer layer. However, this design may suffer from a final-layer bottleneck: vulnerability-discriminative cues can be distributed across layers and become less detec… ▽ More

    Submitted 10 April, 2026; originally announced April 2026.

    Comments: ACL 2026 main conference

  2. arXiv:2603.16271  [pdf, ps, other

    cs.CV

    VIGOR: VIdeo Geometry-Oriented Reward for Temporal Generative Alignment

    Authors: Tengjiao Yin, Jinglei Shi, Heng Guo, Xi Wang

    Abstract: Video diffusion models lack explicit geometric supervision during training, leading to inconsistency artifacts such as object deformation, spatial drift, and depth violations in generated videos. To address this limitation, we propose a geometry-based reward model that leverages pretrained geometric foundation models to evaluate multi-view consistency through cross-frame reprojection error. Unlike… ▽ More

    Submitted 21 March, 2026; v1 submitted 17 March, 2026; originally announced March 2026.

    Comments: Project Page: https://vigor-geometry-reward.com/

  3. arXiv:2603.09030  [pdf, ps, other

    cs.RO cs.AI

    PlayWorld: Learning Robot World Models from Autonomous Play

    Authors: Tenny Yin, Zhiting Mei, Zhonghe Zheng, Miyu Yamane, David Wang, Jade Sceats, Samuel M. Bateman, Lihan Zha, Apurva Badithela, Ola Shorinwa, Anirudha Majumdar

    Abstract: Action-conditioned video models offer a promising path to building general-purpose robot simulators that can improve directly from data. Yet, despite training on large-scale robot datasets, current state-of-the-art video models still struggle to predict physically consistent robot-object interactions that are crucial in robotic manipulation. To close this gap, we present PlayWorld, a simple, scala… ▽ More

    Submitted 5 April, 2026; v1 submitted 9 March, 2026; originally announced March 2026.

    Comments: Website: https://robot-playworld.github.io/

  4. arXiv:2602.10556  [pdf, ps, other

    cs.RO cs.AI

    LAP: Language-Action Pre-Training Enables Zero-shot Cross-Embodiment Transfer

    Authors: Lihan Zha, Asher J. Hancock, Mingtong Zhang, Tenny Yin, Yixuan Huang, Dhruv Shah, Allen Z. Ren, Anirudha Majumdar

    Abstract: A long-standing goal in robotics is a generalist policy that can be deployed zero-shot on new robot embodiments without per-embodiment adaptation. Despite large-scale multi-embodiment pre-training, existing Vision-Language-Action models (VLAs) remain tightly coupled to their training embodiments and typically require costly fine-tuning. We introduce Language-Action Pre-training (LAP), a simple rec… ▽ More

    Submitted 15 February, 2026; v1 submitted 11 February, 2026; originally announced February 2026.

    Comments: Project website: https://lap-vla.github.io

  5. arXiv:2602.08559  [pdf, ps, other

    cs.IR

    QARM V2: Quantitative Alignment Multi-Modal Recommendation for Reasoning User Sequence Modeling

    Authors: Tian Xia, Jiaqi Zhang, Yueyang Liu, Hongjian Dou, Tingya Yin, Jiangxia Cao, Xulei Liang, Tianlu Xie, Lihao Liu, Xiang Chen, Shen Wang, Changxin Lao, Haixiang Gan, Jinkai Yu, Keting Cen, Lu Hao, Xu Zhang, Qiqiang Zhong, Zhongbo Sun, Yiyu Wang, Shuang Yang, Mingxin Wen, Xiangyu Wu, Shaoguo Liu, Tingting Gao , et al. (3 additional authors not shown)

    Abstract: With the evolution of large language models (LLMs), there is growing interest in leveraging their rich semantic understanding to enhance industrial recommendation systems (RecSys). Traditional RecSys relies on ID-based embeddings for user sequence modeling in the General Search Unit (GSU) and Exact Search Unit (ESU) paradigm, which suffers from low information density, knowledge isolation, and wea… ▽ More

    Submitted 9 February, 2026; originally announced February 2026.

    Comments: Work in progress

  6. arXiv:2602.05723  [pdf, ps, other

    cs.AI

    Mitigating Hallucination in Financial Retrieval-Augmented Generation via Fine-Grained Knowledge Verification

    Authors: Taoye Yin, Haoyuan Hu, Yaxin Fan, Xinhao Chen, Xinya Wu, Kai Deng, Kezun Zhang, Feng Wang

    Abstract: In financial Retrieval-Augmented Generation (RAG) systems, models frequently rely on retrieved documents to generate accurate responses due to the time-sensitive nature of the financial domain. While retrieved documents help address knowledge gaps, model-generated responses still suffer from hallucinations that contradict the retrieved information. To mitigate this inconsistency, we propose a Rein… ▽ More

    Submitted 5 February, 2026; originally announced February 2026.

    Comments: accepted by ICASSP 2026

  7. arXiv:2601.07823  [pdf, ps, other

    eess.SY cs.RO

    Video Generation Models in Robotics -- Applications, Research Challenges, Future Directions

    Authors: Zhiting Mei, Tenny Yin, Ola Shorinwa, Apurva Badithela, Zhonghe Zheng, Joseph Bruno, Madison Bland, Lihan Zha, Asher Hancock, Jaime Fernández Fisac, Philip Dames, Anirudha Majumdar

    Abstract: Video generation models have emerged as high-fidelity models of the physical world, capable of synthesizing high-quality videos capturing fine-grained interactions between agents and their environments conditioned on multi-modal user inputs. Their impressive capabilities address many of the long-standing challenges faced by physics-based simulators, driving broad adoption in many problem domains,… ▽ More

    Submitted 12 January, 2026; originally announced January 2026.

  8. arXiv:2512.14048  [pdf, ps, other

    cs.AI

    Intention Chain-of-Thought Prompting with Dynamic Routing for Code Generation

    Authors: Shen Li, Li Huang, Shaoxiong Zhan, Weifeng Sun, Tao Yin, Zhongxin Liu, Meng Yan

    Abstract: Large language models (LLMs) exhibit strong generative capabilities and have shown great potential in code generation. Existing chain-of-thought (CoT) prompting methods enhance model reasoning by eliciting intermediate steps, but suffer from two major limitations: First, their uniform application tends to induce overthinking on simple tasks. Second, they lack intention abstraction in code generati… ▽ More

    Submitted 15 December, 2025; originally announced December 2025.

    Comments: Accepted at AAAI-2026

  9. arXiv:2512.13024  [pdf

    physics.optics

    Theory of Remaining Exceptional Points from Nongeneric Splitting in Non-Hermitian Systems

    Authors: Teng Yin, Hao Zhang

    Abstract: In non-Hermitian physics, high-order exceptional points(HOEPs) with eigenvalues and eigenvectors coalesce are known for their enhanced sensitivity to perturbations. Typically, they exhibit eigenvalue splitting that scales as ε^(1/n), which is referred to as the generic response. However, under certain conditions, a nongeneric response of HOEPs occurs where the splitting follows a lower order ε^(1/… ▽ More

    Submitted 15 December, 2025; originally announced December 2025.

  10. arXiv:2512.05927  [pdf, ps, other

    cs.CV cs.AI cs.RO

    World Models That Know When They Don't Know - Controllable Video Generation with Calibrated Uncertainty

    Authors: Zhiting Mei, Tenny Yin, Micah Baker, Ola Shorinwa, Anirudha Majumdar

    Abstract: Recent advances in generative video models have led to significant breakthroughs in high-fidelity video synthesis, specifically in controllable video generation where the generated video is conditioned on text and action inputs, e.g., in instruction-guided video editing and world modeling in robotics. Despite these exceptional capabilities, controllable video models often hallucinate - generating… ▽ More

    Submitted 10 March, 2026; v1 submitted 5 December, 2025; originally announced December 2025.

  11. arXiv:2511.11685  [pdf, ps, other

    cs.LG

    R-Tuning: Wavelet-Decomposed Replay and Semantic Alignment for Continual Adaptation of Pretrained Time-Series Models

    Authors: Tianyi Yin, Jingwei Wang, Chenze Wang, Han Wang, Jiexuan Cai, Min Liu, Yunlong Ma, Kun Gao, Yuting Song, Weiming Shen

    Abstract: Pre-trained models have demonstrated exceptional generalization capabilities in time-series forecasting; however, adapting them to evolving data distributions remains a significant challenge. A key hurdle lies in accessing the original training data, as fine-tuning solely on new data often leads to catastrophic forgetting. To address this issue, we propose Replay Tuning (R-Tuning), a novel framewo… ▽ More

    Submitted 12 November, 2025; originally announced November 2025.

  12. arXiv:2510.12245  [pdf, ps, other

    cs.LG cs.AI

    MoRA: On-the-fly Molecule-aware Low-Rank Adaptation Framework for LLM-based Multi-Modal Molecular Assistant

    Authors: Tao Yin, Xiaohong Zhang, Jiacheng Zhang, Li Huang, Zhibin Zhang, Yuansong Zeng, Jin Xie, Meng Yan

    Abstract: Effectively integrating molecular graph structures with Large Language Models (LLMs) is a key challenge in drug discovery. Most existing multi-modal alignment methods typically process these structures by fine-tuning the LLM or adding a static adapter simultaneously. However, these approaches have two main limitations: (1) it optimizes a shared parameter space across all molecular inputs, limiting… ▽ More

    Submitted 14 October, 2025; originally announced October 2025.

  13. arXiv:2509.24414  [pdf, ps, other

    cs.LG cs.AI

    ScatterAD: Temporal-Topological Scattering Mechanism for Time Series Anomaly Detection

    Authors: Tao Yin, Xiaohong Zhang, Shaochen Fu, Zhibin Zhang, Li Huang, Yiyuan Yang, Kaixiang Yang, Meng Yan

    Abstract: One main challenge in time series anomaly detection for industrial IoT lies in the complex spatio-temporal couplings within multivariate data. However, traditional anomaly detection methods focus on modeling spatial or temporal dependencies independently, resulting in suboptimal representation learning and limited sensitivity to anomalous dispersion in high-dimensional spaces. In this work, we con… ▽ More

    Submitted 29 September, 2025; originally announced September 2025.

    Comments: 39th Conference on Neural Information Processing Systems (NeurIPS 2025)

  14. arXiv:2508.19325  [pdf, ps, other

    cs.CV

    PRISM: A Framework Harnessing Unsupervised Visual Representations and Textual Prompts for Explainable MACE Survival Prediction from Cardiac Cine MRI

    Authors: Haoyang Su, Jin-Yi Xiang, Shaohao Rui, Yifan Gao, Xingyu Chen, Tingxuan Yin, Shaoting Zhang, Xiaosong Wang, Lian-Ming Wu

    Abstract: Accurate prediction of major adverse cardiac events (MACE) remains a central challenge in cardiovascular prognosis. We present PRISM (Prompt-guided Representation Integration for Survival Modeling), a self-supervised framework that integrates visual representations from non-contrast cardiac cine magnetic resonance imaging with structured electronic health records (EHRs) for survival analysis. PRIS… ▽ More

    Submitted 29 January, 2026; v1 submitted 26 August, 2025; originally announced August 2025.

  15. arXiv:2507.14480  [pdf, ps, other

    math.AP math-ph

    On the direct and inverse electromagnetic scattering in a parallel-plate waveguide

    Authors: Jiawei Liang, Maojun Li, Tao Yin

    Abstract: This paper devotes to providing rigorous theoretical analysis of the wellposedness of the direct problem and the uniqueness of the inverse problem of electromagnetic scattering in a parallel-plate waveguide. The direct problem is reduced to an equivalent boundary value problem on a bounded domain by introducing an exact transparent boundary condition in terms of the electric-to-magnetic Calderón o… ▽ More

    Submitted 19 July, 2025; originally announced July 2025.

  16. arXiv:2507.11310  [pdf, ps, other

    cs.CR cs.CL

    LRCTI: A Large Language Model-Based Framework for Multi-Step Evidence Retrieval and Reasoning in Cyber Threat Intelligence Credibility Verification

    Authors: Fengxiao Tang, Huan Li, Ming Zhao, Zongzong Wu, Shisong Peng, Tao Yin

    Abstract: Verifying the credibility of Cyber Threat Intelligence (CTI) is essential for reliable cybersecurity defense. However, traditional approaches typically treat this task as a static classification problem, relying on handcrafted features or isolated deep learning models. These methods often lack the robustness needed to handle incomplete, heterogeneous, or noisy intelligence, and they provide limite… ▽ More

    Submitted 15 July, 2025; originally announced July 2025.

  17. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3410 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 19 December, 2025; v1 submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  18. arXiv:2507.05747  [pdf, ps, other

    math.NA

    Regularized boundary integral equation methods for open-arc scattering problems in thermoelasticity

    Authors: Yixuan X. Kong, José Pinto, Tao Yin

    Abstract: This paper devotes to developing novel boundary integral equation (BIE) solvers for the problem of thermoelastic scattering by open-arcs with four different boundary conditions in two dimensions. The proposed methodology is inspired by the Calderón formulas, whose eigenvalues are shown to accumulate at particular points depending only on Lamé parameters, satisfied by the thermoelastic boundary int… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

  19. arXiv:2507.05725  [pdf, ps, other

    math.NA

    Multi-patch/multiple-scattering frequency-time hybrid solver for interior and exterior wave equation problems

    Authors: Shuai Pan, Gang Bao, Tao Yin, Oscar P. Bruno

    Abstract: This paper proposes a new multiple-scattering frequency-time hybrid (FTH-MS) integral equation solver for problems of wave scattering by obstacles in two dimensional space, including interior problems in closed cavities and problems exterior to a set of disconnected open or closed scattering obstacles. The multiple-scattering FTH-MS method is based on a partition of the domain boundary into a user… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

    Comments: 28 pages, 17 figures

  20. arXiv:2506.18183  [pdf, ps, other

    cs.AI cs.CL

    Reasoning about Uncertainty: Do Reasoning Models Know When They Don't Know?

    Authors: Zhiting Mei, Christina Zhang, Tenny Yin, Justin Lidard, Ola Shorinwa, Anirudha Majumdar

    Abstract: Reasoning language models have set state-of-the-art (SOTA) records on many challenging benchmarks, enabled by multi-step reasoning induced using reinforcement learning. However, like previous language models, reasoning models are prone to generating confident, plausible responses that are incorrect (hallucinations). Knowing when and how much to trust these models is critical to the safe deployment… ▽ More

    Submitted 17 July, 2025; v1 submitted 22 June, 2025; originally announced June 2025.

  21. arXiv:2506.01600  [pdf, ps, other

    cs.RO cs.AI cs.CV

    WoMAP: World Models For Embodied Open-Vocabulary Object Localization

    Authors: Tenny Yin, Zhiting Mei, Tao Sun, Lihan Zha, Emily Zhou, Jeremy Bao, Miyu Yamane, Ola Shorinwa, Anirudha Majumdar

    Abstract: Language-instructed active object localization is a critical challenge for robots, requiring efficient exploration of partially observable environments. However, state-of-the-art approaches either struggle to generalize beyond demonstration datasets (e.g., imitation learning methods) or fail to generate physically grounded actions (e.g., VLMs). To address these limitations, we introduce WoMAP (Wor… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  22. arXiv:2505.24288  [pdf, ps, other

    math.AP math.NA

    Factorization method for near-field inverse scattering problems in elastodynamics

    Authors: Chun Liu, Guanghui Hu, Tao Yin, Bo Zhang

    Abstract: Consider a time-harmonic elastic point source incident on a bounded obstacle which is embedded in an open space filled with a homogeneous and isotropic elastic medium. This paper is concerned with the inverse problem of recovering the location and shape of the obstacle from near-field data generated by infinitely many incident point source waves at a fixed energy. The incident point sources and th… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    Comments: 20 pages, 20 figures

    MSC Class: 35R30; 65N21; 35P25

  23. arXiv:2505.19400  [pdf, ps, other

    gr-qc

    Highly-accurate neutron star modeling in the Hartle-Thorne Approximation

    Authors: Carlos Conde-Ocazionez, Tuojin Yin, Jaquelyn Noronha-Hostler, Nicolás Yunes

    Abstract: Future X-ray missions, such as NICER and LOFT, together with gravitational-wave observations from ground-based detectors, will provide new insights into neutron stars. Interpreting accurate observations in the future will require accurate models of their gravitational fields. In this first paper of a two-part series, we construct the perturbation equations for slowly-rotating, isolated, and unmagn… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 36 pages, 1 figure

  24. arXiv:2505.12074  [pdf, ps, other

    cs.CV

    Denoising Mutual Knowledge Distillation in Bi-Directional Multiple Instance Learning

    Authors: Chen Shu, Boyu Fu, Yiman Li, Ting Yin, Wenchuan Zhang, Jie Chen, Yuhao Yi, Hong Bu

    Abstract: Multiple Instance Learning is the predominant method for Whole Slide Image classification in digital pathology, enabling the use of slide-level labels to supervise model training. Although MIL eliminates the tedious fine-grained annotation process for supervised learning, whether it can learn accurate bag- and instance-level classifiers remains a question. To address the issue, instance-level clas… ▽ More

    Submitted 27 May, 2025; v1 submitted 17 May, 2025; originally announced May 2025.

    Comments: 15 pages, 3 figures

  25. arXiv:2505.03774  [pdf, other

    cs.LG cs.SI

    Out-of-Distribution Detection in Heterogeneous Graphs via Energy Propagation

    Authors: Tao Yin, Chen Zhao, Xiaoyan Liu, Minglai Shao

    Abstract: Graph neural networks (GNNs) are proven effective in extracting complex node and structural information from graph data. While current GNNs perform well in node classification tasks within in-distribution (ID) settings, real-world scenarios often present distribution shifts, leading to the presence of out-of-distribution (OOD) nodes. OOD detection in graphs is a crucial and challenging task. Most… ▽ More

    Submitted 29 April, 2025; originally announced May 2025.

    Comments: Knowledge-Based Systems 2025

  26. arXiv:2504.07997  [pdf, ps, other

    cs.CL

    BiasCause: Evaluate Socially Biased Causal Reasoning of Large Language Models

    Authors: Tian Xie, Tongxin Yin, Vaishakh Keshava, Xueru Zhang, Siddhartha Reddy Jonnalagadda

    Abstract: While large language models (LLMs) play increasingly significant roles in society, research shows they continue to generate content that reflects social bias against sensitive groups. Existing benchmarks effectively identify these biases, but a critical gap remains in understanding the underlying reasoning processes that produce them. This paper addresses this gap by evaluating the causal reasonin… ▽ More

    Submitted 10 March, 2026; v1 submitted 8 April, 2025; originally announced April 2025.

    Comments: This work has been done when the first author is at Google. The first author is a student at the Ohio State University

  27. arXiv:2503.09580  [pdf, ps, other

    math.NA math.AP

    A fast Fourier spectral method for the linearized Boltzmann collision operator

    Authors: Tianai Yin, Zhenning Cai, Yanli Wang

    Abstract: We introduce a fast Fourier spectral method to compute linearized collision operators of the Boltzmann equation for variable hard-sphere gases. While the state-of-the-art method provides a computational cost O(MN^4 log N), with N being the number of modes in each direction and M being the number of quadrature points on a hemisphere, our method reduces the cost to O(N^4 log N), removing the factor… ▽ More

    Submitted 14 September, 2025; v1 submitted 12 March, 2025; originally announced March 2025.

    MSC Class: 65M70; 76P05; 35Q20

  28. arXiv:2503.04836  [pdf, ps, other

    eess.IV cs.AI cs.CV

    PGAD: Prototype-Guided Adaptive Distillation for Multi-Modal Learning in AD Diagnosis

    Authors: Yanfei Li, Teng Yin, Wenyi Shang, Jingyu Liu, Xi Wang, Kaiyang Zhao

    Abstract: Missing modalities pose a major issue in Alzheimer's Disease (AD) diagnosis, as many subjects lack full imaging data due to cost and clinical constraints. While multi-modal learning leverages complementary information, most existing methods train only on complete data, ignoring the large proportion of incomplete samples in real-world datasets like ADNI. This reduces the effective training set and… ▽ More

    Submitted 26 August, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

  29. arXiv:2412.12226  [pdf, other

    cs.LG cs.AI

    Apollo-Forecast: Overcoming Aliasing and Inference Speed Challenges in Language Models for Time Series Forecasting

    Authors: Tianyi Yin, Jingwei Wang, Yunlong Ma, Han Wang, Chenze Wang, Yukai Zhao, Min Liu, Weiming Shen, Yufeng Chen

    Abstract: Encoding time series into tokens and using language models for processing has been shown to substantially augment the models' ability to generalize to unseen tasks. However, existing language models for time series forecasting encounter several obstacles, including aliasing distortion and prolonged inference times, primarily due to the limitations of quantization processes and the computational de… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  30. arXiv:2412.07772  [pdf, ps, other

    cs.CV

    From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

    Authors: Tianwei Yin, Qiang Zhang, Richard Zhang, William T. Freeman, Fredo Durand, Eli Shechtman, Xun Huang

    Abstract: Current video diffusion models achieve impressive generation quality but struggle in interactive applications due to bidirectional attention dependencies. The generation of a single frame requires the model to process the entire sequence, including the future. We address this limitation by adapting a pretrained bidirectional diffusion transformer to an autoregressive transformer that generates fra… ▽ More

    Submitted 23 September, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: CVPR 2025. Project Page: https://causvid.github.io/

  31. arXiv:2412.04470  [pdf, other

    cs.CV

    Turbo3D: Ultra-fast Text-to-3D Generation

    Authors: Hanzhe Hu, Tianwei Yin, Fujun Luan, Yiwei Hu, Hao Tan, Zexiang Xu, Sai Bi, Shubham Tulsiani, Kai Zhang

    Abstract: We present Turbo3D, an ultra-fast text-to-3D system capable of generating high-quality Gaussian splatting assets in under one second. Turbo3D employs a rapid 4-step, 4-view diffusion generator and an efficient feed-forward Gaussian reconstructor, both operating in latent space. The 4-step, 4-view generator is a student model distilled through a novel Dual-Teacher approach, which encourages the stu… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: project page: https://turbo-3d.github.io/

  32. arXiv:2412.04408  [pdf, other

    cs.IT cs.LG

    Providing Differential Privacy for Federated Learning Over Wireless: A Cross-layer Framework

    Authors: Jiayu Mao, Tongxin Yin, Aylin Yener, Mingyan Liu

    Abstract: Federated Learning (FL) is a distributed machine learning framework that inherently allows edge devices to maintain their local training data, thus providing some level of privacy. However, FL's model updates still pose a risk of privacy leakage, which must be mitigated. Over-the-air FL (OTA-FL) is an adapted FL design for wireless edge networks that leverages the natural superposition property of… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: submitted for an IEEE publication

  33. arXiv:2411.08043  [pdf, other

    physics.space-ph physics.geo-ph

    Graph-GIC: A Smart and Parallelized Geomagnetically Induced Current Modelling Algorithm Based on Graph Theory for Space Weather Applications

    Authors: Wen Chen, Ding Yuan, Xueshang Feng, Stefaan Poedts, Zhengyang Zou, Song Feng, Yuxuan Zhu, Tong Yin

    Abstract: Geomagnetically Induced Current (GIC) refers to the electromagnetic response of the Earth and its conductive modern infrastructures to space weather and would pose a significant threat to high-voltage power grids designed for the alternative current operation. To assess the impact of space weather on the power grid, one needs to calculate the GIC on a national or continental scale. In this study,… ▽ More

    Submitted 29 October, 2024; originally announced November 2024.

    Comments: 19 pages, 10 figures

  34. arXiv:2411.05626  [pdf, other

    physics.comp-ph math.NA

    Can Efficient Fourier-Transform Techniques Favorably Impact on Broadband Computational Electromagnetism?

    Authors: Thomas G. Anderson, Mark Lyon, Tao Yin, Oscar P. Bruno

    Abstract: In view of recently demonstrated joint use of novel Fourier-transform techniques and effective high-accuracy frequency domain solvers related to the Method of Moments, it is argued that a set of transformative innovations could be developed for the effective, accurate and efficient simulation of problems of wave propagation and scattering of broadband, time-dependent wavefields. This contribution… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    MSC Class: 65R10; 65R20

  35. arXiv:2411.02248  [pdf, other

    eess.SY

    Advancing Cyber-Attack Detection in Power Systems: A Comparative Study of Machine Learning and Graph Neural Network Approaches

    Authors: Tianzhixi Yin, Syed Ahsan Raza Naqvi, Sai Pushpak Nandanoori, Soumya Kundu

    Abstract: This paper explores the detection and localization of cyber-attacks on time-series measurements data in power systems, focusing on comparing conventional machine learning (ML) like k-means, deep learning method like autoencoder, and graph neural network (GNN)-based techniques. We assess the detection accuracy of these approaches and their potential to pinpoint the locations of specific sensor meas… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  36. arXiv:2410.17037  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.app-ph

    Aluminum Scandium Nitride as a Functional Material at 1000°C

    Authors: Venkateswarlu Gaddam, Shaurya S. Dabas, Jinghan Gao, David J. Spry, Garrett Baucom, Nicholas G. Rudawski, Tete Yin, Ethan Angerhofer, Philip G. Neudeck, Honggyu Kim, Philip X. -L. Feng, Mark Sheplak, Roozbeh Tabrizian

    Abstract: Aluminum scandium nitride (AlScN) has emerged as a highly promising material for high-temperature applications due to its robust piezoelectric, ferroelectric, and dielectric properties. This study investigates the behavior of Al0.7Sc0.3N thin films in extreme thermal environments, demonstrating functional stability up to 1000°C, making it suitable for use in aerospace, hypersonics, deep-well, and… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

  37. arXiv:2409.05251  [pdf, other

    cs.RO

    Online Resynthesis of High-Level Collaborative Tasks for Robots with Changing Capabilities

    Authors: Amy Fang, Tenny Yin, Hadas Kress-Gazit

    Abstract: Given a collaborative high-level task and a team of heterogeneous robots and behaviors to satisfy it, this work focuses on the challenge of automatically, at runtime, adjusting the individual robot behaviors such that the task is still satisfied, when robots encounter changes to their abilities--either failures or additional actions they can perform. We consider tasks encoded in LTL^ψand minimize… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

    Comments: Under review in IEEE Robotics and Automation Letters

  38. arXiv:2408.00539  [pdf, ps, other

    cs.CL cs.AI

    Intermittent Semi-Working Mask: A New Masking Paradigm for LLMs

    Authors: HaoYuan Hu, Mingcong Lu, Di Luo, XinYa Wu, Jiangcai Zhu, Taoye Yin, Zheng Li, Hao Wang, Shusheng Zhang, KeZun Zhang, KaiLai Shao, Chao Chen, Feng Wang

    Abstract: Multi-turn dialogues and context-intensive tasks challenge Large Language Models (LLMs) to integrate long histories without sacrificing generation quality. Although prefix LLMs can better exploit historical context via bidirectional attention on prefix tokens, they are rarely used in practice because multi-turn training requires many duplicated triplets, and its bidirectional prefix prevents KV-ca… ▽ More

    Submitted 17 February, 2026; v1 submitted 1 August, 2024; originally announced August 2024.

  39. Cascaded two-stage feature clustering and selection via separability and consistency in fuzzy decision systems

    Authors: Yuepeng Chen, Weiping Ding, Hengrong Ju, Jiashuang Huang, Tao Yin

    Abstract: Feature selection is a vital technique in machine learning, as it can reduce computational complexity, improve model performance, and mitigate the risk of overfitting. However, the increasing complexity and dimensionality of datasets pose significant challenges in the selection of features. Focusing on these challenges, this paper proposes a cascaded two-stage feature clustering and selection algo… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by IEEE Transactions on Fuzzy Systems for publication. Permission from IEEE must be obtained for all other uses, in any current or future media. The final version is available at [10.1109/TFUZZ.2024.3420963]

    Journal ref: IEEE Transactions on Fuzzy Systems 2024

  40. arXiv:2407.15756  [pdf, other

    cs.LG cs.AI

    Model editing for distribution shifts in uranium oxide morphological analysis

    Authors: Davis Brown, Cody Nizinski, Madelyn Shapiro, Corey Fallon, Tianzhixi Yin, Henry Kvinge, Jonathan H. Tu

    Abstract: Deep learning still struggles with certain kinds of scientific data. Notably, pretraining data may not provide coverage of relevant distribution shifts (e.g., shifts induced via the use of different measurement instruments). We consider deep learning models trained to classify the synthesis conditions of uranium ore concentrates (UOCs) and show that model editing is particularly effective for impr… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: Presented at CV4MS @ CVPR 2024

  41. arXiv:2406.18019  [pdf, other

    cs.RO

    Continuous Execution of High-Level Collaborative Tasks for Heterogeneous Robot Teams

    Authors: Amy Fang, Tenny Yin, Jiawei Lin, Hadas Kress-Gazit

    Abstract: We propose a control synthesis framework for a heterogeneous multi-robot system to satisfy collaborative tasks, where actions may take varying duration of time to complete. We encode tasks using the discrete logic LTL^ψ, which uses the concept of bindings to interleave robot actions and express information about relationship between specific task requirements and robot assignments. We present a sy… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Under review in IEEE Transactions on Robotics

  42. arXiv:2405.14867  [pdf, other

    cs.CV

    Improved Distribution Matching Distillation for Fast Image Synthesis

    Authors: Tianwei Yin, Michaël Gharbi, Taesung Park, Richard Zhang, Eli Shechtman, Fredo Durand, William T. Freeman

    Abstract: Recent approaches have shown promises distilling diffusion models into efficient one-step generators. Among them, Distribution Matching Distillation (DMD) produces one-step generators that match their teacher in distribution, without enforcing a one-to-one correspondence with the sampling trajectories of their teachers. However, to ensure stable training, DMD requires an additional regression loss… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Code, model, and dataset are available at https://tianweiy.github.io/dmd2

  43. arXiv:2401.15862  [pdf, other

    math.NA

    PML-based boundary integral equation method for electromagnetic scattering problems in a layered-medium

    Authors: Gang Bao, Wangtao Lu, Tao Yin, Lu Zhang

    Abstract: This paper proposes a new boundary integral equation (BIE) methodology based on the perfectly matched layer (PML) truncation technique for solving the electromagnetic scattering problems in a multi-layered medium. Instead of using the original PML stretched fields, artificial fields which are also equivalent to the solutions in the physical region are introduced. This significantly simplifies the… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 26 pages, 13 figures

  44. arXiv:2312.15460  [pdf, other

    math.NA

    On a Robin-type non-singular coupling scheme for solving the wave scattering problems

    Authors: Xiaojuan Liu, Maojun Li, Tao Yin

    Abstract: This paper studies a non-singular coupling scheme for solving the acoustic and elastic wave scattering problems and its extension to the problems of Laplace and Lamé equations and the problem with a compactly supported inhomogeneity is also briefly discussed. Relying on the solution representation of the wave scattering problem, a Robin-type artificial boundary condition in terms of layer potentia… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  45. arXiv:2312.15189  [pdf, other

    physics.comp-ph math.NA

    Helmholtz decomposition based windowed Green function methods for elastic scattering problems on a half-space

    Authors: Tao Yin, Lu Zhang, Weiying Zheng, Xiaopeng Zhu

    Abstract: This paper proposes a new Helmholtz decomposition based windowed Green function (HD-WGF) method for solving the time-harmonic elastic scattering problems on a half-space with Dirichlet boundary conditions in both 2D and 3D. The Helmholtz decomposition is applied to separate the pressure and shear waves, which satisfy the Helmholtz and Helmholtz/Maxwell equations, respectively, and the correspondin… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: 20 pages, 5 figures

  46. arXiv:2312.06302  [pdf

    physics.app-ph

    Non-iterative Methods in Inhomogeneous Background Inverse Scattering Imaging Problem Assisted by Swin Transformer Network

    Authors: Naike Du, Tiantian Yin, Jing Wang, Rencheng Song, Kuiwen Xu, Bingyuan Liang, Sheng Sun, Xiuzhu Ye

    Abstract: A deep learning-assisted inversion method is proposed to solve the inhomogeneous background imaging problem. Three non-iterative methods, namely the distorted-Born (DB) major current coefficients method, the DB modified Born approximation method, and the DB connection method, are introduced to address the inhomogeneous background inverse scattering problem. These methods retain the multiple scatte… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: We have submitted this paper to TGRS(IEEE Transactionson Geoscience andRemote Sensing) on 29-Jan-2023; and resubmitted on 12-Jul-2023

  47. arXiv:2311.18828  [pdf, other

    cs.CV

    One-step Diffusion with Distribution Matching Distillation

    Authors: Tianwei Yin, Michaël Gharbi, Richard Zhang, Eli Shechtman, Fredo Durand, William T. Freeman, Taesung Park

    Abstract: Diffusion models generate high-quality images but require dozens of forward passes. We introduce Distribution Matching Distillation (DMD), a procedure to transform a diffusion model into a one-step image generator with minimal impact on image quality. We enforce the one-step image generator match the diffusion model at distribution level, by minimizing an approximate KL divergence whose gradient c… ▽ More

    Submitted 4 October, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: CVPR 2024, Project page: https://tianweiy.github.io/dmd/

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  48. arXiv:2311.14138  [pdf, ps, other

    physics.comp-ph math.NA

    A symmetric Gauss-Seidel method for the steady-state Boltzmann equation

    Authors: Tianai Yin, Zhenning Cai, Yanli Wang

    Abstract: We introduce numerical solvers for the steady-state Boltzmann equation based on the symmetric Gauss-Seidel (SGS) method. Due to the quadratic collision operator in the Boltzmann equation, the SGS method requires solving a nonlinear system on each grid cell, and we consider two methods, namely Newton's method and the fixed-point iteration, in our numerical tests. For small Knudsen numbers, our meth… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  49. arXiv:2311.12264  [pdf, other

    eess.SY cs.AI cs.LG

    Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations

    Authors: Sayak Mukherjee, Ramij R. Hossain, Sheik M. Mohiuddin, Yuan Liu, Wei Du, Veronica Adetola, Rohit A. Jinsiwale, Qiuhua Huang, Tianzhixi Yin, Ankit Singhal

    Abstract: Improving system-level resiliency of networked microgrids is an important aspect with increased population of inverter-based resources (IBRs). This paper (1) presents resilient control design in presence of adversarial cyber-events, and proposes a novel federated reinforcement learning (Fed-RL) approach to tackle (a) model complexities, unknown dynamical behaviors of IBR devices, (b) privacy issue… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 10 pages, 7 figures

    Journal ref: IEEE Transactions on Smart Grid ( Volume: 16, Issue: 2, March 2025)

  50. arXiv:2310.06341  [pdf, other

    cs.LG

    Federated Learning with Reduced Information Leakage and Computation

    Authors: Tongxin Yin, Xuwei Tan, Xueru Zhang, Mohammad Mahdi Khalili, Mingyan Liu

    Abstract: Federated learning (FL) is a distributed learning paradigm that allows multiple decentralized clients to collaboratively learn a common model without sharing local data. Although local data is not exposed directly, privacy concerns nonetheless exist as clients' sensitive information can be inferred from intermediate computations. Moreover, such information leakage accumulates substantially over ti… ▽ More

    Submitted 1 October, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted by Transactions on Machine Learning Research (TMLR)