Skip to main content

Showing 1–50 of 198 results for author: Hou, H

.
  1. arXiv:2604.12512  [pdf, ps, other

    cs.CV cs.AI

    NTIRE 2026 The 3rd Restore Any Image Model (RAIM) Challenge: Professional Image Quality Assessment (Track 1)

    Authors: Guanyi Qin, Jie Liang, Bingbing Zhang, Lishen Qu, Ya-nan Guan, Hui Zeng, Lei Zhang, Radu Timofte, Jianhui Sun, Xinli Yue, Tao Shao, Huan Hou, Wenjie Liao, Shuhao Han, Jieyu Yuan, Chunle Guo, Chongyi Li, Zewen Chen, Yunze Liu, Jian Guo, Juan Wang, Yun Zeng, Bing Li, Weiming Hu, Hesong Li , et al. (28 additional authors not shown)

    Abstract: In this paper, we present an overview of the NTIRE 2026 challenge on the 3rd Restore Any Image Model in the Wild, specifically focusing on Track 1: Professional Image Quality Assessment. Conventional Image Quality Assessment (IQA) typically relies on scalar scores. By compressing complex visual characteristics into a single number, these methods fundamentally struggle to distinguish subtle differe… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

    Comments: NTIRE Challenge Report. Accepted by CVPRW 2026

  2. arXiv:2604.12282  [pdf, ps, other

    cs.CL

    Towards Robust Real-World Spreadsheet Understanding with Multi-Agent Multi-Format Reasoning

    Authors: Houxing Ren, Mingjie Zhan, Zimu Lu, Ke Wang, Yunqiao Yang, Haotian Hou, Hongsheng Li

    Abstract: Spreadsheets are central to real-world applications such as enterprise reporting, auditing, and scientific data management. Despite their ubiquity, existing large language model based approaches typically treat tables as plain text, overlooking critical layout cues and visual semantics. Moreover, real-world spreadsheets are often massive in scale, exceeding the input length that LLMs can efficient… ▽ More

    Submitted 14 April, 2026; originally announced April 2026.

    Comments: Accepted to ACL 2026 (main conference)

  3. arXiv:2604.10931  [pdf, ps, other

    eess.SP

    Reliable Online Resource Allocation for Multi-User Semantic Communications: A Constraint Bayesian Optimization Approach

    Authors: Huawei Hou, Suzhi Bi, Xian Li, Haixia Zhang, Zhi Quan

    Abstract: Semantic communication has been increasingly integrated into edge computing systems for reconstruction tasks, owing to its advantages in source compression, robustness to channel noise, and task execution efficiency. However, the black-box nature of neural-network (NN)-based semantic codecs, together with the noisy transmission of semantic features, makes it difficult to allocate transmission reso… ▽ More

    Submitted 12 April, 2026; originally announced April 2026.

    Comments: 13 pages, 9 figures. The paper has been submitted for potential journal publications

  4. arXiv:2603.20621  [pdf, ps, other

    eess.SP

    A Channel Knowledge Map-Driven Two-Stage Coordinated User Scheduling in Multi-Cell Massive MIMO Systems

    Authors: Jiayang Wan, Hongwei Hou, Jiawei Zhuang, Wenjin Wang, Shi Jin

    Abstract: This paper investigates narrowband coordinated user scheduling in multi-cell massive multiple-input multiple-output (MIMO) systems. We formulate the problem under a spectral-efficiency maximization criterion, revealing inherent challenges in computational complexity and signaling overhead. To address these, we develop a user-scheduling-oriented CKM (US-CKM) and a US-CKM-driven two-stage coordinate… ▽ More

    Submitted 20 March, 2026; originally announced March 2026.

    Comments: This work has been submitted to the IEEE for possible publication

  5. arXiv:2603.15707  [pdf, ps, other

    cs.SE cs.AI

    SEMAG: Self-Evolutionary Multi-Agent Code Generation

    Authors: Yulin Peng, Haowen Hou, Xinxin Zhu, Ying Tiffany He, F. Richard Yu

    Abstract: Large Language Models (LLMs) have made significant progress in handling complex programming tasks. However, current methods rely on manual model selection and fixed workflows, which limit their ability to adapt to changing task complexities. To address this, we propose SEMAG, a Self-Evolutionary Multi-Agent code Generation framework that mimics human coding practices. It decomposes programming tas… ▽ More

    Submitted 16 March, 2026; originally announced March 2026.

  6. arXiv:2603.09803  [pdf, ps, other

    cs.LG

    Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning

    Authors: Tiehua Mei, Minxuan Lv, Leiyu Pan, Zhenpeng Su, Hongru Hou, Hengrui Chen, Ao Xu, Deqing Yang

    Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) improves reasoning in large language models but treats all correct solutions equally, potentially reinforcing flawed traces that get correct answers by chance. We observe that better reasoning are better teachers: high-quality solutions serve as more effective demonstrations than low-quality ones. We term this teaching ability Demonstration Uti… ▽ More

    Submitted 10 March, 2026; originally announced March 2026.

  7. arXiv:2603.03666  [pdf, ps, other

    math.AP

    On non-uniqueness of mild solutions and stationary singular solutions to the Navier-Stokes equations

    Authors: Alexey Cheskidov, Hedong Hou

    Abstract: We prove that the unconditional uniqueness of mild solutions to the Navier-Stokes equations fails in all the Besov spaces with negative regularity index, by constructing non-trivial stationary singular solutions via convex integration. We also establish uniqueness of stationary weak solutions in an endpoint critical space. Similar results are proved for the fractional Navier-Stokes equations with… ▽ More

    Submitted 3 March, 2026; originally announced March 2026.

    Comments: 41 pages. Comments are welcome

    MSC Class: 35A02; 35K55; 35Q30; 42B37; 76D05

  8. arXiv:2603.03009  [pdf, ps, other

    math.PR

    Susceptible-Infected Epidemics on Evolving Graphs at Critical Infection Rate

    Authors: Wenze Chen, Haojie Hou, Ruibo Ma, Dong Yao

    Abstract: Consider an SI process on a graph $G$ where each S--I connection becomes I--I at rate $λ$. Here S and I stand for ``susceptible'' and ``infected'' respectively. The evoSI model is a modification of the SI model in which S--I edges are broken at rate $ρ$ and the ``S'' connects to a randomly chosen vertex. It is proven in Durrett and Yao [2022, Electron. J. Probab.] that, for the supercritical evoSI… ▽ More

    Submitted 3 March, 2026; originally announced March 2026.

    Comments: 45 pages

  9. arXiv:2602.13548  [pdf, ps, other

    cs.IT

    Redundancy-Optimal Constructions of $(1,1)$-Criss-Cross Deletion Correcting Codes with Efficient Encoding/Decoding Algorithms

    Authors: Wenhao Liu, Zhengyi Jiang, Zhongyi Huang, Hanxu Hou

    Abstract: Two-dimensional error-correcting codes, where codewords are represented as $n \times n$ arrays over a $q$-ary alphabet, find important applications in areas such as QR codes, DNA-based storage, and racetrack memories. Among the possible error patterns, $(t_r,t_c)$-criss-cross deletions-where $t_r$ rows and $t_c$ columns are simultaneously deleted-are of particular significance. In this paper, we f… ▽ More

    Submitted 13 February, 2026; originally announced February 2026.

    Comments: 18 pages, 1 figure

  10. arXiv:2602.12922  [pdf, ps, other

    cs.CV

    Beyond Benchmarks of IUGC: Rethinking Requirements of Deep Learning Methods for Intrapartum Ultrasound Biometry from Fetal Ultrasound Videos

    Authors: Jieyun Bai, Zihao Zhou, Yitong Tang, Jie Gan, Zhuonan Liang, Jianan Fan, Lisa B. Mcguire, Jillian L. Clarke, Weidong Cai, Jacaueline Spurway, Yubo Tang, Shiye Wang, Wenda Shen, Wangwang Yu, Yihao Li, Philippe Zhang, Weili Jiang, Yongjie Li, Salem Muhsin Ali Binqahal Al Nasim, Arsen Abzhanov, Numan Saeed, Mohammad Yaqub, Zunhui Xian, Hongxing Lin, Libin Lan , et al. (38 additional authors not shown)

    Abstract: A substantial proportion (45\%) of maternal deaths, neonatal deaths, and stillbirths occur during the intrapartum phase, with a particularly high burden in low- and middle-income countries. Intrapartum biometry plays a critical role in monitoring labor progression; however, the routine use of ultrasound in resource-limited settings is hindered by a shortage of trained sonographers. To address this… ▽ More

    Submitted 13 February, 2026; originally announced February 2026.

  11. arXiv:2602.12861  [pdf, ps, other

    math.GN

    Stone duality of Lawson compact algebraic L-domain

    Authors: Huijun Hou, Ao Shen

    Abstract: In this paper, a subclass of bounded distributive lattices, that is, finitely disjunctive distributive lattices (FDD-lattices) have been introduced. Then we apply it to establish a Stone duality for Lawson compact algebraic L-domains. Furthermore, we develop a dual equivalence between the category of FDD-lattices with lattice homomorphisms and that of Lawson compact algebraic L-domains with spectr… ▽ More

    Submitted 13 February, 2026; originally announced February 2026.

  12. arXiv:2602.12575  [pdf, ps, other

    cs.CL cs.LG

    Discovering Semantic Latent Structures in Psychological Scales: A Response-Free Pathway to Efficient Simplification

    Authors: Bo Wang, Yuxuan Zhang, Yueqin Hu, Hanchao Hou, Kaiping Peng, Shiguang Ni

    Abstract: Psychological scale refinement traditionally relies on response-based methods such as factor analysis, item response theory, and network psychometrics to optimize item composition. Although rigorous, these approaches require large samples and may be constrained by data availability and cross-cultural comparability. Recent advances in natural language processing suggest that the semantic structure… ▽ More

    Submitted 8 March, 2026; v1 submitted 12 February, 2026; originally announced February 2026.

    Comments: 79 pages, 20 figures; parameter perturbation result of epoch-cn updated; minor revisions on grammars

  13. arXiv:2601.07861  [pdf, ps, other

    cs.CL cs.IR

    EmbeddingRWKV: State-Centric Retrieval with Reusable States

    Authors: Haowen Hou, Jie Yang

    Abstract: Current Retrieval-Augmented Generation (RAG) systems typically employ a traditional two-stage pipeline: an embedding model for initial retrieval followed by a reranker for refinement. However, this paradigm suffers from significant inefficiency due to the lack of shared information between stages, leading to substantial redundant computation. To address this limitation, we propose \textbf{State-Ce… ▽ More

    Submitted 9 January, 2026; originally announced January 2026.

    Comments: 23 pages, 3 figures, 6 tables

  14. arXiv:2601.07129  [pdf, ps, other

    math.PR

    Minimum and extremal process for a branching random walk outside the boundary case

    Authors: Xinxin Chen, Haojie Hou

    Abstract: This work extends the studies on the minimum and extremal process of a supercritical branching random walk outside the boundary case which cannot be reduced to the boundary case. We study here the situation where the log-generating function explodes at $1$ and the random walk associated to the spine possesses a stretched exponential tail with exponent $b\in(0,\frac12)$. Under suitable conditions,… ▽ More

    Submitted 13 January, 2026; v1 submitted 11 January, 2026; originally announced January 2026.

    Comments: 42 pages. Update a reference

  15. arXiv:2601.05513  [pdf, ps, other

    cs.IR

    LEAPS: An LLM-Empowered Adaptive Plugin for Taobao AI Search

    Authors: Lei Wang, Jinhang Wu, Zhibin Wang, Biye Li, Haiping Hou

    Abstract: The rapid advancement of large language models has reshaped user search cognition, driving a paradigm shift from discrete keyword-based search to high-dimensional conversational interaction. However, existing e-commerce search architectures face a critical capability deficit in adapting to this change. Users are often caught in a dilemma: precise natural language descriptions frequently trigger ze… ▽ More

    Submitted 8 January, 2026; originally announced January 2026.

  16. arXiv:2601.04524  [pdf, ps, other

    cs.AI

    BioPIE: A Biomedical Protocol Information Extraction Dataset for High-Reasoning-Complexity Experiment Question Answer

    Authors: Haofei Hou, Shunyi Zhao, Fanxu Meng, Kairui Yang, Lecheng Ruan, Qining Wang

    Abstract: Question Answer (QA) systems for biomedical experiments facilitate cross-disciplinary communication, and serve as a foundation for downstream tasks, e.g., laboratory automation. High Information Density (HID) and Multi-Step Reasoning (MSR) pose unique challenges for biomedical experimental QA. While extracting structured knowledge, e.g., Knowledge Graphs (KGs), can substantially benefit biomedical… ▽ More

    Submitted 7 January, 2026; originally announced January 2026.

  17. arXiv:2512.05665  [pdf, ps, other

    cs.CL cs.CV

    Interleaved Latent Visual Reasoning with Selective Perceptual Modeling

    Authors: Shuai Dong, Siyuan Wang, Xingyu Liu, Chenglin Li, Haowen Hou, Zhongyu Wei

    Abstract: Interleaved reasoning paradigms enhance Multimodal Large Language Models (MLLMs) with visual feedback but are hindered by the prohibitive computational cost of re-encoding pixel-dense images. A promising alternative, latent visual reasoning, circumvents this bottleneck yet faces limitations: methods either fail to capture intermediate state evolution due to single-step, non-interleaved structures,… ▽ More

    Submitted 21 January, 2026; v1 submitted 5 December, 2025; originally announced December 2025.

    Comments: 18 pages, 11 figures. Code available at https://github.com/XD111ds/ILVR

  18. arXiv:2512.02557  [pdf, ps, other

    eess.SP

    Deep Learning-Based Joint Uplink-Downlink CSI Acquisition for Next-Generation Upper Mid-Band Systems

    Authors: Xuan He, Hongwei Hou, Yafei Wang, Wenjin Wang, Shi Jin, Symeon Chatzinotas, Björn Ottersten

    Abstract: In next-generation wireless communication systems, the newly designated upper mid-band has attracted considerable attention, also called frequency range 3 (FR3), highlighting the need for downlink (DL) transmission design, which fundamentally relies on accurate CSI. However, CSI acquisition in FR3 systems faces significant challenges: the increased number of antennas and wider transmission bandwid… ▽ More

    Submitted 2 December, 2025; originally announced December 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  19. arXiv:2512.02515  [pdf, ps, other

    cs.SD

    VibOmni: Towards Scalable Bone-conduction Speech Enhancement on Earables

    Authors: Lixing He, Yunqi Guo, Haozheng Hou, Zhenyu Yan

    Abstract: Earables, such as True Wireless Stereo earphones and VR/AR headsets, are increasingly popular, yet their compact design poses challenges for robust voice-related applications like telecommunication and voice assistant interactions in noisy environments. Existing speech enhancement systems, reliant solely on omnidirectional microphones, struggle with ambient noise like competing speakers. To addres… ▽ More

    Submitted 2 December, 2025; originally announced December 2025.

    Comments: Submitted to TMC

  20. arXiv:2511.20090  [pdf, ps, other

    cs.AR cs.AI

    R3A: Reliable RTL Repair Framework with Multi-Agent Fault Localization and Stochastic Tree-of-Thoughts Patch Generation

    Authors: Zizhang Luo, Fan Cui, Kexing Zhou, Runlin Guo, Mile Xia, Hongyuan Hou, Yun Liang

    Abstract: Repairing RTL bugs is crucial for hardware design and verification. Traditional automatic program repair (APR) methods define dedicated search spaces to locate and fix bugs with program synthesis. However, they heavily rely on fixed templates and can only deal with limited bugs. As an alternative, Large Language Models with the ability to understand code semantics can be explored for RTL repair. H… ▽ More

    Submitted 25 November, 2025; v1 submitted 25 November, 2025; originally announced November 2025.

    ACM Class: B.5.3; I.2.2

  21. arXiv:2511.18135  [pdf, ps, other

    physics.soc-ph

    Evaluating Parametric Car-Following Models in Naturalistic Congestion: Insights in Driver Behavior and Model Limitations

    Authors: Huaidian Hou, Arpan Kusari, Brian T. W. Lin

    Abstract: Car-Following is a broadly studied state of driving, and many modeling approaches through various heuristics and engineering methods have been proposed. Congestion is a common traffic phenomenon also widely investigated, both from macroscopic and microscopic perspectives. Yet, current literature lack a unified evaluation of Car-Following models with naturalistic congestion data. This paper compare… ▽ More

    Submitted 22 November, 2025; originally announced November 2025.

    Comments: Presented at the 104th Transportation Research Board Annual Meeting, Washington D.C., 2025. Paper on TRB Archive: https://annualmeeting.mytrb.org/FileUpload/FullPaper?ID=61095&SessionID=23204&ConferenceID=13. Source code available at https://github.com/DanielHou315/Car_Following_Eval

  22. arXiv:2511.14190  [pdf

    physics.optics

    Experimental realization of a full-band wave antireflection based on temporal taper metamaterials

    Authors: Haonan Hou, Kai Peng, Yangkai Wang, Jiarui Wang, Xudong Zhang, Ren Wang, Hao Hu, Jiang Xiong

    Abstract: As time can be introduced as an additional degree of freedom, temporal metamaterials nowadays open up new avenues for wave control and manipulation. Among these advancements, temporal metamaterial-based antireflection coatings have recently emerged as an innovative method that inherently avoids additional spatial insertions. However, prior temporal antireflection models with finite inserted tempor… ▽ More

    Submitted 18 November, 2025; originally announced November 2025.

  23. arXiv:2511.04812  [pdf, ps, other

    cs.RO cs.AI cs.LG

    Multimodal Diffusion Forcing for Forceful Manipulation

    Authors: Zixuan Huang, Huaidian Hou, Dmitry Berenson

    Abstract: Given a dataset of expert trajectories, standard imitation learning approaches typically learn a direct mapping from observations (e.g., RGB images) to actions. However, such methods often overlook the rich interplay between different modalities, i.e., sensory inputs, actions, and rewards, which is crucial for modeling robot behavior and understanding task outcomes. In this work, we propose Multim… ▽ More

    Submitted 13 April, 2026; v1 submitted 6 November, 2025; originally announced November 2025.

    Comments: Project website: https://unified-df.github.io

  24. arXiv:2510.24350  [pdf, ps, other

    eess.SP

    Achieving Constant-Envelope Waveform in CP-OFDMA Framework

    Authors: Yiming Zhu, Zhuhong Zhu, Xiaodong Xu, Hongwei Hou, Wenjin Wang, Rui Ding

    Abstract: OFDM is widely adopted in modern wireless communication systems, but its power efficiency is limited by high envelope fluctuations. Although various high power-efficiency waveforms have been proposed, most are incompatible with the CP-OFDMA framework and remain ineffective in multi-user downlink transmissions. To address this issue, we propose a constant-envelope (CE) waveform design, which enable… ▽ More

    Submitted 28 October, 2025; originally announced October 2025.

    Comments: This work will be submitted to the IEEE for possible publication

  25. arXiv:2510.22039  [pdf, ps, other

    cs.AI q-bio.NC

    Predictive Coding Enhances Meta-RL To Achieve Interpretable Bayes-Optimal Belief Representation Under Partial Observability

    Authors: Po-Chen Kuo, Han Hou, Will Dabney, Edgar Y. Walker

    Abstract: Learning a compact representation of history is critical for planning and generalization in partially observable environments. While meta-reinforcement learning (RL) agents can attain near Bayes-optimal policies, they often fail to learn the compact, interpretable Bayes-optimal belief states. This representational inefficiency potentially limits the agent's adaptability and generalization capacity… ▽ More

    Submitted 24 October, 2025; originally announced October 2025.

    Comments: Accepted to Annual Conference on Neural Information Processing Systems (NeurIPS) 2025

  26. arXiv:2510.17136  [pdf, ps, other

    cs.LG

    In-situ Autoguidance: Eliciting Self-Correction in Diffusion Models

    Authors: Enhao Gu, Haolin Hou

    Abstract: The generation of high-quality, diverse, and prompt-aligned images is a central goal in image-generating diffusion models. The popular classifier-free guidance (CFG) approach improves quality and alignment at the cost of reduced variation, creating an inherent entanglement of these effects. Recent work has successfully disentangled these properties by guiding a model with a separately trained, inf… ▽ More

    Submitted 20 October, 2025; originally announced October 2025.

    Comments: 6 pages, 3 figures. ICML 2025 Workshop submission

    ACM Class: I.2.6; I.5.1; I.5.4

  27. arXiv:2509.19358  [pdf, ps, other

    cs.CL cs.AI

    Benchmarking and Improving LLM Robustness for Personalized Generation

    Authors: Chimaobi Okite, Naihao Deng, Kiran Bodipati, Huaidian Hou, Joyce Chai, Rada Mihalcea

    Abstract: Recent years have witnessed a growing interest in personalizing the responses of large language models (LLMs). While existing evaluations primarily focus on whether a response aligns with a user's preferences, we argue that factuality is an equally important yet often overlooked dimension. In the context of personalization, we define a model as robust if its responses are both factually accurate a… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

    Comments: First draft. First camera-ready version

  28. arXiv:2509.17743  [pdf, ps, other

    cs.CV

    VideoPro: Adaptive Program Reasoning for Long Video Understanding

    Authors: Chenglin Li, Feng Han, Yikun Wang, Ruilin Li, Shuai Dong, Haowen Hou, Haitao Li, Qianglong Chen, Feng Tao, Jingqi Tong, Yin Zhang, Jiaqi Wang

    Abstract: Large language models (LLMs) have shown promise in generating program workflows for visual tasks. However, previous approaches often rely on closed-source models, lack systematic reasoning, and struggle with long-form video question answering (videoQA). To address these challenges, we introduce the FS-VisPR framework, an adaptive visual program reasoning approach that balances fast reasoning for s… ▽ More

    Submitted 25 January, 2026; v1 submitted 22 September, 2025; originally announced September 2025.

  29. arXiv:2509.16686  [pdf, ps, other

    cs.CL

    EG-MLA: Embedding-Gated Multi-head Latent Attention for Scalable and Efficient LLMs

    Authors: Zhengge Cai, Haowen Hou

    Abstract: Reducing the key-value (KV) cache size is a crucial step toward enabling efficient inference in large language models (LLMs), especially under latency and memory constraints. While Multi-Head Attention (MHA) offers strong representational power, it incurs significant memory overhead. Recent work on Multi-head Latent Attention (MLA) mitigates this by compressing KV representations into a shared lat… ▽ More

    Submitted 20 September, 2025; originally announced September 2025.

  30. arXiv:2509.08381  [pdf

    cs.CL cs.AI

    Low-Resource Fine-Tuning for Multi-Task Structured Information Extraction with a Billion-Parameter Instruction-Tuned Model

    Authors: Yu Cheng Chih, Yong Hao Hou

    Abstract: Deploying large language models (LLMs) for structured data extraction in domains such as financial compliance reporting, legal document analytics, and multilingual knowledge base construction is often impractical for smaller teams due to the high cost of running large architectures and the difficulty of preparing large, high-quality datasets. Most recent instruction-tuning studies focus on seven-b… ▽ More

    Submitted 10 September, 2025; originally announced September 2025.

    Comments: 13 pages, 8 figures, includes experiments on JSON extraction, knowledge graph extraction, and NER

  31. arXiv:2509.04190  [pdf

    cs.DL

    The changing role of cited papers over time: An analysis of highly cited papers based on a large full-text dataset

    Authors: Gege Lin, Nees Jan van Eck, Haiyan Hou, Zhigang Hu

    Abstract: This paper examines how the role of cited papers evolves over time by analyzing nearly 900 highly cited papers (HCPs) published between 2000 and 2016 and the full text of over 220,000 papers citing them. We investigate multiple citation characteristics, including citation location within the full text, reference and in-text citation types, citation sentiment, and textual and bibliographic relatedn… ▽ More

    Submitted 4 September, 2025; originally announced September 2025.

  32. arXiv:2508.19532  [pdf, ps, other

    cs.CL

    Alignment with Fill-In-the-Middle for Enhancing Code Generation

    Authors: Houxing Ren, Zimu Lu, Weikang Shi, Haotian Hou, Yunqiao Yang, Ke Wang, Aojun Zhou, Junting Pan, Mingjie Zhan, Hongsheng Li

    Abstract: The code generation capabilities of Large Language Models (LLMs) have advanced applications like tool invocation and problem-solving. However, improving performance in code-related tasks remains challenging due to limited training data that is verifiable with accurate test cases. While Direct Preference Optimization (DPO) has shown promise, existing methods for generating test cases still face lim… ▽ More

    Submitted 26 August, 2025; originally announced August 2025.

    Comments: Accepted to EMNLP 2025 (main conference)

  33. arXiv:2508.15156  [pdf, ps, other

    math.PR

    On the maximal displacement of subcritical branching random walks with or without killing

    Authors: Haojie Hou, Shuxiong Zhang

    Abstract: Consider a subcritical branching random walk $\{Z_k\}_{k\geq 0}$ with offspring distribution $\{p_k\}_{k\geq 0}$ and step size $X$. Let $M_n$ denote the rightmost position reached by $\{Z_k\}_{k\geq 0}$ up to generation $n$, and define $M := \sup_{n\geq 0} M_n$. In this paper we give asymptotics of tail probability of $M$ under optimal assumptions $\sum^{\infty}_{k=1}(k\log k) p_k<\infty$ and… ▽ More

    Submitted 20 August, 2025; originally announced August 2025.

    Comments: 34 pages

  34. arXiv:2508.13517  [pdf

    cs.IR cs.AI cs.LG cs.SI

    Heterogeneous Influence Maximization in User Recommendation

    Authors: Hongru Hou, Jiachen Sun, Wenqing Lin, Wendong Bi, Xiangrong Wang, Deqing Yang

    Abstract: User recommendation systems enhance user engagement by encouraging users to act as inviters to interact with other users (invitees), potentially fostering information propagation. Conventional recommendation methods typically focus on modeling interaction willingness. Influence-Maximization (IM) methods focus on identifying a set of users to maximize the information propagation. However, existing… ▽ More

    Submitted 19 August, 2025; originally announced August 2025.

    Comments: Accepted in CIKM 2025

  35. arXiv:2508.12772  [pdf, ps, other

    math.PR

    Law of the iterated logarithm for supercritical non-local spatial branching processes

    Authors: Haojie Hou, Ting Yang

    Abstract: Suppose that $X=(X_{t})_{t\ge 0}$ is either a general supercritical non-local branching Markov process, or a general supercritical non-local superprocess, on a Luzin space. Here, by ``supercritical" we mean that the mean semigroup of $X$ exhibits a Perron-Frobenius type behaviour with a positive principal eigenvalue. In this paper, we study the almost sure behaviour of a family of martingales natu… ▽ More

    Submitted 16 September, 2025; v1 submitted 18 August, 2025; originally announced August 2025.

    MSC Class: 60J80; 60J68; 60F15; 60J35

  36. arXiv:2508.08491  [pdf, ps, other

    eess.SP

    Tensor-Structured Bayesian Channel Prediction for Upper Mid-Band XL-MIMO Systems

    Authors: Hongwei Hou, Yafei Wang, Xinping Yi, Wenjin Wang, Dirk T. M. Slock, Shi Jin

    Abstract: The upper mid-band balances coverage and capacity for the future cellular systems and also embraces XL-MIMO systems, offering enhanced spectral and energy efficiency. However, these benefits are significantly degraded under mobility due to channel aging, and further exacerbated by the unique near-field (NF) and spatial non-stationarity (SnS) propagation in such systems. To address this challenge,… ▽ More

    Submitted 11 August, 2025; originally announced August 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  37. arXiv:2508.04355  [pdf, ps, other

    cs.IT

    Grid-like Error-Correcting Codes for Matrix Multiplication with Better Correcting Capability

    Authors: Hao Shi, Zhengyi Jiang, Zhongyi Huang, Bo Bai, Gong Zhang, Hanxu Hou

    Abstract: Matrix multiplication over the real field constitutes a foundational operation in the training of deep learning models, serving as a computational cornerstone for both forward and backward propagation processes. However, the presence of silent data corruption (SDC) in large-scale distributed training environments poses a significant threat to model convergence and predictive accuracy, particularly… ▽ More

    Submitted 6 August, 2025; originally announced August 2025.

  38. arXiv:2508.00379  [pdf, ps, other

    cs.IT eess.SP

    Active IRS-Enabled Integrated Sensing and Communications with Extended Targets

    Authors: Yuan Fang, Xianxin Song, Huazhou Hou, Ziguo Zhong, Xianghao Yu, Jie Xu, Yongming Huang

    Abstract: This paper studies the active intelligent reflecting surface (IRS)-enabled integrated sensing and communications (ISAC), in which an active IRS is deployed to assist the base station (BS) in serving multiple communication users (CUs) and simultaneously sensing an \emph{extended} target at the non-line-of-sight (NLoS) area of the BS. The active IRS has the capability of amplifying the reflected sig… ▽ More

    Submitted 1 August, 2025; originally announced August 2025.

  39. arXiv:2507.19993  [pdf, ps, other

    cs.CV

    FROSS: Faster-than-Real-Time Online 3D Semantic Scene Graph Generation from RGB-D Images

    Authors: Hao-Yu Hou, Chun-Yi Lee, Motoharu Sonogashira, Yasutomo Kawanishi

    Abstract: The ability to abstract complex 3D environments into simplified and structured representations is crucial across various domains. 3D semantic scene graphs (SSGs) achieve this by representing objects as nodes and their interrelationships as edges, facilitating high-level scene understanding. Existing methods for 3D SSG generation, however, face significant challenges, including high computational d… ▽ More

    Submitted 10 August, 2025; v1 submitted 26 July, 2025; originally announced July 2025.

    Comments: International Conference on Computer Vision (ICCV 2025)

  40. arXiv:2507.19874  [pdf, ps, other

    cs.CV

    All-in-One Medical Image Restoration with Latent Diffusion-Enhanced Vector-Quantized Codebook Prior

    Authors: Haowei Chen, Zhiwen Yang, Haotian Hou, Hui Zhang, Bingzheng Wei, Gang Zhou, Yan Xu

    Abstract: All-in-one medical image restoration (MedIR) aims to address multiple MedIR tasks using a unified model, concurrently recovering various high-quality (HQ) medical images (e.g., MRI, CT, and PET) from low-quality (LQ) counterparts. However, all-in-one MedIR presents significant challenges due to the heterogeneity across different tasks. Each task involves distinct degradations, leading to diverse i… ▽ More

    Submitted 26 July, 2025; originally announced July 2025.

    Comments: 11pages, 3figures, MICCAI 2025

  41. arXiv:2507.18058  [pdf, ps, other

    physics.optics

    Multicolor interband solitons in microcombs

    Authors: Qing-Xin Ji, Hanfei Hou, Jinhao Ge, Yan Yu, Maodong Gao, Warren Jin, Joel Guo, Lue Wu, Peng Liu, Avi Feshali, Mario Paniccia, John Bowers, Kerry Vahala

    Abstract: In microcombs, solitons can drive non-soliton-forming modes to induce optical gain. Under specific conditions, a regenerative secondary temporal pulse coinciding in time and space with the exciting soliton pulse will form at a new spectral location. A mechanism involving Kerr-induced pulse interactions has been proposed theoretically, leading to multicolor solitons containing constituent phase-loc… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  42. arXiv:2507.14485  [pdf, ps, other

    cs.CV cs.AI

    Benefit from Reference: Retrieval-Augmented Cross-modal Point Cloud Completion

    Authors: Hongye Hou, Liu Zhan, Yang Yang

    Abstract: Completing the whole 3D structure based on an incomplete point cloud is a challenging task, particularly when the residual point cloud lacks typical structural characteristics. Recent methods based on cross-modal learning attempt to introduce instance images to aid the structure feature learning. However, they still focus on each particular input class, limiting their generation abilities. In this… ▽ More

    Submitted 19 July, 2025; originally announced July 2025.

  43. arXiv:2507.02223  [pdf

    physics.optics quant-ph

    Observation of wave amplification and temporal topological state in a genuine photonic time crystal

    Authors: Jiang Xiong, Xudong Zhang, Longji Duan, Jiarui Wang, Yang Long, Haonan Hou, Letian Yu, Linyang Zou, Baile Zhang

    Abstract: Photonic time crystals (PTCs) are materials whose dielectric permittivity is periodically modulated in time, giving rise to bandgaps not in energy-as in conventional photonic crystals-but in momentum, known as k-gaps. These k-gaps enable wave amplification by extracting energy from temporal modulation, offering a mechanism for coherent light generation that bypasses traditional optical gain. PTCs… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

  44. arXiv:2506.13651  [pdf, ps, other

    cs.LG

    xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

    Authors: Kaiyuan Chen, Yixin Ren, Yang Liu, Xiaobo Hu, Haotong Tian, Tianbao Xie, Fangfu Liu, Haoye Zhang, Hongzhang Liu, Yuan Gong, Chen Sun, Han Hou, Hui Yang, James Pan, Jianan Lou, Jiayi Mao, Jizheng Liu, Jinpeng Li, Kangyi Liu, Kenkun Liu, Rui Wang, Run Li, Tong Niu, Wenlong Zhang, Wenqi Yan , et al. (8 additional authors not shown)

    Abstract: We introduce xbench, a dynamic, profession-aligned evaluation suite designed to bridge the gap between AI agent capabilities and real-world productivity. While existing benchmarks often focus on isolated technical skills, they may not accurately reflect the economic value agents deliver in professional settings. To address this, xbench targets commercially significant domains with evaluation tasks… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: Project page: https://xbench.org

  45. arXiv:2506.11899  [pdf, ps, other

    eess.SP

    DMRS-Based Uplink Channel Estimation for MU-MIMO Systems with Location-Specific SCSI Acquisition

    Authors: Jiawei Zhuang, Hongwei Hou, Minjie Tang, Wenjin Wang, Shi Jin, Vincent K. N. Lau

    Abstract: With the growing number of users in multi-user multiple-input multiple-output (MU-MIMO) systems, demodulation reference signals (DMRSs) are efficiently multiplexed in the code domain via orthogonal cover codes (OCC) to ensure orthogonality and minimize pilot interference. In this paper, we investigate uplink DMRS-based channel estimation for MU-MIMO systems with Type II OCC pattern standardized in… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  46. arXiv:2506.07600  [pdf, ps, other

    cs.CV cs.AI

    SceneRAG: Scene-level Retrieval-Augmented Generation for Video Understanding

    Authors: Nianbo Zeng, Haowen Hou, Fei Richard Yu, Si Shi, Ying Tiffany He

    Abstract: Despite recent advances in retrieval-augmented generation (RAG) for video understanding, effectively understanding long-form video content remains underexplored due to the vast scale and high complexity of video data. Current RAG approaches typically segment videos into fixed-length chunks, which often disrupts the continuity of contextual information and fails to capture authentic scene boundarie… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  47. arXiv:2505.17571  [pdf, ps, other

    cs.CL

    Reasoning Meets Personalization: Unleashing the Potential of Large Reasoning Model for Personalized Generation

    Authors: Sichun Luo, Guanzhi Deng, Jian Xu, Xiaojie Zhang, Hanxu Hou, Linqi Song

    Abstract: Personalization is a critical task in modern intelligent systems, with applications spanning diverse domains, including interactions with large language models (LLMs). Recent advances in reasoning capabilities have significantly enhanced LLMs, enabling unprecedented performance in tasks such as mathematics and coding. However, their potential for personalization tasks remains underexplored. In t… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  48. arXiv:2505.16314  [pdf, ps, other

    cs.CV cs.AI

    NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

    Authors: Shuhao Han, Haotian Fan, Fangyuan Kong, Wenjie Liao, Chunle Guo, Chongyi Li, Radu Timofte, Liang Li, Tao Li, Junhui Cui, Yunqiu Wang, Yang Tai, Jingwei Sun, Jianhui Sun, Xinli Yue, Tianyi Wang, Huan Hou, Junda Lu, Xinyang Huang, Zitang Zhou, Zijian Zhang, Xuhui Zheng, Xuecheng Wu, Chong Peng, Xuezhi Cao , et al. (90 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2025 challenge on Text to Image (T2I) generation model quality assessment, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2025. The aim of this challenge is to address the fine-grained quality assessment of text-to-image generation models. This challenge evaluates text-to-image models from two aspe… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  49. arXiv:2505.12691  [pdf, ps, other

    math.PR

    Law of iterated logarithm for supercritical non-symmetric branching Markov process

    Authors: Haojie Hou, Yan-Xia Ren, Renming Song

    Abstract: Let $\{(X_t)_{t\geq 0}, \mathbb{P}_{δ_x}, x\in E\}$ be a supercritical branching Markov process (which is not necessary symmetric) on a locally compact metric measure space $(E,μ)$ with spatially dependent local branching mechanism. Under some assumptions on the semigroup of the spatial motion, we first prove law of iterated logarithm type results for $\langle f, X_t\rangle$ under the second momen… ▽ More

    Submitted 11 December, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

    Comments: 49 pages

  50. arXiv:2505.09387  [pdf, ps, other

    math.AP math.CA

    On well-posedness for non-autonomous parabolic Cauchy problems with rough initial data

    Authors: Hedong Hou

    Abstract: We establish a complete picture for existence, uniqueness, and representation of weak solutions to non-autonomous parabolic Cauchy problems of divergence type. The coefficients are only assumed to be uniformly elliptic, bounded, measurable, and complex-valued, without any additional regularity or symmetry conditions. The initial data are tempered distributions taken in homogeneous Hardy--Sobolev s… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: 51 pages, 2 figures. Comments are welcome

    MSC Class: Primary 35K15; Secondary 42B37; 35B45; 42B30; 46E35