Skip to main content

Showing 1–50 of 265 results for author: Shin, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2603.25943  [pdf, ps, other

    cs.IT

    Enormous Fluid Antenna Systems (E-FAS) under Correlated Surface-Wave Leakage: Physical Layer Security

    Authors: Farshad Rostami Ghadi, Kai-Kit Wong, Masoud Kaveh, Mohammad Javad Ahmadi, Kin-Fai Tong, Hyundong Shin

    Abstract: Enormous fluid antenna systems (E-FAS) have recently emerged as a surface-wave (SW)-enabled architecture that can induce controllable large-scale channel gains through guided electromagnetic routing. This paper develops a secrecy analysis framework for E-FAS-assisted downlink transmission with practical pilot-based channel estimation. We consider a multiple-input single-output (MISO) wiretap setti… ▽ More

    Submitted 26 March, 2026; originally announced March 2026.

  2. arXiv:2603.23489  [pdf, ps, other

    cs.CV

    AgentRVOS: Reasoning over Object Tracks for Zero-Shot Referring Video Object Segmentation

    Authors: Woojeong Jin, Jaeho Lee, Heeseong Shin, Seungho Jang, Junhwan Heo, Seungryong Kim

    Abstract: Referring Video Object Segmentation (RVOS) aims to segment a target object throughout a video given a natural language query. Training-free methods for this task follow a common pipeline: a MLLM selects keyframes, grounds the referred object within those frames, and a video segmentation model propagates the results. While intuitive, this design asks the MLLM to make temporal decisions before any o… ▽ More

    Submitted 24 March, 2026; originally announced March 2026.

  3. arXiv:2603.17356  [pdf, ps, other

    cs.CL

    PACE-RAG: Patient-Aware Contextual and Evidence-based Policy RAG for Clinical Drug Recommendation

    Authors: Chaeyoung Huh, Hyunmin Hwang, Jung Hwan Shin, Jinse Park, Jong Chul Ye

    Abstract: Drug recommendation requires a deep understanding of individual patient context, especially for complex conditions like Parkinson's disease. While LLMs possess broad medical knowledge, they fail to capture the subtle nuances of actual prescribing patterns. Existing RAG methods also struggle with these complexities because guideline-based retrieval remains too generic and similar-patient retrieval… ▽ More

    Submitted 18 March, 2026; originally announced March 2026.

    Comments: 26 pages, 15 figures

  4. arXiv:2603.09175  [pdf, ps, other

    cs.RO

    STONE Dataset: A Scalable Multi-Modal Surround-View 3D Traversability Dataset for Off-Road Robot Navigation

    Authors: Konyul Park, Daehun Kim, Jiyong Oh, Seunghoon Yu, Junseo Park, Jaehyun Park, Hongjae Shin, Hyungchan Cho, Jungho Kim, Jun Won Choi

    Abstract: Reliable off-road navigation requires accurate estimation of traversable regions and robust perception under diverse terrain and sensing conditions. However, existing datasets lack both scalability and multi-modality, which limits progress in 3D traversability prediction. In this work, we introduce STONE, a large-scale multi-modal dataset for off-road navigation. STONE provides (1) trajectory-guid… ▽ More

    Submitted 12 March, 2026; v1 submitted 10 March, 2026; originally announced March 2026.

    Comments: ICRA 2026

  5. arXiv:2603.02579  [pdf, ps, other

    cs.LG eess.SY

    Joint Optimization of Model Partitioning and Resource Allocation for Anti-Jamming Collaborative Inference Systems

    Authors: Mengru Wu, Jiawei Li, Jiaqi Wei, Bin Lyu, Kai-Kit Wong, Hyundong Shin

    Abstract: With the increasing computational demands of deep neural network (DNN) inference on resource-constrained devices, DNN partitioning-based device-edge collaborative inference has emerged as a promising paradigm. However, the transmission of intermediate feature data is vulnerable to malicious jamming, which significantly degrades the overall inference performance. To counter this threat, this letter… ▽ More

    Submitted 2 March, 2026; originally announced March 2026.

  6. arXiv:2603.02109  [pdf, ps, other

    eess.SP cs.LG

    Orchestrating Multimodal DNN Workloads in Wireless Neural Processing

    Authors: Sai Xu, Kai-Kit Wong, Yanan Du, Hyundong Shin

    Abstract: In edge inference, wireless resource allocation and accelerator-level deep neural network (DNN) scheduling have yet to be co-optimized in an end-to-end manner. The lack of coordination between wireless transmission and accelerator-level DNN execution prevents efficient overlap, leading to higher end-to-end inference latency. To address this issue, this paper investigates multimodal DNN workload or… ▽ More

    Submitted 2 March, 2026; originally announced March 2026.

  7. arXiv:2602.18887  [pdf, ps, other

    cs.CV

    SafeDrive: Fine-Grained Safety Reasoning for End-to-End Driving in a Sparse World

    Authors: Jungho Kim, Jiyong Oh, Seunghoon Yu, Hongjae Shin, Donghyuk Kwak, Jun Won Choi

    Abstract: The end-to-end (E2E) paradigm, which maps sensor inputs directly to driving decisions, has recently attracted significant attention due to its unified modeling capability and scalability. However, ensuring safety in this unified framework remains one of the most critical challenges. In this work, we propose SafeDrive, an E2E planning framework designed to perform explicit and interpretable safety… ▽ More

    Submitted 31 March, 2026; v1 submitted 21 February, 2026; originally announced February 2026.

    Comments: Accepted to CVPR 2026, 19 pages, 9 figures

  8. arXiv:2602.12706  [pdf, ps, other

    cs.LG

    Physics-Informed Laplace Neural Operator for Solving Partial Differential Equations

    Authors: Heechang Kim, Qianying Cao, Hyomin Shin, Seungchul Lee, George Em Karniadakis, Minseok Choi

    Abstract: Neural operators have emerged as fast surrogate solvers for parametric partial differential equations (PDEs). However, purely data-driven models often require extensive training data and can generalize poorly, especially in small-data regimes and under unseen (out-of-distribution) input functions that are not represented in the training data. To address these limitations, we propose the Physics-In… ▽ More

    Submitted 13 February, 2026; originally announced February 2026.

    Comments: 38 pages,19 figures

  9. arXiv:2602.10414  [pdf, ps, other

    cs.CL

    EVOKE: Emotion Vocabulary Of Korean and English

    Authors: Yoonwon Jung, Hagyeong Shin, Benjamin K. Bergen

    Abstract: This paper introduces EVOKE (Emotion Vocabulary of Korean and English), a Korean-English parallel dataset of emotion words. The dataset offers comprehensive coverage of emotion words in each language, in addition to many-to-many translations between words in the two languages and identification of language-specific emotion words. The dataset contains 1,426 Korean words and 1,397 English words, and… ▽ More

    Submitted 9 April, 2026; v1 submitted 10 February, 2026; originally announced February 2026.

    Comments: Workshop on Computational Affective Science, LREC 2026

  10. arXiv:2602.07628  [pdf, ps, other

    cs.AI cs.LG

    SleepMaMi: A Universal Sleep Foundation Model for Integrating Macro- and Micro-structures

    Authors: Keondo Park, Younghoon Na, Yourim Choi, Hyunwoo Ryu, Hyun-Woo Shin, Hyung-Sin Kim

    Abstract: While the shift toward unified foundation models has revolutionized many deep learning domains, sleep medicine remains largely restricted to task-specific models that focus on localized micro-structure features. These approaches often neglect the rich, multi-modal context of Polysomnography (PSG) and fail to capture the global macro-structure of a full night's sleep. To address this, we introduce… ▽ More

    Submitted 7 February, 2026; originally announced February 2026.

    Comments: 8 pages, Appendix 9 pages

  11. arXiv:2602.06247  [pdf, ps, other

    cs.IT

    AI-Limited Fluid Antenna-Aided Integrated Sensing and Communication Systems

    Authors: Farshad Rostami Ghadi, Kai-Kit Wong, F. Javier Lopez-Martinez, Zhentian Zhang, Hyundong Shin, Christos Masouros

    Abstract: This paper characterizes the fundamental limits of integrated sensing and communication (ISAC) when the transmitter is subject to an artificial intelligence (AI) representation bottleneck and the receiver employs a fluid antenna system (FAS). Specifically, the message is first encoded into an ideal Gaussian waveform and mapped by an AI encoder into a finite-capacity latent representation that cons… ▽ More

    Submitted 5 February, 2026; originally announced February 2026.

  12. Learning-based Adaptive Control of Quadruped Robots for Active Stabilization on Moving Platforms

    Authors: Minsung Yoon, Heechan Shin, Jeil Jeong, Sung-Eui Yoon

    Abstract: A quadruped robot faces balancing challenges on a six-degrees-of-freedom moving platform, like subways, buses, airplanes, and yachts, due to independent platform motions and resultant diverse inertia forces on the robot. To alleviate these challenges, we present the Learning-based Active Stabilization on Moving Platforms (\textit{LAS-MP}), featuring a self-balancing policy and system state estimat… ▽ More

    Submitted 8 February, 2026; v1 submitted 3 February, 2026; originally announced February 2026.

    Comments: Accepted at IROS 2024. Project Page: https://sgvr.kaist.ac.kr/~msyoon/papers/IROS24/

    Journal ref: In Proceedings of the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 701-708, 2024

  13. arXiv:2601.22620  [pdf, ps, other

    cs.CL

    Layer-wise Swapping for Generalizable Multilingual Safety

    Authors: Hyunseo Shin, Wonseok Hwang

    Abstract: Despite the rapid advancements of Large Language Models (LLMs), safety risks remain a critical challenge for low-resource languages. Existing safety datasets are predominantly English centric, limiting progress in multilingual safety alignment. As a result, low resource expert models, finetuned on their respective instruction datasets, tend to exhibit higher unsafety rates compared to their high r… ▽ More

    Submitted 13 February, 2026; v1 submitted 30 January, 2026; originally announced January 2026.

    Comments: EACL 2026 main

  14. arXiv:2601.21506  [pdf, ps, other

    cs.RO eess.SY

    IROS: A Dual-Process Architecture for Real-Time VLM-Based Indoor Navigation

    Authors: Joonhee Lee, Hyunseung Shin, Jeonggil Ko

    Abstract: Indoor mobile robot navigation requires fast responsiveness and robust semantic understanding, yet existing methods struggle to provide both. Classical geometric approaches such as SLAM offer reliable localization but depend on detailed maps and cannot interpret human-targeted cues (e.g., signs, room numbers) essential for indoor reasoning. Vision-Language-Action (VLA) models introduce semantic gr… ▽ More

    Submitted 29 January, 2026; originally announced January 2026.

  15. arXiv:2601.18471  [pdf, ps, other

    cs.IT

    Finite-Aperture Fluid Antenna Array Design: Analysis and Algorithm

    Authors: Zhentian Zhang, Kai-Kit Wong, Hao Jiang, Farshad Rostami Ghadi, Hyundong Shin, Yangyang Zhang

    Abstract: Finite-aperture constraints render array design nontrivial and can undermine the effectiveness of classical sparse geometries. This letter provides universal guidance for fluid antenna array (FAA) design under a fixed aperture. We derive a closed-form Cramér--Rao bound (CRB) that unifies conventional and reconfigurable arrays by explicitly linking the Fisher information to the geometric variance o… ▽ More

    Submitted 26 January, 2026; originally announced January 2026.

  16. arXiv:2601.17039  [pdf, ps, other

    cs.CV cs.AI

    MANGO: A Global Single-Date Paired Dataset for Mangrove Segmentation

    Authors: Junhyuk Heo, Beomkyu Choi, Hyunjin Shin, Darongsae Kwon

    Abstract: Mangroves are critical for climate-change mitigation, requiring reliable monitoring for effective conservation. While deep learning has emerged as a powerful tool for mangrove detection, its progress is hindered by the limitations of existing datasets. In particular, many resources provide only annual map products without curated single-date image-mask pairs, limited to specific regions rather tha… ▽ More

    Submitted 20 January, 2026; originally announced January 2026.

  17. Improving the Accuracy of Community Detection on Signed Networks via Community Refinement and Contrastive Learning

    Authors: Hyunuk Shin, Hojin Kim, Chanyoung Lee, Yeon-Chang Lee, David Yoon Suk Kang

    Abstract: Community detection (CD) on signed networks is crucial for understanding how positive and negative relations jointly shape network structure. However, existing CD methods often yield inconsistent communities due to noisy or conflicting edge signs. In this paper, we propose ReCon, a model-agnostic post-processing framework that progressively refines community structures through four iterative steps… ▽ More

    Submitted 22 January, 2026; originally announced January 2026.

    Journal ref: ACM WWW 2026

  18. arXiv:2601.15532  [pdf, ps, other

    cs.NI

    Resource Allocation and Sharing for UAV-Assisted Integrated TN-NTN with Multi-Connectivity

    Authors: Abd Ullah Khan, Wali Ullah Khan, Haejoon Jung, Hyundong Shin

    Abstract: Unmanned aerial vehicles (UAVs) with multi-connectivity (MC) capabilities efficiently and reliably transfer data between terrestrial networks (TNs) and non-terrestrial networks (NTNs). However, optimally sharing and allocating spectrum and power resources to maintain MC while ensuring reliable connectivity and optimal performance remains challenging in such networks. Channel variations induced by… ▽ More

    Submitted 25 January, 2026; v1 submitted 21 January, 2026; originally announced January 2026.

  19. arXiv:2601.09942  [pdf, ps, other

    cs.SI cs.CY

    How Diplomacy Reshapes Online Discourse:Asymmetric Persistence in Online Framing of North Korea

    Authors: Hunjun Shin, Hoonbae Moon, Mohit Singhal

    Abstract: Public opinion toward foreign adversaries shapes and constrains diplomatic options. Prior research has largely relied on sentiment analysis and survey based measures, providing limited insight into how sustained narrative changes (beyond transient emotional reactions) might follow diplomatic engagement. This study examines the extent to which high stakes diplomatic summits shape how adversaries ar… ▽ More

    Submitted 14 January, 2026; originally announced January 2026.

  20. arXiv:2601.06835  [pdf, ps, other

    cs.CV cs.AI

    OSCAR: Optical-aware Semantic Control for Aleatoric Refinement in Sar-to-Optical Translation

    Authors: Hyunseo Lee, Sang Min Kim, Ho Kyung Shin, Taeheon Kim, Woo-Jeoung Nam

    Abstract: Synthetic Aperture Radar (SAR) provides robust all-weather imaging capabilities; however, translating SAR observations into photo-realistic optical images remains a fundamentally ill-posed problem. Current approaches are often hindered by the inherent speckle noise and geometric distortions of SAR data, which frequently result in semantic misinterpretation, ambiguous texture synthesis, and structu… ▽ More

    Submitted 11 January, 2026; originally announced January 2026.

    Comments: main 15 pages, supplementary 5 pages

  21. arXiv:2601.01807  [pdf, ps, other

    cs.CV cs.AI

    Adaptive Hybrid Optimizer based Framework for Lumpy Skin Disease Identification

    Authors: Ubaidullah, Muhammad Abid Hussain, Mohsin Raza Jafri, Rozi Khan, Moid Sandhu, Abd Ullah Khan, Hyundong Shin

    Abstract: Lumpy Skin Disease (LSD) is a contagious viral infection that significantly deteriorates livestock health, thereby posing a serious threat to the global economy and food security. Owing to its rapid spread characteristics, early and precise identification is crucial to prevent outbreaks and ensure timely intervention. In this paper, we propose a hybrid deep learning-based approach called LUMPNet f… ▽ More

    Submitted 5 January, 2026; originally announced January 2026.

  22. arXiv:2601.01207  [pdf, ps, other

    cs.LG stat.ML

    Sparse Bayesian Message Passing under Structural Uncertainty

    Authors: Yoonhyuk Choi, Jiho Choi, Chanran Kim, Yumin Lee, Hawon Shin, Yeowon Jeon, Minjeong Kim, Jiwoo Kang

    Abstract: Semi-supervised learning on real-world graphs is frequently challenged by heterophily, where the observed graph is unreliable or label-disassortative. Many existing graph neural networks either rely on a fixed adjacency structure or attempt to handle structural noise through regularization. In this work, we explicitly capture structural uncertainty by modeling a posterior distribution over signed… ▽ More

    Submitted 3 January, 2026; originally announced January 2026.

  23. arXiv:2512.23400  [pdf, ps, other

    cs.SI cs.LG

    Quantum Intelligence Meets BD-RIS-Enabled AmBC: Challenges, Opportunities, and Practical Insights

    Authors: Abd Ullah Khan, Uman Khalid, Trung Q. Duong, Hyundong Shin

    Abstract: A beyond-diagonal reconfigurable intelligent surface (BD-RIS) is an innovative type of reconfigurable intelligent surface (RIS) that has recently been proposed and is considered a revolutionary advancement in wave manipulation. Unlike the mutually disconnected arrangement of elements in traditional RISs, BD-RIS creates cost-effective and simple inter-element connections, allowing for greater freed… ▽ More

    Submitted 31 December, 2025; v1 submitted 29 December, 2025; originally announced December 2025.

  24. arXiv:2512.21717  [pdf, ps, other

    cs.NI cs.AI cs.LG cs.SI

    Multiconnectivity for SAGIN: Current Trends, Challenges, AI-driven Solutions, and Opportunities

    Authors: Abd Ullah Khan, Adnan Shahid, Haejoon Jung, Hyundong Shin

    Abstract: Space-air-ground-integrated network (SAGIN)-enabled multiconnectivity (MC) is emerging as a key enabler for next-generation networks, enabling users to simultaneously utilize multiple links across multi-layer non-terrestrial networks (NTN) and multi-radio access technology (multi-RAT) terrestrial networks (TN). However, the heterogeneity of TN and NTN introduces complex architectural challenges th… ▽ More

    Submitted 25 January, 2026; v1 submitted 25 December, 2025; originally announced December 2025.

  25. arXiv:2512.21043  [pdf, ps, other

    cs.RO

    Tracing Energy Flow: Learning Tactile-based Grasping Force Control to Prevent Slippage in Dynamic Object Interaction

    Authors: Cheng-Yu Kuo, Hirofumi Shin, Takamitsu Matsubara

    Abstract: Regulating grasping force to reduce slippage during dynamic object interaction remains a fundamental challenge in robotic manipulation, especially when objects are manipulated by multiple rolling contacts, have unknown properties (such as mass or surface conditions), and when external sensing is unreliable. In contrast, humans can quickly regulate grasping force by touch, even without visual cues.… ▽ More

    Submitted 24 December, 2025; originally announced December 2025.

    Comments: 8 pages. Accepted by IEEE Robotics and Automation Letters (RA-L)

  26. arXiv:2511.20696  [pdf, ps, other

    cs.LG cs.AI

    Prototype-Guided Non-Exemplar Continual Learning for Cross-subject EEG Decoding

    Authors: Dan Li, Hye-Bin Shin, Yeon-Woo Choi

    Abstract: Due to the significant variability in electroencephalo-gram (EEG) signals across individuals, knowledge acquired from previous subjects is often overwritten as new subjects are introduced in continual EEG decoding tasks. Existing methods mainly rely on storing historical data from seen subjects as replay buffers to mitigate forgetting, which is impractical under privacy or memory constraints. To a… ▽ More

    Submitted 15 January, 2026; v1 submitted 24 November, 2025; originally announced November 2025.

    Comments: 4 pages, 2 figures, 14th IEEE International Winter Conference on Brain-Computer Interface Conference 2026

  27. arXiv:2511.18878  [pdf, ps, other

    cs.RO cs.AI

    Accelerating Reinforcement Learning via Error-Related Human Brain Signals

    Authors: Suzie Kim, Hye-Bin Shin, Hyo-Jeong Jang

    Abstract: In this work, we investigate how implicit neural feed back can accelerate reinforcement learning in complex robotic manipulation settings. While prior electroencephalogram (EEG) guided reinforcement learning studies have primarily focused on navigation or low-dimensional locomotion tasks, we aim to understand whether such neural evaluative signals can improve policy learning in high-dimensional ma… ▽ More

    Submitted 24 November, 2025; originally announced November 2025.

  28. arXiv:2511.17897  [pdf, ps, other

    cs.IT

    Multi-Port Selection for FAMA: Massive Connectivity with Fewer RF Chains than Users

    Authors: Hanjiang Hong, Kai-Kit Wong, Xusheng Zhu, Hao Xu, Han Xiao, Farshad Rostami Ghadi, Hyundong Shin

    Abstract: Fluid antenna multiple access (FAMA) is an emerging technology in massive access designed to meet the demands of future wireless communication networks by naturally mitigating multiuser interference through the utilization of the fluid antenna system (FAS) at RF-chain-limited mobile device. The transition from single-active-port to multi-active-port on a shared RF chain for slow FAMA can greatly e… ▽ More

    Submitted 21 November, 2025; originally announced November 2025.

  29. arXiv:2511.17416  [pdf, ps, other

    cs.IT

    Fluid Antenna System-Enabled UAV-to-Ground Communications

    Authors: Xusheng Zhu, Kai-Kit Wong, Qingqing Wu, Hyundong Shin, Yangyang Zhang

    Abstract: Fluid antenna systems (FAS) have emerged as a revolutionary technology offering enhanced spatial diversity within a compact form factor. Concurrently, unmanned aerial vehicles (UAVs) are integral to future networks, necessitating channel models that capture both multipath fading and shadowing. This letter presents a novel performance analysis of a UAV-to-ground link, where the receiver is equipped… ▽ More

    Submitted 21 November, 2025; originally announced November 2025.

  30. arXiv:2511.15186  [pdf, ps, other

    cs.CV

    Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset

    Authors: Geon Choi, Hangyul Yoon, Hyunju Shin, Hyunki Park, Sang Hoon Seo, Eunho Yang, Edward Choi

    Abstract: The applicability of current lesion segmentation models for chest X-rays (CXRs) has been limited both by a small number of target labels and the reliance on complex, expert-level text inputs, creating a barrier to practical use. To address these limitations, we introduce instruction-guided lesion segmentation (ILS), a medical-domain adaptation of referring image segmentation (RIS) designed to segm… ▽ More

    Submitted 26 March, 2026; v1 submitted 19 November, 2025; originally announced November 2025.

    Comments: Camera-ready version for CVPR 2026

  31. arXiv:2511.15138  [pdf, ps, other

    cs.LG cs.HC

    Cross-Modal Consistency-Guided Active Learning for Affective BCI Systems

    Authors: Hyo-Jeong Jang, Hye-Bin Shin, Kang Yin

    Abstract: Deep learning models perform best with abundant, high-quality labels, yet such conditions are rarely achievable in EEG-based emotion recognition. Electroencephalogram (EEG) signals are easily corrupted by artifacts and individual variability, while emotional labels often stem from subjective and inconsistent reports-making robust affective decoding particularly difficult. We propose an uncertainty… ▽ More

    Submitted 19 November, 2025; originally announced November 2025.

  32. arXiv:2511.12851  [pdf, ps, other

    cs.CL cs.AI

    NeuroLex: A Lightweight Domain Language Model for EEG Report Understanding and Generation

    Authors: Kang Yin, Hye-Bin Shin

    Abstract: Clinical electroencephalogram (EEG) reports encode domain-specific linguistic conventions that general-purpose language models (LMs) fail to capture. We introduce NeuroLex, a lightweight domain-adaptive language model trained purely on EEG report text from the Harvard Electroencephalography Database. Unlike existing biomedical LMs, NeuroLex is tailored to the linguistic and diagnostic characterist… ▽ More

    Submitted 16 November, 2025; originally announced November 2025.

  33. arXiv:2511.07891  [pdf, ps, other

    eess.SP cs.AI

    Toward Adaptive BCIs: Enhancing Decoding Stability via User State-Aware EEG Filtering

    Authors: Yeon-Woo Choi, Hye-Bin Shin, Dan Li

    Abstract: Brain-computer interfaces (BCIs) often suffer from limited robustness and poor long-term adaptability. Model performance rapidly degrades when user attention fluctuates, brain states shift over time, or irregular artifacts appear during interaction. To mitigate these issues, we introduce a user state-aware electroencephalogram (EEG) filtering framework that refines neural representations before de… ▽ More

    Submitted 11 November, 2025; originally announced November 2025.

    Comments: 4 pages, 3 figures, conference

  34. arXiv:2510.26193  [pdf, ps, other

    cs.CL

    RCScore: Quantifying Response Consistency in Large Language Models

    Authors: Dongjun Jang, Youngchae Ahn, Hyopil Shin

    Abstract: Current LLM evaluations often rely on a single instruction template, overlooking models' sensitivity to instruction style-a critical aspect for real-world deployments. We present RCScore, a multi-dimensional framework quantifying how instruction formulation affects model responses. By systematically transforming benchmark problems into multiple instruction styles, RCScore reveals performance varia… ▽ More

    Submitted 30 October, 2025; originally announced October 2025.

    Journal ref: EMNLP 2025 Main Conference

  35. arXiv:2510.26014  [pdf, ps, other

    cs.LG cs.AI

    Dual Mixture-of-Experts Framework for Discrete-Time Survival Analysis

    Authors: Hyeonjun Lee, Hyungseob Shin, Gunhee Nam, Hyeonsoo Lee

    Abstract: Survival analysis is a task to model the time until an event of interest occurs, widely used in clinical and biomedical research. A key challenge is to model patient heterogeneity while also adapting risk predictions to both individual characteristics and temporal dynamics. We propose a dual mixture-of-experts (MoE) framework for discrete-time survival analysis. Our approach combines a feature-enc… ▽ More

    Submitted 29 October, 2025; originally announced October 2025.

    Comments: Accepted to NeurIPS 2025 workshop Learning from Time Series for Health (TS4H)

  36. arXiv:2510.24774  [pdf, ps, other

    cs.CY cs.CL

    PANORAMA: A Dataset and Benchmarks Capturing Decision Trails and Rationales in Patent Examination

    Authors: Hyunseung Lim, Sooyohn Nam, Sungmin Na, Ji Yong Cho, June Yong Yang, Hyungyu Shin, Yoonjoo Lee, Juho Kim, Moontae Lee, Hwajung Hong

    Abstract: Patent examination remains an ongoing challenge in the NLP literature even after the advent of large language models (LLMs), as it requires an extensive yet nuanced human judgment on whether a submitted claim meets the statutory standards of novelty and non-obviousness against previously granted claims -- prior art -- in expert domains. Previous NLP studies have approached this challenge as a pred… ▽ More

    Submitted 24 October, 2025; originally announced October 2025.

  37. arXiv:2510.16432  [pdf, ps, other

    cs.IT

    Cluster-wise processing in fronthaul-aware cell-free massive MIMO systems

    Authors: Zahra Mobini, Ahmet Hasim Gokceoglu, Li Wang, Gunnar Peters, Hyundong Shin, Hien Quoc Ngo

    Abstract: We exploit a general cluster-based network architecture for a fronthaul-limited user-centric cell-free massive multiple-input multiple-output (CF-mMIMO) system under different degrees of cooperation among the access points (APs) to achieve scalable implementation. In particular, we consider a CF-mMIMO system wherein the available APs are grouped into multiple processing clusters (PCs) to share cha… ▽ More

    Submitted 18 October, 2025; originally announced October 2025.

  38. arXiv:2510.15510  [pdf, ps, other

    cs.CV cs.RO

    Exploring Conditions for Diffusion models in Robotic Control

    Authors: Heeseong Shin, Byeongho Heo, Dongyoon Han, Seungryong Kim, Taekyung Kim

    Abstract: While pre-trained visual representations have significantly advanced imitation learning, they are often task-agnostic as they remain frozen during policy learning. In this work, we explore leveraging pre-trained text-to-image diffusion models to obtain task-adaptive visual representations for robotic control, without fine-tuning the model itself. However, we find that naively applying textual cond… ▽ More

    Submitted 8 April, 2026; v1 submitted 17 October, 2025; originally announced October 2025.

    Comments: Accepted to CVPR 2026. Project page: https://orca-rc.github.io/

  39. arXiv:2510.11036  [pdf, ps, other

    cs.RO cs.AI

    XGrasp: Gripper-Aware Grasp Detection with Multi-Gripper Data Generation

    Authors: Yeonseo Lee, Jungwook Mun, Hyosup Shin, Guebin Hwang, Junhee Nam, Taeyeop Lee, Sungho Jo

    Abstract: Real-world robotic systems frequently require diverse end-effectors for different tasks, however most existing grasp detection methods are optimized for a single gripper type, demanding retraining or optimization for each novel gripper configuration. This gripper-specific retraining paradigm is neither scalable nor practical. We propose XGrasp, a real-time gripper-aware grasp detection framework t… ▽ More

    Submitted 12 March, 2026; v1 submitted 13 October, 2025; originally announced October 2025.

    Comments: 9 pages, 10 figures

  40. arXiv:2509.23284  [pdf, ps, other

    cs.IT

    RIS-Assisted XL-MIMO for Near-Field and Far-Field Communications

    Authors: Xiaomin Cao, Mohammadali Mohammadi, Hien Quoc Ngo, Hyundong Shin, Michail Matthaiou

    Abstract: We consider a reconfigurable intelligent surface (RIS)-assisted extremely large-scale multiple-input multiple-output (XL-MIMO) downlink system, where an XL-MIMO array serves two groups of single-antennas users, namely near-field users (NFUEs) and far-field users (FFUEs). FFUEs are subject to blockage, and their communication is facilitated through the RIS. We consider three precoding schemes at th… ▽ More

    Submitted 27 September, 2025; originally announced September 2025.

    Comments: The manuscript has been accepted for publication in IEEE TWC

  41. arXiv:2509.21319  [pdf, ps, other

    cs.CL cs.AI cs.LG

    RLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards

    Authors: Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Ellie Evans, Daniel Egert, Hoo-Chang Shin, Felipe Soares, Yi Dong, Oleksii Kuchaiev

    Abstract: Reinforcement Learning with Human Feedback (RLHF) and Reinforcement Learning with Verifiable Rewards (RLVR) are the main RL paradigms used in LLM post-training, each offering distinct advantages. However, RLHF struggles with interpretability and reward hacking because it relies on human judgments that usually lack explicit criteria, whereas RLVR is limited in scope by its focus on correctness-base… ▽ More

    Submitted 30 October, 2025; v1 submitted 25 September, 2025; originally announced September 2025.

    Comments: Added link to access models: https://huggingface.co/collections/nvidia/reward-models-10-2025

  42. arXiv:2509.18096  [pdf, ps, other

    cs.CV

    Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers

    Authors: Chaehyun Kim, Heeseong Shin, Eunbeen Hong, Heeji Yoon, Anurag Arnab, Paul Hongsuck Seo, Sunghwan Hong, Seungryong Kim

    Abstract: Text-to-image diffusion models excel at translating language prompts into photorealistic images by implicitly grounding textual concepts through their cross-modal attention mechanisms. Recent multi-modal diffusion transformers extend this by introducing joint self-attention over concatenated image and text tokens, enabling richer and more scalable cross-modal alignment. However, a detailed underst… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: NeurIPS 2025. Project page: https://cvlab-kaist.github.io/Seg4Diff/

  43. arXiv:2509.17462  [pdf, ps, other

    cs.CV

    MAESTRO: Task-Relevant Optimization via Adaptive Feature Enhancement and Suppression for Multi-task 3D Perception

    Authors: Changwon Kang, Jisong Kim, Hongjae Shin, Junseo Park, Jun Won Choi

    Abstract: The goal of multi-task learning is to learn to conduct multiple tasks simultaneously based on a shared data representation. While this approach can improve learning efficiency, it may also cause performance degradation due to task conflicts that arise when optimizing the model for different objectives. To address this challenge, we introduce MAESTRO, a structured framework designed to generate tas… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: Accepted to ICCV 2025

  44. arXiv:2509.14632  [pdf, ps, other

    eess.AS cs.AI eess.SP

    Mitigating Intra-Speaker Variability in Diarization with Style-Controllable Speech Augmentation

    Authors: Miseul Kim, Soo Jin Park, Kyungguen Byun, Hyeon-Kyeong Shin, Sunkuk Moon, Shuhua Zhang, Erik Visser

    Abstract: Speaker diarization systems often struggle with high intrinsic intra-speaker variability, such as shifts in emotion, health, or content. This can cause segments from the same speaker to be misclassified as different individuals, for example, when one raises their voice or speaks faster during conversation. To address this, we propose a style-controllable speech generation model that augments speec… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

    Comments: Submitted to ICASSP 2026

  45. arXiv:2509.13934  [pdf, ps, other

    eess.SY cs.LG

    Large Language Model-Empowered Decision Transformer for UAV-Enabled Data Collection

    Authors: Zhixion Chen, Jiangzhou Wang, Hyundong Shin, Arumugam Nallanathan

    Abstract: The deployment of unmanned aerial vehicles (UAVs) for reliable and energy-efficient data collection from spatially distributed devices holds great promise in supporting diverse Internet of Things (IoT) applications. Nevertheless, the limited endurance and communication range of UAVs necessitate intelligent trajectory planning. While reinforcement learning (RL) has been extensively explored for UAV… ▽ More

    Submitted 19 September, 2025; v1 submitted 17 September, 2025; originally announced September 2025.

    Comments: 14pages, 8 figures

  46. arXiv:2509.08815  [pdf, ps, other

    cs.IT

    Fluid Antenna Systems: A Geometric Approach to Error Probability and Fundamental Limits

    Authors: Xusheng Zhu, Kai-Kit Wong, Hao Xu, Han Xiao, Hanjiang Hong, Hyundong Shin, Yangyang Zhang

    Abstract: The fluid antenna system (FAS) concept is an emerging paradigm that promotes the utilization of the feature of shape and position reconfigurability in antennas to broaden the design of wireless communication systems. This also means that spatial diversity can be exploited in an unconventional way. However, a rigorous framework for error probability analysis of FAS under realistic spatially correla… ▽ More

    Submitted 10 September, 2025; originally announced September 2025.

  47. arXiv:2509.07979  [pdf, ps, other

    cs.CV

    Visual Representation Alignment for Multimodal Large Language Models

    Authors: Heeji Yoon, Jaewoo Jung, Junwan Kim, Hyungyu Choi, Heeseong Shin, Sangbeom Lim, Honggyu An, Chaehyun Kim, Jisang Han, Donghyun Kim, Chanho Eom, Sunghwan Hong, Seungryong Kim

    Abstract: Multimodal large language models (MLLMs) trained with visual instruction tuning have achieved strong performance across diverse tasks, yet they remain limited in vision-centric tasks such as object counting or spatial reasoning. We attribute this gap to the prevailing text-only supervision paradigm, which provides only indirect guidance for the visual pathway and often leads MLLMs to discard fine-… ▽ More

    Submitted 10 October, 2025; v1 submitted 9 September, 2025; originally announced September 2025.

    Comments: Project Page: https://cvlab-kaist.github.io/VIRAL/

  48. arXiv:2508.15693  [pdf, ps, other

    cs.AI

    NiceWebRL: a Python library for human subject experiments with reinforcement learning environments

    Authors: Wilka Carvalho, Vikram Goddla, Ishaan Sinha, Hoon Shin, Kunal Jha

    Abstract: We present NiceWebRL, a research tool that enables researchers to use machine reinforcement learning (RL) environments for online human subject experiments. NiceWebRL is a Python library that allows any Jax-based environment to be transformed into an online interface, supporting both single-agent and multi-agent environments. As such, NiceWebRL enables AI researchers to compare their algorithms to… ▽ More

    Submitted 21 August, 2025; originally announced August 2025.

  49. arXiv:2508.14561  [pdf

    cs.CV cs.RO

    Making Pose Representations More Expressive and Disentangled via Residual Vector Quantization

    Authors: Sukhyun Jeong, Hong-Gi Shin, Yong-Hoon Choi

    Abstract: Recent progress in text-to-motion has advanced both 3D human motion generation and text-based motion control. Controllable motion generation (CoMo), which enables intuitive control, typically relies on pose code representations, but discrete pose codes alone cannot capture fine-grained motion details, limiting expressiveness. To overcome this, we propose a method that augments pose code-based late… ▽ More

    Submitted 20 August, 2025; originally announced August 2025.

  50. arXiv:2508.14444  [pdf, ps, other

    cs.CL cs.AI cs.LG

    NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

    Authors: NVIDIA, :, Aarti Basant, Abhijit Khairnar, Abhijit Paithankar, Abhinav Khattar, Adithya Renduchintala, Aditya Malte, Akhiad Bercovich, Akshay Hazare, Alejandra Rico, Aleksander Ficek, Alex Kondratenko, Alex Shaposhnikov, Alexander Bukharin, Ali Taghibakhshi, Amelia Barton, Ameya Sunil Mahabaleshwarkar, Amy Shen, Andrew Tao, Ann Guan, Anna Shors, Anubhav Mandarwal, Arham Mehta, Arun Venkatesan , et al. (192 additional authors not shown)

    Abstract: We introduce Nemotron-Nano-9B-v2, a hybrid Mamba-Transformer language model designed to increase throughput for reasoning workloads while achieving state-of-the-art accuracy compared to similarly-sized models. Nemotron-Nano-9B-v2 builds on the Nemotron-H architecture, in which the majority of the self-attention layers in the common Transformer architecture are replaced with Mamba-2 layers, to achi… ▽ More

    Submitted 2 September, 2025; v1 submitted 20 August, 2025; originally announced August 2025.