Skip to main content

Showing 1–50 of 168 results for author: Jung, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2604.02745  [pdf, ps, other

    cs.RO

    Geometrically-Constrained Radar-Inertial Odometry via Continuous Point-Pose Uncertainty Modeling

    Authors: Wooseong Yang, Dongjae Lee, Minwoo Jung, Ayoung Kim

    Abstract: Radar odometry is crucial for robust localization in challenging environments; however, the sparsity of reliable returns and distinctive noise characteristics impede its performance. This paper introduces geometrically-constrained radar-inertial odometry and mapping that jointly consolidates point and pose uncertainty. We employ the continuous trajectory model to estimate the pose uncertainty at a… ▽ More

    Submitted 3 April, 2026; originally announced April 2026.

    Comments: 8 pages, 8 figures, 6 tables, accepted to RA-L

  2. arXiv:2603.14892  [pdf, ps, other

    cs.CV

    Balancing Saliency and Coverage: Semantic Prominence-Aware Budgeting for Visual Token Compression in VLMs

    Authors: Jaehoon Lee, Mingi Jung, Soohyuk Jang, Seungryong Yoo, Dahuin Jung, Sungroh Yoon

    Abstract: Large Vision-Language Models (VLMs) achieve strong multimodal understanding capabilities by leveraging high-resolution visual inputs, but the resulting large number of visual tokens creates a major computational bottleneck. Recent work mitigates this issue through visual token compression, typically compressing tokens based on saliency, diversity, or a fixed combination of both. We observe that th… ▽ More

    Submitted 16 March, 2026; originally announced March 2026.

  3. arXiv:2603.05889  [pdf, ps, other

    cs.HC cs.CY

    Measuring Perceptions of Fairness in AI Systems: The Effects of Infra-marginality

    Authors: Schrasing Tong, Minseok Jung, Ilaria Liccardi, Lalana Kagal

    Abstract: Differences in data distributions between demographic groups, known as the problem of infra-marginality, complicate how people evaluate fairness in machine learning models. We present a user study with 85 participants in a hypothetical medical decision-making scenario to examine two treatments: group-specific model performance and training data availability. Our results show that participants did… ▽ More

    Submitted 5 March, 2026; originally announced March 2026.

  4. arXiv:2603.03695  [pdf, ps, other

    cs.RO

    TreeLoc++: Robust 6-DoF LiDAR Localization in Forests with a Compact Digital Forest Inventory

    Authors: Minwoo Jung, Dongjae Lee, Nived Chebrolu, Haedam Oh, Maurice Fallon, Ayoung Kim

    Abstract: Reliable localization is essential for sustainable forest management, as it allows robots or sensor systems to revisit and monitor the status of individual trees over long periods. In modern forestry, this management is structured around Digital Forest Inventories (DFIs), which encode stems using compact geometric attributes rather than raw data. Despite their central role, DFIs have been overlook… ▽ More

    Submitted 3 March, 2026; originally announced March 2026.

    Comments: 25 pages, 27 figures and 15 tables

  5. arXiv:2602.22508  [pdf, ps, other

    cs.AI

    Mirroring the Mind: Distilling Human-Like Metacognitive Strategies into Large Language Models

    Authors: Ik-hwan Kim, Hyeongrok Han, Mingi Jung, Sangwon Yu, Jinseok Hong, Sang Hun Kim, Yoonyoung Choi, Sungroh Yoon

    Abstract: Large Reasoning Models (LRMs) often exhibit structural fragility in complex reasoning tasks, failing to produce correct answers even after successfully deriving valid intermediate steps. Through systematic analysis, we observe that these failures frequently stem not from a lack of reasoning capacity, but from a deficiency in self-regulatory control, where valid logic is destabilized by uncontrolle… ▽ More

    Submitted 25 February, 2026; originally announced February 2026.

    Comments: 31 pages

  6. arXiv:2602.15863  [pdf, ps, other

    cs.CL cs.AI

    Not the Example, but the Process: How Self-Generated Examples Enhance LLM Reasoning

    Authors: Daehoon Gwak, Minseo Jung, Junwoo Park, Minho Park, ChaeHun Park, Junha Hyung, Jaegul Choo

    Abstract: Recent studies have shown that Large Language Models (LLMs) can improve their reasoning performance through self-generated few-shot examples, achieving results comparable to manually curated in-context examples. However, the underlying mechanism behind these gains remains unclear, making it hard to decide when and how to apply the technique effectively. In this work, we argue that the key benefit… ▽ More

    Submitted 26 January, 2026; originally announced February 2026.

    Comments: Presented at AACL-IJCNLP 2025

  7. arXiv:2602.10654  [pdf, ps, other

    cs.AR cs.FL

    DRAMPyML: A Formal Description of DRAM Protocols with Timed Petri Nets

    Authors: Derek Christ, Thomas Zimmermann, Philippe Barbie, Dmitri Saberi, Yao Yin, Matthias Jung

    Abstract: The JEDEC committee defines various domain-specific DRAM standards. These standards feature increasingly complex and evolving protocol specifications, which are detailed in timing diagrams and command tables. Understanding these protocols is becoming progressively challenging as new features and complex device hierarchies are difficult to comprehend without an expressive model. While each JEDEC st… ▽ More

    Submitted 11 February, 2026; originally announced February 2026.

  8. arXiv:2602.06807  [pdf, ps, other

    cs.RO cs.AI cs.LG

    SuReNav: Superpixel Graph-based Constraint Relaxation for Navigation in Over-constrained Environments

    Authors: Keonyoung Koh, Moonkyeong Jung, Samuel Seungsup Lee, Daehyung Park

    Abstract: We address the over-constrained planning problem in semi-static environments. The planning objective is to find a best-effort solution that avoids all hard constraint regions while minimally traversing the least risky areas. Conventional methods often rely on pre-defined area costs, limiting generalizations. Further, the spatial continuity of navigation spaces makes it difficult to identify region… ▽ More

    Submitted 6 February, 2026; originally announced February 2026.

    Comments: Accepted by ICRA 2026. Code and videos are available at https://sure-nav.github.io/

  9. Understanding How Accessibility Practices Impact Teamwork in Mixed-Ability Teams that Collaborate Virtually

    Authors: Crescentia Jung, Kexin Cheng, Sharon Heung, Malte F. Jung, Shiri Azenkot

    Abstract: Virtual collaboration has transformed how people in mixed-ability teams, composed of disabled and non-disabled people, work together by offering greater flexibility. In these settings, accessibility practices, such as accommodations and inclusive norms, are essential for providing access to disabled people. However, we do not yet know how these practices shape broader facets of teamwork, such as p… ▽ More

    Submitted 3 February, 2026; originally announced February 2026.

  10. arXiv:2602.01501  [pdf, ps, other

    cs.RO cs.CV

    TreeLoc: 6-DoF LiDAR Global Localization in Forests via Inter-Tree Geometric Matching

    Authors: Minwoo Jung, Nived Chebrolu, Lucas Carvalho de Lima, Haedam Oh, Maurice Fallon, Ayoung Kim

    Abstract: Reliable localization is crucial for navigation in forests, where GPS is often degraded and LiDAR measurements are repetitive, occluded, and structurally complex. These conditions weaken the assumptions of traditional urban-centric localization methods, which assume that consistent features arise from unique structural patterns, necessitating forest-centric solutions to achieve robustness in these… ▽ More

    Submitted 12 February, 2026; v1 submitted 1 February, 2026; originally announced February 2026.

    Comments: An 8-page paper with 7 tables and 8 figures, accepted to ICRA 2026

  11. arXiv:2602.00803  [pdf, ps, other

    cs.AR

    AutoGNN: End-to-End Hardware-Driven Graph Preprocessing for Enhanced GNN Performance

    Authors: Seungkwan Kang, Seungjun Lee, Donghyun Gouk, Miryeong Kwon, Hyunkyu Choi, Junhyeok Jang, Sangwon Lee, Huiwon Choi, Jie Zhang, Wonil Choi, Mahmut Taylan Kandemir, Myoungsoo Jung

    Abstract: Graph neural network (GNN) inference faces significant bottlenecks in preprocessing, which often dominate overall inference latency. We introduce AutoGNN, an FPGA-based accelerator designed to address these challenges by leveraging FPGA's reconfigurability and specialized components. AutoGNN adapts to diverse graph inputs, efficiently performing computationally intensive tasks such as graph conver… ▽ More

    Submitted 31 January, 2026; originally announced February 2026.

  12. arXiv:2601.14012  [pdf, ps, other

    eess.AS cs.AI

    MATE: Matryoshka Audio-Text Embeddings for Open-Vocabulary Keyword Spotting

    Authors: Youngmoon Jung, Myunghun Jung, Joon-Young Yang, Yong-Hyeok Lee, Jaeyoung Roh, Hoon-Young Cho

    Abstract: Open-vocabulary keyword spotting (KWS) with text-based enrollment has emerged as a flexible alternative to fixed-phrase triggers. Prior utterance-level matching methods, from an embedding-learning standpoint, learn embeddings at a single fixed dimensionality. We depart from this design and propose Matryoshka Audio-Text Embeddings (MATE), a dual-encoder framework that encodes multiple embedding gra… ▽ More

    Submitted 20 January, 2026; originally announced January 2026.

    Comments: 5 pages, 1 figure, Accepted at ICASSP 2026

  13. arXiv:2601.01708  [pdf, ps, other

    cs.CL

    A Training-Free Large Reasoning Model-based Knowledge Tracing Framework for Unified Prediction and Prescription

    Authors: Unggi Lee, Joo Young Kim, Ran Ju, Minyoung Jung, Jeyeon Eo

    Abstract: Knowledge Tracing (KT) aims to estimate a learner's evolving mastery based on interaction histories. Recent studies have explored Large Language Models (LLMs) for KT via autoregressive nature, but such approaches typically require fine-tuning and exhibit unstable or near-random performance. Moreover, prior KT systems primarily focus on prediction and rely on multi-stage pipelines for feedback and… ▽ More

    Submitted 4 January, 2026; originally announced January 2026.

  14. IM HERE: Interaction Model for Human Effort Based Robot Engagement

    Authors: Dominykas Strazdas, Magnus Jung, Jan Marquenie, Ingo Siegert, Ayoub Al-Hamadi

    Abstract: The effectiveness of human-robot interaction often hinges on the ability to cultivate engagement - a dynamic process of cognitive involvement that supports meaningful exchanges. Many existing definitions and models of engagement are either too vague or lack the ability to generalize across different contexts. We introduce IM HERE, a novel framework that models engagement effectively in human-human… ▽ More

    Submitted 3 December, 2025; originally announced December 2025.

    Comments: 8 pages, 5 figures

    Journal ref: 2025 IEEE Conference on Cognitive and Computational Aspects of Situation Management (CogSIMA)

  15. arXiv:2511.09072  [pdf, ps, other

    cs.RO cs.CV

    SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields

    Authors: Sangheon Yang, Yeongin Yoon, Hong Mo Jung, Jongwoo Lim

    Abstract: Traditional Visual Odometry (VO) and Visual Inertial Odometry (VIO) methods rely on a 'pose-centric' paradigm, which computes absolute camera poses from the local map thus requires large-scale landmark maintenance and continuous map optimization. This approach is computationally expensive, limiting their real-time performance on resource-constrained devices. To overcome these limitations, we intro… ▽ More

    Submitted 12 November, 2025; originally announced November 2025.

  16. arXiv:2510.27432  [pdf, ps, other

    cs.CV cs.AI

    Mitigating Semantic Collapse in Partially Relevant Video Retrieval

    Authors: WonJun Moon, MinSeok Jung, Gilhan Park, Tae-Young Kim, Cheol-Ho Cho, Woojin Jun, Jae-Pil Heo

    Abstract: Partially Relevant Video Retrieval (PRVR) seeks videos where only part of the content matches a text query. Existing methods treat every annotated text-video pair as a positive and all others as negatives, ignoring the rich semantic variation both within a single video and across different videos. Consequently, embeddings of both queries and their corresponding video-clip segments for distinct eve… ▽ More

    Submitted 31 October, 2025; originally announced October 2025.

    Comments: Accpeted to NeurIPS 2025. Code is available at https://github.com/admins97/MSC_PRVR

  17. arXiv:2510.26113  [pdf, ps, other

    cs.CV cs.AI

    EgoExo-Con: Exploring View-Invariant Video Temporal Understanding

    Authors: Minjoon Jung, Junbin Xiao, Junghyun Kim, Byoung-Tak Zhang, Angela Yao

    Abstract: Can Video-LLMs achieve consistent temporal understanding when videos capture the same event from different viewpoints? To study this, we introduce EgoExo-Con (Consistency), a benchmark of comprehensively synchronized egocentric and exocentric video pairs with human-refined queries in natural language. EgoExo-Con emphasizes two temporal understanding tasks: Temporal Verification and Temporal Ground… ▽ More

    Submitted 29 October, 2025; originally announced October 2025.

    Comments: project page: \url{https://minjoong507.github.io/projects/EgoExo-Con/}

  18. arXiv:2510.18905  [pdf, ps, other

    cs.LG cs.AI

    3D Optimization for AI Inference Scaling: Balancing Accuracy, Cost, and Latency

    Authors: Minseok Jung, Abhas Ricky, Muhammad Rameez Chatni

    Abstract: AI inference scaling is often tuned through 1D heuristics (a fixed reasoning pass) or 2D bivariate trade-offs (e.g., accuracy vs. compute), which fail to consider cost and latency constraints. We introduce a 3D optimization framework that jointly calibrates accuracy, cost, and latency within a unified decision space, enabling constraints-aware inference scaling. Using Monte Carlo simulations acros… ▽ More

    Submitted 15 November, 2025; v1 submitted 20 October, 2025; originally announced October 2025.

  19. Architecture, Simulation and Software Stack to Support Post-CMOS Accelerators: The ARCHYTAS Project

    Authors: Giovanni Agosta, Stefano Cherubin, Derek Christ, Francesco Conti, Asbjørn Djupdal, Matthias Jung, Georgios Keramidas, Roberto Passerone, Paolo Rech, Elisa Ricci, Philippe Velha, Flavio Vella, Kasim Sinan Yildirim, Nils Wilbert

    Abstract: ARCHYTAS aims to design and evaluate non-conventional hardware accelerators, in particular, optoelectronic, volatile and non-volatile processing-in-memory, and neuromorphic, to tackle the power, efficiency, and scalability bottlenecks of AI with an emphasis on defense use cases (e.g., autonomous vehicles, surveillance drones, maritime and space platforms). In this paper, we present the system arch… ▽ More

    Submitted 18 October, 2025; originally announced October 2025.

    Journal ref: 2025 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)

  20. arXiv:2510.14622  [pdf, ps, other

    cs.DC

    MPI-over-CXL: Enhancing Communication Efficiency in Distributed HPC Systems

    Authors: Miryeong Kwon, Donghyun Gouk, Hyein Woo, Junhee Kim, Jinwoo Baek, Kyungkuk Nam, Sangyoon Ji, Jiseon Kim, Hanyeoreum Bae, Junhyeok Jang, Hyunwoo You, Junseok Moon, Myoungsoo Jung

    Abstract: MPI implementations commonly rely on explicit memory-copy operations, incurring overhead from redundant data movement and buffer management. This overhead notably impacts HPC workloads involving intensive inter-processor communication. In response, we introduce MPI-over-CXL, a novel MPI communication paradigm leveraging CXL, which provides cache-coherent shared memory across multiple hosts. MPI-ov… ▽ More

    Submitted 16 October, 2025; originally announced October 2025.

  21. arXiv:2510.14580  [pdf, ps, other

    cs.DC

    ScalePool: Hybrid XLink-CXL Fabric for Composable Resource Disaggregation in Unified Scale-up Domains

    Authors: Hyein Woo, Miryeong Kwon, Jiseon Kim, Eunjee Na, Hanjin Choi, Seonghyeon Jang, Myoungsoo Jung

    Abstract: This paper proposes ScalePool, a novel cluster architecture designed to interconnect numerous accelerators using unified hardware interconnects rather than traditional long-distance networking. ScalePool integrates Accelerator-Centric Links (XLink) and Compute Express Link (CXL) into a unified XLink-CXL hybrid fabric. Specifically, ScalePool employs XLink for intra-cluster, low-latency accelerator… ▽ More

    Submitted 16 October, 2025; originally announced October 2025.

  22. arXiv:2510.14391  [pdf, ps, other

    cs.SD cs.AI cs.LG

    Beat Tracking as Object Detection

    Authors: Jaehoon Ahn, Moon-Ryul Jung

    Abstract: Recent beat and downbeat tracking models (e.g., RNNs, TCNs, Transformers) output frame-level activations. We propose reframing this task as object detection, where beats and downbeats are modeled as temporal "objects." Adapting the FCOS detector from computer vision to 1D audio, we replace its original backbone with WaveBeat's temporal feature extractor and add a Feature Pyramid Network to capture… ▽ More

    Submitted 16 October, 2025; v1 submitted 16 October, 2025; originally announced October 2025.

    Comments: 11 pages, 4 figures, 5 tables

  23. arXiv:2510.11110  [pdf, ps, other

    cs.LG cs.AI

    PhysioME: A Robust Multimodal Self-Supervised Framework for Physiological Signals with Missing Modalities

    Authors: Cheol-Hui Lee, Hwa-Yeon Lee, Min-Kyung Jung, Dong-Joo Kim

    Abstract: Missing or corrupted modalities are common in physiological signal-based medical applications owing to hardware constraints or motion artifacts. However, most existing methods assume the availability of all modalities, resulting in substantial performance degradation in the absence of any modality. To overcome this limitation, this study proposes PhysioME, a robust framework designed to ensure rel… ▽ More

    Submitted 13 October, 2025; originally announced October 2025.

    Comments: 9 pages, 2 figures

  24. arXiv:2509.20750  [pdf, ps, other

    cs.CL cs.AI

    Confidence-guided Refinement Reasoning for Zero-shot Question Answering

    Authors: Youwon Jang, Woo Suk Choi, Minjoon Jung, Minsu Lee, Byoung-Tak Zhang

    Abstract: We propose Confidence-guided Refinement Reasoning (C2R), a novel training-free framework applicable to question-answering (QA) tasks across text, image, and video domains. C2R strategically constructs and refines sub-questions and their answers (sub-QAs), deriving a better confidence score for the target answer. C2R first curates a subset of sub-QAs to explore diverse reasoning paths, then compare… ▽ More

    Submitted 25 September, 2025; originally announced September 2025.

    Comments: 18 pages (including references and appendix)

  25. arXiv:2509.14834  [pdf, ps, other

    cs.CL

    LLM Agents at the Roundtable: A Multi-Perspective and Dialectical Reasoning Framework for Essay Scoring

    Authors: Jinhee Jang, Ayoung Moon, Minkyoung Jung, YoungBin Kim, Seung Jin Lee

    Abstract: The emergence of large language models (LLMs) has brought a new paradigm to automated essay scoring (AES), a long-standing and practical application of natural language processing in education. However, achieving human-level multi-perspective understanding and judgment remains a challenge. In this work, we propose Roundtable Essay Scoring (RES), a multi-agent evaluation framework designed to perfo… ▽ More

    Submitted 18 September, 2025; v1 submitted 18 September, 2025; originally announced September 2025.

  26. arXiv:2509.08765  [pdf, ps, other

    physics.comp-ph cs.LG math.NA stat.ML

    One-shot acceleration of transient PDE solvers via online-learned preconditioners

    Authors: Mikhail Khodak, Min Ki Jung, Brian Wynne, Edmond Chow, Egemen Kolemen

    Abstract: Data-driven acceleration of scientific computing workflows has been a high-profile aim of machine learning (ML) for science, with numerical simulation of transient partial differential equations (PDEs) being one of the main applications. The focus thus far has been on methods that require classical simulations to train, which when combined with the data-hungriness and optimization challenges of ne… ▽ More

    Submitted 3 December, 2025; v1 submitted 10 September, 2025; originally announced September 2025.

    Comments: code available at https://github.com/mkhodak/PCGBandit

  27. arXiv:2509.08640  [pdf

    eess.IV cs.AI cs.CV

    RoentMod: A Synthetic Chest X-Ray Modification Model to Identify and Correct Image Interpretation Model Shortcuts

    Authors: Lauren H. Cooke, Matthias Jung, Jan M. Brendel, Nora M. Kerkovits, Borek Foldyna, Michael T. Lu, Vineet K. Raghu

    Abstract: Chest radiographs (CXRs) are among the most common tests in medicine. Automated image interpretation may reduce radiologists\' workload and expand access to diagnostic expertise. Deep learning multi-task and foundation models have shown strong performance for CXR interpretation but are vulnerable to shortcut learning, where models rely on spurious and off-target correlations rather than clinically… ▽ More

    Submitted 10 September, 2025; originally announced September 2025.

    Comments: 25 + 8 pages, 4 + 7 figures

    MSC Class: I.4; I.2; J.3

  28. arXiv:2509.00707  [pdf, ps, other

    cs.CL cs.AI

    Reward-Weighted Sampling: Enhancing Non-Autoregressive Characteristics in Masked Diffusion LLMs

    Authors: Daehoon Gwak, Minseo Jung, Junwoo Park, Minho Park, ChaeHun Park, Junha Hyung, Jaegul Choo

    Abstract: Masked diffusion models (MDMs) offer a promising non-autoregressive alternative for large language modeling. Standard decoding methods for MDMs, such as confidence-based sampling, select tokens independently based on individual token confidences at each diffusion step. However, we observe that this independent token selection often results in generation orders resembling sequential autoregressive… ▽ More

    Submitted 20 September, 2025; v1 submitted 31 August, 2025; originally announced September 2025.

    Comments: EMNLP 2025 Main Paper (Long)

  29. arXiv:2507.18956  [pdf, ps, other

    cs.CL

    A Similarity Measure for Comparing Conversational Dynamics

    Authors: Sang Min Jung, Kaixiang Zhang, Cristian Danescu-Niculescu-Mizil

    Abstract: The quality of a conversation goes beyond the individual quality of each reply, and instead emerges from how these combine into interactional dynamics that give the conversation its distinctive overall "shape". However, there is no robust automated method for comparing conversations in terms of their overall dynamics. Such methods could enhance the analysis of conversational data and help evaluate… ▽ More

    Submitted 20 September, 2025; v1 submitted 25 July, 2025; originally announced July 2025.

    Comments: Proceedings of EMNLP 2025 (Findings). Code and demos available in ConvoKit (https://convokit.cornell.edu/)

  30. arXiv:2507.17174  [pdf, ps, other

    cs.GR cs.HC cs.LG

    GhostUMAP2: Measuring and Analyzing (r,d)-Stability of UMAP

    Authors: Myeongwon Jung, Takanori Fujiwara, Jaemin Jo

    Abstract: Despite the widespread use of Uniform Manifold Approximation and Projection (UMAP), the impact of its stochastic optimization process on the results remains underexplored. We observed that it often produces unstable results where the projections of data points are determined mostly by chance rather than reflecting neighboring structures. To address this limitation, we introduce (r,d)-stability to… ▽ More

    Submitted 22 July, 2025; originally announced July 2025.

  31. arXiv:2507.07223  [pdf, ps, other

    cs.DC cs.AR

    Compute Can't Handle the Truth: Why Communication Tax Prioritizes Memory and Interconnects in Modern AI Infrastructure

    Authors: Myoungsoo Jung

    Abstract: Modern AI workloads such as large language models (LLMs) and retrieval-augmented generation (RAG) impose severe demands on memory, communication bandwidth, and resource flexibility. Traditional GPU-centric architectures struggle to scale due to growing inter-GPU communication overheads. This report introduces key AI concepts and explains how Transformers revolutionized data representation in LLMs.… ▽ More

    Submitted 13 July, 2025; v1 submitted 9 July, 2025; originally announced July 2025.

    ACM Class: B.4.3; C.0; C.2.1; C.2.2

  32. arXiv:2506.15613  [pdf, ps, other

    cs.AR

    From Block to Byte: Transforming PCIe SSDs with CXL Memory Protocol and Instruction Annotation

    Authors: Miryeong Kwon, Donghyun Gouk, Junhyeok Jang, Jinwoo Baek, Hyunwoo You, Sangyoon Ji, Hongjoo Jung, Junseok Moon, Seungkwan Kang, Seungjun Lee, Myoungsoo Jung

    Abstract: This paper explores how Compute Express Link (CXL) can transform PCIe-based block storage into a scalable, byte-addressable working memory. We address the challenges of adapting block storage to CXL's memory-centric model by emphasizing cacheability as a key enabler and advocating for Type 3 endpoint devices, referred to as CXL-SSDs. To validate our approach, we prototype a CXL-SSD on a custom FPG… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  33. arXiv:2506.15601  [pdf, ps, other

    cs.AR

    CXL-GPU: Pushing GPU Memory Boundaries with the Integration of CXL Technologies

    Authors: Donghyun Gouk, Seungkwan Kang, Seungjun Lee, Jiseon Kim, Kyungkuk Nam, Eojin Ryu, Sangwon Lee, Dongpyung Kim, Junhyeok Jang, Hanyeoreum Bae, Myoungsoo Jung

    Abstract: This work introduces a GPU storage expansion solution utilizing CXL, featuring a novel GPU system design with multiple CXL root ports for integrating diverse storage media (DRAMs and/or SSDs). We developed and siliconized a custom CXL controller integrated at the hardware RTL level, achieving two-digit nanosecond roundtrip latency, the first in the field. This study also includes speculative read… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  34. arXiv:2506.15175  [pdf, ps, other

    cs.RO

    SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization

    Authors: Hanjun Kim, Minwoo Jung, Wooseong Yang, Ayoung Kim

    Abstract: Despite the growing adoption of radar in robotics, the majority of research has been confined to homogeneous sensor types, overlooking the integration and cross-modality challenges inherent in heterogeneous radar technologies. This leads to significant difficulties in generalizing across diverse radar data types, with modality-aware approaches that could leverage the complementary strengths of het… ▽ More

    Submitted 10 October, 2025; v1 submitted 18 June, 2025; originally announced June 2025.

    Comments: 9 pages, 9 figures, accepted to RA-L

  35. arXiv:2506.07471  [pdf, ps, other

    cs.CV cs.AI

    Ambiguity-Restrained Text-Video Representation Learning for Partially Relevant Video Retrieval

    Authors: CH Cho, WJ Moon, W Jun, MS Jung, JP Heo

    Abstract: Partially Relevant Video Retrieval~(PRVR) aims to retrieve a video where a specific segment is relevant to a given text query. Typical training processes of PRVR assume a one-to-one relationship where each text query is relevant to only one video. However, we point out the inherent ambiguity between text and video content based on their conceptual scope and propose a framework that incorporates th… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: Accepted to AAAI 2025

  36. arXiv:2506.06769  [pdf, ps, other

    cs.AR

    Containerized In-Storage Processing and Computing-Enabled SSD Disaggregation

    Authors: Miryeong Kwon, Donghyun Gouk, Eunjee Na, Jiseon Kim, Junhee Kim, Hyein Woo, Eojin Ryu, Hyunkyu Choi, Jinwoo Baek, Hanyeoreum Bae, Mahmut Kandemir, Myoungsoo Jung

    Abstract: ISP minimizes data transfer for analytics but faces challenges in adaptation and disaggregation. We propose DockerSSD, an ISP model leveraging OS-level virtualization and lightweight firmware to enable containerized data processing directly on SSDs. Key features include Ethernet over NVMe for network-based ISP management and Virtual Firmware for secure, efficient container execution. DockerSSD sup… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  37. arXiv:2506.02835  [pdf, ps, other

    cs.RO

    High-speed control and navigation for quadrupedal robots on complex and discrete terrain

    Authors: Hyeongjun Kim, Hyunsik Oh, Jeongsoo Park, Yunho Kim, Donghoon Youm, Moonkyu Jung, Minho Lee, Jemin Hwangbo

    Abstract: High-speed legged navigation in discrete and geometrically complex environments is a challenging task because of the high-degree-of-freedom dynamics and long-horizon, nonconvex nature of the optimization problem. In this work, we propose a hierarchical navigation pipeline for legged robots that can traverse such environments at high speed. The proposed pipeline consists of a planner and tracker mo… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Journal ref: Science Robotics 10.102 (2025): eads6192

  38. arXiv:2505.21033  [pdf, ps, other

    cs.CL

    Def-DTS: Deductive Reasoning for Open-domain Dialogue Topic Segmentation

    Authors: Seungmin Lee, Yongsang Yoo, Minhwa Jung, Min Song

    Abstract: Dialogue Topic Segmentation (DTS) aims to divide dialogues into coherent segments. DTS plays a crucial role in various NLP downstream tasks, but suffers from chronic problems: data shortage, labeling ambiguity, and incremental complexity of recently proposed solutions. On the other hand, Despite advances in Large Language Models (LLMs) and reasoning strategies, these have rarely been applied to DT… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 19 pages, 3 figures, Accepted to Findings of the ACL 2025

  39. arXiv:2505.18577  [pdf, ps, other

    cs.AR

    CXL Topology-Aware and Expander-Driven Prefetching: Unlocking SSD Performance

    Authors: Dongsuk Oh, Miryeong Kwon, Jiseon Kim, Eunjee Na, Junseok Moon, Hyunkyu Choi, Seonghyeon Jang, Hanjin Choi, Hongjoo Jung, Sangwon Lee, Myoungsoo Jung

    Abstract: Integrating compute express link (CXL) with SSDs allows scalable access to large memory but has slower speeds than DRAMs. We present ExPAND, an expander-driven CXL prefetcher that offloads last-level cache (LLC) prefetching from host CPU to CXL-SSDs. ExPAND uses a heterogeneous prediction algorithm for prefetching and ensures data consistency with CXL.mem's back-invalidation. We examine prefetch t… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  40. arXiv:2505.18364  [pdf, ps, other

    cs.RO

    ImLPR: Image-based LiDAR Place Recognition using Vision Foundation Models

    Authors: Minwoo Jung, Lanke Frank Tarimo Fu, Maurice Fallon, Ayoung Kim

    Abstract: LiDAR Place Recognition (LPR) is a key component in robotic localization, enabling robots to align current scans with prior maps of their environment. While Visual Place Recognition (VPR) has embraced Vision Foundation Models (VFMs) to enhance descriptor robustness, LPR has relied on task-specific models with limited use of pre-trained foundation-level knowledge. This is due to the lack of 3D foun… ▽ More

    Submitted 7 August, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

    Comments: CoRL2025 Accepted, 23 Pages, 15 Figures and 14 Tables

  41. arXiv:2505.16735  [pdf, ps, other

    eess.AS cs.AI

    Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting

    Authors: Youngmoon Jung, Yong-Hyeok Lee, Myunghun Jung, Jaeyoung Roh, Chang Woo Han, Hoon-Young Cho

    Abstract: For text enrollment-based open-vocabulary keyword spotting (KWS), acoustic and text embeddings are typically compared at either the phoneme or utterance level. To facilitate this, we optimize acoustic and text encoders using deep metric learning (DML), enabling direct comparison of multi-modal embeddings in a shared embedding space. However, the inherent heterogeneity between audio and text modali… ▽ More

    Submitted 22 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: 5 pages, 1 figure, Accepted at Interspeech 2025

  42. arXiv:2505.11406  [pdf, other

    cs.HC cs.AI cs.CL

    Large Language Model Use Impact Locus of Control

    Authors: Jenny Xiyu Fu, Brennan Antone, Kowe Kadoma, Malte Jung

    Abstract: As AI tools increasingly shape how we write, they may also quietly reshape how we perceive ourselves. This paper explores the psychological impact of co-writing with AI on people's locus of control. Through an empirical study with 462 participants, we found that employment status plays a critical role in shaping users' reliance on AI and their locus of control. Current results demonstrated that em… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  43. arXiv:2505.05076  [pdf, other

    cs.RO cs.CV

    The City that Never Settles: Simulation-based LiDAR Dataset for Long-Term Place Recognition Under Extreme Structural Changes

    Authors: Hyunho Song, Dongjae Lee, Seunghun Oh, Minwoo Jung, Ayoung Kim

    Abstract: Large-scale construction and demolition significantly challenge long-term place recognition (PR) by drastically reshaping urban and suburban environments. Existing datasets predominantly reflect limited or indoor-focused changes, failing to adequately represent extensive outdoor transformations. To bridge this gap, we introduce the City that Never Settles (CNS) dataset, a simulation-based dataset… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  44. arXiv:2504.06580  [pdf, ps, other

    cs.CV cs.AI

    Exploring Ordinal Bias in Action Recognition for Instructional Videos

    Authors: Joochan Kim, Minjoon Jung, Byoung-Tak Zhang

    Abstract: Action recognition models have achieved promising results in understanding instructional videos. However, they often rely on dominant, dataset-specific action sequences rather than true video comprehension, a problem that we define as ordinal bias. To address this issue, we propose two effective video manipulation methods: Action Masking, which masks frames of frequently co-occurring actions, and… ▽ More

    Submitted 5 December, 2025; v1 submitted 9 April, 2025; originally announced April 2025.

    Comments: Accepted at SCSL @ ICLR 2025

  45. arXiv:2503.23685  [pdf, other

    cs.ET

    An In-Situ Spatial-Temporal Sequence Detector for Neuromorphic Vision Sensor Empowered by High Density Vertical NAND Storage

    Authors: Zijian Zhao, Varun Darshana Parekh, Po-Kai Hsu, Yixin Qin, Yiming Song, A N M Nafiul Islam, Ningyuan Cao, Siddharth Joshi, Thomas Kämpfe, Moonyoung Jung, Kwangyou Seo, Kwangsoo Kim, Wanki Kim, Daewon Ha, Sourav Dutta, Abhronil Sengupta, Xiao Gong, Shimeng Yu, Vijaykrishnan Narayanan, Kai Ni

    Abstract: Neuromorphic vision sensors require efficient real-time pattern recognition, yet conventional architectures struggle with energy and latency constraints. Here, we present a novel in-situ spatiotemporal sequence detector that leverages vertical NAND storage to achieve massively parallel pattern detection. By encoding each cell with two single-transistor-based multi-level cell (MLC) memory elements,… ▽ More

    Submitted 30 March, 2025; originally announced March 2025.

    Comments: 26 pages, 7 figures

  46. arXiv:2503.05777  [pdf, ps, other

    cs.CL cs.AI cs.CY

    Medical Hallucinations in Foundation Models and Their Impact on Healthcare

    Authors: Yubin Kim, Hyewon Jeong, Shan Chen, Shuyue Stella Li, Chanwoo Park, Mingyu Lu, Kumail Alhamoud, Jimin Mun, Cristina Grau, Minseok Jung, Rodrigo Gameiro, Lizhou Fan, Eugene Park, Tristan Lin, Joonsik Yoon, Wonjin Yoon, Maarten Sap, Yulia Tsvetkov, Paul Liang, Xuhai Xu, Xin Liu, Chunjong Park, Hyeonhoon Lee, Hae Won Park, Daniel McDuff , et al. (2 additional authors not shown)

    Abstract: Hallucinations in foundation models arise from autoregressive training objectives that prioritize token-likelihood optimization over epistemic accuracy, fostering overconfidence and poorly calibrated uncertainty. We define medical hallucination as any model-generated output that is factually incorrect, logically inconsistent, or unsupported by authoritative clinical evidence in ways that could alt… ▽ More

    Submitted 2 November, 2025; v1 submitted 25 February, 2025; originally announced March 2025.

  47. arXiv:2502.18688  [pdf, other

    cs.RO cs.HC

    Rapidly Built Medical Crash Cart! Lessons Learned and Impacts on High-Stakes Team Collaboration in the Emergency Room

    Authors: Angelique Taylor, Tauhid Tanjim, Michael Joseph Sack, Maia Hirsch, Kexin Cheng, Kevin Ching, Jonathan St. George, Thijs Roumen, Malte F. Jung, Hee Rin Lee

    Abstract: Designing robots to support high-stakes teamwork in emergency settings presents unique challenges, including seamless integration into fast-paced environments, facilitating effective communication among team members, and adapting to rapidly changing situations. While teleoperated robots have been successfully used in high-stakes domains such as firefighting and space exploration, autonomous robots… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

    Comments: 10 pages, 6 figures, HRI conference

  48. arXiv:2502.08662  [pdf, ps, other

    cs.CL cs.AI

    RoToR: Towards More Reliable Responses for Order-Invariant Inputs

    Authors: Soyoung Yoon, Dongha Ahn, Youngwon Lee, Minkyu Jung, HyungJoo Jang, Seung-won Hwang

    Abstract: Mitigating positional bias of language models (LMs) for listwise inputs is a well-known and important problem (e.g., lost-in-the-middle). While zero-shot order-invariant LMs have been proposed to solve this issue, their success on practical listwise problems has been limited. In this work, as a first contribution, we identify and overcome two limitations to make zero-shot invariant LMs more practi… ▽ More

    Submitted 2 June, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: Accepted at ACL 2025 main

  49. arXiv:2502.07703  [pdf, other

    cs.RO

    GaRLIO: Gravity enhanced Radar-LiDAR-Inertial Odometry

    Authors: Chiyun Noh, Wooseong Yang, Minwoo Jung, Sangwoo Jung, Ayoung Kim

    Abstract: Recently, gravity has been highlighted as a crucial constraint for state estimation to alleviate potential vertical drift. Existing online gravity estimation methods rely on pose estimation combined with IMU measurements, which is considered best practice when direct velocity measurements are unavailable. However, with radar sensors providing direct velocity data-a measurement not yet utilized for… ▽ More

    Submitted 21 February, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  50. arXiv:2502.04528  [pdf, ps, other

    cs.CL cs.LG

    Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection

    Authors: Minseok Jung, Cynthia Fuertes Panizo, Liam Dugan, Yi R., Fung, Pin-Yu Chen, Paul Pu Liang

    Abstract: The advancement of large language models (LLMs) has made it difficult to differentiate human-written text from AI-generated text. Several AI-text detectors have been developed in response, which typically utilize a fixed global threshold (e.g., $θ= 0.5$) to classify machine-generated text. However, one universal threshold could fail to account for distributional variations by subgroups. For exampl… ▽ More

    Submitted 1 February, 2026; v1 submitted 6 February, 2025; originally announced February 2025.