Skip to main content

Showing 1–50 of 130 results for author: Ju, S

.
  1. arXiv:2604.11197  [pdf, ps, other

    cs.CV

    MedP-CLIP: Medical CLIP with Region-Aware Prompt Integration

    Authors: Jiahui Peng, He Yao, Jingwen Li, Yanzhou Su, Sibo Ju, Yujie Lu, Jin Ye, Hongchun Lu, Xue Li, Lincheng Jiang, Min Zhu, Junlong Cheng

    Abstract: Contrastive Language-Image Pre-training (CLIP) has demonstrated outstanding performance in global image understanding and zero-shot transfer through large-scale text-image alignment. However, the core of medical image analysis often lies in the fine-grained understanding of specific anatomical structures or lesion regions. Therefore, precisely comprehending region-of-interest (RoI) information pro… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

  2. arXiv:2604.03023  [pdf, ps, other

    cs.RO

    Behavior-Constrained Reinforcement Learning with Receding-Horizon Credit Assignment for High-Performance Control

    Authors: Siwei Ju, Jan Tauberschmidt, Oleg Arenz, Peter van Vliet, Jan Peters

    Abstract: Learning high-performance control policies that remain consistent with expert behavior is a fundamental challenge in robotics. Reinforcement learning can discover high-performing strategies but often departs from desirable human behavior, whereas imitation learning is limited by demonstration quality and struggles to improve beyond expert data. We propose a behavior-constrained reinforcement learn… ▽ More

    Submitted 3 April, 2026; originally announced April 2026.

  3. arXiv:2604.00514  [pdf, ps, other

    cs.CV cs.AI

    MAESIL: Masked Autoencoder for Enhanced Self-supervised Medical Image Learning

    Authors: Kyeonghun Kim, Hyeonseok Jung, Youngung Han, Junsu Lim, YeonJu Jean, Seongbin Park, Eunseob Choi, Hyunsu Go, SeoYoung Ju, Seohyoung Park, Gyeongmin Kim, MinJu Kwon, KyungSeok Yuh, Soo Yong Kim, Ken Ying-Kai Liao, Nam-Joon Kim, Hyuk-Jae Lee

    Abstract: Training deep learning models for three-dimensional (3D) medical imaging, such as Computed Tomography (CT), is fundamentally challenged by the scarcity of labeled data. While pre-training on natural images is common, it results in a significant domain shift, limiting performance. Self-Supervised Learning (SSL) on unlabeled medical data has emerged as a powerful solution, but prominent frameworks o… ▽ More

    Submitted 1 April, 2026; originally announced April 2026.

    Comments: 5 pages, 3 figures. Accepted at ICEIC 2026

  4. arXiv:2604.00402  [pdf, ps, other

    cs.CV cs.AI

    COTTA: Context-Aware Transfer Adaptation for Trajectory Prediction in Autonomous Driving

    Authors: Seohyoung Park, Jaeyeol Lim, Seoyoung Ju, Kyeonghun Kim, Nam-Joon Kim, Hyuk-Jae Lee

    Abstract: Developing robust models to accurately predict the trajectories of surrounding agents is fundamental to autonomous driving safety. However, most public datasets, such as the Waymo Open Motion Dataset and Argoverse, are collected in Western road environments and do not reflect the unique traffic patterns, infrastructure, and driving behaviors of other regions, including South Korea. This domain dis… ▽ More

    Submitted 31 March, 2026; originally announced April 2026.

    Comments: 4 pages, 2 figures. Accepted at ICEIC 2026

  5. arXiv:2603.29356  [pdf, ps, other

    cs.CV cs.AI

    CIPHER: Counterfeit Image Pattern High-level Examination via Representation

    Authors: Kyeonghun Kim, Youngung Han, Seoyoung Ju, Yeonju Jean, YooHyun Kim, Minseo Choi, SuYeon Lim, Kyungtae Park, Seungwoo Baek, Sieun Hyeon, Nam-Joon Kim, Hyuk-Jae Lee

    Abstract: The rapid progress of generative adversarial networks (GANs) and diffusion models has enabled the creation of synthetic faces that are increasingly difficult to distinguish from real images. This progress, however, has also amplified the risks of misinformation, fraud, and identity abuse, underscoring the urgent need for detectors that remain robust across diverse generative models. In this work,… ▽ More

    Submitted 31 March, 2026; originally announced March 2026.

    Comments: 6 pages, 2 figures. Accepted at IEEE-Asia 2025

  6. arXiv:2603.29343  [pdf, ps, other

    cs.CV

    FOSCU: Feasibility of Synthetic MRI Generation via Duo-Diffusion Models for Enhancement of 3D U-Nets in Hepatic Segmentation

    Authors: Youngung Han, Kyeonghun Kim, Seoyoung Ju, Yeonju Jean, Minkyung Cha, Seohyoung Park, Hyeonseok Jung, Nam-Joon Kim, Woo Kyoung Jeong, Ken Ying-Kai Liao, Hyuk-Jae Lee

    Abstract: Medical image segmentation faces fundamental challenges including restricted access, costly annotation, and data shortage to clinical datasets through Picture Archiving and Communication Systems (PACS). These systemic barriers significantly impede the development of robust segmentation algorithms. To address these challenges, we propose FOSCU, which integrates Duo-Diffusion, a 3D latent diffusion… ▽ More

    Submitted 31 March, 2026; originally announced March 2026.

    Comments: 10 pages, 5 figures. Accepted at IEEE APCCAS 2025

  7. arXiv:2603.27460  [pdf, ps, other

    cs.CV cs.AI

    Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

    Authors: Zhongying Deng, Cheng Tang, Ziyan Huang, Jiashi Lin, Ying Chen, Junzhi Ning, Chenglong Ma, Jiyao Liu, Wei Li, Yinghao Zhu, Shujian Gao, Yanyan Huang, Sibo Ju, Yanzhou Su, Pengcheng Chen, Wenhao Tang, Tianbin Li, Haoyu Wang, Yuanfeng Ji, Hui Sun, Shaobo Min, Liang Peng, Feilong Tang, Haochen Xue, Rulin Zhou , et al. (102 additional authors not shown)

    Abstract: Foundation models have demonstrated remarkable success across diverse domains and tasks, primarily due to the thrive of large-scale, diverse, and high-quality datasets. However, in the field of medical imaging, the curation and assembling of such medical datasets are highly challenging due to the reliance on clinical expertise and strict ethical and privacy constraints, resulting in a scarcity of… ▽ More

    Submitted 28 March, 2026; originally announced March 2026.

    Comments: 157 pages, 19 figures, 26 tables. Project repo: \url{https://github.com/uni-medical/Project-Imaging-X}

  8. arXiv:2603.23845  [pdf, ps, other

    cs.CV

    3D-LLDM: Label-Guided 3D Latent Diffusion Model for Improving High-Resolution Synthetic MR Imaging in Hepatic Structure Segmentation

    Authors: Kyeonghun Kim, Jaehyeok Bae, Youngung Han, Joo Young Bae, Seoyoung Ju, Junsu Lim, Gyeongmin Kim, Nam-Joon Kim, Woo Kyoung Jeong, Ken Ying-Kai Liao, Won Jae Lee, Pa Hong, Hyuk-Jae Lee

    Abstract: Deep learning and generative models are advancing rapidly, with synthetic data increasingly being integrated into training pipelines for downstream analysis tasks. However, in medical imaging, their adoption remains constrained by the scarcity of reliable annotated datasets. To address this limitation, we propose 3D-LLDM, a label-guided 3D latent diffusion model that generates high-quality synthet… ▽ More

    Submitted 24 March, 2026; originally announced March 2026.

    Comments: Accepted to ISBI 2026 (Oral). Camera-ready version

  9. arXiv:2603.22911  [pdf, ps, other

    cs.CV cs.AI

    ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling

    Authors: Shaobo Ju, Baiyang Song, Tao Chen, Jiapeng Zhang, Qiong Wu, Chao Chang, HuaiXi Wang, Yiyi Zhou, Rongrong Ji

    Abstract: Due to the great saving of computation and memory overhead, token compression has become a research hot-spot for MLLMs and achieved remarkable progress in image-language tasks. However, for the video, existing methods still fall short of high-ratio token compression. We attribute this shortcoming to the insufficient modeling of temporal and continual video content, and propose a novel and training… ▽ More

    Submitted 12 April, 2026; v1 submitted 24 March, 2026; originally announced March 2026.

  10. arXiv:2603.20392  [pdf

    cs.LG cs.AI stat.ML

    SymCircuit: Bayesian Structure Inference for Tractable Probabilistic Circuits via Entropy-Regularized Reinforcement Learning

    Authors: Y. Sungtaek Ju

    Abstract: Probabilistic circuit (PC) structure learning is hampered by greedy algorithms that make irreversible, locally optimal decisions. We propose SymCircuit, which replaces greedy search with a learned generative policy trained via entropy-regularized reinforcement learning. Instantiating the RL-as-inference framework in the PC domain, we show the optimal policy is a tempered Bayesian posterior, recove… ▽ More

    Submitted 20 March, 2026; originally announced March 2026.

    Comments: 17 pages

  11. arXiv:2602.19213  [pdf, ps, other

    cs.CV

    SegMoTE: Token-Level Mixture of Experts for Medical Image Segmentation

    Authors: Yujie Lu, Jingwen Li, Sibo Ju, Yanzhou Su, he yao, Yisong Liu, Min Zhu, Junlong Cheng

    Abstract: Medical image segmentation is vital for clinical diagnosis and quantitative analysis, yet remains challenging due to the heterogeneity of imaging modalities and the high cost of pixel-level annotations. Although general interactive segmentation models like SAM have achieved remarkable progress, their transfer to medical imaging still faces two key bottlenecks: (i) the lack of adaptive mechanisms f… ▽ More

    Submitted 22 February, 2026; originally announced February 2026.

  12. arXiv:2602.17708  [pdf

    physics.chem-ph astro-ph.IM cs.LG physics.ao-ph physics.comp-ph physics.plasm-ph

    Spectral Homogenization of the Radiative Transfer Equation via Low-Rank Tensor Train Decomposition

    Authors: Y. Sungtaek Ju

    Abstract: Radiative transfer in absorbing-scattering media requires solving a transport equation across a spectral domain with 10^5 - 10^6 molecular absorption lines. Line-by-line (LBL) computation is prohibitively expensive, while existing approximations sacrifice spectral fidelity. We show that the Young-measure homogenization framework produces solution tensors I that admit low-rank tensor-train (TT) dec… ▽ More

    Submitted 12 February, 2026; originally announced February 2026.

    Comments: 30 pages; submitted for publication

  13. arXiv:2602.03816  [pdf, ps, other

    cs.LG

    SymPlex: A Structure-Aware Transformer for Symbolic PDE Solving

    Authors: Yesom Park, Annie C. Lu, Shao-Ching Huang, Qiyang Hu, Y. Sungtaek Ju, Stanley Osher

    Abstract: We propose SymPlex, a reinforcement learning framework for discovering analytical symbolic solutions to partial differential equations (PDEs) without access to ground-truth expressions. SymPlex formulates symbolic PDE solving as tree-structured decision-making and optimizes candidate solutions using only the PDE and its boundary conditions. At its core is SymFormer, a structure-aware Transformer t… ▽ More

    Submitted 3 February, 2026; originally announced February 2026.

    Comments: 27 pages

  14. arXiv:2602.03668  [pdf, ps, other

    cs.RO cs.CV

    MVP-LAM: Learning Action-Centric Latent Action via Cross-Viewpoint Reconstruction

    Authors: Jung Min Lee, Dohyeok Lee, Seokhun Ju, Taehyun Cho, Jin Woo Koo, Li Zhao, Sangwoo Hong, Jungwoo Lee

    Abstract: Learning \emph{latent actions} from diverse human videos enables scaling robot learning beyond embodiment-specific robot datasets, and these latent actions have recently been used as pseudo-action labels for vision-language-action (VLA) model pretraining. To make VLA pretraining effective, latent actions should contain information about the underlying agent's actions despite the absence of ground-… ▽ More

    Submitted 3 February, 2026; originally announced February 2026.

  15. arXiv:2601.10148  [pdf, ps, other

    cs.AI

    DecisionLLM: Large Language Models for Long Sequence Decision Exploration

    Authors: Xiaowei Lv, Zhilin Zhang, Yijun Li, Yusen Huo, Siyuan Ju, Xuyan Li, Chunxiang Hong, Tianyu Wang, Yongcai Wang, Peng Sun, Chuan Yu, Jian Xu, Bo Zheng

    Abstract: Long-sequence decision-making, which is usually addressed through reinforcement learning (RL), is a critical component for optimizing strategic operations in dynamic environments, such as real-time bidding in computational advertising. The Decision Transformer (DT) introduced a powerful paradigm by framing RL as an autoregressive sequence modeling problem. Concurrently, Large Language Models (LLMs… ▽ More

    Submitted 15 January, 2026; originally announced January 2026.

  16. Comparison of SCAN+U and r2SCAN+U for Charge Density Wave Instability and Lattice Dynamics in CuTe

    Authors: Seungha Ju, Sooran Kim

    Abstract: Identifying an appropriate exchange-correlation functional and computational conditions is essential for explaining the fundamental physics of materials and predicting their properties. Here, we investigate the performance of the meta-GGA functionals SCAN and r2SCAN, with and without a Hubbard U, for describing the charge density wave (CDW) in the quasi-one-dimensional material CuTe. By examining… ▽ More

    Submitted 15 January, 2026; originally announced January 2026.

  17. arXiv:2601.02197  [pdf, ps, other

    gr-qc physics.optics

    Optical echoes of light near a black hole

    Authors: Suting Ju, Jingxuan Zhang, Li-Gang Wang

    Abstract: The light deflection under a strong gravitational field, referred to as strong gravitational lensing, provides a powerful probe of spacetime geometry. Besides, laboratory analogue models are employed to study the effects of curved spacetime and explore the design of optical devices. Here, applying the framework of analogue gravity, we reveal the behavior of the optical echo from a pulsed point-lik… ▽ More

    Submitted 5 January, 2026; originally announced January 2026.

    Comments: 16 pages, 7 figures

  18. arXiv:2512.22426  [pdf

    physics.flu-dyn cs.LG

    Uncertainty-Aware Flow Field Reconstruction Using SVGP Kolmogorov-Arnold Networks

    Authors: Y. Sungtaek Ju

    Abstract: Reconstructing time-resolved flow fields from temporally sparse velocimetry measurements is critical for characterizing many complex thermal-fluid systems. We introduce a machine learning framework for uncertainty-aware flow reconstruction using sparse variational Gaussian processes in the Kolmogorov-Arnold network topology (SVGP-KAN). This approach extends the classical foundations of Linear Stoc… ▽ More

    Submitted 26 December, 2025; originally announced December 2025.

    Comments: 36 pages, 11 figures, submitted for publication in a journal

  19. arXiv:2512.20344  [pdf

    cs.AI

    A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice

    Authors: Yaowei Bai, Ruiheng Zhang, Yu Lei, Xuhua Duan, Jingfeng Yao, Shuguang Ju, Chaoyang Wang, Wei Yao, Yiwan Guo, Guilin Zhang, Chao Wan, Qian Yuan, Lei Chen, Wenjuan Tang, Biqiang Zhu, Xinggang Wang, Tao Sun, Wei Zhou, Dacheng Tao, Yongchao Xu, Chuansheng Zheng, Huangxuan Zhao, Bo Du

    Abstract: A global shortage of radiologists has been exacerbated by the significant volume of chest X-ray workloads, particularly in primary care. Although multimodal large language models show promise, existing evaluations predominantly rely on automated metrics or retrospective analyses, lacking rigorous prospective clinical validation. Janus-Pro-CXR (1B), a chest X-ray interpretation system based on Deep… ▽ More

    Submitted 23 December, 2025; originally announced December 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2507.19493

  20. arXiv:2512.08410  [pdf, ps, other

    cs.CV

    Towards Effective Long Video Understanding of Multimodal Large Language Models via One-shot Clip Retrieval

    Authors: Tao Chen, Shaobo Ju, Qiong Wu, Chenxin Fang, Kun Zhang, Jun Peng, Hui Li, Yiyi Zhou, Rongrong Ji

    Abstract: Due to excessive memory overhead, most Multimodal Large Language Models (MLLMs) can only process videos of limited frames. In this paper, we propose an effective and efficient paradigm to remedy this shortcoming, termed One-shot video-Clip based Retrieval-Augmented Generation (OneClip-RAG). Compared with existing video RAG methods, OneClip-RAG makes full use of the merits of video clips for augmen… ▽ More

    Submitted 9 April, 2026; v1 submitted 9 December, 2025; originally announced December 2025.

  21. arXiv:2512.05306  [pdf

    cs.LG stat.ML

    Uncertainty Quantification for Scientific Machine Learning using Sparse Variational Gaussian Process Kolmogorov-Arnold Networks (SVGP KAN)

    Authors: Y. Sungtaek Ju

    Abstract: Kolmogorov-Arnold Networks have emerged as interpretable alternatives to traditional multi-layer perceptrons. However, standard implementations lack principled uncertainty quantification capabilities essential for many scientific applications. We present a framework integrating sparse variational Gaussian process inference with the Kolmogorov-Arnold topology, enabling scalable Bayesian inference w… ▽ More

    Submitted 9 December, 2025; v1 submitted 4 December, 2025; originally announced December 2025.

    Comments: 20 pages, 3 figures

    ACM Class: I.2.6; I.2.8; G.3; G.1.2; J.2

  22. arXiv:2512.00260  [pdf

    cs.LG stat.ML

    Scalable and Interpretable Scientific Discovery via Sparse Variational Gaussian Process Kolmogorov-Arnold Networks (SVGP KAN)

    Authors: Y. Sungtaek Ju

    Abstract: Kolmogorov-Arnold Networks (KANs) offer a promising alternative to Multi-Layer Perceptron (MLP) by placing learnable univariate functions on network edges, enhancing interpretability. However, standard KANs lack probabilistic outputs, limiting their utility in applications requiring uncertainty quantification. While recent Gaussian Process (GP) extensions to KANs address this, they utilize exact i… ▽ More

    Submitted 28 November, 2025; originally announced December 2025.

    Comments: 7 pages, 3 figures

    ACM Class: I.2.6; I.2.8; G.3; G.1.2; J.2

  23. arXiv:2510.27114  [pdf, ps, other

    cs.RO cs.LG

    Learning Generalizable Visuomotor Policy through Dynamics-Alignment

    Authors: Dohyeok Lee, Jung Min Lee, Munkyung Kim, Seokhun Ju, Jin Woo Koo, Kyungjae Lee, Dohyeong Kim, TaeHyun Cho, Jungwoo Lee

    Abstract: Behavior cloning methods for robot learning suffer from poor generalization due to limited data support beyond expert demonstrations. Recent approaches leveraging video prediction models have shown promising results by learning rich spatiotemporal representations from large-scale datasets. However, these models learn action-agnostic dynamics that cannot distinguish between different control inputs… ▽ More

    Submitted 30 October, 2025; originally announced October 2025.

    Comments: 9 pages, 6 figures

  24. arXiv:2510.11241  [pdf, ps, other

    cond-mat.mtrl-sci

    Optimizing Cross-Domain Transfer for Universal Machine Learning Interatomic Potentials

    Authors: Jaesun Kim, Jinmu You, Yutack Park, Yunsung Lim, Yujin Kang, Jisu Kim, Haekwan Jeon, Suyeon Ju, Deokgi Hong, Seung Yul Lee, Saerom Choi, Yongdeok Kim, Jae W. Lee, Seungwu Han

    Abstract: Accurate yet transferable machine-learning interatomic potentials (MLIPs) are essential for accelerating materials and chemical discovery. However, most universal MLIPs overfit to narrow datasets or computational protocols, limiting their reliability across chemical and functional domains. We introduce a transferable multi-domain training strategy that jointly optimizes universal and task-specific… ▽ More

    Submitted 6 November, 2025; v1 submitted 13 October, 2025; originally announced October 2025.

    Comments: 23 pages, 5 figures, 2 tables, Supplementary information included as ancillary file (+32 pages)

  25. arXiv:2509.12777  [pdf, ps, other

    cs.CV cs.AI

    CECT-Mamba: a Hierarchical Contrast-enhanced-aware Model for Pancreatic Tumor Subtyping from Multi-phase CECT

    Authors: Zhifang Gong, Shuo Gao, Ben Zhao, Yingjing Xu, Yijun Yang, Shenghong Ju, Guangquan Zhou

    Abstract: Contrast-enhanced computed tomography (CECT) is the primary imaging technique that provides valuable spatial-temporal information about lesions, enabling the accurate diagnosis and subclassification of pancreatic tumors. However, the high heterogeneity and variability of pancreatic tumors still pose substantial challenges for precise subtyping diagnosis. Previous methods fail to effectively explor… ▽ More

    Submitted 16 September, 2025; originally announced September 2025.

  26. arXiv:2507.19493  [pdf

    cs.HC eess.IV

    From Bench to Bedside: A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice

    Authors: Yaowei Bai, Ruiheng Zhang, Yu Lei, Jingfeng Yao, Shuguang Ju, Chaoyang Wang, Wei Yao, Yiwan Guo, Guilin Zhang, Chao Wan, Qian Yuan, Xuhua Duan, Xinggang Wang, Tao Sun, Yongchao Xu, Chuansheng Zheng, Huangxuan Zhao, Bo Du

    Abstract: A global shortage of radiologists has been exacerbated by the significant volume of chest X-ray workloads, particularly in primary care. Although multimodal large language models show promise, existing evaluations predominantly rely on automated metrics or retrospective analyses, lacking rigorous prospective clinical validation. Janus-Pro-CXR (1B), a chest X-ray interpretation system based on Deep… ▽ More

    Submitted 31 May, 2025; originally announced July 2025.

  27. arXiv:2505.22379  [pdf, other

    physics.flu-dyn

    Dynamics of thin film flows on a vertical fibre with vapor absorption

    Authors: Souradip Chattopadhyay, Zihao Yu, Y. Sungtaek Ju, Hangjie Ji

    Abstract: Water vapor capture through free surface flows plays a crucial role in various industrial applications, such as liquid desiccant air conditioning systems, water harvesting, and dewatering. This paper studies the dynamics of a silicone liquid sorbent (also known as water-absorbing silicone oil) flowing down a vertical cylindrical fibre while absorbing water vapor. We propose a one-sided thin-film-t… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 32 pages, 13 figures

  28. arXiv:2505.18831  [pdf, ps, other

    cs.IR

    Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning

    Authors: Jinzheng Li, Sibo Ju, Yanzhou Su, Hongguang Li, Yiqing Shen

    Abstract: Existing large language models (LLMs) driven search agents typically rely on prompt engineering to decouple the user queries into search plans, limiting their effectiveness in complex scenarios requiring reasoning. Furthermore, they suffer from excessive token consumption due to Python-based search plan representations and inadequate integration of multimedia elements for both input processing and… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  29. arXiv:2505.06273  [pdf, ps, other

    cs.LG cs.AI

    Policy-labeled Preference Learning: Is Preference Enough for RLHF?

    Authors: Taehyun Cho, Seokhun Ju, Seungyub Han, Dohyeong Kim, Kyungjae Lee, Jungwoo Lee

    Abstract: To design rewards that align with human goals, Reinforcement Learning from Human Feedback (RLHF) has emerged as a prominent technique for learning reward functions from human preferences and optimizing policies via reinforcement learning algorithms. However, existing RLHF methods often misinterpret trajectories as being generated by an optimal policy, causing inaccurate likelihood estimation and s… ▽ More

    Submitted 13 May, 2025; v1 submitted 6 May, 2025; originally announced May 2025.

  30. arXiv:2504.01886  [pdf, other

    cs.CV

    GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical Reasoning

    Authors: Yanzhou Su, Tianbin Li, Jiyao Liu, Chenglong Ma, Junzhi Ning, Cheng Tang, Sibo Ju, Jin Ye, Pengcheng Chen, Ming Hu, Shixiang Tang, Lihao Liu, Bin Fu, Wenqi Shao, Xiaowei Hu, Xiangwen Liao, Yuanfeng Ji, Junjun He

    Abstract: Recent advances in general medical AI have made significant strides, but existing models often lack the reasoning capabilities needed for complex medical decision-making. This paper presents GMAI-VL-R1, a multimodal medical reasoning model enhanced by reinforcement learning (RL) to improve its reasoning abilities. Through iterative training, GMAI-VL-R1 optimizes decision-making, significantly boos… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  31. arXiv:2503.07032  [pdf, other

    cs.CL cs.CV

    Multimodal Human-AI Synergy for Medical Imaging Quality Control: A Hybrid Intelligence Framework with Adaptive Dataset Curation and Closed-Loop Evaluation

    Authors: Zhi Qin, Qianhui Gui, Mouxiao Bian, Rui Wang, Hong Ge, Dandan Yao, Ziying Sun, Yuan Zhao, Yu Zhang, Hui Shi, Dongdong Wang, Chenxin Song, Shenghong Ju, Lihao Liu, Junjun He, Jie Xu, Yuan-Cheng Wang

    Abstract: Medical imaging quality control (QC) is essential for accurate diagnosis, yet traditional QC methods remain labor-intensive and subjective. To address this challenge, in this study, we establish a standardized dataset and evaluation framework for medical imaging QC, systematically assessing large language models (LLMs) in image quality assessment and report standardization. Specifically, we first… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  32. arXiv:2502.12171  [pdf, ps, other

    cs.LG cs.AI cs.CL

    GoRA: Gradient-driven Adaptive Low Rank Adaptation

    Authors: Haonan He, Peng Ye, Yuchen Ren, Yuan Yuan, Luyang Zhou, Shucun Ju, Lei Chen

    Abstract: Low-Rank Adaptation (LoRA) is a crucial method for efficiently fine-tuning large language models (LLMs), with its effectiveness influenced by two key factors: rank selection and weight initialization. While numerous LoRA variants have been proposed to improve performance by addressing one of these aspects, they often compromise usability or computational efficiency. In this paper, we analyze and i… ▽ More

    Submitted 24 October, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

    Comments: NeurIPS 2025

  33. arXiv:2501.05211  [pdf, other

    cond-mat.mtrl-sci

    Application of pretrained universal machine-learning interatomic potential for physicochemical simulation of liquid electrolytes in Li-ion battery

    Authors: Suyeon Ju, Jinmu You, Gijin Kim, Yutack Park, Hyungmin An, Seungwu Han

    Abstract: Achieving higher operational voltages, faster charging, and broader temperature ranges for Li-ion batteries necessitates advancements in electrolyte engineering. However, the complexity of optimizing combinations of solvents, salts, and additives has limited the effectiveness of both experimental and computational screening methods for liquid electrolytes. Recently, pretrained universal machine-le… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: 14 pages, 6 figures, Supplementary information included as ancillary file (+33 pages)

  34. arXiv:2412.00859  [pdf, other

    cond-mat.mes-hall

    Magnetically tuned topological phase in graphene nanoribbon heterojunctions

    Authors: Wei-Jian Li, Da-Fei Sun, Sheng Ju, Ai-Lei He, Yuan Zhou

    Abstract: The interplay between topology and magnetism often triggers the exotic quantum phases. Here, we report an accessible scheme to engineer the robust $\mathbb{Z}_{2}$ topology by intrinsic magnetism, originating from the zigzag segment connecting two armchair segments with different width, in one-dimensional graphene nanoribbon heterojunctions. Our first-principle and model simulations reveal that th… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

    Comments: 5 pages, 5 figures

    Journal ref: Phys.Rev.B 112, 115401 (2025)

  35. arXiv:2410.20017  [pdf, other

    cs.LG cs.AI cs.HC

    Off-Policy Selection for Initiating Human-Centric Experimental Design

    Authors: Ge Gao, Xi Yang, Qitong Gao, Song Ju, Miroslav Pajic, Min Chi

    Abstract: In human-centric tasks such as healthcare and education, the heterogeneity among patients and students necessitates personalized treatments and instructional interventions. While reinforcement learning (RL) has been utilized in those tasks, off-policy selection (OPS) is pivotal to close the loop by offline evaluating and selecting policies without online interactions, yet current OPS methods often… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  36. arXiv:2410.16199  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci cond-mat.other

    Momentum-Resolved Fingerprint of Mottness in Layer-Dimerized Nb$_3$Br$_8$

    Authors: Mihir Date, Francesco Petocchi, Yun Yen, Jonas A. Krieger, Banabir Pal, Vicky Hasse, Emily C. McFarlane, Chris Körner, Jiho Yoon, Matthew D. Watson, Vladimir N. Strocov, Yuanfeng Xu, Ilya Kostanovski, Mazhar N. Ali, Sailong Ju, Nicholas C. Plumb, Michael A. Sentef, Georg Woltersdorf, Michael Schüler, Philipp Werner, Claudia Felser, Stuart S. P. Parkin, Niels B. M. Schröter

    Abstract: In a well-ordered crystalline solid, insulating behaviour can arise from two mechanisms: electrons can either scatter off a periodic potential, thus forming band gaps that can lead to a band insulator, or they localize due to strong interactions, resulting in a Mott insulator. For an even number of electrons per unit cell, either band- or Mott-insulators can theoretically occur. However, unambiguo… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: 9 pages, 3 figures

  37. arXiv:2410.16032  [pdf, other

    cs.LG cs.AI

    TimeMixer++: A General Time Series Pattern Machine for Universal Predictive Analysis

    Authors: Shiyu Wang, Jiawei Li, Xiaoming Shi, Zhou Ye, Baichuan Mo, Wenze Lin, Shengtong Ju, Zhixuan Chu, Ming Jin

    Abstract: Time series analysis plays a critical role in numerous applications, supporting tasks such as forecasting, classification, anomaly detection, and imputation. In this work, we present the time series pattern machine (TSPM), a model designed to excel in a broad range of time series tasks through powerful representation and pattern extraction capabilities. Traditional time series models often struggl… ▽ More

    Submitted 19 May, 2025; v1 submitted 21 October, 2024; originally announced October 2024.

    Comments: Accepted by the 13th International Conference on Learning Representations (ICLR 2025)

  38. arXiv:2410.14196  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Quantum-Confined Tunable Ferromagnetism on the Surface of a van der Waals Antiferromagnet NaCrTe2

    Authors: Yidian Li, Xian Du, Junjie Wang, Runzhe Xu, Wenxuan Zhao, Kaiyi Zhai, Jieyi Liu, Houke Chen, Yiheng Yang, Nicolas C. Plumb, Sailong Ju, Ming Shi, Zhongkai Liu, Jiangang Guo, Xiaolong Chen, Yulin Chen, Lexian Yang

    Abstract: The surface of three-dimensional materials provides an ideal and versatile platform to explore quantum-confined physics. Here, we systematically investigate the electronic structure of Na-intercalated CrTe2, a van der Waals antiferromagnet, using angle-resolved photoemission spectroscopy and ab-initio calculations. The measured band structure deviates from the calculation of bulk NaCrTe2 but agree… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Journal ref: Nano Lett. 24, 9832-9838 (2024)

  39. arXiv:2410.03984  [pdf, other

    eess.IV

    Shadow Augmentation for Handwashing Action Recognition: from Synthetic to Real Datasets

    Authors: Shengtai Ju, Amy R. Reibman

    Abstract: Video analytics systems designed for deployment in outdoor conditions can be vulnerable to many environmental changes, particularly changes in shadow. Existing works have shown that shadow and its introduced distribution shift can cause system performance to degrade sharply. In this paper, we explore mitigation strategies to shadow-induced breakdown points of an action recognition system, using th… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  40. arXiv:2407.21260  [pdf, other

    cs.LG cs.AI stat.ML

    Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation

    Authors: Taehyun Cho, Seungyub Han, Seokhun Ju, Dohyeong Kim, Kyungjae Lee, Jungwoo Lee

    Abstract: Distributional reinforcement learning improves performance by capturing environmental stochasticity, but a comprehensive theoretical understanding of its effectiveness remains elusive. In addition, the intractable element of the infinite dimensionality of distributions has been overlooked. In this paper, we present a regret analysis of distributional reinforcement learning with general value funct… ▽ More

    Submitted 13 May, 2025; v1 submitted 30 July, 2024; originally announced July 2024.

  41. arXiv:2407.09520  [pdf, other

    cs.CV eess.IV

    Exploring the Impact of Hand Pose and Shadow on Hand-washing Action Recognition

    Authors: Shengtai Ju, Amy R. Reibman

    Abstract: In the real world, camera-based application systems can face many challenges, including environmental factors and distribution shift. In this paper, we investigate how pose and shadow impact a classifier's performance, using the specific application of handwashing action recognition. To accomplish this, we generate synthetic data with desired variations to introduce controlled distribution shift.… ▽ More

    Submitted 19 June, 2024; originally announced July 2024.

  42. arXiv:2404.19613  [pdf

    cond-mat.mtrl-sci physics.app-ph physics.comp-ph

    High-throughput discovery of metal oxides with high thermoelectric performance via interpretable feature engineering on small data

    Authors: Shengluo Ma, Yongchao Rao, Xiang Huang, Shenghong Ju

    Abstract: In this work, we have proposed a data-driven screening framework combining the interpretable machine learning with high-throughput calculations to identify a series of metal oxides that exhibit both high-temperature tolerance and high power factors. Aiming at the problem of weak generalization ability of small data with power factors at high temperatures, we employ symbolic regression for feature… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  43. arXiv:2404.06063  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Heuristic-enhanced Candidates Selection strategy for GPTs tackle Few-Shot Aspect-Based Sentiment Analysis

    Authors: Baoxing Jiang, Yujie Wan, Shenggen Ju

    Abstract: Few-Shot Aspect-Based Sentiment Analysis (FSABSA) is an indispensable and highly challenging task in natural language processing. However, methods based on Pre-trained Language Models (PLMs) struggle to accommodate multiple sub-tasks, and methods based on Generative Pre-trained Transformers (GPTs) perform poorly. To address the above issues, the paper designs a Heuristic-enhanced Candidates Select… ▽ More

    Submitted 19 August, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: 9 pages, 5 figures

  44. arXiv:2403.15887  [pdf

    cond-mat.soft cond-mat.mtrl-sci physics.app-ph physics.chem-ph physics.comp-ph

    Tutorial: AI-assisted exploration and active design of polymers with high intrinsic thermal conductivity

    Authors: Xiang Huang, Shenghong Ju

    Abstract: Designing polymers with high intrinsic thermal conductivity (TC) is critically important for the thermal management of organic electronics and photonics. However, this is a challenging task owing to the diversity of the chemical space and the barriers to advanced synthetic experiments/characterization techniques for polymers. In this Tutorial, the fundamentals and implementation of combining class… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Journal ref: Journal of Applied Physics 135, 171101, 2024

  45. arXiv:2402.17437  [pdf, other

    cs.CL cs.AI

    Exploiting Emotion-Semantic Correlations for Empathetic Response Generation

    Authors: Zhou Yang, Zhaochun Ren, Yufeng Wang, Xiaofei Zhu, Zhihao Chen, Tiecheng Cai, Yunbing Wu, Yisong Su, Sibo Ju, Xiangwen Liao

    Abstract: Empathetic response generation aims to generate empathetic responses by understanding the speaker's emotional feelings from the language of dialogue. Recent methods capture emotional words in the language of communicators and construct them as static vectors to perceive nuanced emotions. However, linguistic research has shown that emotional words in language are dynamic and have correlations with… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 12 pages, 3 figures, Findings of EMNLP 2023

  46. arXiv:2402.11600  [pdf

    cond-mat.soft cond-mat.mtrl-sci physics.app-ph physics.comp-ph

    AI-assisted inverse design of sequence-ordered high intrinsic thermal conductivity polymers

    Authors: Xiang Huang, C. Y. Zhao, Hong Wang, Shenghong Ju

    Abstract: Artificial intelligence (AI) promotes the polymer design paradigm from a traditional trial-and-error approach to a data-driven style. Achieving high thermal conductivity (TC) for intrinsic polymers is urgent because of their importance in the thermal management of many industrial applications such as microelectronic devices and integrated circuits. In this work, we have proposed a robust AI-assist… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Journal ref: Materials Today Physics 44, 101438, 2024

  47. arXiv:2401.11522  [pdf

    cond-mat.soft cond-mat.mes-hall physics.chem-ph physics.comp-ph

    Tunable thermal conductivity of sustainable geopolymers by Si/Al ratio and moisture content: insights from atomistic simulations

    Authors: Wenkai Liu, Shenghong Ju

    Abstract: In this work, the effects of Si/Al ratio and moisture content on thermal transport in sustainable geopolymers has been comprehensively investigated by using the molecular dynamics simulation. The thermal conductivity of geopolymer systems increases with the increase of Si/Al ratio, and the phonon vibration frequency region which plays a major role in the main increase of its thermal conductivity i… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Journal ref: The Journal of Physical Chemistry B 128, 2972, 2024

  48. arXiv:2401.10524  [pdf, other

    physics.optics gr-qc

    Spectral Switches of Light in Curved Space

    Authors: Suting Ju, Chenni Xu, Li-Gang Wang

    Abstract: Acting as analog models of curved spacetime, surfaces of revolution employed for exploring novel optical effects are followed with great interest nowadays to enhance our comprehension of the universe. It is of general interest to understand the spectral effect of light propagating through a long distance in the universe. Here, we address the issue on how curved space affects the phenomenon of spec… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 9 pages,5 figures

  49. arXiv:2310.19350  [pdf, other

    cond-mat.mtrl-sci

    Disorder-dependent Li diffusion in $\mathrm{Li_6PS_5Cl}$ investigated by machine learning potential

    Authors: Jiho Lee, Suyeon Ju, Seungwoo Hwang, Jinmu You, Jisu Jung, Youngho Kang, Seungwu Han

    Abstract: Solid-state electrolytes with argyrodite structures, such as $\mathrm{Li_6PS_5Cl}$, have attracted considerable attention due to their superior safety compared to liquid electrolytes and higher ionic conductivity than other solid electrolytes. Although experimental efforts have been made to enhance conductivity by controlling the degree of disorder, the underlying diffusion mechanism is not yet fu… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 34 pages, 6 figures

  50. arXiv:2310.10977  [pdf, other

    math.NA

    A positivity-preserving numerical method for a thin liquid film on a vertical cylindrical fiber

    Authors: Bohyun Kim, Hangjie Ji, Andrea L. Bertozzi, Abolfazl Sadeghpour, Y. Sungtaek Ju

    Abstract: When a thin liquid film flows down on a vertical fiber, one can observe the complex and captivating interfacial dynamics of an unsteady flow. Such dynamics are applicable in various fluid experiments due to their high surface area-to-volume ratio. Recent studies verified that when the flow undergoes regime transitions, the magnitude of the film thickness changes dramatically, making numerical simu… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.