Skip to main content

Showing 1–50 of 67 results for author: Na, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2604.02032  [pdf, ps, other

    cs.CV cs.LG

    IndoorCrowd: A Multi-Scene Dataset for Human Detection, Segmentation, and Tracking with an Automated Annotation Pipeline

    Authors: Sebastian-Ion Nae, Radu Moldoveanu, Alexandra Stefania Ghita, Adina Magda Florea

    Abstract: Understanding human behaviour in crowded indoor environments is central to surveillance, smart buildings, and human-robot interaction, yet existing datasets rarely capture real-world indoor complexity at scale. We introduce IndoorCrowd, a multi-scene dataset for indoor human detection, instance segmentation, and multi-object tracking, collected across four campus locations (ACS-EC, ACS-EG, IE-Cent… ▽ More

    Submitted 2 April, 2026; originally announced April 2026.

    Comments: Accepted at Conference on Computer Vision and Pattern Recognition Workshops 2026

  2. arXiv:2603.20587  [pdf, ps, other

    cs.LG cs.IT math.MG

    Neural collapse in the orthoplex regime

    Authors: James Alcala, Rayna Andreeva, Vladimir A. Kobzar, Dustin G. Mixon, Sanghoon Na, Shashank Sule, Yangxinyu Xie

    Abstract: When training a neural network for classification, the feature vectors of the training set are known to collapse to the vertices of a regular simplex, provided the dimension $d$ of the feature space and the number $n$ of classes satisfies $n\leq d+1$. This phenomenon is known as neural collapse. For other applications like language models, one instead takes $n\gg d$. Here, the neural collapse phen… ▽ More

    Submitted 20 March, 2026; originally announced March 2026.

  3. arXiv:2603.10230  [pdf, ps, other

    math.OC cs.LG math.NA stat.ML

    A Trust-Region Interior-Point Stochastic Sequential Quadratic Programming Method

    Authors: Yuchen Fang, Jihun Kim, Sen Na, James Demmel, Javad Lavaei

    Abstract: In this paper, we propose a trust-region interior-point stochastic sequential quadratic programming (TR-IP-SSQP) method for solving optimization problems with a stochastic objective and deterministic nonlinear equality and inequality constraints. In this setting, exact evaluations of the objective function and its gradient are unavailable, but their stochastic estimates can be constructed. In part… ▽ More

    Submitted 10 March, 2026; originally announced March 2026.

  4. arXiv:2602.19124  [pdf, ps, other

    cs.HC

    Dark and Bright Side of Participatory Red-Teaming with Targets of Stereotyping for Eliciting Harmful Behaviors from Large Language Models

    Authors: Sieun Kim, Yeeun Jo, Sungmin Na, Hyunseung Lim, Eunchae Lee, Yu Min Choi, Soohyun Cho, Hwajung Hong

    Abstract: Red-teaming, where adversarial prompts are crafted to expose harmful behaviors and assess risks, offers a dynamic approach to surfacing underlying stereotypical bias in large language models. Because such subtle harms are best recognized by those with lived experience, involving targets of stereotyping as red-teamers is essential. However, critical challenges remain in leveraging their lived exper… ▽ More

    Submitted 22 February, 2026; originally announced February 2026.

    Comments: 20 pages, 4 tables, 3 figures. Accepted to CHI 2026, April 13-17, 2026, Barcelona, Spain

  5. arXiv:2602.13440  [pdf, ps, other

    cs.CV cs.RO

    Learning on the Fly: Replay-Based Continual Object Perception for Indoor Drones

    Authors: Sebastian-Ion Nae, Mihai-Eugen Barbu, Sebastian Mocanu, Marius Leordeanu

    Abstract: Autonomous agents such as indoor drones must learn new object classes in real-time while limiting catastrophic forgetting, motivating Class-Incremental Learning (CIL). However, most unmanned aerial vehicle (UAV) datasets focus on outdoor scenes and offer limited temporally coherent indoor videos. We introduce an indoor dataset of $14,400$ frames capturing inter-drone and ground vehicle footage, an… ▽ More

    Submitted 13 February, 2026; originally announced February 2026.

    Comments: Accepted at European Robotics Forum (ERF) 2026

  6. arXiv:2602.07543  [pdf, ps, other

    cs.AI cond-mat.mtrl-sci

    MSP-LLM: A Unified Large Language Model Framework for Complete Material Synthesis Planning

    Authors: Heewoong Noh, Gyoung S. Na, Namkyeong Lee, Chanyoung Park

    Abstract: Material synthesis planning (MSP) remains a fundamental and underexplored bottleneck in AI-driven materials discovery, as it requires not only identifying suitable precursor materials but also designing coherent sequences of synthesis operations to realize a target material. Although several AI-based approaches have been proposed to address isolated subtasks of MSP, a unified methodology for solvi… ▽ More

    Submitted 1 March, 2026; v1 submitted 7 February, 2026; originally announced February 2026.

  7. arXiv:2602.07087  [pdf, ps, other

    physics.chem-ph cs.AI cs.LG physics.comp-ph

    Electron-Informed Coarse-Graining Molecular Representation Learning for Real-World Molecular Physics

    Authors: Gyoung S. Na, Chanyoung Park

    Abstract: Various representation learning methods for molecular structures have been devised to accelerate data-driven chemistry. However, the representation capabilities of existing methods are essentially limited to atom-level information, which is not sufficient to describe real-world molecular physics. Although electron-level information can provide fundamental knowledge about chemical compounds beyond… ▽ More

    Submitted 6 February, 2026; originally announced February 2026.

    Comments: KDD 2025 Research Track

  8. arXiv:2601.01048  [pdf, ps, other

    cs.CR

    CuFuzz: Hardening CUDA Programs through Transformation and Fuzzing

    Authors: Saurabh Singh, Ruobing Han, Jaewon Lee, Seonjin Na, Yonghae Kim, Taesoo Kim, Hyesoon Kim

    Abstract: GPUs have gained significant popularity over the past decade, extending beyond their original role in graphics rendering. This evolution has brought GPU security and reliability to the forefront of concerns. Prior research has shown that CUDA's lack of memory safety can lead to serious vulnerabilities. While fuzzing is effective for finding such bugs on CPUs, equivalent tools for GPUs are lacking… ▽ More

    Submitted 2 January, 2026; originally announced January 2026.

    Comments: 16 pages, 7 figures, 2 tables

  9. arXiv:2512.08948  [pdf, ps, other

    stat.ML cs.LG math.OC math.ST

    Online Inference of Constrained Optimization: Primal-Dual Optimality and Sequential Quadratic Programming

    Authors: Yihang Gao, Michael K. Ng, Michael W. Mahoney, Sen Na

    Abstract: We study online statistical inference for the solutions of stochastic optimization problems with equality and inequality constraints. Such problems are prevalent in statistics and machine learning, encompassing constrained $M$-estimation, physics-informed models, safe reinforcement learning, and algorithmic fairness. We develop a stochastic sequential quadratic programming (SSQP) method to solve t… ▽ More

    Submitted 27 November, 2025; originally announced December 2025.

    Comments: 80 pages, 5 figures, 5 tables

  10. arXiv:2511.20686  [pdf, ps, other

    cs.AI cs.CY cs.LG

    AssurAI: Experience with Constructing Korean Socio-cultural Datasets to Discover Potential Risks of Generative AI

    Authors: Chae-Gyun Lim, Seung-Ho Han, EunYoung Byun, Jeongyun Han, Soohyun Cho, Eojin Joo, Heehyeon Kim, Sieun Kim, Juhoon Lee, Hyunsoo Lee, Dongkun Lee, Jonghwan Hyeon, Yechan Hwang, Young-Jun Lee, Kyeongryul Lee, Minhyeong An, Hyunjun Ahn, Jeongwoo Son, Junho Park, Donggyu Yoon, Taehyung Kim, Jeemin Kim, Dasom Choi, Kwangyoung Lee, Hyunseung Lim , et al. (29 additional authors not shown)

    Abstract: The rapid evolution of generative AI necessitates robust safety evaluations. However, current safety datasets are predominantly English-centric, failing to capture specific risks in non-English, socio-cultural contexts such as Korean, and are often limited to the text modality. To address this gap, we introduce AssurAI, a new quality-controlled Korean multimodal dataset for evaluating the safety o… ▽ More

    Submitted 20 November, 2025; originally announced November 2025.

    Comments: 16 pages, HuggingFace: https://huggingface.co/datasets/TTA01/AssurAI

  11. arXiv:2511.17634  [pdf, ps, other

    cs.CV

    Efficient Score Pre-computation for Diffusion Models via Cross-Matrix Krylov Projection

    Authors: Kaikwan Lau, Andrew S. Na, Justin W. L. Wan

    Abstract: This paper presents a novel framework to accelerate score-based diffusion models. It first converts the standard stable diffusion model into the Fokker-Planck formulation which results in solving large linear systems for each image. For training involving many images, it can lead to a high computational cost. The core innovation is a cross-matrix Krylov projection method that exploits mathematical… ▽ More

    Submitted 19 November, 2025; originally announced November 2025.

  12. arXiv:2510.24774  [pdf, ps, other

    cs.CY cs.CL

    PANORAMA: A Dataset and Benchmarks Capturing Decision Trails and Rationales in Patent Examination

    Authors: Hyunseung Lim, Sooyohn Nam, Sungmin Na, Ji Yong Cho, June Yong Yang, Hyungyu Shin, Yoonjoo Lee, Juho Kim, Moontae Lee, Hwajung Hong

    Abstract: Patent examination remains an ongoing challenge in the NLP literature even after the advent of large language models (LLMs), as it requires an extensive yet nuanced human judgment on whether a submitted claim meets the statutory standards of novelty and non-obviousness against previously granted claims -- prior art -- in expert domains. Previous NLP studies have approached this challenge as a pred… ▽ More

    Submitted 24 October, 2025; originally announced October 2025.

  13. arXiv:2510.04478  [pdf, ps, other

    math.OC cs.CE cs.DC math.DS math.NA

    Overlapping Schwarz Scheme for Linear-Quadratic Programs in Continuous Time

    Authors: Hongli Zhao, Mihai Anitescu, Sen Na

    Abstract: We present an optimize-then-discretize framework for solving linear-quadratic optimal control problems (OCP) governed by time-inhomogeneous ordinary differential equations (ODEs). Our method employs a modified overlapping Schwarz decomposition based on the Pontryagin Minimum Principle, partitioning the temporal domain into overlapping intervals and independently solving Hamiltonian systems in cont… ▽ More

    Submitted 12 October, 2025; v1 submitted 6 October, 2025; originally announced October 2025.

    Comments: 34 pages, 2 figures

  14. arXiv:2509.01563  [pdf, ps, other

    cs.CV

    Kwai Keye-VL 1.5 Technical Report

    Authors: Biao Yang, Bin Wen, Boyang Ding, Changyi Liu, Chenglong Chu, Chengru Song, Chongling Rao, Chuan Yi, Da Li, Dunju Zang, Fan Yang, Guorui Zhou, Guowang Zhang, Han Shen, Hao Peng, Haojie Ding, Hao Wang, Haonan Fan, Hengrui Ju, Jiaming Huang, Jiangxia Cao, Jiankang Chen, Jingyun Hua, Kaibing Chen, Kaiyu Jiang , et al. (36 additional authors not shown)

    Abstract: In recent years, the development of Large Language Models (LLMs) has significantly advanced, extending their capabilities to multimodal tasks through Multimodal Large Language Models (MLLMs). However, video understanding remains a challenging area due to the dynamic and information-dense nature of videos. Existing models struggle with the trade-off between spatial resolution and temporal coverage… ▽ More

    Submitted 7 September, 2025; v1 submitted 1 September, 2025; originally announced September 2025.

    Comments: Github page: https://github.com/Kwai-Keye/Keye

  15. arXiv:2508.16112  [pdf, ps, other

    cs.AI

    IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra

    Authors: Heewoong Noh, Namkyeong Lee, Gyoung S. Na, Kibum Kim, Chanyoung Park

    Abstract: Spectral analysis provides crucial clues for the elucidation of unknown materials. Among various techniques, infrared spectroscopy (IR) plays an important role in laboratory settings due to its high accessibility and low cost. However, existing approaches often fail to reflect expert analytical processes and lack flexibility in incorporating diverse types of chemical knowledge, which is essential… ▽ More

    Submitted 22 August, 2025; originally announced August 2025.

  16. arXiv:2507.19878  [pdf, ps, other

    cs.CV cs.RO

    Efficient Self-Supervised Neuro-Analytic Visual Servoing for Real-time Quadrotor Control

    Authors: Sebastian Mocanu, Sebastian-Ion Nae, Mihai-Eugen Barbu, Marius Leordeanu

    Abstract: This work introduces a self-supervised neuro-analytical, cost efficient, model for visual-based quadrotor control in which a small 1.7M parameters student ConvNet learns automatically from an analytical teacher, an improved image-based visual servoing (IBVS) controller. Our IBVS system solves numerical instabilities by reducing the classical visual servoing equations and enabling efficient stable… ▽ More

    Submitted 26 July, 2025; originally announced July 2025.

    Comments: Accepted at the International Conference on Computer Vision Workshops 2025

  17. arXiv:2506.13472  [pdf, ps, other

    cs.CL cs.AI

    ROSAQ: Rotation-based Saliency-Aware Weight Quantization for Efficiently Compressing Large Language Models

    Authors: Junho Yoon, Geom Lee, Donghyeon Jeon, Inho Kang, Seung-Hoon Na

    Abstract: Quantization has been widely studied as an effective technique for reducing the memory requirement of large language models (LLMs), potentially improving the latency time as well. Utilizing the characteristic of rotational invariance of transformer, we propose the rotation-based saliency-aware weight quantization (ROSAQ), which identifies salient channels in the projection feature space, not in th… ▽ More

    Submitted 17 June, 2025; v1 submitted 16 June, 2025; originally announced June 2025.

    Comments: 10 pages, 2 figures

  18. arXiv:2505.18327  [pdf, ps, other

    stat.ML cs.LG math.NA math.OC math.ST stat.CO

    Online Statistical Inference of Constrained Stochastic Optimization via Random Scaling

    Authors: Xinchen Du, Wanrong Zhu, Wei Biao Wu, Sen Na

    Abstract: Constrained stochastic nonlinear optimization problems have attracted significant attention for their ability to model complex real-world scenarios in physics, economics, and biology. As datasets continue to grow, online inference methods have become crucial for enabling real-time decision-making without the need to store historical data. In this work, we develop an online inference procedure for… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 43 pages, 1 figure, 8 tables

  19. arXiv:2505.11738  [pdf

    cs.AI

    Automated Real-time Assessment of Intracranial Hemorrhage Detection AI Using an Ensembled Monitoring Model (EMM)

    Authors: Zhongnan Fang, Andrew Johnston, Lina Cheuy, Hye Sun Na, Magdalini Paschali, Camila Gonzalez, Bonnie A. Armstrong, Arogya Koirala, Derrick Laurel, Andrew Walker Campion, Michael Iv, Akshay S. Chaudhari, David B. Larson

    Abstract: Artificial intelligence (AI) tools for radiology are commonly unmonitored once deployed. The lack of real-time case-by-case assessments of AI prediction confidence requires users to independently distinguish between trustworthy and unreliable AI predictions, which increases cognitive burden, reduces productivity, and potentially leads to misdiagnoses. To address these challenges, we introduce Ense… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  20. arXiv:2503.19091  [pdf, ps, other

    math.OC cs.CC cs.LG math.NA stat.ML

    High Probability Complexity Bounds of Trust-Region Stochastic Sequential Quadratic Programming with Heavy-Tailed Noise

    Authors: Yuchen Fang, Javad Lavaei, Sen Na

    Abstract: In this paper, we consider nonlinear optimization problems with a stochastic objective and deterministic equality constraints. We propose a Trust-Region Stochastic Sequential Quadratic Programming (TR-SSQP) method and establish its high-probability iteration complexity bounds for identifying first- and second-order $ε$-stationary points. In our algorithm, we assume that exact objective values, gra… ▽ More

    Submitted 1 April, 2026; v1 submitted 24 March, 2025; originally announced March 2025.

    Comments: 66 pages, 7 figures

  21. arXiv:2502.11101  [pdf, other

    cs.CL cs.AI

    CacheFocus: Dynamic Cache Re-Positioning for Efficient Retrieval-Augmented Generation

    Authors: Kun-Hui Lee, Eunhwan Park, Donghoon Han, Seung-Hoon Na

    Abstract: Large Language Models (LLMs) excel across a variety of language tasks yet are constrained by limited input lengths and high computational costs. Existing approaches\textemdash such as relative positional encodings (e.g., RoPE, ALiBi) and sliding window mechanisms\textemdash partially alleviate these issues but often require additional training or suffer from performance degradation with longer inp… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

    Comments: 11 pages (Work in progress)

  22. arXiv:2502.07221  [pdf, ps, other

    cs.CV

    HOMIE: Histopathology Omni-modal Embedding for Pathology Composed Retrieval

    Authors: Qifeng Zhou, Wenliang Zhong, Thao M. Dang, Hehuan Ma, Saiyang Na, Yuzhi Guo, Junzhou Huang

    Abstract: The integration of Artificial Intelligence (AI) into pathology faces a fundamental challenge: black-box predictive models lack transparency, while generative approaches risk clinical hallucination. A case-based retrieval paradigm offers a more interpretable alternative for clinical adoption. However, current SOTA models are constrained by dual-encoder architectures that cannot process the composed… ▽ More

    Submitted 21 December, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

  23. arXiv:2502.07114  [pdf, other

    stat.ML cs.LG math.NA math.OC stat.CO

    Online Covariance Matrix Estimation in Sketched Newton Methods

    Authors: Wei Kuang, Mihai Anitescu, Sen Na

    Abstract: Given the ubiquity of streaming data, online algorithms have been widely used for parameter estimation, with second-order methods particularly standing out for their efficiency and robustness. In this paper, we study an online sketched Newton method that leverages a randomized sketching technique to perform an approximate Newton step in each iteration, thereby eliminating the computational bottlen… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: 52 pages, 2 figures, 7 tables

  24. arXiv:2502.05360  [pdf, ps, other

    cs.LG math.OC stat.ML

    Curse of Dimensionality in Neural Network Optimization

    Authors: Sanghoon Na, Haizhao Yang

    Abstract: This paper demonstrates that when a shallow neural network with a Lipschitz continuous activation function is trained using either empirical or population risk to approximate a target function that is $r$ times continuously differentiable on $[0,1]^d$, the population risk may not decay at a rate faster than $t^{-\frac{4r}{d-2r}}$, where $t$ denotes the time parameter of the gradient flow dynamics.… ▽ More

    Submitted 5 March, 2026; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: Accepted for publication in Information and Inference: A Journal of the IMA. 32 pages, 1 figure

  25. arXiv:2502.05305  [pdf, ps, other

    stat.ML cs.LG math.OC

    Online Covariance Estimation in Nonsmooth Stochastic Approximation

    Authors: Liwei Jiang, Abhishek Roy, Krishna Balasubramanian, Damek Davis, Dmitriy Drusvyatskiy, Sen Na

    Abstract: We consider applying stochastic approximation (SA) methods to solve nonsmooth variational inclusion problems. Existing studies have shown that the averaged iterates of SA methods exhibit asymptotic normality, with an optimal limiting covariance matrix in the local minimax sense of Hájek and Le Cam. However, no methods have been proposed to estimate this covariance matrix in a nonsmooth and potenti… ▽ More

    Submitted 11 August, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: 46 pages, 1 figure; Accepted at the 38th Annual Conference on Learning Theory (COLT 2025)

  26. arXiv:2412.02957  [pdf, ps, other

    cs.LG cs.AI

    3D Interaction Geometric Pre-training for Molecular Relational Learning

    Authors: Namkyeong Lee, Yunhak Oh, Heewoong Noh, Gyoung S. Na, Minkai Xu, Hanchen Wang, Tianfan Fu, Chanyoung Park

    Abstract: Molecular Relational Learning (MRL) is a rapidly growing field that focuses on understanding the interaction dynamics between molecules, which is crucial for applications ranging from catalyst engineering to drug discovery. Despite recent progress, earlier MRL approaches are limited to using only the 2D topological structure of molecules, as obtaining the 3D interaction geometry remains prohibitiv… ▽ More

    Submitted 30 September, 2025; v1 submitted 3 December, 2024; originally announced December 2024.

  27. arXiv:2410.21341  [pdf, ps, other

    cs.LG cs.AI

    Retrieval-Retro: Retrieval-based Inorganic Retrosynthesis with Expert Knowledge

    Authors: Heewoong Noh, Namkyeong Lee, Gyoung S. Na, Chanyoung Park

    Abstract: While inorganic retrosynthesis planning is essential in the field of chemical science, the application of machine learning in this area has been notably less explored compared to organic retrosynthesis planning. In this paper, we propose Retrieval-Retro for inorganic retrosynthesis planning, which implicitly extracts the precursor information of reference materials that are retrieved from the know… ▽ More

    Submitted 12 October, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024

  28. arXiv:2410.14569  [pdf, other

    cs.CR cs.AI

    When LLMs Go Online: The Emerging Threat of Web-Enabled LLMs

    Authors: Hanna Kim, Minkyoo Song, Seung Ho Na, Seungwon Shin, Kimin Lee

    Abstract: Recent advancements in Large Language Models (LLMs) have established them as agentic systems capable of planning and interacting with various tools. These LLM agents are often paired with web-based tools, enabling access to diverse sources and real-time information. Although these advancements offer significant benefits across various applications, they also increase the risk of malicious use, par… ▽ More

    Submitted 3 February, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

    Comments: 20 pages, To appear in Usenix Security 2025

  29. arXiv:2409.15734  [pdf, other

    math.OC cs.LG math.NA stat.CO stat.ML

    Trust-Region Sequential Quadratic Programming for Stochastic Optimization with Random Models

    Authors: Yuchen Fang, Sen Na, Michael W. Mahoney, Mladen Kolar

    Abstract: In this work, we consider solving optimization problems with a stochastic objective and deterministic equality constraints. We propose a Trust-Region Sequential Quadratic Programming method to find both first- and second-order stationary points. Our method utilizes a random model to represent the objective function, which is constructed from stochastic observations of the objective and is designed… ▽ More

    Submitted 26 September, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: 41 pages, 3 figures

  30. arXiv:2409.14119  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Obliviate: Neutralizing Task-agnostic Backdoors within the Parameter-efficient Fine-tuning Paradigm

    Authors: Jaehan Kim, Minkyoo Song, Seung Ho Na, Seungwon Shin

    Abstract: Parameter-efficient fine-tuning (PEFT) has become a key training strategy for large language models. However, its reliance on fewer trainable parameters poses security risks, such as task-agnostic backdoors. Despite their severe impact on a wide range of tasks, there is no practical defense solution available that effectively counters task-agnostic backdoors within the context of PEFT. In this stu… ▽ More

    Submitted 6 October, 2024; v1 submitted 21 September, 2024; originally announced September 2024.

    Comments: Under Review

  31. arXiv:2409.10777  [pdf, other

    cs.LG math.NA

    Physics-Informed Neural Networks with Trust-Region Sequential Quadratic Programming

    Authors: Xiaoran Cheng, Sen Na

    Abstract: Physics-Informed Neural Networks (PINNs) represent a significant advancement in Scientific Machine Learning (SciML), which integrate physical domain knowledge into an empirical loss function as soft constraints and apply existing machine learning methods to train the model. However, recent research has noted that PINNs may fail to learn relatively complex Partial Differential Equations (PDEs). Thi… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 20 pages, 9 figures, 3 tables

  32. arXiv:2405.20216  [pdf, other

    cs.CV cs.AI cs.LG

    Boost Your Human Image Generation Model via Direct Preference Optimization

    Authors: Sanghyeon Na, Yonggyu Kim, Hyunjoon Lee

    Abstract: Human image generation is a key focus in image synthesis due to its broad applications, but even slight inaccuracies in anatomy, pose, or details can compromise realism. To address these challenges, we explore Direct Preference Optimization (DPO), which trains models to generate preferred (winning) images while diverging from non-preferred (losing) ones. However, conventional DPO methods use gener… ▽ More

    Submitted 9 April, 2025; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR 2025 as a highlight paper

  33. arXiv:2404.08131  [pdf, other

    cs.LG cs.IT stat.ML

    Frame Quantization of Neural Networks

    Authors: Wojciech Czaja, Sanghoon Na

    Abstract: We present a post-training quantization algorithm with error estimates relying on ideas originating from frame theory. Specifically, we use first-order Sigma-Delta ($ΣΔ$) quantization for finite unit-norm tight frames to quantize weight matrices and biases in a neural network. In our scenario, we derive an error bound between the original neural network and the quantized neural network in terms of… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 20 pages, 2 figures

  34. arXiv:2404.06661  [pdf, other

    cs.CV

    Efficient Denoising using Score Embedding in Score-based Diffusion Models

    Authors: Andrew S. Na, William Gao, Justin W. L. Wan

    Abstract: It is well known that training a denoising score-based diffusion models requires tens of thousands of epochs and a substantial number of image data to train the model. In this paper, we propose to increase the efficiency in training score-based diffusion models. Our method allows us to decrease the number of epochs needed to train the diffusion model. We accomplish this by solving the log-density… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  35. arXiv:2403.17833  [pdf, other

    cs.LG cs.DC

    GPFL: A Gradient Projection-Based Client Selection Framework for Efficient Federated Learning

    Authors: Shijie Na, Yuzhi Liang, Siu-Ming Yiu

    Abstract: Federated learning client selection is crucial for determining participant clients while balancing model accuracy and communication efficiency. Existing methods have limitations in handling data heterogeneity, computational burdens, and independent client treatment. To address these challenges, we propose GPFL, which measures client value by comparing local and global descent directions. We also e… ▽ More

    Submitted 26 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  36. arXiv:2401.13220  [pdf, other

    eess.IV cs.CV

    Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation

    Authors: Saiyang Na, Yuzhi Guo, Feng Jiang, Hehuan Ma, Junzhou Huang

    Abstract: In the rapidly evolving field of AI research, foundational models like BERT and GPT have significantly advanced language and vision tasks. The advent of pretrain-prompting models such as ChatGPT and Segmentation Anything Model (SAM) has further revolutionized image segmentation. However, their applications in specialized areas, particularly in nuclei segmentation within medical imaging, reveal a k… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  37. arXiv:2312.13289  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Compositional Representation of Polymorphic Crystalline Materials

    Authors: Namkyeong Lee, Heewoong Noh, Gyoung S. Na, Jimeng Sun, Tianfan Fu, Marinka Zitnik, Chanyoung Park

    Abstract: Machine learning (ML) has seen promising developments in materials science, yet its efficacy largely depends on detailed crystal structural data, which are often complex and hard to obtain, limiting their applicability in real-world material synthesis processes. An alternative, using compositional descriptors, offers a simpler approach by indicating the elemental ratios of compounds without detail… ▽ More

    Submitted 7 December, 2024; v1 submitted 17 November, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 AI4Science Workshop

  38. arXiv:2311.12856  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.LG

    Density of States Prediction of Crystalline Materials via Prompt-guided Multi-Modal Transformer

    Authors: Namkyeong Lee, Heewoong Noh, Sungwon Kim, Dongmin Hyun, Gyoung S. Na, Chanyoung Park

    Abstract: The density of states (DOS) is a spectral property of crystalline materials, which provides fundamental insights into various characteristics of the materials. While previous works mainly focus on obtaining high-quality representations of crystalline materials for DOS prediction, we focus on predicting the DOS from the obtained representations by reflecting the nature of DOS: DOS determines the ge… ▽ More

    Submitted 22 November, 2023; v1 submitted 24 October, 2023; originally announced November 2023.

    Comments: NeurIPS 2023. arXiv admin note: text overlap with arXiv:2303.07000

  39. MFIM: Megapixel Facial Identity Manipulation

    Authors: Sanghyeon Na

    Abstract: Face swapping is a task that changes a facial identity of a given image to that of another person. In this work, we propose a novel face-swapping framework called Megapixel Facial Identity Manipulation (MFIM). The face-swapping model should achieve two goals. First, it should be able to generate a high-quality image. We argue that a model which is proficient in generating a megapixel image can ach… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: ECCV 2022 accepted

  40. arXiv:2305.18451  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.MN

    Shift-Robust Molecular Relational Learning with Causal Substructure

    Authors: Namkyeong Lee, Kanghoon Yoon, Gyoung S. Na, Sein Kim, Chanyoung Park

    Abstract: Recently, molecular relational learning, whose goal is to predict the interaction behavior between molecular pairs, got a surge of interest in molecular sciences due to its wide range of applications. In this work, we propose CMRL that is robust to the distributional shift in molecular relational learning by detecting the core substructure that is causally related to chemical reactions. To do so,… ▽ More

    Submitted 20 July, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: KDD 2023

  41. arXiv:2305.18379  [pdf, other

    math.OC cs.LG math.NA stat.ML

    Constrained Optimization via Exact Augmented Lagrangian and Randomized Iterative Sketching

    Authors: Ilgee Hong, Sen Na, Michael W. Mahoney, Mladen Kolar

    Abstract: We consider solving equality-constrained nonlinear, nonconvex optimization problems. This class of problems appears widely in a variety of applications in machine learning and engineering, ranging from constrained deep neural networks, to optimal control, to PDE-constrained optimization. We develop an adaptive inexact Newton method for this problem class. In each iteration, we solve the Lagrangian… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: 25 pages, 4 figures

    Journal ref: ICML 2023

  42. arXiv:2305.01520  [pdf, other

    q-bio.MN cs.LG

    Conditional Graph Information Bottleneck for Molecular Relational Learning

    Authors: Namkyeong Lee, Dongmin Hyun, Gyoung S. Na, Sungwon Kim, Junseok Lee, Chanyoung Park

    Abstract: Molecular relational learning, whose goal is to learn the interaction behavior between molecular pairs, got a surge of interest in molecular sciences due to its wide range of applications. Recently, graph neural networks have recently shown great success in molecular relational learning by modeling a molecule as a graph structure, and considering atom-level interactions between two molecules. Desp… ▽ More

    Submitted 9 July, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

    Comments: ICML 2023

  43. arXiv:2303.07000  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.comp-ph

    Predicting Density of States via Multi-modal Transformer

    Authors: Namkyeong Lee, Heewoong Noh, Sungwon Kim, Dongmin Hyun, Gyoung S. Na, Chanyoung Park

    Abstract: The density of states (DOS) is a spectral property of materials, which provides fundamental insights on various characteristics of materials. In this paper, we propose a model to predict the DOS by reflecting the nature of DOS: DOS determines the general distribution of states as a function of energy. Specifically, we integrate the heterogeneous information obtained from the crystal structure and… ▽ More

    Submitted 10 April, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: ICLR 2023 Workshop on Machine Learning for Materials (ML4Materials)

  44. arXiv:2205.13687  [pdf, other

    math.OC cs.LG stat.ML

    Statistical Inference of Constrained Stochastic Optimization via Sketched Sequential Quadratic Programming

    Authors: Sen Na, Michael W. Mahoney

    Abstract: We consider online statistical inference of constrained stochastic nonlinear optimization problems. We apply the Stochastic Sequential Quadratic Programming (StoSQP) method to solve these problems, which can be regarded as applying second-order Newton's method to the Karush-Kuhn-Tucker (KKT) conditions. In each iteration, the StoSQP method computes the Newton direction by solving a quadratic progr… ▽ More

    Submitted 17 February, 2025; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: 72 pages, 2 figures, 11 tables

  45. arXiv:2204.09266  [pdf, other

    math.OC cs.LG stat.ML

    Hessian Averaging in Stochastic Newton Methods Achieves Superlinear Convergence

    Authors: Sen Na, Michał Dereziński, Michael W. Mahoney

    Abstract: We consider minimizing a smooth and strongly convex objective function using a stochastic Newton method. At each iteration, the algorithm is given an oracle access to a stochastic estimate of the Hessian matrix. The oracle model includes popular algorithms such as Subsampled Newton and Newton Sketch. Despite using second-order information, these existing methods do not exhibit superlinear converge… ▽ More

    Submitted 28 November, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: 43 pages, 16 figures

  46. arXiv:2204.08922  [pdf, other

    cs.CL cs.AI cs.LG

    Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring

    Authors: Hee-Jun Jung, Doyeon Kim, Seung-Hoon Na, Kangil Kim

    Abstract: Knowledge distillation is an approach to transfer information on representations from a teacher to a student by reducing their difference. A challenge of this approach is to reduce the flexibility of the student's representations inducing inaccurate learning of the teacher's knowledge. To resolve it in transferring, we investigate distillation of structures of representations specified to three ty… ▽ More

    Submitted 27 February, 2023; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: This work has been submitted to the ELSEVIER for possible publication

  47. arXiv:2204.01466  [pdf, other

    cond-mat.dis-nn cond-mat.mtrl-sci cs.LG

    A single Long Short-Term Memory network for enhancing the prediction of path-dependent plasticity with material heterogeneity and anisotropy

    Authors: Ehsan Motevali Haghighi, SeonHong Na

    Abstract: This study presents the applicability of conventional deep recurrent neural networks (RNN) to predict path-dependent plasticity associated with material heterogeneity and anisotropy. Although the architecture of RNN possesses inductive biases toward information over time, it is still challenging to learn the path-dependent material behavior as a function of the loading path considering the change… ▽ More

    Submitted 4 April, 2022; v1 submitted 28 March, 2022; originally announced April 2022.

  48. arXiv:2202.01141  [pdf, other

    cs.RO cs.LG

    Federated Reinforcement Learning for Collective Navigation of Robotic Swarms

    Authors: Seongin Na, Tomáš Rouček, Jiří Ulrich, Jan Pikman, Tomáš Krajník, Barry Lennox, Farshad Arvin

    Abstract: The recent advancement of Deep Reinforcement Learning (DRL) contributed to robotics by allowing automatic controller design. The automatic controller design is a crucial approach for designing swarm robotic systems, which require more complex controllers than a single robot system to lead a desired collective behaviour. Although the DRL-based controller design method showed its effectiveness, the… ▽ More

    Submitted 11 September, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: 10 pages, 8 figures, submitted to IEEE Transactions on Cognitive and Developmental Systems Journal (R1)

  49. arXiv:2109.11502  [pdf, other

    math.OC cs.LG math.NA stat.ML

    Inequality Constrained Stochastic Nonlinear Optimization via Active-Set Sequential Quadratic Programming

    Authors: Sen Na, Mihai Anitescu, Mladen Kolar

    Abstract: We study nonlinear optimization problems with a stochastic objective and deterministic equality and inequality constraints, which emerge in numerous applications including finance, manufacturing, power systems and, recently, deep neural networks. We propose an active-set stochastic sequential quadratic programming (StoSQP) algorithm that utilizes a differentiable exact augmented Lagrangian as the… ▽ More

    Submitted 30 January, 2023; v1 submitted 23 September, 2021; originally announced September 2021.

    Comments: 65 pages, 9 figures

  50. arXiv:2109.08022  [pdf, other

    cs.SI cs.AI cs.CY

    Meta-Path-based Fake News Detection Leveraging Multi-level Social Context Information

    Authors: Jian Cui, Kwanwoo Kim, Seung Ho Na, Seungwon Shin

    Abstract: Fake news, false or misleading information presented as news, has a significant impact on many aspects of society, such as in politics or healthcare domains. Due to the deceiving nature of fake news, applying Natural Language Processing (NLP) techniques to the news content alone is insufficient. The multi-level social context information (news publishers and engaged users in social media) and temp… ▽ More

    Submitted 16 November, 2021; v1 submitted 13 September, 2021; originally announced September 2021.