Skip to main content

Showing 1–21 of 21 results for author: Parasyris, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2512.04355  [pdf, ps, other

    cs.DC cs.AI cs.PF

    Counting Without Running: Evaluating LLMs' Reasoning About Code Complexity

    Authors: Gregory Bolet, Giorgis Georgakoudis, Konstantinos Parasyris, Harshitha Menon, Niranjan Hasabnis, Kirk W. Cameron, Gal Oren

    Abstract: Modern GPU software stacks demand developers who can anticipate performance bottlenecks before ever launching a kernel; misjudging floating-point workloads upstream can derail tuning, scheduling, and even hardware procurement. Yet despite rapid progress in code generation, today's Large Language Models (LLMs) are rarely tested on this kind of forward-looking reasoning. We close that gap with gpuFL… ▽ More

    Submitted 3 December, 2025; originally announced December 2025.

    Comments: 13 pages, 6 figures, MLSys 2026 Submission

  2. arXiv:2511.05626  [pdf, ps, other

    cs.SE cs.AI cs.DC

    LLMs as Packagers of HPC Software

    Authors: Caetano Melone, Daniel Nichols, Konstantinos Parasyris, Todd Gamblin, Harshitha Menon

    Abstract: High performance computing (HPC) software ecosystems are inherently heterogeneous, comprising scientific applications that depend on hundreds of external packages, each with distinct build systems, options, and dependency constraints. Tools such as Spack automate dependency resolution and environment management, but their effectiveness relies on manually written build recipes. As these ecosystems… ▽ More

    Submitted 6 November, 2025; originally announced November 2025.

  3. arXiv:2510.17158  [pdf, ps, other

    cs.DC cs.PF cs.SE

    Integrating Performance Tools in Model Reasoning for GPU Kernel Optimization

    Authors: Daniel Nichols, Konstantinos Parasyris, Charles Jekel, Abhinav Bhatele, Harshitha Menon

    Abstract: Language models are now prevalent in software engineering with many developers using them to automate tasks and accelerate their development. While language models have been tremendous at accomplishing complex software engineering tasks, there are still many areas where they fail to deliver desirable results, for instance code performance related tasks. Tasks like optimization depend on many compl… ▽ More

    Submitted 20 October, 2025; originally announced October 2025.

  4. arXiv:2507.11467  [pdf, ps, other

    cs.AI cs.SE

    Modeling Code: Is Text All You Need?

    Authors: Daniel Nichols, Konstantinos Parasyris, Harshitha Menon, Brian R. Bartoldson, Giorgis Georgakoudis, Tal Ben-Nun, Abhinav Bhatele

    Abstract: Code LLMs have become extremely popular recently for modeling source code across a variety of tasks, such as generation, translation, and summarization. However, transformer-based models are limited in their capabilities to reason through structured, analytical properties of code, such as control and data flow. Previous work has explored the modeling of these properties with structured data and gr… ▽ More

    Submitted 15 July, 2025; originally announced July 2025.

  5. arXiv:2505.08135  [pdf, ps, other

    cs.SE cs.AI cs.DC cs.PF

    Leveraging AI for Productive and Trustworthy HPC Software: Challenges and Research Directions

    Authors: Keita Teranishi, Harshitha Menon, William F. Godoy, Prasanna Balaprakash, David Bau, Tal Ben-Nun, Abhinav Bhatele, Franz Franchetti, Michael Franusich, Todd Gamblin, Giorgis Georgakoudis, Tom Goldstein, Arjun Guha, Steven Hahn, Costin Iancu, Zheming Jin, Terry Jones, Tze Meng Low, Het Mankad, Narasinga Rao Miniskar, Mohammad Alaul Haque Monil, Daniel Nichols, Konstantinos Parasyris, Swaroop Pophale, Pedro Valero-Lara , et al. (3 additional authors not shown)

    Abstract: We discuss the challenges and propose research directions for using AI to revolutionize the development of high-performance computing (HPC) software. AI technologies, in particular large language models, have transformed every aspect of software development. For its part, HPC software is recognized as a highly specialized scientific field of its own. We discuss the challenges associated with lever… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 12 pages, 1 Figure, Accepted at "The 1st International Workshop on Foundational Large Language Models Advances for HPC" LLM4HPC to be held in conjunction with ISC High Performance 2025

    Journal ref: In: Neuwirth, S., Paul, A.K., Weinzierl, T., Carson, E.C. (eds) High Performance Computing. ISC High Performance 2025. Lecture Notes in Computer Science, vol 16091. Springer, Cham

  6. arXiv:2505.03988  [pdf, other

    cs.DC cs.AI cs.PF

    Can Large Language Models Predict Parallel Code Performance?

    Authors: Gregory Bolet, Giorgis Georgakoudis, Harshitha Menon, Konstantinos Parasyris, Niranjan Hasabnis, Hayden Estes, Kirk W. Cameron, Gal Oren

    Abstract: Accurate determination of the performance of parallel GPU code typically requires execution-time profiling on target hardware -- an increasingly prohibitive step due to limited access to high-end GPUs. This paper explores whether Large Language Models (LLMs) can offer an alternative approach for GPU performance prediction without relying on hardware. We frame the problem as a roofline classificati… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 5 pages, 4 figures, accepted to AI4Sys Workshop at HPDC 2025

  7. arXiv:2410.09191  [pdf, other

    cs.SE cs.PF cs.PL

    Testing the Unknown: A Framework for OpenMP Testing via Random Program Generation

    Authors: Ignacio Laguna, Patrick Chapman, Konstantinos Parasyris, Giorgis Georgakoudis, Cindy Rubio-González

    Abstract: We present a randomized differential testing approach to test OpenMP implementations. In contrast to previous work that manually creates dozens of verification and validation tests, our approach is able to randomly generate thousands of tests, exposing OpenMP implementations to a wide range of program behaviors. We represent the space of possible random OpenMP tests using a grammar and implement o… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  8. arXiv:2407.18352  [pdf, other

    cs.DC

    HPAC-ML: A Programming Model for Embedding ML Surrogates in Scientific Applications

    Authors: Zane Fink, Konstantinos Parasyris, Praneet Rathi, Giorgis Georgakoudis, Harshitha Menon, Peer-Timo Bremer

    Abstract: Recent advancements in Machine Learning (ML) have substantially improved its predictive and computational abilities, offering promising opportunities for surrogate modeling in scientific applications. By accurately approximating complex functions with low computational cost, ML-based surrogates can accelerate scientific applications by replacing computationally intensive components with faster mod… ▽ More

    Submitted 26 August, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: 16 pages, 9 figures. Accepted at SC24

  9. arXiv:2404.09349  [pdf, other

    cs.LG cs.CR cs.CV

    Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies

    Authors: Brian R. Bartoldson, James Diffenderfer, Konstantinos Parasyris, Bhavya Kailkhura

    Abstract: This paper revisits the simple, long-studied, yet still unsolved problem of making image classifiers robust to imperceptible perturbations. Taking CIFAR10 as an example, SOTA clean accuracy is about $100$%, but SOTA robustness to $\ell_{\infty}$-norm bounded perturbations barely exceeds $70$%. To understand this gap, we analyze how model size, dataset size, and synthetic data quality affect robust… ▽ More

    Submitted 10 July, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: ICML 2024

  10. Taking GPU Programming Models to Task for Performance Portability

    Authors: Joshua H. Davis, Pranav Sivaraman, Joy Kitson, Konstantinos Parasyris, Harshitha Menon, Isaac Minn, Giorgis Georgakoudis, Abhinav Bhatele

    Abstract: Portability is critical to ensuring high productivity in developing and maintaining scientific software as the diversity in on-node hardware architectures increases. While several programming models provide portability for diverse GPU systems, they don't make any guarantees about performance portability. In this work, we explore several programming models -- CUDA, HIP, Kokkos, RAJA, OpenMP, OpenAC… ▽ More

    Submitted 4 September, 2025; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: 16 pages, 5 figures

  11. arXiv:2310.02025  [pdf, other

    cs.LG

    DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training

    Authors: Aochuan Chen, Yimeng Zhang, Jinghan Jia, James Diffenderfer, Jiancheng Liu, Konstantinos Parasyris, Yihua Zhang, Zheng Zhang, Bhavya Kailkhura, Sijia Liu

    Abstract: Zeroth-order (ZO) optimization has become a popular technique for solving machine learning (ML) problems when first-order (FO) information is difficult or impossible to obtain. However, the scalability of ZO optimization remains an open problem: Its use has primarily been limited to relatively small-scale ML problems, such as sample-wise adversarial attack generation. To our best knowledge, no pri… ▽ More

    Submitted 15 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR'24. Codes are available at https://github.com/OPTML-Group/DeepZero

  12. arXiv:2309.15432  [pdf, other

    cs.PL

    ComPile: A Large IR Dataset from Production Sources

    Authors: Aiden Grossman, Ludger Paehler, Konstantinos Parasyris, Tal Ben-Nun, Jacob Hegna, William Moses, Jose M Monsalve Diaz, Mircea Trofin, Johannes Doerfert

    Abstract: Code is increasingly becoming a core data modality of modern machine learning research impacting not only the way we write code with conversational agents like OpenAI's ChatGPT, Google's Bard, or Anthropic's Claude, the way we translate code from one language into another, but also the compiler infrastructure underlying the language. While modeling approaches may vary and representations differ, t… ▽ More

    Submitted 30 April, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

  13. arXiv:2308.16877  [pdf, other

    cs.DC

    HPAC-Offload: Accelerating HPC Applications with Portable Approximate Computing on the GPU

    Authors: Zane Fink, Konstantinos Parasyris, Giorgis Georgakoudis, Harshitha Menon

    Abstract: The end of Dennard scaling and the slowdown of Moore's law led to a shift in technology trends toward parallel architectures, particularly in HPC systems. To continue providing performance benefits, HPC should embrace Approximate Computing (AC), which trades application quality loss for improved performance. However, existing AC techniques have not been extensively applied and evaluated in state-o… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 12 pages, 12 pages. Accepted at SC23

  14. arXiv:2303.08873  [pdf, other

    cs.PL cs.DC cs.LG

    Machine Learning-Driven Adaptive OpenMP For Portable Performance on Heterogeneous Systems

    Authors: Giorgis Georgakoudis, Konstantinos Parasyris, Chunhua Liao, David Beckingsale, Todd Gamblin, Bronis de Supinski

    Abstract: Heterogeneity has become a mainstream architecture design choice for building High Performance Computing systems. However, heterogeneity poses significant challenges for achieving performance portability of execution. Adapting a program to a new heterogeneous platform is laborious and requires developers to manually explore a vast space of execution parameters. To address those challenges, this pa… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Report number: LLNL-CONF-833682

  15. arXiv:2202.05223  [pdf, other

    cs.SE

    Reliabuild: Searching for High-Fidelity Builds Using Active Learning

    Authors: Harshitha Menon, Konstantinos Parasyris, Tom Scogland, Todd Gamblin

    Abstract: Modern software is incredibly complex. A typical application may comprise hundreds or thousands of reusable components. Automated package managers can help to maintain a consistent set of dependency versions, but ultimately the solvers in these systems rely on constraints generated by humans. At scale, small errors add up, and it becomes increasingly difficult to find high-fidelity configurations.… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

  16. arXiv:2111.13908  [pdf, other

    cs.DC

    Artificial neural networks for online error detection

    Authors: Vassilis Vassiliadis, Konstantinos Parasyris, Christos D. Antonopoulos, Spyros Lalis, Nikolaos Bellas

    Abstract: Hardware reliability is adversely affected by the downscaling of semiconductor devices and the scale-out of systems necessitated by modern applications. Apart from crashes, this unreliability often manifests as silent data corruptions (SDCs), affecting application output. Therefore, we need low-cost and low-human-effort solutions to reduce the incidence rate and the effects of SDCs on the quality… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: 11 pages, 9 figures, originally submitted to Usenix ATC 2018 but paper was not accepted for publication

    ACM Class: C.4

  17. Scrooge Attack: Undervolting ARM Processors for Profit

    Authors: Christian Göttel, Konstantinos Parasyris, Osman Unsal, Pascal Felber, Marcelo Pasin, Valerio Schiavoni

    Abstract: Latest ARM processors are approaching the computational power of x86 architectures while consuming much less energy. Consequently, supply follows demand with Amazon EC2, Equinix Metal and Microsoft Azure offering ARM-based instances, while Oracle Cloud Infrastructure is about to add such support. We expect this trend to continue, with an increasing number of cloud providers offering ARM-based clou… ▽ More

    Submitted 12 May, 2022; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: European Commission Project: LEGaTO - Low Energy Toolset for Heterogeneous Computing (EC-H2020-780681)

    Journal ref: 2021 40th International Symposium on Reliable Distributed Systems (SRDS) (2021) 187-197

  18. arXiv:2102.06894  [pdf, other

    cs.DC

    MATCH: An MPI Fault Tolerance Benchmark Suite

    Authors: Luanzheng Guo, Giorgis Georgakoudis, Konstantinos Parasyris, Ignacio Laguna, Dong Li

    Abstract: MPI has been ubiquitously deployed in flagship HPC systems aiming to accelerate distributed scientific applications running on tens of hundreds of processes and compute nodes. Maintaining the correctness and integrity of MPI application execution is critical, especially for safety-critical scientific applications. Therefore, a collection of effective MPI fault tolerance techniques have been propos… ▽ More

    Submitted 13 February, 2021; originally announced February 2021.

    Journal ref: IEEE International Symposium on Workload Characterization (IISWC 2020)

  19. arXiv:1912.01563  [pdf, other

    cs.DC

    LEGaTO: Low-Energy, Secure, and Resilient Toolset for Heterogeneous Computing

    Authors: B. Salami, K. Parasyris, A. Cristal, O. Unsal, X. Martorell, P. Carpenter, R. De La Cruz, L. Bautista, D. Jimenez, C. Alvarez, S. Nabavi, S. Madonar, M. Pericas, P. Trancoso, M. Abduljabbar, J. Chen, P. N. Soomro, M Manivannan, M. Berge, S. Krupop, F. Klawonn, Al Mekhlafi, S. May, T. Becker, G. Gaydadjiev , et al. (20 additional authors not shown)

    Abstract: The LEGaTO project leverages task-based programming models to provide a software ecosystem for Made in-Europe heterogeneous hardware composed of CPUs, GPUs, FPGAs and dataflow engines. The aim is to attain one order of magnitude energy savings from the edge to the converged cloud/HPC, balanced with the security and resilience challenges. LEGaTO is an ongoing three-year EU H2020 project started in… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: 6 pages, 9 figures

  20. arXiv:1912.00154  [pdf, other

    cs.PF cs.AR

    Hardware Versus Software Fault Injection of Modern Undervolted SRAMs

    Authors: Muhammet Abdullah Soyturk, Konstantinos Parasyris, Behzad Salami, Osman Unsal, Gulay Yalcin, Leonardo Bautista Gomez

    Abstract: To improve power efficiency, researchers are experimenting with dynamically adjusting the supply voltage of systems below the nominal operating points. However, production systems are typically not allowed to function on voltage settings that is below the reliable limit. Consequently, existing software fault tolerance studies are based on fault models, which inject faults on random fault locations… ▽ More

    Submitted 30 November, 2019; originally announced December 2019.

  21. arXiv:1412.5150  [pdf, other

    cs.PL

    A Programming Model and Runtime System for Significance-Aware Energy-Efficient Computing

    Authors: Vassilis Vassiliadis, Konstantinos Parasyris, Charalambos Chalios, Christos D. Antonopoulos, Spyros Lalis, Nikolaos Bellas, Hans Vandierendonck, Dimitrios S. Nikolopoulos

    Abstract: Reducing energy consumption is one of the key challenges in computing technology. One factor that contributes to high energy consumption is that all parts of the program are considered equally significant for the accuracy of the end-result. However, in many cases, parts of computations can be performed in an approximate way, or even dropped, without affecting the quality of the final output to a s… ▽ More

    Submitted 15 December, 2014; originally announced December 2014.