Skip to main content

Showing 1–1 of 1 results for author: Petrovych, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2603.26595  [pdf, ps, other

    cs.LG hep-ex

    PQuantML: A Tool for End-to-End Hardware-aware Model Compression

    Authors: Roope Niemi, Anastasiia Petrovych, Arghya Ranjan Das, Enrico Lupi, Chang Sun, Dimitrios Danopoulos, Marlon Joshua Helbing, Mia Liu, Sebastian Dittmeier, Michael Kagan, Vladimir Loncar, Maurizio Pierini

    Abstract: PQuantML is a new open-source, hardware-aware neural network model compression library tailored to end-to-end workflows. Motivated by the need to deploy performant models to environments with strict latency constraints, PQuantML simplifies training of compressed models by providing a unified interface to apply pruning and quantization, either jointly or individually. The library implements multipl… ▽ More

    Submitted 27 March, 2026; originally announced March 2026.