A small C++ library for easy explicit compile time vectorization. WIP.
-
Updated
Feb 27, 2026 - C++
8000
A small C++ library for easy explicit compile time vectorization. WIP.
An optimized fork of hnswlib focusing on AVX-512 SIMD, cache-aware memory layout, and OpenMP threading on AMD Zen 5.
Mandelbrot set calculation using OpenMP vectorization. School project. Tested on barbora.it4i.cz, batch calculator needs a fix.
Extremely hard, multi-turn, open-source-grounded coding evaluations that reliably break every current frontier models (Claude, GPT, Grok, Gemini, Llama, etc.) on numerical stability, zero-allocation, autograd, SIMD, and long-chain correctness.
OnnxRT based Inference Optimization of Roberta model trained for Sentiment Analysis On Twitter Dataset
Implementation of bignum using avx512 extensions
KK: A novel cryptographic primitive. One permutation. Everything from scratch
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Battleships opponent and compute experiments, with AVX2 / AVX-512
Applications of High Performance Computing
Provide ultra-low latency decision support and risk analysis for prediction market making using SIMD-accelerated logit-space computations.
Calculate Sum of Absolute Difference (SAD) by AVX-512
Zero-dependency C++ cryptographic hash library (SHA-2, SHA-3, HMAC, PBKDF2) with SIMD backends and runtime CPU dispatch
Automatically select the optimal implementation at program startup, ts and dl safe options are available.
Rewrite of a personal project from back in December 2023.
A modern C++20 implementation utilizing AVX-512 and Osvik's optimized Boolean circuits to achieve massive throughput through bitslicing.
Add a description, image, and links to the avx512 topic page so that developers can more easily learn about it.
To associate your repository with the avx512 topic, visit your repo's landing page and select "manage topics."