- Kentucky
-
Joined
Mar 8, 2026
Popular repositories Loading
-
sm120-moe-bench
sm120-moe-bench PublicSM120 MoE Inference Benchmark: Qwen3.5-397B on RTX PRO 6000 Blackwell — K=64 CUTLASS kernel fix + real-world legal prompt benchmarks
Cuda 3
-
flashinfer
flashinfer PublicForked from flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
Python 1
-
vllm
vllm PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python 1
-
cutlass
cutlass PublicForked from NVIDIA/cutlass
CUDA Templates and Python DSLs for High-Performance Linear Algebra
C++ 1
-
dflash
dflash PublicForked from z-lab/dflash
DFlash: Block Diffusion for Flash Speculative Decoding
Python
-
claude-code-prompts
claude-code-prompts PublicForked from repowise-dev/claude-code-prompts
Independently authored prompt templates for AI coding agents — system prompts, tool prompts, agent delegation, memory management, and multi-agent coordination. Informed by studying Claude Code.
If the problem persists, check the GitHub status page or contact support.