brandonmmusic-max

Brandon M. Music brandonmmusic-max

I am practicing lawyer from Kentucky who took an interest in ai systems engineering. I've been coding in some form for 30 years .

Achievements

sm120-moe-bench sm120-moe-bench Public

SM120 MoE Inference Benchmark: Qwen3.5-397B on RTX PRO 6000 Blackwell — K=64 CUTLASS kernel fix + real-world legal prompt benchmarks

Cuda 3
flashinfer flashinfer Public

Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 1
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 1
cutlass cutlass Public

Forked from NVIDIA/cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 1
dflash dflash Public

Forked from z-lab/dflash

DFlash: Block Diffusion for Flash Speculative Decoding

Python
claude-code-prompts claude-code-prompts Public

Forked from repowise-dev/claude-code-prompts

Independently authored prompt templates for AI coding agents — system prompts, tool prompts, agent delegation, memory management, and multi-agent coordination. Informed by studying Claude Code.