Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

For me it was able to try out different architectures for perf improvement, then once it's settled on some good architectures, it can do lower level optimizations on them by profiling the code etc.
 help



That's great. It seems like a large body of experts are having real problem with this, so maybe you should publish something about your methods, or start a business...

I can't vouch for whether or not it can beat human experts though because I'm no CUDA expert myself. The original CUDA code were human written and I first let codex adapt it to my specific use case. Then I basically let codex generate ideas and try the ideas out itself (I think it's a bit like Karpathy's autoresearch, except I was still doing manual prompting). And that was enough to get me 20x improvement.

I suspect when people said AI wrote non performant CUDA kernels it was beginning-mid last year and it's definitely vastly improved since back then. And the agent's ability to iteratively improve really impressed me.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: