fastcv is a C++ CUDA rewrite with Pytorch bindings of the image filters in the OpenCV library.
A high performance C/C++ inference engine for GPT based architectures that runs on CPU.
Implementing GPT-2 architecture in triton.
A CUDA implementation of a simple feedforward neural network and benchmark against libraries like pytorch.
A collection of advanced machine learning topics implemented from scratch.