Projects

GitHub

fastcv

fastcv is a C++ CUDA rewrite with Pytorch bindings of the image filters in the OpenCV library.

GitHub

InferGPT

A high performance C/C++ inference engine for GPT based architectures that runs on CPU.

GitHub

Triton GPT-2

Implementing GPT-2 architecture in triton.

GitHub

MNIST CUDA

A CUDA implementation of a simple feedforward neural network and benchmark against libraries like pytorch.

GitHub

Advanced ML

A collection of advanced machine learning topics implemented from scratch.