harrisonvanderbyl/rwkv-cpp-cuda

A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependencies

C++Other
This is stars and forks stats for /harrisonvanderbyl/rwkv-cpp-cuda repository. As of 29 Apr, 2024 this repository has 283 stars and 15 forks.

RWKV Cuda This is a super simple c++/cuda implementation of rwkv with no pytorch/libtorch dependencies. included is a simple example of how to use in both c++ and python. Features Direct Disk -> Gpu loading ( practically no ram needed ) Uint8 by default Incredibly fast No dependencies Simple to use Simple to build Optional Python binding using pytorch tensors as wrappers Native tokenizer! Windows Support! HIP(AMD) GPU support! Vulkan(All) Support! Distributable programs! (check actions for the...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
code-sujan/mmc_workshop_ciCSSPHPOther20140
krmeet/sound-space-plusGDScriptGLSLOther18+180
uber/cadenceGoOther7.3k07440
ni/grpc-labviewLabVIEWC++Python670520
tensorflow/flutter-tfliteDartCRuby2960530
ROCm-Developer-Tools/aompFortranCShell1520360
AnalogJ/scrutinyGoHTMLOther3.5k01210
anirudhmalik/xhunterJavaJavaScriptOther44701160
logicalclocks/hopsworks-tutorialsJupyter NotebookOther1210500
heclak/community-a4e-cLuaC++C4930740