This is stars and forks stats for /AlibabaResearch/flash-llm repository. As of 06 May, 2024 this repository has 73 stars and 9 forks.
Flash-LLM Flash-LLM is a large language model (LLM) inference acceleration library for unstructured model pruning. Flash-LLM mainly contains efficient GPU code based on Tensor-Core-accelerated unstructured sparse matrix multiplication calculations, which can effectively accelerate the performance of common matrix calculations in LLM. With Flash-LLM, the pruned LLM models can be deployed onto GPUs with less memory consumption and can be executed more efficiently. Currently, the code has been evaluated...
Flash-LLM Flash-LLM is a large language model (LLM) inference acceleration library for unstructured model pruning. Flash-LLM mainly contains efficient GPU code based on Tensor-Core-accelerated unstructured sparse matrix multiplication calculations, which can effectively accelerate the performance of common matrix calculations in LLM. With Flash-LLM, the pruned LLM models can be deployed onto GPUs with less memory consumption and can be executed more efficiently. Currently, the code has been evaluated...
repo | techs | stars | weekly | forks | weekly |
---|---|---|---|---|---|
ingonyama-zk/cuda-field-expts | Cuda | 10 | 0 | 0 | 0 |
Hamad-Anwar/Task-Sync-Pro-V2 | DartC++CMake | 73 | 0 | 22 | 0 |
RoboMaster/IntelligentUAVChampionshipSimulator | DockerfileShell | 54 | +1 | 14 | 0 |
gsnewmark/dotfiles | Emacs LispShellCSS | 7 | 0 | 0 | 0 |
NOAA-EMC/NCEPLIBS-g2 | FortranCCMake | 4 | 0 | 13 | 0 |
SarangKumar/IO-LearnHub | JavaScriptHTMLCSS | 26 | +7 | 9 | 0 |
OutRed/outredgames | HTMLJavaScriptCSS | 83 | 0 | 156 | 0 |
beacon-biosignals/Ray.jl | JuliaC++Dockerfile | 8 | 0 | 0 | 0 |
jontelang/BigShotJbSnapper3Plugin | LogosMakefileObjective-C | 21 | +2 | 1 | 0 |
oauth-wg/oauth-sd-jwt-vc | Makefile | 9 | 0 | 9 | 0 |