AlibabaResearch/flash-llm - stats on ReviewGithub

Cuda Python C++C Shell Makefile

This is stars and forks stats for /AlibabaResearch/flash-llm repository. As of 06 May, 2024 this repository has 73 stars and 9 forks.

Flash-LLM Flash-LLM is a large language model (LLM) inference acceleration library for unstructured model pruning. Flash-LLM mainly contains efficient GPU code based on Tensor-Core-accelerated unstructured sparse matrix multiplication calculations, which can effectively accelerate the performance of common matrix calculations in LLM. With Flash-LLM, the pruned LLM models can be deployed onto GPUs with less memory consumption and can be executed more efficiently. Currently, the code has been evaluated...

Read on Github Github Stats Page

repo	techs	stars	weekly	forks
ingonyama-zk/cuda-field-expts	Cuda	10	0	0
Hamad-Anwar/Task-Sync-Pro-V2	DartC++CMake	73	0	22
RoboMaster/IntelligentUAVChampionshipSimulator	DockerfileShell	54	+1	14
gsnewmark/dotfiles	Emacs LispShellCSS	7	0	0
NOAA-EMC/NCEPLIBS-g2	FortranCCMake	4	0	13
SarangKumar/IO-LearnHub	JavaScriptHTMLCSS	26	+7	9
OutRed/outredgames	HTMLJavaScriptCSS	83	0	156
beacon-biosignals/Ray.jl	JuliaC++Dockerfile	8	0	0
jontelang/BigShotJbSnapper3Plugin	LogosMakefileObjective-C	21	+2	1
oauth-wg/oauth-sd-jwt-vc	Makefile	9	0	9