FMInference/FlexGen - stats on ReviewGithub

Python Shell machine-learning deep-learning offloading high-throughput opt gpt-3 large-language-models

This is stars and forks stats for /FMInference/FlexGen repository. As of 08 May, 2024 this repository has 8582 stars and 490 forks.

FlexGen: High-throughput Generative Inference of Large Language Models with a Single GPU [paper] FlexGen is a high-throughput generation engine for running large language models with limited GPU memory. FlexGen allows high-throughput generation by IO-efficient offloading, compression, and large effective batch sizes. Motivation In recent years, large language models (LLMs) have shown great performance across a wide range of tasks. Increasingly, LLMs have been applied not only to interactive applications...

Read on Github Github Stats Page

repo	techs	stars	weekly	forks
miguelgrinberg/microdot	PythonOther	803	0	87
mobarski/ask-my-pdf	PythonOther	487	0	183
Sjb4243/SRP	Jupyter NotebookRShell	0	0	9
Bywalks/DarkAngel	RubyPythonShell	509	0	66
chrisbra/Recover.vim	Vim ScriptPythonMakefile	240	0	25
surajaifly/winter-23	ApexJavaScriptHTML	0	0	5
mortbopet/Ripes	C++AssemblyCMake	2.1k	+12	249
furkanagess/flutter_base_project	CMakeDartC++	42	+1	3
iNeuronai/flask_app_a	CSSPythonHTML	12	0	32
daluobai-devops/jenkins-shared-library	GroovyShell	35	0	3