This is stars and forks stats for /FMInference/FlexGen repository. As of 08 May, 2024 this repository has 8582 stars and 490 forks.
FlexGen: High-throughput Generative Inference of Large Language Models with a Single GPU [paper] FlexGen is a high-throughput generation engine for running large language models with limited GPU memory. FlexGen allows high-throughput generation by IO-efficient offloading, compression, and large effective batch sizes. Motivation In recent years, large language models (LLMs) have shown great performance across a wide range of tasks. Increasingly, LLMs have been applied not only to interactive applications...
FlexGen: High-throughput Generative Inference of Large Language Models with a Single GPU [paper] FlexGen is a high-throughput generation engine for running large language models with limited GPU memory. FlexGen allows high-throughput generation by IO-efficient offloading, compression, and large effective batch sizes. Motivation In recent years, large language models (LLMs) have shown great performance across a wide range of tasks. Increasingly, LLMs have been applied not only to interactive applications...
repo | techs | stars | weekly | forks | weekly |
---|---|---|---|---|---|
miguelgrinberg/microdot | PythonOther | 803 | 0 | 87 | 0 |
mobarski/ask-my-pdf | PythonOther | 487 | 0 | 183 | 0 |
Sjb4243/SRP | Jupyter NotebookRShell | 0 | 0 | 9 | 0 |
Bywalks/DarkAngel | RubyPythonShell | 509 | 0 | 66 | 0 |
chrisbra/Recover.vim | Vim ScriptPythonMakefile | 240 | 0 | 25 | 0 |
surajaifly/winter-23 | ApexJavaScriptHTML | 0 | 0 | 5 | 0 |
mortbopet/Ripes | C++AssemblyCMake | 2.1k | +12 | 249 | 0 |
furkanagess/flutter_base_project | CMakeDartC++ | 42 | +1 | 3 | 0 |
iNeuronai/flask_app_a | CSSPythonHTML | 12 | 0 | 32 | 0 |
daluobai-devops/jenkins-shared-library | GroovyShell | 35 | 0 | 3 | 0 |