hpcaitech/EnergonAI

Large-scale model inference.

PythonC++CudaOther
This is stars and forks stats for /hpcaitech/EnergonAI repository. As of 03 May, 2024 this repository has 624 stars and 92 forks.

Energon-AI A service framework for large-scale model inference, Energon-AI has the following characteristics: Parallelism for Large-scale Models: With tensor parallel operations, pipeline parallel wrapper, distributed checkpoint loading, and customized CUDA kernel, EnergonAI can enable efficient parallel inference for larges-scale models. Pre-built large models: There are pre-built implementation for popular models, such as OPT. It supports the cache technique for the generation task and distributed...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
lucidrains/lion-pytorchPython1.7k0450
WikiEducationFoundation/WikiEduDashboardRubyJavaScriptHaml35705170
rerun-io/rerunRustPythonC++3.3k+333146+11
matt-kimball/allocscopeRustShellDockerfile5350180
swarmlet/swarmletShellDockerfilePython8090500
schooltechx/youtubeSvelteJavaScriptC#210160
JordanMarr/Elmish.AvaloniaF#Other60050
dmiller/clojure-clr-nextF#Other90040
dgraph-io/dgraphGoShellHCL19.7k01.5k0
flashohq/flashoHTMLTypeScriptPython3340160