intelligent-machine-learning/dlrover

DLRover: An Automatic Distributed Deep Learning System

PythonGoC++CudaShellMakefileOther
This is stars and forks stats for /intelligent-machine-learning/dlrover repository. As of 10 May, 2024 this repository has 386 stars and 62 forks.

DLRover DLRover: An Automatic Distributed Deep Learning System DLRover makes the distributed training of large AI models easy, stable, fast and green. It can automatically train the Deep Learning model on the distributed cluster. It helps model developers to focus on model arichtecture, without taking care of any engineering stuff, say, hardware acceleration, distributed running, etc. Now, it provides automated operation and maintenance for deep learning training jobs on K8s/Ray. Major features as Fault-Tolerance,...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
gitcoinco/governanceSolidityPython940350
vittominacori/erc1363-payable-tokenJavaScriptSolidityShell1280510
bdloser404/FluttermuxShellVim Script31040
gmitrev/dotfilesVim ScriptShell4000
YangMr/mt-projectVueTypeScriptHTML0000
imsyy/DailyHotVueJavaScriptOther6901080
manga-raiku/raiku-appVueTypeScriptPLpgSQL5030
ianw/bottomupcsXSLTCSSJavaScript1.4k01490
ROCmSoftwarePlatform/hipBLASLtAssemblyC++Python15+124+2
AnacondaRecipes/anaconda-anon-usage-feedstockBatchfileShell0000