This is stars and forks stats for /JonasGeiping/cramming repository. As of 07 May, 2024 this repository has 1144 stars and 87 forks.
Cramming Language Model (Pretraining) This repository contains code to replicate our research described in "Cramming: Training a Language Model on a Single GPU in One Day". We experiment with language model pretraining a BERT-type model with limited compute, wondering "how bad can it really be"? You can find our paper here: https://arxiv.org/abs/2212.14034, and the abstract below: Recent trends in language modeling have focused on increasing performance through scaling, and have resulted in an environment...
Cramming Language Model (Pretraining) This repository contains code to replicate our research described in "Cramming: Training a Language Model on a Single GPU in One Day". We experiment with language model pretraining a BERT-type model with limited compute, wondering "how bad can it really be"? You can find our paper here: https://arxiv.org/abs/2212.14034, and the abstract below: Recent trends in language modeling have focused on increasing performance through scaling, and have resulted in an environment...
repo | techs | stars | weekly | forks | weekly |
---|---|---|---|---|---|
karpathy/nanoGPT | Python | 25.1k | 0 | 3.4k | 0 |
kuca-belludo/urnas | Python | 92 | 0 | 17 | 0 |
apachecn/ailearning | PythonJavaScriptCSS | 36.6k | 0 | 11.3k | 0 |
Aeternalis-Ingenium/FastAPI-Backend-Template | PythonDockerfileOther | 446 | 0 | 66 | 0 |
watchexec/cargo-watch | RustRoffShell | 2.4k | +11 | 75 | 0 |
K0p1-Git/cloudflare-ddns-updater | Shell | 870 | 0 | 271 | 0 |
AsYetUntitled/Framework | SQFC++Python | 237 | 0 | 316 | 0 |
ilaria-manco/multimodal-ml-music | TeXPython | 243 | 0 | 10 | 0 |
preservim/vim-textobj-sentence | Vim ScriptShell | 93 | 0 | 8 | 0 |
lupyuen/pinephone-nuttx | ZigCShell | 68 | 0 | 8 | 0 |