CStanKonrad/long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

PythonJupyter NotebookShell
This is stars and forks stats for /CStanKonrad/long_llama repository. As of 05 Mar, 2024 this repository has 1239 stars and 78 forks.

LongLLaMA: Focused Transformer Training for Context Scaling >_ 🎓 LongLLaMA-Code 7B Instruct 📑🗨   Learn more ⇧ { LongLLaMA-Code 7B } LongLLaMA-Instruct-3Bv1.1 LongLLaMA-3Bv1.1 TLDR | Overview | Usage | LongLLaMA performance | Authors | Citation | License | Acknowledgments FoT continued pretraining | Instruction tuning TLDR This repository...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
HennyJie/BrainGBMATLABPythonShell1240310
mate-academy/py-cinema-full-stackVuePythonCSS101770
axiom-crypto/axiom-v1-contractsYulSolidityShell27040
lem0nSec/ShellGhostCPython90701030
Lidarr/LidarrC#JavaScriptCSS3k02190
logicboard/logicboardElixirTypeScriptDockerfile41020
SWMFsoftware/SWMFFortranPerlMakefile0000
jgm/typst-hsHaskellTypstTeX24020
jvns/dns-weekendJupyter NotebookPython1540100
lyhue1991/torchkerasJupyter NotebookPython874+23125+2