jeonsworld/ViT-pytorch

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Jupyter NotebookPython
This is stars and forks stats for /jeonsworld/ViT-pytorch repository. As of 02 May, 2024 this repository has 1619 stars and 333 forks.

Vision Transformer Pytorch reimplementation of Google's repository for the ViT model that was released with the paper An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale by Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, Neil Houlsby. This paper show that Transformers applied directly to image patches and pre-trained on large datasets...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
upb-lea/reinforcement_learning_course_materialsJupyter NotebookTeXOther86801960
in-toto/attestationPythonGoMakefile153+4380
FairwindsOps/chartsMustacheShellSmarty12101370
kahst/BirdNET-AnalyzerPythonDockerfile4730950
warp-tech/warpgateRustSveltePython2.8k+2387+3
databricks/LearningSparkV2ScalaPythonJava1k06430
emptysuns/Hi_HysteriaShellBatchfilePython2.6k05950
xiaoZ-hc/redtoolShellPythonJavaScript1.3k03170
BlackArch/blackarchShellPerlPython2.5k05550
the-tech-academy/developersPythonSmalltalk50616+6