lucidrains/flash-cosine-sim-attention

Cuda Python C++Makefile deep-learning artificial-intelligence attention-mechanisms

This is stars and forks stats for /lucidrains/flash-cosine-sim-attention repository. As of 06 May, 2024 this repository has 184 stars and 8 forks.

Dive into Deep Learning, redone by Quanta Magazine Flash Cosine Similarity Attention Implementation of fused cosine similarity attention in the same style as Flash Attention. The observation is that by adopting l2 normalized queries and keys, you no longer need to keep track of the row maximums for numerical stability. This greatly simplifies the flash attention algorithm, assuming cosine similarity attention comes at no generalization cost. In other words, stable, fast, memory efficient, and longer...

Read on Github Github Stats Page

repo	techs	stars	forks
gskinner/flutter_animate	DartC++CMake	665	48
vyos/vyos-documentation	PythonDockerfileShell	155	285
Activiti/activiti-cloud-common-chart	MustacheSmartyMakefile	2	1
AndyGlx/FullControl-GCode-Designer	VBAPythonShell	461	65
ztachip/ztachip	VHDLVerilogSystemVerilog	162	22
mingyuan-zhang/MotionDiffuse	Python	674	66
VirtualHotBar/HotPEToolBox	BatchfileLuaPython	306	23
xinntao/Real-ESRGAN-ncnn-vulkan	CC++CMake	1k	108
HavocFramework/Modules	CPythonC++	127	35
picotorrent/picotorrent	C++C#CMake	2.5k	181