openai/blocksparse

Efficient GPU kernels for block-sparse matrix multiplication and convolution

CudaPythonC++CSSCMakefile
This is stars and forks stats for /openai/blocksparse repository. As of 27 Apr, 2024 this repository has 961 stars and 196 forks.

Status: Active (under active development, breaking changes may occur) Blocksparse The blocksparse package contains TensorFlow Ops and corresponding GPU kernels for block-sparse matrix multiplication. Also included are related ops like edge bias, sparse weight norm and layer norm. To learn more, see the launch post on the OpenAI blog. Prerequisites First, you need at least one Nvidia GPU. For best performance, we recommend using a Pascal or Maxwell generation GPU -- this is the full list of features...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
abo-abo/auto-yasnippetEmacs LispMakefile2400150
purplg/hassEmacs LispMakefileDockerfile78040
emacsorphanage/anzuEmacs LispMakefile393-1280
GEOS-ESM/GEOSadasFortranShellPerl6030
inogs/ensdamFortranPythonPerl0000
GEOS-ESM/g5pertFortranCOther0000
GEOS-ESM/GEOSana_GridCompFortranPerlPython1030
Martin-Courtney-Sage/project_twoJavaGherkinPython1030
spf13/castGoMakefile3.1k02850
tenable/terrascanGoOpen Policy AgentShell4.2k+5480+3