philipturner/metal-benchmarks

Apple GPU microarchitecture

MetalSwiftC++
This is stars and forks stats for /philipturner/metal-benchmarks repository. As of 28 Mar, 2024 this repository has 236 stars and 9 forks.

Metal Benchmarks This document thoroughly explains the Apple GPU microarchitecture, focusing on its GPGPU performance. Details include latencies for each ALU assembly instruction, cache sizes, and the number of unique instruction pipelines. This document enables evidence-based reasoning about performance on the Apple GPU, helping people diagnose bottlenecks in real-world software. It also compares Apple silicon to generations of AMD and Nvidia microarchitectures, showing where it might exhibit different...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
sourcelocation/ResSet16Objective-CCSwift146020
pgRouting/GSoC-pgRoutingC++CPLpgSQL50340
pointfreeco/standupsSwift1170270
infoskirmish/hiveCC++Python1390450
pichenettes/cvpalAssemblyCC++720350
universal-ctags/ctagsCC++M46k+86220
OmriBaso/RToolZCC++Assembly2920420
cxasm/cc-compareC++5020300
cp2k/dbcsrFortranCPython1050400
j178/leetgoGoC++Rust4420240