This is stars and forks stats for /philipturner/metal-benchmarks repository. As of 28 Mar, 2024 this repository has 236 stars and 9 forks.
Metal Benchmarks This document thoroughly explains the Apple GPU microarchitecture, focusing on its GPGPU performance. Details include latencies for each ALU assembly instruction, cache sizes, and the number of unique instruction pipelines. This document enables evidence-based reasoning about performance on the Apple GPU, helping people diagnose bottlenecks in real-world software. It also compares Apple silicon to generations of AMD and Nvidia microarchitectures, showing where it might exhibit different...
Metal Benchmarks This document thoroughly explains the Apple GPU microarchitecture, focusing on its GPGPU performance. Details include latencies for each ALU assembly instruction, cache sizes, and the number of unique instruction pipelines. This document enables evidence-based reasoning about performance on the Apple GPU, helping people diagnose bottlenecks in real-world software. It also compares Apple silicon to generations of AMD and Nvidia microarchitectures, showing where it might exhibit different...
repo | techs | stars | weekly | forks | weekly |
---|---|---|---|---|---|
sourcelocation/ResSet16 | Objective-CCSwift | 146 | 0 | 2 | 0 |
pgRouting/GSoC-pgRouting | C++CPLpgSQL | 5 | 0 | 34 | 0 |
pointfreeco/standups | Swift | 117 | 0 | 27 | 0 |
infoskirmish/hive | CC++Python | 139 | 0 | 45 | 0 |
pichenettes/cvpal | AssemblyCC++ | 72 | 0 | 35 | 0 |
universal-ctags/ctags | CC++M4 | 6k | +8 | 622 | 0 |
OmriBaso/RToolZ | CC++Assembly | 292 | 0 | 42 | 0 |
cxasm/cc-compare | C++ | 502 | 0 | 30 | 0 |
cp2k/dbcsr | FortranCPython | 105 | 0 | 40 | 0 |
j178/leetgo | GoC++Rust | 442 | 0 | 24 | 0 |