![]() ![]()
Hyper-Q expands GK110 hardware work queues from 1 to 32. Atomic operations are also overhauled, speeding up the execution speed of atomic operations and adding some FP64 operations that were previously only available for FP32 data. New shuffle instructions allow for threads within a warp to share data without going back to memory, making the process much quicker than the previous load/share/store method. New shuffle Instructions Īt a low level, GK110 sees additional instructions and operations to further improve performance. ![]() With a 48KB space, the texture cache can become a read-only cache for compute workloads. Register file space has increased to 256KB compared to Fermi. The SMX also sees an increase in space for register file. The single biggest change from GK104 is that rather than 8 dedicated FP64 CUDA cores, GK110 has up to 64, giving it 8x the FP64 throughput of a GK104 SMX. With GK110, Nvidia opted to increase compute performance.
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |