書名： Hands-On GPU：Accelerated Computer Vision with OpenCV and CUDA
作者名： Bhaumik Vaidya
本章字數： 96字
更新時間： 2021-08-13 15:48:26

Cache memory

On the latest GPUs, there is an L1 cache per multiprocessor and an L2 cache, which is shared between all multiprocessors. Both global and local memories use these caches. As L1 is near to thread execution, it is very fast. As shown in the diagram for memory architecture earlier, the L1 cache and shared memory use the same 64 KB. Both can be configured for how many bytes they will use out of the 64 KB. All global memory access goes through an L2 cache. Texture memory and constant memory have their separate caches.

官术网_书友最值得收藏!

Hands-On GPU：Accelerated Computer Vision with OpenCV and CUDA

Cache memory