Implementation

One block per macroblock

Reference frame stored in texture memory. Can increase caching by preloading to shared memory.

Current frame- can store the entire thing in shared memory (it doesn't change). Maybe load from texture?

http://forums.nvidia.com/index.php?showtopic=44424