Implementation
One block per macroblock
Reference frame stored in texture memory. Can increase caching by preloading to shared memory.
Current frame- can store the entire thing in shared memory (it doesn't change). Maybe load from texture?
http://forums.nvidia.com/index.php?showtopic=44424