Assuming 44 bytes of texels per pixel results in required bandwidth of 285GB/s at that fillrate. Assuming the texels fetched by the shader map 1:1 to pixels then caching isn't going to help in any meaningful way (since each texel is used just once).

Click to expand...