I'm mining Litecoins using an AMD Radeon HD7850 with 2G global memory, and my conf is below:
thread-concurrency=4096
lookup-gap=2
After reading the algorithms of scrypt130511.cl, I discovered lookup-gap is used for time memory tradeoff.
It consumes 512MB global memory when loopup-gap is 2, while it consumes 1GB global memory when loopup-gap is 1.
But after I change lookup-gap to 1, the hashrate dropped from 320K to 300K. Why is it slower when there's less computation?