Use more parallelism in attention block in prefill mode.#177
Merged
copybara-service[bot] merged 3 commits intogoogle:devfrom May 3, 2024
Merged
Use more parallelism in attention block in prefill mode.#177copybara-service[bot] merged 3 commits intogoogle:devfrom
copybara-service[bot] merged 3 commits intogoogle:devfrom