Personal Wiki

Home

❯

entities

❯

Eduardo Alvarez

Eduardo Alvarez

May 27, 20261 min read

Eduardo Alvarez

NVIDIA engineer. Published on the NVIDIA Developer Blog on KV cache quantisation and inference optimisation for Blackwell GPUs.

Appearances in this wiki

  • Optimizing Inference for Long Context and Large Batch Sizes with NVFP4 KV Cache — Author; introduces NVFP4 KV cache quantisation and its latency/throughput benefits on Blackwell.

Graph View

  • Eduardo Alvarez
  • Appearances in this wiki

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community