Maggie Zhang Person. Author at NVIDIA. Appearances in this wiki Scaling LLMs with NVIDIA Triton and TensorRT-LLM Using Kubernetes — Author of the article; published on the NVIDIA Developer Blog (October 2024).