For Hopper-based systems running Llama models, Nvidia claims Dynamo can effectively double the inference performance.