最終更新:2024-04-03 (水) 01:17:52 (24d)  

TensorRT-LLM
Top / TensorRT-LLM

A TensorRT Toolbox for Optimized Large Language Model Inference

https://github.com/NVIDIA/TensorRT-LLM/

関連