最終更新:2024-04-03 (水) 01:17:52 (163d)
TensorRT-LLM
Top / TensorRT-LLM
A TensorRT Toolbox for Optimized Large Language Model Inference
https://github.com/NVIDIA/TensorRT-LLM/