最終更新:2024-04-03 (水) 15:34:29 (352d)
Mistral 7B
Top / Mistral 7B
the most powerful language model for its size to date.
https://mistral.ai/news/announcing-mistral-7b/
概要
- Outperforms Llama 2 13B on all benchmarks
- Outperforms Llama 1 34B? on many benchmarks
- Approaches CodeLlama 7B? performance on code, while remaining good at English tasks
- Uses Grouped-query attention (GQA) for faster inference
- Uses Sliding Window Attention (SWA) to handle longer sequences at smaller cost
ChatRTX
- Mistral 7B int4 - だいたいVRAM 7GB