paperswithcode.com
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale | Papers With Code
#2 best model for Language Modelling on C4 (Perplexity metric)
Похожие материалы на paperswithcode.com