Actividad - This looks amazing, if true. The paper is claiming state of the art across... - Kbin en español, instancia regional para personas de Costa Rica y más allá.

Lenguador, hace 11 meses

This looks amazing, if true. The paper is claiming state of the art across literally every metric. Even in their ablation study the model outperforms all others.

I'm a bit suspicious that they don't extend their perplexity numbers to the 13B model, or provide the hyper parameters, but they reference it in text and in their scaling table.

Code will be released in a week https://github.com/microsoft/unilm/tree/master/retnet

responder

reportar

actividad

copiar enlace

copiar enlace al fediverso

Loading...