Whisper Large-v3 Release en (github.com)

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

The large-v3 model shows improved performance over a wide variety of languages, and the plot below includes all languages where Whisper large-v3 performs lower than 60% error rate on Common Voice 15 and Fleurs, showing 10% to 20% reduction of errors compared to large-v2:

Imagen

Texto alternativo para la imagen

Federación

Status:

Activo | Inactivo

Instancias:

/m/opensource@kbin.social

Hilos (148)

Microblog (100)

Gente

Revistas

Hilo

Even_Adder

@Even_Adder@lemmy.dbzer0.com

Añadido: hace 7 meses
Visualizaciones: 50
En linea: -
Relación: 0

Revista

This magazine is dedicated to discussions on open source software, hardware, and technology. Whether you are a developer, a tech enthusiast, or simply interested in the philosophy of open source, this is the place for you. Here you can share your knowledge, ask questions, and engage in discussions on topics such as open source programming languages, operating systems, hardware, and more. From the benefits and challenges of open source to the latest developments and trends, this category covers a wide range of topics related to open source.

Creado: hace 1 año
Propietaria/o: donelias
Suscriptores/as: 1
En linea: -

Moderadores/as

donelias

Añadir un comentario