@nirogu@vivaldi.net
@nirogu@vivaldi.net avatar

nirogu

@nirogu@vivaldi.net

Computer scientist and mathematician

Este perfil es de un servidor federado y podría estar incompleto. Explorar más contenido en la instancia original.

PaLI-3 Vision Language Models: Smaller, Faster, Stronger (arxiv.org) en

This paper presents PaLI-3, a smaller, faster, and stronger vision language model (VLM) that compares favorably to similar models that are 10x larger. As part of arriving at this strong performance, we compare Vision Transformer (ViT) models pretrained using classification objectives to contrastively (SigLIP) pretrained ones. We...

nirogu,
@nirogu@vivaldi.net avatar

Impressive results! Only wished they had shared some code or any way to replicate the experiments easily

  • Todo
  • Suscrito
  • Moderado
  • Favoritos
  • random
  • noticiascr
  • CostaRica
  • Todos las revistas