ViT CLS-Visualizer

Enter the Hugging Face model repo ID (must be public), upload an image, and visualize the cosine similarity between the CLS token and patches.

Popular Vision Transformer models to try:

  • google/vit-base-patch16-224
  • facebook/deit-base-distilled-patch16-224
  • microsoft/dit-base