Channel Vision Transformers: An Image Is Worth C x 16 x 16 Words

Type
Publication
UniReps: the First Workshop on Unifying Representations in Neural Models