How To Encode Location In The Vision Transformer? A Study On Position Embeddings