Abstract: Recent transformers-based systems are advancing image captioning applications. However, those works have been mainly applied to English-based image captioning problems. In this paper, we introduce a transformers-based Turkish-based image captioning algorithm. Our proposed algorithm uses appearance and geometry features from the input image and combines them along with the WordPiece embeddings to generate the Turkish-based caption. Our experimental results show improvement when compared to the other existing techniques including the original ORT and the show-and-tell algorithms.
Loading