Selma: A Semantic Local Code Search Platform

Published: 01 Jan 2024, Last Modified: 20 May 2025ECIR (5) 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Searching for the right code snippet is cumbersome and not a trivial task. Online platforms such as Github.com or searchcode.com provide tools to search, but they are limited to publicly available and internet-hosted code. However, during the development of research prototypes or confidential tools, it is preferable to store source code locally. Consequently, the use of external code search tools becomes impractical. Here, we present Selma (Code and Videos: https://anreu.github.io/selma): a local code search platform that enables term-based and semantic retrieval of source code. Selma searches code and comments, annotates undocumented code to enable term-based search in natural language, and trains neural models for code retrieval.
Loading