Multi-granular software annotation using file-level weak labelling

Published: 2024, Last Modified: 16 Feb 2026Empir. Softw. Eng. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: One of the most time-consuming tasks for developers is the comprehension of new code bases. An effective approach to aid this process is to label source code files with meaningful annotations, which can help developers understand the content and functionality of a code base quicker. However, most existing solutions for code annotation focus on project-level classification: manually labelling individual files is time-consuming, error-prone and hard to scale.
Loading