Analyzing CodeBERT's Performance on Natural Language Code Search

Anonymous

Analyzing CodeBERT's Performance on Natural Language Code Search

Anonymous

16 Jan 2022 (modified: 05 May 2023)ACL ARR 2022 January Blind SubmissionReaders: Everyone

Abstract: Large language models such as CodeBERT perform very well on tasks such as natural language code search. We show that this is most likely due to the high token overlap and similarity between the queries and the code in datasets obtained from large codebases, rather than any deeper understanding of the syntax or semantics of the query or code.

Paper Type: long

0 Replies

Loading