SOSecure: The Wisdom of the Crowd for Safer AI-Generated Code

Manisha Mukherjee; Vincent Josua Hellendoorn

SOSecure: The Wisdom of the Crowd for Safer AI-Generated Code

Manisha Mukherjee, Vincent Josua Hellendoorn

Published: 28 Mar 2026, Last Modified: 13 Apr 2026AIware 2026EveryoneRevisionsCC BY 4.0

Keywords: LLM, Secure Code Generation, RAG

TL;DR: We introduce SOSecure, a retrieval augmented post generation framework that leverages Stack Overflow security discussions to detect and repair vulnerabilities in LLM generated code at inference time without retraining.

Abstract: Large Language Models (LLMs) are widely used for automated code generation, but the code they produce can contain security vulnerabilities. Their reliance on pretraining data means they may not reflect newly discovered vulnerabilities or evolving security practices. In contrast, developer communities on Stack Overflow (SO) provide a continuously updated record of security issues and their resolutions, as developers discuss and address vulnerabilities in real-world code. However, this information is not directly available to LLMs during code generation. This paper presents \textbf{SOSecure}, a post-generation security review layer that operationalizes Stack Overflow (SO) discussions as inference-time safety signals. SOSecure builds a security-focused knowledge base from SO answers and comments that explicitly identify vulnerabilities and security antipatterns. Given an LLM-generated snippet, it retrieves discussions involving similar code patterns and incorporates them as contextual guidance to revise potentially unsafe outputs. Unlike approaches that rely solely on curated vulnerability descriptions, SOSecure leverages community-authored critiques to provide targeted, framework-specific security nudges. We evaluate SOSecure on three datasets, SALLM, LLMSecEval, and LMSys. Across these datasets, SOSecure achieves fix rates of 71.7\%, 91.3\%, and 96.7\%, respectively, compared to 49.1\%, 56.5\%, and 37.5\% when prompting GPT-4 without retrieved discussions. SOSecure requires no retraining or fine-tuning and demonstrates how community knowledge can function as a lightweight inference-time safety layer for AI-generated code.

Revision Summary: Uploading camera ready

Email Sharing: We authorize the sharing of all author emails with Program Chairs.

Data Release: We authorize the release of our submission and author names to the public.

Paper Type: Full-length papers (i.e. case studies, theoretical, applied research papers). 8 pages

Reroute: true

Submission Number: 37

Loading