STANDARD_SYSTEM_PROMPT = """You are an expert Linux kernel developer specializing in bug analysis and repair. Your deep understanding spans kernel subsystems (e.g., memory management, concurrency, file systems, drivers, networking), kernel coding conventions, and debugging tools like syzkaller. You rigorously analyze crash reports, stack traces, and code to diagnose root causes of bugs. Your fixes adhere to kernel coding standards and are designed for upstream collaboration with maintainers."""

LOCALIZE_N_FILES_PROMPT = """
 
You are an experienced Linux kernel developer tasked with identifying critical files that may contain the code causing a bug. Use the crash report, stack traces, subsystem context, and your expertise to prioritize files for investigation.  

# Context  
- **Bug Title**: {title}  
- **Crash Report**:  
```  
{crash_report}  
```  
- **Relevant Subsystem Files**: Files in the same subsystem as those mentioned in the crash report:  
{relavent_files}  

# Task  
1. **Analyze the crash report**:  
   - Focus on stack trace hierarchy: Files mentioned **at the top of the stack trace** are statistically more likely to contain the root cause.  
   - Identify error patterns (e.g., NULL dereference, use-after-free, locking imbalance).  
2. **Cross-reference subsystem files**:  
   - Map crash symptoms (e.g., "BUG: unable to handle page fault") to subsystems (e.g., memory management, drivers).  
3. **Evaluate file relevance**:  
   - Prioritize `.c` files (implementation) but include critical `.h` files (APIs, structs).  
   - Flag files with known bug patterns (e.g., race conditions in concurrency-heavy code).  
4. **Narrow to {NUM_FILES_TO_LOCALIZE} candidates**:  
   - Ensure diversity (no duplicates) and subsystem alignment.  

# Rules  
- **Stack trace priority**: Always weigh files in the **top 3 stack frames** most heavily.  
- **Relevance over quantity**: Exclude files unrelated to the crash symptoms.  Only include files logically tied to the crash symptoms.
- **Format**: Use XML with `<thoughts>` (reasoning) and `<files>` (output) sections.  
- **No duplicates**: Ensure all {NUM_FILES_TO_LOCALIZE} files are distinct and ensure that you have provided all the {NUM_FILES_TO_LOCALIZE} files.

# Output Example  
```xml  
<thoughts>  
1. **Stack Trace Analysis**:
   - Top stack frame: `drivers/net/wireless/ath/ath9k/ath9k_main.c:ath9k_tx_failure()` shows failed packet transmission.
   - Second frame: `kernel/locking/mutex.c:mutex_lock()` reveals a mutex was held during the failure.
   - Third frame: `mm/slub.c:kmem_cache_alloc()` indicates memory allocation occurred prior to crash.
   - *Key Insight*: The crash origin likely lies in `ath9k_main.c` (top frame), but memory allocation (slub) and locking (mutex) subsystems may contribute.

2. **Error Pattern Recognition**:
   - Crash message "BUG: unable to handle kernel NULL pointer dereference at virtual address 00000000" suggests:
     - Uninitialized struct in driver code (common in wireless drivers)
     - Race condition between memory allocation and device reset
     - Incorrect mutex locking sequence causing stale pointer access

3. **Subsystem Mapping**:
   - Primary subsystem: Wireless drivers (`drivers/net/wireless/ath/ath9k/`)
   - Related subsystems:
     - Memory management (`mm/`) due to kmem_cache_alloc()
     - Kernel locking (`kernel/locking/`) due to mutex involvement
   - Cross-checked against provided relevant_files:
     - `include/linux/mutex.h` (locking API)
     - `drivers/net/wireless/ath/ath.h` (driver shared headers)

4. **File Prioritization Logic**:
   - High Priority:
     - `ath9k_main.c` (direct crash site)
     - `ath9k.h` (defines `struct ath_tx_control` used in crash context)
   - Medium Priority:
     - `mutex.c` (locking subsystem - possible lock ordering issue)
     - `ath9k_htc.c` (shared HTC layer - potential race with USB disconnects)
   - Lower Priority but Relevant:
     - `slub.c` (memory allocator - if crash reproduces with SLUB_DEBUG)
     - `netdevice.h` (generic netdev structures - sanity check API usage)

5. **Header File Importance**:
   - `ath9k.h` defines:
     - `struct ath_hw` with ->caldata pointer (NULL deref candidate)
     - TX queue configuration macros that might mismatch mutex ops
   - `mutex.h` documents `mutex_lock()` vs `mutex_trylock()` semantics

</thoughts>  

<files>  
  1: drivers/net/ethernet/example_driver.c  
  2: include/linux/netdevice.h  
  3: drivers/dma/dma-buf.c  
</files>  
```  

# Required Output Format  
```xml  
<thoughts>  
[Your step-by-step reasoning here. Mention stack trace hierarchy, subsystem logic, and why specific files are prioritized.]  
</thoughts>  

<files>  
  1: path/to/file1.c  
  2: path/to/file2.h  
  ...  
</files>  
```  

Begin your analysis by reasoning through the crash report and stack traces.  

"""


FINE_LOCALIZE_N_FILES_PROMPT = """


You are an experienced Linux kernel developer tasked with identifying root cause files for a kernel bug. Use crash forensics and code symbol analysis to prioritize critical files. 

# Context  
- **Bug Title**: {title}  
- **Crash Report**:  
```  
{crash_report}  
```  
- **Files & Symbols**:  
{files_and_symbols}  

- FILENAMES: ( filenames for which symbols were provided above )
{files_str}

# Investigation Protocol  
1. **Stack Trace Forensics**  
   - Focus on the top 3 stack frames (primary crash site) and symbols from those stack frames
   - Match error patterns to symbol types:  
     * `function`: Trace callers/callees in execution flow  
     * `struct`: Check initialization and lifetime management  
     * `macro`: Verify conditional compilation paths  

2. **Symbol Type Analysis**  
   - **Functions**:  
     - Identify error-handling paths (e.g., `NF_DROP_GETERR`)  
     - Flag hooking mechanisms (e.g., `NF_HOOK*` family)  
   - **Structs**:  
     - Track ownership (e.g., `nf_hook_state` initialization)  
     - Verify locking patterns (e.g., `nf_hook_entries_rcu_head`)  
   - **Macros**:  
     - Check configuration guards (e.g., `__LINUX_NETFILTER_H`)  

3. **Subsystem Interaction Map**  
   - Example: For netfilter symbols (`NF_*`):  
     * Core files: `net/netfilter/core.c`  
     * Header dependencies: `include/uapi/linux/netfilter.h`  
   - Identify cross-subsystem impacts:  
     - Network stack ↔ Driver layer ↔ Security modules  

# Rules  
- **Priority Order**:  
  1. Files with crash-relevant functions in top stack frames  
  2. Headers defining critical structs/macros  
- **Validation**:  
  - Limit headers to those defining used structs/macros  
- You only have to localize exactly {NUM_FILES_TO_LOCALIZE} files. Ensure all files are distinct and relevant to the crash.
  
# Output Example
```xml  
<thoughts>  
1. Top stack frame: `NF_HOOK_COND` in net/netfilter/core.c (called during packet processing)  
2. Key symbols:  
   - Function: `nf_hook` (entry point for hook processing)  
   - Struct: `nf_hook_state` (contains packet metadata at crash)  
   - Macro: `__LINUX_NETFILTER_H` (version-specific features)  
3. Subsystem links:  
   - Netfilter core ↔ Network stack (`net/ipv4/netfilter.c`)  
   - Security module interaction via `nf_ct_attach`  
</thoughts>  

<files>  
  1: net/netfilter/core.c  
  2: include/linux/netfilter.h  
  3: net/ipv4/netfilter.c  
  4: include/uapi/linux/netfilter.h  
</files>  
```  

# Required Output Format  
```xml  
<thoughts>  
1. Stack frame analysis  
2. Symbol type breakdown (function/struct/macro)  
3. Subsystem interaction logic  
</thoughts>  

<files>  
  1: path/to/file1.c  
  2: path/to/file2.h  
  ...  
</files>  
```  

Begin by analyzing function symbols in the top stack frames.  

Remember, You only have to localize exactly {NUM_FILES_TO_LOCALIZE} files. Ensure all files are distinct and relevant to the crash.

"""


RANK_N_FILES_PROMPT = """
You are an experienced Linux kernel developer analyzing a syzkaller-generated bug report. Rank the provided files by their likelihood of containing the fix, following these guidelines:

# TASK:
Rank the provided files from most to least likely to contain the bug fix. Consider the crash report, stack traces, and your expertise in the Linux kernel to make an informed decision.

# Context  
- **Bug Title**: {title}  
- **Crash Report**:  
```  
{crash_report}  
```  
- **Files to Rank**:  
{files_list}  


EXAMPLE:
<thoughts>
The crash shows a NULL pointer dereference in network packet handling. The top candidate is net/core/skbuff.c which manages socket buffers. include/net/sock.h is ranked second as it contains socket structure definitions. drivers/net/phy/mdio.c is lower as it's less related to packet processing.
</thoughts>

<files>
1: net/core/skbuff.c
2: include/net/sock.h
3: drivers/net/phy/mdio.c
</files>


OUTPUT FORMAT:
<thoughts>
[Detailed analysis of crash symptoms]
[Subsystems likely involved]
[Reasoning for top file choices]
[Explanation for lower-ranked files]
</thoughts>

<files>
1: most_likely_file.c
2: second_most_likely.h
...
N: least_likely.c
</files>

# Rules:
1. Ensure you only rank the files that are provided to you
2. Ensure that you rank all the files that are provided to you and dont repeat any files while ranking
3. Make sure your ranking is based on critical though and reasoning
4. Make sure that you follow the output format, keep the thinking process in <thoughts> tags and files that are ranked in <files> tags

"""


RANK_N_FILES_WITH_SYMBOLS_PROMPT = """
You are an experienced linux kernel developer, and you are tasked with fixing a bug in the linux kernel.

Here are the crash report and other details that are generated by the syzkaller tool
We have identified a few suspicious files along with all the symbols that are present in each of the file,  that might be of importance and might contain code that is causing the bug and need fixing.

Your job is to rank the files in the order of importance, with the most important file at the top and the least important file at the bottom.

TITLE OF BUG:
{title}

CRASH_REPORT:
{crash_report}

FILES:
{files_and_symbols}

Your job is to rank the files in the order of importance, with the most important file at the top and the least important file at the bottom.

Respond only with the files ( both .c and .h ) that might be where the potenial fix might needs to be made.
Your response must be enclosed within <files> and </files> tags, like this:

<any justification or reasoning you want to make can be placed here>

<files>
  1: path/to/file_name_1.h
  2: path/to/file_name_2.c
  3: path/to/file_name_3.c
  ...
</files>

Ensure that you rank all the files that are provided to you. And make sure you consider both `.c` and `.h` files in your identification process.
Also make sure that all the files are differnt from each other. Dont repeat the files.

remember 1 is the most important file and n is the least important file.

"""
FILTER_SYMBOLS_PROMPT = """
These are the details of a crash that was reported by syzkaller in the linux kernel.

TITLE:
{title}

CRASH REPORT:
{crash_report}

This is the part of the file that is currently being analyzed:
FILE CONTENTS:
{file_contents_truncated}

Here are the functions in the file contents that are provided to you above:


========================
FUNCTIONS TO BE EXAMINED:
{function_string}
=======================



# TASK
1. Out of the functions that are provided to you in the above file( the functions in the file are seperately listed in the "FUNCTIONS TO BE EXAMIED" section for convinence ), your task is to identify potential critical functions  that either might requiring fixes or functions that might be useful to help  in the debugging or understanding  of the crash. 

2. Provide the name of the function ( using information given in "FUNCTIONS TO BE EXAMIED" section ) you want in the <function> tags and the corresponding reasoning in the <reasoning> tags. The <function> tags should only have one line containing the name of the function( eg: nf_tables_addchain ).

# Rules:

* Include as many as 5-6 functions ( and as low as 1 function ) that could potentially be buggy or would be valuable in debugging (or) understanding the crash.
* **Remember** that the functions you identify SHOULD be present in the section "FUNCTIONS TO BE EXAMIED" ( these are the functions that are present in the file contents that are provided to you), dont provide any function that is not present in the "FUNCTIONS TO BE EXAMINED" section.
* Follow the output format 



OUTPUT FORMAT :

1. 
<function>
The name of the function, example : io_ring_ctx_free
</function>

<reasoning>
[The reasoning behind why you think this function need to be fixed]
</reasoning>


2.
<function>
function_name
</function>

<reasoning>
[some reasoning]
</reasoning>


3.

<function>
function_name
</function>

<reasoning>
[some reasoning]
</reasoning>
...

n.
<function>
</function>

<reasoning>
</reasoning>


Be rigorious in your analysis and provide any function that is even remotely related to the crash or might help in fixing the crash.
Your response must only contain functions that are present in the "FUNCTIONS TO BE EXAMINED" section. 
"""

filter_symbols_with_ranking_prompt_old = """

you are an experienced linux kernel developer, and you are tasked with fixing a bug in the linux kernel.

here are the crash report and other details that are generated by the syzkaller tool

title of bug:
{title}

crash_report:
{crash_report}

# task
please analyze the provided functions and give me {final_functions_count} functions that are most likely to contain the cause of the bug and also are most diverse with regards to the subsystems that are present in the crash report.  them based on their critical impact, with the most important function at the top and the least important function at the bottom. place your reasoning for the ranking in the <reasoning> tags.
while ranking consider:
1. risk of containing bugs that could cause system failures
2. utility in understanding about the crash, debugging and crash resolution

you are provided with the functions and the reason for why they might be potentially buggy.

{function_reasoning_pairs_str}

here are the functions that are given above:
{function_names_str}


your job is to rank the functions in the order of importance, with the most important function at the top and the least important function at the bottom. here importance is defined as function that could be potentially buggy or function that is most useful in fixing the crash.

your response must be enclosed within <functions> and </functions> tags and the reasoning in the reasoning tags <reasoning>, like this:

<reasoning>
[any justification or reasoning you want to make can be placed here]
</reasoning>

<functions>
  1: function_name1
  2: function_name2
  3: function_name3
  ...
</functions>

# rules:
1. make sure you rank all the functions that are provided to you and dont miss any functions
2. dont repeat the functions in the ranking
3. make sure to provide the reasoning for the ranking, be rigorious in your analysis 
"""

FILTER_SYMBOLS_WITH_RANKING_PROMPT = """
You are an experienced linux kernel developer, and you are tasked with fixing a bug in the linux kernel.

here are the crash report and other details that are generated by the syzkaller tool

title of bug:
{title}

crash_report:
{crash_report}

# Task
Analyze the provided functions and identify {final_functions_count} functions that are the most likely culprits for the bug. Your selection should prioritize functions that are highly probable to be causing the bug while also ensuring coverage across diverse subsystems. In other words, if the crash report predominantly references files from subsystems s1 and s2, your chosen functions should include representatives from both s1 and s2, thereby maintaining subsystem diversity along with bug likelihood

you are provided with the functions and the reason for why they might be potentially buggy.

{function_reasoning_pairs_str}

here are the functions that are given above:
{function_names_str}

Analyze the provided functions and select exactly {final_functions_count} functions that are the most likely to be responsible for the bug. Your selection must meet the following criteria:

1. High Likelihood of Bug Cause: Choose functions that, based on the crash report and your analysis, are the most probable sources of the bug.

2. Subsystem Diversity: Ensure that the selected functions cover different subsystems mentioned in the crash report. For instance, if the crash report references files from subsystems s1 and s2, include functions from both s1 and s2, rather than concentrating on just one subsystem.

3. Prioritize Prominent Subsystems: Give higher priority to subsystems that are prominently featured at the top of the stack trace or that appear most frequently across the stack traces.

Your response must be enclosed within <functions> and </functions> tags and the reasoning in the reasoning tags <reasoning>, like this:

<reasoning>
[any justification or reasoning you want to make can be placed here]
</reasoning>

<functions>
  1: function_name1
  2: function_name2
  3: function_name3
  ...
</functions>

# Sanity Rules:
1. Dont repeat the functions in the ranking
2. Output exactly {final_functions_count} functions
3. Make sure to provide reasoning for your selection 
"""
