from typing import Dict, List, Optional, Tuple
from pathlib import Path
from subprocess import run
from copy import deepcopy
import re
from cresearcher.utils.ctags import findSymbolInTags, createSymbolDefinition, getMarkdownShortForm
from cresearcher.utils.modelHandlers import getModelHandler
from cresearcher.utils.gitlog import getAllGlobs
from cresearcher.utils.types import SymbolDefinition, GlobalCtxAgentState, GlobalCtxAgentResult
from cresearcher.models.model_utils.tokens import get_token_len
from logging import Logger


GLOBAL_CTX_AGENT_MAIN_SYSTEM_PROMPT = lambda prompt_preamble, prompt_analysis_examples, repoName: f"""{prompt_preamble}

You will be given a list of currently open definitions (e.g. function/struct/union/macro/global constant definitions). You can choose to open more definitions by giving the file path and symbol name, or just by giving the symbol name if you are unsure about the file. You can also close the definitions that you're sure are not relevant for fixing the bug. You will also have the ability to search for arbitrary regex patterns throughout the repository code and throught the past commit messages and patches in the {repoName} repository git history.

You should reason from first principles about the possible root causes of the crash and how it can be fixed. For example, you can take a bottom-up approach, starting with the variable and function involved in the crash directly. Then you should track the dataflow of the variable and the control flow through conditional statements and function calls. You should analyse this flow and open more functions if needed, to narrow down on the root cause of the bug. You should use the `search_code` action to search for useful patterns of code throughout the repository. You should also use the `search_commits` actions to search through previous commit messages and patches for past commits that can be helpful. You can use the `search_definition` action (with just the symbol name or the file path and symbol name) to search for the definition of a relevant symbol, including functions, structs, unions, global constants and macros.

Once you have explored the repository and gathered enough context using the `search_definition`, `search_code` and `search_commits` actions, you should write a justification for why you have gathered all the context necessary to completely understand and repair this bug. If you are satisfied with your justification, you should take the `done` action to end exploration. If not, you should explore the repository code and commit history more. You must not stop until you have enough context to describe, in a specific and detailed manner, all possible root causes of the crash and how to fix it.

You should always review any responses you give, and ensure that you never make inferences or conclusions that don't directly follow from reading the {repoName} repository code and the crash report. You should always place any insights you gain from your analysis into your memory so that they are readily available. Focus on gathering all the relevant context and understanding the bug step-by-step, through careful reasoning steps.

For each step of your analysis, you must provide your response in the following format:

<thoughts>
Write your current analysis of the situation and reasoning about next steps. Be specific about what you've learned and what you need to investigate next. If you believe you have enough information to completely understand all possible root causes of the bug and how to fix it, you should write a justification here. Then you should analyse the justification and see if it holds up. If yes, you should take the `done` action to end exploration. If not, you should continue exploring.
</thoughts>

<actions>
Available actions (write one action per line with no additional formatting):

close_definition(filePath, symbolName, startLine)      Close the defintion of a specificed symbol that is not relevant to the bug (e.g. debug functions like dump_stack)
search_definition(filePath, symbolName)    Search for the definition of a specified symbol, which can be a function name, a struct name, a global constant, a union name or a macro name
search_definition(symbolName)              Search for the definition of a specified symbol (a function name, a struct name, a global constant, a union name or a macro name) when you don't know its file path. Use this to search for the definiton of symbols whose file you don't know.

search_code("regex pattern")               Search through all tracked files in the repository using git grep with regex pattern. Shows 2 context lines before and after each match, limited to 5 matches total.
                                           Examples: "function_name\\(" matches exact function calls
                                                     "struct \\w+_info" matches struct names ending in _info
search_commits("regex pattern")            Search historical commit messages and code changes in the repository using git log --grep and -G with regex pattern. Shows full commit message and complete patch for each matching commit. Limited to 5 matches total.
                                           Example: "handle->size|crypto_fun\\(|ptr->" matches commits that access handle->size, or call crypto_fun, or dereference ptr.

done                                       End exploration because you have gathered enough context to understand the bug and propose a fix. You must only take this action if you are satisfied with the justification you provided in the thoughts section. If unsure, continue exploring. You should also write your justification after this action inside <justification></justification> tags.

For done, write the action name alone on one line, then from the next line, write a specific and detailed justification inside <justification></justification> tags for why you have gathered all relevant context to understand and fix the bug. It should be formatted as follows:

done
<justification>
Specific and detailed reasoning about why the information you have gathered is enough to understand the bug and fix it. Talk about the possible root causes of the bug, how you can fix it, and how you the information is sufficient to reach these conclusions. Write at least 3-4 sentences.
</justification>

</actions>

<memory>
Record any important insights gained during this step that you want to remember for future steps. Each insight should be on a new line starting with a dash (-). These insights will be available in all future steps.
</memory>

You can, and should, combine multiple actions in a single response. Taking multiple independent actions in a single step saves time and money. The done action must be followed by the justification in <justification> tags.

Here are examples of responses in different scenarios:

{prompt_analysis_examples}

"""

GLOBAL_CTX_AGENT_SELECTION_SYSTEM_PROMPT = lambda prompt_preamble, repoName : f"""{prompt_preamble}

The context contains definitions of relevant symbols (e.g. functions, structs, unions, global constants, macros), regex search results from the {repoName} codebase, and regex search results from the historical commit messages and patches. One or more functions or other symbols may need to be edited to fix the bug.

You should select all the pieces of context that are relevant to completely understand and fix the bug. In case you are doubtful about some piece of context, you should still mark it as relevant. Your guiding principle is that if someone else is given just the crash report and the pieces of context you have marked as relevant, they should be able to understand all possible root causes of the crash, and propose fixes for the bug.

Every piece of context given to you will be in the following format:
<context id=<unique numeric id>>
Content of the context
</context>

You should write a detailed explanation of everything you have understood about the bug, why each piece of context you have selected is relevant to understanding and fixing the bug, and how different pieces of context are connected to each other. You should also write about all insights you have gained from the context analysis that will help in understanding the bug better.

You should then list all the context ids that you have selected as relevant to understanding and fixing the bug. You should provide a comma-separated list of all the context ids inside <contexts> tags, without any additional formatting.

You must provide your response in the following format:

<explanation>
Detailed reasoning about the bug, why each piece of context is relevant, and how different pieces of context are connected to each other.
</explanation>

<contexts>
Comma-separated list of all context ids that you have selected as relevant to understanding and fixing the bug, without any additional formatting.
</contexts>

"""

def generateGlobalCtxAgentSelectionPrompt(state: GlobalCtxAgentState, bugDict: Dict[str, any], buggyFunctions: List[SymbolDefinition], relevantLines: List[Tuple[str, str, Optional[str]]], repoName: str, logger: Logger) -> str:
    sections = []
    separator = "\n" + "=" * 80 + "\n"

    sections.append(f"You have gathered the following context from analysing a {repoName} crash report. Your task is to reason about which pieces of context are relevant to completely understand and fix the bug. You should select all the pieces of context that are relevant to the bug. In case you are doubtful about some piece of context, you should still mark it as relevant. Your guiding principle is that if someone else is given just the crash report and the pieces of context you have marked as relevant, they should be able to understand all possible root causes of the crash, and propose fixes for the bug.")
    sections.append(separator)

    ctxId = 1
    if buggyFunctions:
        sections.append("POTENTIALLY BUGGY FUNCTIONS:")
        for func in buggyFunctions:
            sections.append(f"<context id={ctxId}>")
            ctxId += 1
            sections.append(f"File: {func.filePath}\t\tFunction: {func.name}\t\tLines {func.start} to {func.end}")
            sections.append(f"```{getMarkdownShortForm(func.filePath)}")
            for i, l in enumerate(func.body.splitlines()):
                if list(filter(lambda f : (func.filePath == f[0]) and (str(i+func.start) == f[2]), relevantLines)):
                    sections.append(f"{i+func.start}| {l} // IMPORTANT LINE: This line is mentioned in the bug crash report. Pay attention to it while considering what pieces of context to mark as relevant from the repository code and commit history.")
                else:
                    sections.append(f"{i+func.start}| {l}")
            sections.append("```")
            sections.append("</context>")
            sections.append("")
        sections.append(separator)

    sections.append("SYMBOL DEFINITIONS:")
    ctxExhausted = False
    ctxLimit = 45000 - get_token_len(bugDict['crash_report_data'])
    numSymbolsAdded = 0
    for _, symbols in state.openDefinitions.items():
        for s in symbols:
            temp_sections = sections + [f"<context id={ctxId}>", f"File: {s.filePath}\t\tSymbol: {s.name}\t\tLines {s.start} to {s.end}", f"```{getMarkdownShortForm(s.filePath)}"]
            for i, l in enumerate(s.body.splitlines()):
                if list(filter(lambda f : (s.filePath == f[0]) and (str(i+s.start) == f[2]), relevantLines)):
                    temp_sections.append(f"{i+s.start}| {l} // IMPORTANT LINE: This line is mentioned in the bug crash report. Pay attention to it while considering what pieces of context to mark as relevant from the repository code and commit history.")
                else:
                    temp_sections.append(f"{i+s.start}| {l}")
            temp_sections += ["```", "</context>", ""]
            if get_token_len("\n".join(temp_sections)) <= ctxLimit:
                sections = temp_sections
                ctxId += 1
                numSymbolsAdded += 1
            else:
                logger.warning(f"Context Limit exceeded at:\nSymbol {s.name} in {s.filePath} at lines {s.start} to {s.end}")
                ctxExhausted = True
                break
        if ctxExhausted:
            break
    if numSymbolsAdded > 0:
        sections.append(separator)
    else:
        logger.warning("No symbol definitions added to context")
        sections = sections[:-1]

    if not ctxExhausted:
        sections.append("SEARCH QUERIES AND RESULTS:")
        queriesAdded = 0
        for query, results in state.pastQueries:
            results = [r for r in results if (r != "No matches found") and ("===Commits matching in" not in r) ]
            if not results: continue
            temp_sections = sections + [f"Query: {query}", "Results:"]
            for result in results:
                temp_sections += ["", f"<context id={ctxId}>", result, "</context>", "", "==="]
                if get_token_len("\n".join(temp_sections)) <= ctxLimit:
                    sections = temp_sections
                    ctxId += 1
                    queriesAdded += 1
                else:
                    logger.warning(f"Context Limit exceeded at:\nQuery: {query}\nResult:\n{result}")
                    ctxExhausted = True
                    break
            if ctxExhausted:
                break
        if queriesAdded > 0:
            sections.append(separator)
        else:
            logger.warning("No search queries added to context")
            sections = sections[:-1]

    sections.append("Inside <explanation> tags, provide a detailed reasoning about the bug, why each piece of context is relevant, and how different pieces of context are connected to each other. Then, list all the context ids that you have selected as relevant to understanding and fixing the bug inside <contexts> tags. If you are unsure, err on the side of including context in your relevant selection for safety. You should provide a comma-separated list of ids without any additional formatting.\n\n")

    return "\n".join(sections)

def generateGlobalCtxAgentReactPrompt(state: GlobalCtxAgentState, bugDict: Dict[str, any], buggyFunctions: List[SymbolDefinition], relevantLines: List[Tuple[str, str, Optional[str]]], maxSteps: int, repoName: str) -> str:
    if 'crash_report_data' not in bugDict:
        raise ValueError("Crash report cannot be empty")
    if 'title' not in bugDict:
        raise ValueError("Title cannot be empty")
    if not isinstance(state, GlobalCtxAgentState):
        raise ValueError("Invalid state object")
    if state.justification:
        raise ValueError("Justification should be None if generating a new prompt")

    sections = []
    separator = "\n" + "=" * 80 + "\n"

    sections.append(f"You are analyzing a {repoName} crash report. Your task is to explore the repository using available tools and gather enough context to completely understand all possible root causes of the crash and how to fix it.")
    sections.append(separator)

    # Step count
    sections.append(f"STEP {state.steps} of {maxSteps-1}")
    sections.append(separator)

    # Remove for conversational = True
    if state.steps == 0:
    # if True:
        # Bug title section
        sections.append(f"BUG TITLE: {bugDict['title']}")
        sections.append(separator)
    
        # Crash report section
        sections.append("CRASH REPORT:")
        sections.append(bugDict['crash_report_data'])
        sections.append(separator)

    # Feedback
    if state.feedback and state.feedback != "":
        sections.append("PREVIOUS ACTIONS FEEDBACK:")
        sections.append(state.feedback)
        sections.append(separator)
    
    # Prev queries section
    if state.prevQueries:
        sections.append("PREVIOUS SEARCHES AND RESULTS:")
        for query, results in state.prevQueries:
            sections.append(f"Query: {query}")
            sections.append(f"Results:")
            if not results:
                sections.append("No matches found")
            for result in results:
                sections.append("")
                sections.append(result)
                sections.append("")
                sections.append("===")
            sections.append("")
        sections.append(separator)

    # Memory section
    if state.memory:
        sections.append("MEMORY:")
        sections.extend(f"- {insight}" for insight in state.memory)
        sections.append(separator)

    # Open functions section
    if state.openDefinitions.items():
        sections.append("CURRENTLY OPEN DEFINITIONS:")
        for _, symbols in state.openDefinitions.items():
            for s in symbols:
                sections.append(f"File: {s.filePath}\t\tSymbol: {s.name}\t\tLines {s.start} to {s.end}")
                sections.append(f"```{getMarkdownShortForm(s.filePath)}")
                for i, l in enumerate(s.body.splitlines()):
                    if list(filter(lambda f : (s.filePath == f[0]) and (str(i+s.start) == f[2]), relevantLines)):
                        sections.append(f"{i+s.start}| {l} // IMPORTANT LINE: This line is mentioned in the bug crash report. Pay attention to it while considering what context to gather from the repository code and commit history.")
                    else:
                        sections.append(f"{i+s.start}| {l}")
                # sections.append("\n".join([f"{i+s.start}| {l}" for i, l in enumerate(s.body.splitlines())]))
                sections.append("```")
                sections.append("")
        sections.append(separator)

    # Potentially buggy functions section
    if buggyFunctions:
        sections.append("POTENTIALLY BUGGY FUNCTIONS:")
        for func in buggyFunctions:
            sections.append(f"File: {func.filePath}\t\tFunction: {func.name}\t\tLines {func.start} to {func.end}")
            sections.append(f"```{getMarkdownShortForm(func.filePath)}")
            for i, l in enumerate(func.body.splitlines()):
                if list(filter(lambda f : (func.filePath == f[0]) and (str(i+func.start) == f[2]), relevantLines)):
                    sections.append(f"{i+func.start}| {l} // IMPORTANT LINE: This line is mentioned in the bug crash report. Pay attention to it while considering what context to gather from the repository code and commit history.")
                else:
                    sections.append(f"{i+func.start}| {l}")
            # sections.append("\n".join([f"{i+func.start}| {l}" for i, l in enumerate(func.body.splitlines())]))
            sections.append("```")
            sections.append("")
        sections.append(separator)

    if (state.steps == maxSteps) and (not state.justification):
        sections.append("This is your last step. If you feel your context is enough, write a justification for why you have gathered all relevant context to understand the bug and propose a fix. Then take the `done` action to end exploration. If not, take all the actions you need in this step because it's your last chance to explore.")
    else:
        # Available actions reminder
        sections.append("AVAILABLE ACTIONS (write one action per line with no formatting inside <actions> tags):")
        sections.append("<actions>")
        sections.append("close_definition(filePath, symbolName, startLine)      Close a definition that is not relevant to the bug (e.g. debug functions like dump_stack)")
        sections.append("search_definition(filePath, symbolName)    Open the definition of a specified symbol, which can be a function name, a struct name, a constant, a union or a macro")
        sections.append("search_definition(symbolName)              Open the definition of a symbol when you don't know its file path. Use this to search for the definiton of symbols whose file you don't know.")

        sections.append("search_code(\"regex pattern\")               Search through all tracked files in the repository using git grep with regex pattern. Shows 2 context lines before and after each match, limited to 5 matches total.")
        sections.append("                                           Examples: \"function_name\\(\" matches exact function calls")
        sections.append("                                                      \"struct \\w+_info\" matches struct names ending in _info")
        sections.append('search_commits("regex pattern")            Search commit messages and code changes in the repository using git log --grep and -G with regex pattern. Shows full commit message and complete patch for each matching commit. Limited to 5 matches total.')
        sections.append('                                           Example: "handle->size|crypto_fun\\(|ptr->" matches commits that access handle->size, or call crypto_fun, or dereference ptr.')
        
        sections.append("done                                       End exploration because you have gathered enough context to understand the bug and propose a fix. You must only take this action if you are satisfied with the justification you provided in the thoughts section. If unsure, continue exploring. You should also write your justification after this action inside <justification></justification> tags.")
        sections.append("                                           (followed by justification in <justification> tags)")

        sections.append("</actions>")

        sections.append("You should examine the evidence and write your thoughts about the current state with respect to the overall goal in the <thoughts> tags. You should always place any insights you gain from your analysis into your memory inside <memory> tags so that they are readily available. Think of it like handing over the task to a coworker who needs to pick up from where you left off. Write everything the coworker would need.  Focus on gathering all the relevant context and understanding the bug step-by-step, through careful reasoning steps.")
    
    return "\n".join(sections)


class GlobalCtxAgent:

    def __init__(self, prompt_preamble:str, prompt_analysis_examples:str, bugDict: Dict[str, any], buggyFunctions: List[SymbolDefinition], relevantLines: List[Tuple[str, str, Optional[str]]], repoPath: Path, tagsFilePath: Path,logger: Logger, maxSteps: int = 10, repoName: str = "linux"):
        self.prompt_preamble = prompt_preamble
        self.prompt_analysis_examples = prompt_analysis_examples
        self.bugDict = bugDict
        self.buggyFunctions = buggyFunctions
        self.relevantLines = relevantLines
        self.repoPath = repoPath
        self.tagsFilePath = tagsFilePath
        self.logger = logger
        self.maxSteps = maxSteps
        self.repoName = repoName
        self.modelHandler = getModelHandler(
            modelName="gpt-4o",
            systemPrompt=GLOBAL_CTX_AGENT_MAIN_SYSTEM_PROMPT(prompt_preamble, prompt_analysis_examples, repoName),
            temperature=0.6,
            conversational=True,
            pinFirstN=1,
        )


    def _handleOpenDefinition(self, symbolName: str, filePath: Optional[str] = None) -> GlobalCtxAgentState:
        """Handle search_definition action."""
        self.logger.info(f"Opening symbol: {symbolName}{f' in {filePath}' if filePath else ''}")
        matches = findSymbolInTags(
            tagsFilePath=self.tagsFilePath,
            symbolName=symbolName,
            filePath=filePath,
            buggyFunctions=self.buggyFunctions,
            openDefinitions=self.state.openDefinitions,
            bugDict=self.bugDict,
            logger=self.logger
        )
        
        if not matches:
            self.logger.warning(f"Symbol {symbolName} not found{f' in {filePath}' if filePath else ''}")
            self.state.feedback += f"Error: Symbol {symbolName} not found{f' in {filePath}' if filePath else ''} for the search_definition action.\n"
            return self.state
            
        if len(matches) > 1:
            self.logger.info(f"Multiple matches found for {symbolName}:")
            for m in matches:
                self.logger.info(f"- {m['file']},{m['start']},{m['end']}")
                
        matches = matches[:5]
        # Create new state for modifications
        newState = deepcopy(self.state)
        
        # Try to add all matching functions
        for match in matches:
            func = createSymbolDefinition(
                filePath=match['file'],
                repoPath=self.repoPath,
                symbolName=symbolName,
                startLine=match['start'],
                endLine=match['end'],
                logger=self.logger    
            )
            
            if func is None:
                self.logger.warning(f"Failed to extract symbol from {match['file']}")
                continue
                
            if func not in newState.openDefinitions[func.filePath]:
                newState.openDefinitions[func.filePath].append(func)
            self.logger.debug(f"Added symbol from {func.filePath}")
        
        # If we couldn't add any symbols, return old state
        if newState.openDefinitions == self.state.openDefinitions:
            self.logger.warning("Failed to open any matching symbols")
            return self.state
            
        return newState

    def _handleCloseDefinition(self, symbolName: str, filePath: str, startLine: int) -> GlobalCtxAgentState:
        """Handle close_definition action."""
        self.logger.info(f"Closing symbol: {symbolName} in {filePath} at line {startLine}")
        newState = deepcopy(self.state)
        
        # If file path specified, only look there
        if filePath in newState.openDefinitions:
            before_len = len(newState.openDefinitions[filePath])
            newState.openDefinitions[filePath] = [f for f in newState.openDefinitions[filePath]
                                            if f.name != symbolName or f.start != startLine]
            if len(newState.openDefinitions[filePath]) == before_len:
                self.logger.warning(f"Symbol {symbolName} not found in {filePath}")
                return self.state
            if not newState.openDefinitions[filePath]:
                del newState.openDefinitions[filePath]
            self.logger.debug(f"Removed symbol from {filePath}")
        else:
            self.logger.warning(f"File {filePath} not found in open definitions")

        return newState

    def _handleSearchCode(self, pattern: str) -> GlobalCtxAgentState:
        """Search repository code using git grep with regex pattern."""
        self.logger.info(f"Searching code for: {pattern}")
        MAX_MATCHES = 5

        # Heirarchical search: first search in the files of the buggyFunctions, then in the files of the openDefinitions, then in the subsystems of the bugs, and then in all files
        allGlobs = getAllGlobs(
            bugDict=self.bugDict,
            buggyFunctions=self.buggyFunctions,
            openDefinitions=self.state.openDefinitions,
            logger=self.logger
        )

        searchResults: List[str] = []
        for c in allGlobs:
            try:
                self.logger.debug(f"Running command: git grep -E -l -i {pattern} {c}")
                # Use git grep with extended regex (-E) for searching
                result = run(["git", "grep", "-E", "-l", "-i", pattern] + c, 
                            cwd=self.repoPath, capture_output=True, text=True)
                self.logger.debug(f"git grep result: {result.stdout}")
            except Exception as e:
                self.logger.error(f"Error searching code: {e}")
                continue
                
            # Get matching files
            matching_files = result.stdout.strip().split('\n') if result.stdout else []
            
            if not matching_files or matching_files == ['']:
                self.logger.info("No matches found in repository")
                continue
            else:
                self.logger.debug(f"Found matches in {len(matching_files)} files")
                for file in matching_files:
                    try:
                        # -C 2 shows 2 lines of context before and after
                        self.logger.debug(f"Running command: git grep -E -n -i -p -C 2 {pattern} {file}")
                        file_result = run(["git", "grep", "-E", "-n", "-i", "-p", "-C", "2", pattern, file],
                                        cwd=self.repoPath, capture_output=True, text=True)
                        self.logger.debug(f"git grep result: {file_result.stdout}")
                    except Exception as e:
                        self.logger.error(f"Error searching code: {e}")
                        continue
                    if file_result.stdout:
                        matches = file_result.stdout.strip().split('--\n')
                        for i, m in enumerate(matches):
                            # If there is no line in m matching :(digits+):, then skip
                            if not re.search(r'\.(c|h):\d+:', m):
                                continue
                            # If there is no line in m matching =(digits+)=, then go to the previous match, and get the last line matching =(digits+)=. Add it at the beginning of m
                            if not re.search(r'\.(c|h)=\d+=', m):
                                if i > 0:
                                    prev_match = matches[i-1]
                                    for line in reversed(prev_match.splitlines()):
                                        if re.search(r'\.(c|h)=\d+=', line):
                                            m = line + "\n--\n" + m
                                            break
                            if m not in searchResults:
                                searchResults.append(m)
                            if len(searchResults) >= MAX_MATCHES:
                                break
                    if len(searchResults) >= MAX_MATCHES:
                        break
            if len(searchResults) >= MAX_MATCHES:
                break

        searchResults = searchResults[:MAX_MATCHES]

        if not searchResults:
            self.logger.info("No matches found in repository")
        
        # Create new state with search results
        newState = deepcopy(self.state)
        if (f"search_code(\"{pattern}\")", searchResults) not in newState.pastQueries:
            newState.pastQueries.append((f"search_code(\"{pattern}\")", searchResults))
        if (f"search_code(\"{pattern}\")", searchResults) not in newState.prevQueries:
            newState.prevQueries.append((f"search_code(\"{pattern}\")", searchResults))
        return newState

    def _handleSearchCommits(self, pattern: str) -> GlobalCtxAgentState:
        """Search commit messages and patches using git log with regex pattern."""
        self.logger.info(f"Searching commits for pattern: {pattern}")
        MAX_MATCHES = 5
        msgResults, codeResults = [], []

        # Hierarchical search: first search in the files of the buggyFunctions, then in the files of the openDefinitions, then in the subsystems of the bugs, and then in all files
        allGlobs = getAllGlobs(
            bugDict=self.bugDict,
            buggyFunctions=self.buggyFunctions,
            openDefinitions=self.state.openDefinitions,
            logger=self.logger
        )

        for c in allGlobs:
            # Search code changes
            try:
                self.logger.debug(f"Running command: git log -E -G {pattern} -p --max-count={MAX_MATCHES - len(msgResults) - len(codeResults)} {self.bugDict['crashes'][0]['kernel-source-commit']}" + " ".join(c))
                proc = run(["git", "log", "-E", "-G", pattern, "-p", f"--max-count={MAX_MATCHES - len(msgResults) - len(codeResults)}", self.bugDict['crashes'][0]['kernel-source-commit']] + c,
                                cwd=self.repoPath, capture_output=True, text=True, timeout=60)
                self.logger.debug(f"git log result: {proc.stdout}")
                code_result = proc.stdout
            except Exception as e:
                self.logger.error(f"Error searching commits: {e}")
                code_result = ""
            code_commits = code_result.split('\ncommit ') if code_result else []
            self.logger.info(f"Found {len(code_commits)} code change matches")
            if code_commits:
                for commit in code_commits:
                    msgEnd = commit.find('\ndiff --git')
                    if msgEnd == -1:
                        continue
                    commitMsg = commit[:msgEnd].strip()
                    patch = commit[msgEnd:].strip()
                    commitStr = f"Commit message:\n{commitMsg}\n\nPatch:\n{patch}"
                    # Truncate commitStr if too long
                    if len(commitStr.splitlines()) > 100:
                        commitStr = "\n".join(commitStr.splitlines()[:100]) + "\n... (Truncated due to length) ...\n"
                    if (commitStr not in codeResults) and (commitStr not in msgResults) and (commitStr not in [x for _,r in self.state.pastQueries for x in r]) and (commitStr):
                        codeResults.append(commitStr)
            if len(codeResults) + len(msgResults) >= MAX_MATCHES:
                break

            try:
                # Search commit messages
                self.logger.debug(f"Running command git log -E --grep {pattern} -p --max-count={MAX_MATCHES - len(msgResults) - len(codeResults)} {self.bugDict['crashes'][0]['kernel-source-commit']} {' '.join(c)}")
                proc = run(["git", "log", "-E", "--grep", pattern, "-p", f"--max-count={MAX_MATCHES - len(msgResults) - len(codeResults)}", self.bugDict['crashes'][0]['kernel-source-commit']] + c,
                                cwd=self.repoPath, capture_output=True, text=True, timeout=60)
                self.logger.debug(f"git log result: {proc.stdout}")
                msg_result = proc.stdout
            except Exception as e:
                self.logger.error(f"Error searching commits: {e}")
                msg_result = ""
            msg_commits = msg_result.split('\ncommit ') if msg_result else []
            self.logger.info(f"Found {len(msg_commits)} message matches")
            if msg_commits:
                for commit in msg_commits:
                    msgEnd = commit.find('\ndiff --git')
                    if msgEnd == -1:
                        continue
                    commitMsg = commit[:msgEnd].strip()
                    patch = commit[msgEnd:].strip()
                    msgStr = f"Commit message:\n{commitMsg}\n\nPatch:\n{patch}"
                    # Truncate msgStr if too long
                    if len(msgStr.splitlines()) > 100:
                        msgStr = "\n".join(msgStr.splitlines()[:100]) + "\n... (Truncated due to length) ...\n"
                    if (msgStr not in codeResults) and (msgStr not in msgResults) and (msgStr not in [x for _,r in self.state.pastQueries for x in r]) and (msgStr):
                        msgResults.append(msgStr)
                
            if len(msgResults) + len(codeResults) >= MAX_MATCHES:
                break
            
        searchResults: List[str] = []
        if not codeResults and not msgResults:
            self.logger.info("No matches found in repository")
        else:
            codeResults = codeResults[:MAX_MATCHES]
            msgResults = msgResults[:(MAX_MATCHES - len(codeResults))]
            if codeResults:
                codeResults = ["===Commits matching in code changes==="] + codeResults
            if msgResults:
                msgResults = ["===Commits matching in messages==="] + msgResults
                if codeResults:
                     msgResults[0] = "\n\n" + msgResults[0] # Add separator to first result so that generateUserPrompt can treat all uniformly
            searchResults = codeResults + msgResults
        # Create new state with search results
        newState = deepcopy(self.state)
        if (f"search_commits(\"{pattern}\")", searchResults) not in newState.pastQueries:
            newState.pastQueries.append((f"search_commits(\"{pattern}\")", searchResults))
        if (f"search_commits(\"{pattern}\")", searchResults) not in newState.prevQueries:
            newState.prevQueries.append((f"search_commits(\"{pattern}\")", searchResults))
        return newState

    def _processResponse(self, response: str) -> GlobalCtxAgentState:
        """Process LLM response and execute actions."""
        self.logger.debug("Processing LLM response")
        newState = deepcopy(self.state)
        
        # Extract sections using regex
        thoughtsMatch = re.search(r'<thoughts>(.*?)</thoughts>', response, re.DOTALL)
        actionsMatch = re.search(r'<actions>(.*?)</actions>', response, re.DOTALL)
        memoryMatch = re.search(r'<memory>(.*?)</memory>', response, re.DOTALL)
        
        if not actionsMatch:
            self.logger.error("Error: Response missing required actions section")
            newState.feedback += "Error: Response missing required actions section inside opening and closing <action> tags. Please try again.\n"
            return newState
            
        # Process thoughts if present (optional)
        if thoughtsMatch:
            self.logger.info(f"Thoughts: {thoughtsMatch.group(1).strip()}")
        
        # Process memory if present
        if memoryMatch:
            try:
                memory_text = memoryMatch.group(1).strip()
                memories = [m.strip('- ').strip() for m in memory_text.split('\n')]
                memories = [m for m in memories if m and (m not in self.state.memory)]
                newState.memory.extend(memories)
                self.logger.debug(f"Added {len(memories)} items to memory")
            except Exception as e:
                self.logger.error(f"Error processing memory section: {e}")
        
        # Process actions
        actions_text = actionsMatch.group(1).strip()
        actions = [line.strip() for line in actions_text.split('\n') if line.strip()]
        
        self.logger.debug(f"Processing {len(actions)} actions")
        for action in actions:
            self.state = newState
            try:
                if action.startswith('search_definition(') or action.startswith('close_definition('):
                    is_open = action.startswith('search_definition(')
                    func_name = 'search_definition(' if is_open else 'close_definition('
                    args = action[len(func_name):-1]  # Remove function name and ')'
                    
                    if not args:
                        self.logger.warning(f"Empty {func_name[:-1]} call")
                        newState.feedback += f"Error: Empty {func_name[:-1]} call. Please provide appropriate arguments.\n"
                        continue
                        
                    # Split arguments and remove quotes
                    args = [arg.strip().strip('"\'') for arg in args.split(',')]

                    if len(args) < 3 and not is_open:
                        self.logger.warning(f"Invalid number of arguments for close_definition: {action}")
                        newState.feedback += f"Error: Invalid number of arguments for close_definition: {action}. Usage: `close_definition(filePath, symbolName, startLine)`.\n"
                        continue
                    
                    if len(args) == 1:
                        newState = self._handleOpenDefinition(symbolName=args[0])
                    elif len(args) == 2:
                        newState = self._handleOpenDefinition(symbolName=args[1], filePath=args[0])
                    elif len(args) == 3:
                        newState = ( self._handleOpenDefinition(symbolName=args[1], filePath=args[0]) if is_open
                            else self._handleCloseDefinition(symbolName=args[1], filePath=args[0], startLine=int(args[2])))
                    else:
                        self.logger.warning(f"Invalid number of arguments for {func_name[:-1]}: {action}")
                        newState.feedback += f"Error: Invalid number of arguments for {func_name[:-1]}: {action}. Please provide appropriate arguments.\n"
                        continue
                        
                elif action.startswith('search_code(') or action.startswith('search_commits('):
                    is_code = action.startswith('search_code(')
                    func_name = 'search_code(' if is_code else 'search_commits('
                    query = action[len(func_name):-1]  # Remove function name and ')'
                    
                    if not query:
                        self.logger.warning(f"Empty {func_name[:-1]} query")
                        newState.feedback += f"Error: Empty {func_name[:-1]} query. Please provide a valid regex pattern.\n"
                        continue
                        
                    # Remove quotes from query
                    query = query.strip().strip('"\'')
                    
                    newState = (self._handleSearchCode(query) if is_code 
                            else self._handleSearchCommits(query))

                elif action == "done":
                    justificationMatch = re.search(r'<justification>(.*?)</justification>', response, re.DOTALL)
                    if not justificationMatch:
                        self.logger.error("Error: done action without justification")
                        newState.feedback += "Error: done action taken without justification content with opening and closing <justification> tags. Please try again.\n"
                        return newState
                    justification = justificationMatch.group(1).strip()
                    newState.justification = justification
                    self.logger.info("Justification provided and done action taken")
                        
                else:
                    self.logger.warning(f"Unknown action: {action}")
                    continue

            except Exception as e:
                self.logger.error(f"Error processing action {action}: {e}")
                continue
        
        return newState

    def _performContextSelection(self) -> GlobalCtxAgentResult:
        """Perform context selection using LLM."""
        self.logger.info("Performing context selection")
        result = GlobalCtxAgentResult(
            steps = 0,
            relevanceExplanation = "",
            relevantFunctions = [],
            relevantDefinitions = {},
            relevantQueries = [],
            promptTokens = 0,
            completionTokens = 0,
            finalReactState = deepcopy(self.state),
        ) # Save final state for debugging

        # Replace system prompt
        self.modelHandler._system_message = {
            "role": "system",
            "content": GLOBAL_CTX_AGENT_SELECTION_SYSTEM_PROMPT(self.prompt_preamble, self.repoName),
        }
        # Generate context selection prompt
        prompt = generateGlobalCtxAgentSelectionPrompt(
            state=self.state,
            bugDict=self.bugDict,
            buggyFunctions=self.buggyFunctions,
            relevantLines=self.relevantLines,
            repoName=self.repoName,
            logger=self.logger
        )
        self.logger.debug("Generated context selection prompt")
        self.logger.debug("PROMPT:\n" + "="*80 + "\n" + prompt + "\n" + "="*80)
        
        # Get response from LLM
        responses, tokenCounts = self.modelHandler.get_responses(prompt, return_token_count=True)
        self.state.promptTokens += tokenCounts['prompt_tokens']
        self.state.completionTokens += tokenCounts['completion_tokens']
        response = responses[0]
        self.logger.debug("Received response from LLM")
        self.logger.debug("RESPONSE:\n" + "="*80 + "\n" + response + "\n" + "="*80)
        
        # Process response
        explanationMatch = re.search(r'<explanation>(.*?)</explanation>', response, re.DOTALL)
        if not explanationMatch:
            self.logger.error("Error: Context selection response missing explanation")
            explanation = ""
        else:
            explanation = explanationMatch.group(1).strip()
            self.logger.info(f"Context selection explanation: {explanation}")
        result.relevanceExplanation = explanation
        
        contextsMatch = re.search(r'<contexts>(.*?)</contexts>', response, re.DOTALL)
        if not contextsMatch:
            self.logger.error("Error: Context selection response missing contexts")
            contexts = []
        else:
            contexts = contextsMatch.group(1).strip().split(",")
            contexts = [c.strip() for c in contexts if c.strip()]
            self.logger.info(f"Selected contexts: {contexts}")
        ctxIds = []
        for c in contexts:
            if c.isdigit():
                ctxIds.append(int(c))
            else:
                self.logger.error(f"Error parsing context id: {c}. Will take entire context just to be safe.")
                ctxIds = []
                break
        self.logger.info(f"Selected context ids: {ctxIds}")
        
        ctxId = 1
        for func in self.buggyFunctions:
            if (ctxId in ctxIds) or (not ctxIds):
                if func not in result.relevantFunctions:
                    result.relevantFunctions.append(func)
            ctxId += 1

        for filePath, symbols in self.state.openDefinitions.items():
            for s in symbols:
                if (ctxId in ctxIds) or (not ctxIds):
                    if filePath not in result.relevantDefinitions:
                        result.relevantDefinitions[filePath] = []
                    if s not in result.relevantDefinitions[filePath]:
                        result.relevantDefinitions[filePath].append(s)
                ctxId += 1
        
        for (query, results) in self.state.pastQueries:
            newResults = []
            for r in results:
                if ("===Commits matching in" in r) or (r == "No matches found"):
                    continue

                if (ctxId in ctxIds) or (not ctxIds):
                    newResults.append(r)
                ctxId += 1
            if newResults:
                result.relevantQueries.append((query, newResults))

        result.steps = self.state.steps + 1
        result.promptTokens = self.state.promptTokens
        result.completionTokens = self.state.completionTokens
        return result

    def run(self) -> GlobalCtxAgentResult:
        """Run the react loop to gather context and return the hypothesis, open definitions, and queries."""
        self.state = GlobalCtxAgentState()
        
        while self.state.steps <= self.maxSteps and (not self.state.justification):
            self.logger.info(f"Step {self.state.steps} of {self.maxSteps}")
            
            prompt = generateGlobalCtxAgentReactPrompt(
                state=self.state,
                bugDict=self.bugDict,
                buggyFunctions=self.buggyFunctions,
                relevantLines=self.relevantLines,
                maxSteps=self.maxSteps,
                repoName=self.repoName,
            )
            self.state.feedback = ""
            self.state.prevQueries = [] #By clearing this, model only sees each query,result once
            self.logger.debug("Generated prompt for LLM")
            self.logger.debug("PROMPT:\n" + "="*80 + "\n" + prompt + "\n" + "="*80)
            
            responses, token_counts  = self.modelHandler.get_responses(prompt, return_token_count=True)
            self.state.promptTokens += token_counts['prompt_tokens']
            self.state.completionTokens += token_counts['completion_tokens']
            response = responses[0]
            self.logger.debug("Received response from LLM")
            self.logger.debug("RESPONSE:\n" + "="*80 + "\n" + response + "\n" + "="*80)
            
            newState = self._processResponse(response)
            self.state = newState
            self.state.steps += 1

        if self.state.justification:
            self.logger.info("Final Justification:\n" + self.state.justification)
        else:
            self.logger.info("Max steps reached without justification")
        
        # Perform context selection
        result = self._performContextSelection()
        
        self.logger.info("Run completed")
        # Log final explanation
        self.logger.info("Final relevance explanation:\n" + result.relevanceExplanation)

        return result