# SimuGen: Multi-modal Agentic Framework for Constructing Block **Diagram-Based Simulation Models**

**Anonymous ACL submission** 

#### Abstract

Recent advances in Large Language Models (LLMs) have demonstrated remarkable capa-004 bilities in mathematical reasoning and code generation. However, LLMs still perform poorly in the simulation domain, especially when tasked with generating Simulink mod-800 els, which are essential in engineering and scientific research. Our preliminary experiments reveal that LLM agents struggle to produce reliable and complete Simulink simulation codes from text-only inputs, likely due to insufficient 012 Simulink-specific data during pre-training. To 014 address this gap, we introduce SimuGen, a multi-modal agentic framework designed to automatically generate accurate Simulink simulation code by leveraging both the visual 018 Simulink diagram image and domain knowledge. SimuGen coordinates several specialized agents-including an Investigator, a unit test reviewer, a code generator, an executor, a debug locator, and a report writer-supported by a domain-specific database. This collaborative, modular architecture enables interpretable and robust simulation generation for Simulink.<sup>1</sup>

#### Introduction 1

011

017

023

024

027

030

"One test result is worth one thousand expert opinions."

— Wernher von Braun, Father of Rocket Science

Simulation models are indispensable for testing complex systems across domains such as automotive, aerospace, and robotics. In industry and academia, Simulink, a tool for modeling and simulation of dynamic systems in the MATLAB environment, has emerged as a prominent block-diagram platform for simulation (MathWorks, 2025).

Recent advances in LLMs have demonstrated remarkable abilities in automating software development and code generation. Models like SWEagent (Yang et al., 2024) and OpenAI's Codex can

translate natural language into program code with increasing proficiency. These successes, however, are mostly confined to textual programming languages, leaving a gap in support for graphical simulation and modeling environments. In particular, existing LLM-based pipelines lack any native capability for Simulink model generation or manipulation. The fundamental difference between textual code and block-diagram models presents a unique challenge: LLMs trained predominantly on linear text struggle to represent and reason about the two-dimensional structure of diagrams (Abdalla and Smith, 2025). Furthermore, unlike the wealth of open-source code available for training, there is a paucity of public data for Simulink or similar graphical models (Shrestha and Csallner, 2021). This data scarcity has hindered the specialization of LLMs for simulation tasks – early attempts like SLGPT (a GPT-2 fine-tuned on just 400 Simulink models) showed promise in generating simple block diagrams, but were limited by small corpus size (Shrestha and Csallner, 2021) and could not generalize to complex systems. More recent efforts in the automotive domain fine-tuned 7B-parameter models for Simulink function generation and still had to synthesize training data via self-instruction due to the lack of real examples (Abdalla and Smith, 2025). No prior work, to our knowledge, has achieved a general LLM-driven solution for automatically constructing full Simulink models from high-level specifications. Beginners usually replicate the now-existing Simulink diagram images to learn Simulink. Inspired by this, we pose the following research question: Is it possible to automatically obtain a complete and correct simulation code from a textual simulation descriptions and the Simulink diagram images?

041

042

043

044

045

047

049

052

053

055

059

060

061

062

063

064

065

066

067

068

069

070

071

072

073

074

075

076

077

078

081

To bridge the gap between advanced LLM capabilities and Simulink-based modeling, we introduce SimuGen – a novel multimodal agentic framework for automatic construction of Simulink simulation

<sup>&</sup>lt;sup>1</sup>Code will be available at: https://anonymous.4open. science/r/SimuGen-CD71

environment. This agentic architecture ensures a modular, interpretable, and extensible workflow for universal simulation data generation. The contribution of this work can be summarized as:

models. SimuGen leverages a state-of-the-art mul-

timodal LLM as its core reasoning engine, inte-

grated with a suite of tools tailored for the Simulink

• Multimodal LLM Agent for Simulink Gen-

eration: We propose the first framework that

enables an LLM-based agent to interpret di-

agram images and generate corresponding

Simulink models. Our system, SimuGen, com-

bines visual understanding of block diagrams

with language-based reasoning and planning,

emulating how human engineers translate dia-

• Multi-Agent Workflow with Standard-

Conformance Reviewer: We design a multi-

agent system incorporating a specialized unit

test reviewer, inspired by SWE agents (Yang

et al., 2024). Unlike traditional unit tests

that check code execution, our reviewer eval-

uates whether the investigator agent's natu-

ral language-generated components align with

• Empirical Insights on Reviewer Effective-

ness: Case studies reveal that while the re-

viewer cannot directly assess the correctness

of block connections, it can verify their com-

pliance with modeling standards-thereby in-

directly improving the overall accuracy and

reliability of the generated models.

Despite Simulink's widespread use and its advan-

tages in modeling physical systems through graph-

ical interfaces, automating its usage with LLMs

introduces significant challenges. Through our

initial experiments, we observed that even ad-

vanced LLMs struggle with reliably generating

valid Simulink models from either textual or visual

inputs. These issues can be broadly categorized

into three main types of errors, each pointing to

a distinct deficiency in the model's reasoning or

**Conceptual Errors: Missing Domain** 

Challenges

2

2.1

grams into executable models.

Simulink modeling standards.

090

083

087

- 100 101
- 102 103

105

106

107 108

110

111

112 113

114 115

116 117

118 119

121 122

123 124

125

126 127

129

These errors emerge when LLMs attempt to generate a Simulink model from a textual description

implementation capabilities.

Knowledge

without a sound understanding of domain-specific principles. For instance, an LLM may fail to grasp the mathematical structure underlying a control system or misinterpret the functional role of certain blocks in a feedback loop. Such text-to-simulation failures are often rooted in the model's lack of grounding in physical reasoning or engineering theory.

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

An illustrative example is provided in Appendix A, where the LLM fails to properly measure the capacitor voltage in a Simscape-based RC circuit by directly tapping a conserving port rather than using a voltage sensor. This oversight reflects a misunderstanding of Simscape's physical signal architecture.

To mitigate such issues, we propose using diagram inputs to help the LLM infer the structure of the model, allowing it to focus on retrieving domain-aligned block functions rather than generating topology from scratch. This approach explicitly embeds high-level intent and structural logic into the input, thereby improving conceptual coherence.

#### 2.2 **Block Implementation Errors: Incorrect Block and Port Usage**

When generating MATLAB code to build models, LLMs often produce syntactically correct but semantically flawed outputs. These include referencing invalid block library paths, misconfiguring port numbers, or confusing block names-particularly for uncommon or custom components.

As illustrated in Figure 2, these errors are commonly observed when users request LLMs to construct Simulink models via MATLAB code. Examples include incorrect block paths, mismatched ports, and duplicated connections, all of which contribute to broken model logic and execution failures.

This category of errors reflects a lack of structured implementation knowledge. To mitigate this, we propose constructing a *block description* database that provides LLMs with access to accurate, structured metadata about block functions, parameter names, port configurations, and usage examples.

#### 2.3 Linkage Implementation Errors: Invalid Connections

In diagram-to-simulation workflows, LLMs frequently generate invalid wiring configurations that violate Simulink standards. Examples include connecting incompatible ports, omitting essential inter-



Figure 1: Common errors when users want LLM to generate MATLAB code for constructing Simulink model, such as incorrect block paths, port mismatches, and duplicated connections.

mediary blocks, or mislabeling ports. These errors often bypass initial syntax checks but fail during execution, making them costly to debug.

181

182

188

189

190

191

192

To address this, we developed a *Unit Test Reviewer* and *Debug Locator* module to systematically evaluate connection integrity. The Unit Test Reviewer validates that all block connections adhere to Simulink specifications, while the Debug Locator assists in diagnosing faults by linking runtime errors to specific connection descriptions or block implementations.

## 2.4 Context Window Bottlenecks in Single LLM Agents

193Recent studies have highlighted the limitations of194single LLMs in handling long-context tasks. For195instance, Hosseini and Others (2025) demonstrated196that even LLMs with extended context windows197struggle with long input sequences, leading to per-198formance degradation. Similarly, Fei et al. (2024)199proposed a semantic compression method to miti-200gate the challenges posed by long inputs, emphasiz-

ing the inherent difficulties faced by single LLMs. To address these issues, Zhang et al. (2024) introduced a multi-agent framework that enables better information aggregation and context reasoning across various LLMs. 201

202

203

204

206

207

208

209

210

211

212

213

214

215

216

217

218

219

220

221

Similarly, our investigation also revealed that using a single LLM to autonomously manage the entire Simulink automation pipeline—including block recognition, connection validation, code generation, and debugging-presents fundamental limitations. As the number of dialogue rounds increases and task types shift, the LLM's contextual focus tends to degrade, leading to a decline in output quality. Furthermore, we observed a recurring pattern of overconfidence, where the LLM assumes the correctness of its generated connections without critically evaluating their validity. These issues result in persistent, hard-to-detect errors and expose the fragility of a single-agent workflow. All these factors motivate us to develop a multi-agent framework to solve the simulation automation tasks.

224

226

227

236

238

239

240

241

242

243

245

247

255

259

260

## 3 Methodology

## 3.1 SimuGen Framework Overview

After identifying that current LLMs exhibit deficiencies in both conceptual understanding and implementation knowledge when generating Simulink simulations-as well as a notable decline in contextual focus as model complexity increases-we propose the SimuGen Framework to address these challenges, as shown in Figure 2. Rather than relying on a single LLM to reason about the entire modeling process, which is prone to context window limitations and loss of focus, SimuGen employs a multi-agent system in which each agent is responsible for a specialized subtask. By decomposing the workflow, the multi-agent approach ensures that each agent maintains high task-specific attention, mitigating the risk of contextual degradation that plagues monolithic LLM solutions. Concretely, SimuGen takes a Simulink diagram image as input, providing explicit structural guidance and obviating the need for the LLM to infer model topology from scratch. The six specialized agents (Investigator, Unit Test Reviewer, Block Builder, Executor, Debug Locator, and Report Writer), together with a comprehensive Simulink block database, collaborate to bridge knowledge gaps and deliver robust, accurate simulation generation.

## 3.2 Database Construction

To provide agents with accurate and contextually relevant reference knowledge, we construct a comprehensive Simulink database that contains Full Block Descriptions, Function Descriptions, and Code Templates. The Full Block Description covers 50 commonly used Simulink blocks, detailing their block types, library paths, underlying principles, connection-related parameter specifications, and port names along with their descriptions. For a thorough explanation of the database construction process and illustrative examples, please refer to Appendix D.

## 3.3 LLM Agent Settings

**Investigator.** The input to the Investigator consists of a Simulink diagram image and a block library containing 50 block types. The Investigator first identifies each block in the diagram image along with its corresponding block type. Leveraging an agentic retrieval-augmented generation (RAG) process, it then queries the database to retrieve the Full Block Descriptions for the relevant

blocks. Subsequently, the Investigator outputs the connectivity relationships between blocks in the diagram image, formatted as: BlockA (BlockA's block type) PortX <-> BlockB (BlockB's block type) PortY. The complete Investigator prompt is provided in Appendix C.

271

272

273

274

275

276

277

278

279

280

281

282

285

286

288

290

291

292

293

294

295

296

297

298

300

301

302

303

304

305

306

307

308

309

310

311

312

313

314

315

316

317

318

319

Unit Test Reviewer. The Unit Test Reviewer receives as input the set of blocks and their connections identified by the Investigator within a given Simulink diagram image, along with the corresponding Full Block Descriptions. The Unit Test Reviewer assesses whether the proposed connections adhere to Simulink standards by performing eight checks: 1. Identify the existence of block list; 2. Identify any extra blocks; 3. Formatting of block name; 4. Formatting of connection description; 5. Validate parameter settings in connections; 6. Detect duplicate connections; 7. Validate block connection types; 8. Verify complete port connections. The complete Unit Test Reviewer prompt is provided in Appendix C.

**Block Builder.** The Block Builder utilizes the Code Template, Function Description, and Full Block Description from the database to implement the simulation code for the diagram image in Simulink using matlab.engine, based on the connection descriptions provided by the Investigator. It is important to note that this work primarily focuses on the wiring aspects of simulations, therefore, the Block Builder is instructed to generate code exclusively with the add\_block and add\_line functions, and to use set\_param only when setting parameters related to the connections. The complete Block Builder prompt can be found in Appendix C.

**Executor.** The Executor is an automated component responsible for running the code generated by the Block Builder. It executes the provided code and automatically returns the results of the execution.

**Debug Locator.** The Debug Locator analyzes error messages from the Executor to pinpoint the cause and identifies the relevant 5–10 lines of code for the Block Builder to modify. In practice, we observed that most errors arise from logical flaws in the connection description that the Unit Test Reviewer fails to detect. Direct code modification in these cases would bypass the Investigator, resulting in code inconsistent with the original diagram.



Figure 2: Overview of the agent-based architecture used in the SimuGen framework.

To resolve this, the Debug Locator first checks whether the code's wiring and parameter settings match the connection description. If they match, the issue likely lies in the connection description itself, the Debug Locator then prompts the Unit Test Reviewer and Investigator to re-examine and revise it. If inconsistencies or syntax errors are found, feedback is sent directly to the Block Builder for code correction. The complete Debug Locator prompt is provided in Appendix C.

320

321

322

323

324

326

328

331

341

342

344

347

348

**Report Writer.** After the simulation has executed successfully, the Report Writer generates a comprehensive simulation report by synthesizing Full Block Descriptions, connection descriptions, and code implementation details. Specifically, the report generated by the Report Writer is structured into four sections: (1) What is the simulation about? (2) What are the main simulation steps? (3) What theoretical knowledge and mathematical modeling are involved in each step? (4) How is each step implemented in code? The complete Report Writer prompt can be found in Appendix C.

#### 3.4 Full Agentic Workflow

The overall workflow proceeds as follows:

 The Investigator receives a Simulink diagram and a block library, identifies present blocks, retrieves their Full Block Descriptions from the RAG database, and extracts connection descriptions.

2. The Unit Test Reviewer checks whether the

connection descriptions comply with the Simulink rules. If violations are found, the Investigator revises the connection descriptions based on the reviewer's feedback.

349

350

351

352

353

354

355

357

358

359

361

362

363

364

365

366

367

368

369

370

371

372

373

374

375

377

3. If valid, the Investigator passes the descriptions to the Block Builder, who generates simulation code using the matlab.engine library.

4. The executor will execute the code passed from the Block Builder.

5. For execution errors, the Debug Locator determines if the issue is due to code generation or missed logical errors, providing targeted feedback to the Block Builder or Investigator for further revision.

6. After successful execution, the Report Writer compiles the latest block descriptions, connection details, and code into a comprehensive simulation report.

#### 4 **Experiments**

#### 4.1 Experimental Setup

**Task Selections.** To comprehensively evaluate the performance of SimuGen, we selected nine Simulink simulations spanning four representative domains: General Applications (covering artificial algebraic loops, Simulink models, state events, and zero-crossing detection), Physical Modeling (bouncing ball), Electrical System Modeling (including 2-bus load flow, bipolar transistor, and RC circuit), and Automotive Modeling (wheel speed).

| Simulation Domain           | Simulation Name            | GT Blocks | GT Connections |
|-----------------------------|----------------------------|-----------|----------------|
|                             | Artificial Algebraic Loops | 5         | 5              |
| General Application         | State Event                | 5         | 6              |
|                             | Zero Crossing Detection    | 9         | 10             |
|                             | Simulink Model             | 11        | 13             |
| Physical Modeling           | Bouncing Ball              | 7         | 7              |
|                             | Bipolar Transistor         | 13        | 20             |
| Electrical Systems Modeling | RC Circuit                 | 14        | 16             |
|                             | 2 Bus Loadflow             | 14        | 13             |
| Automotive Modeling         | Wheel Speed                | 12        | 11             |

Table 1: Simulation domains and complexity of ground truth (GT) models for 9 simulation examples.

Table 1 summarizes the complexity of the ground
truth (GT) models corresponding to these simulation examples. In our study, model complexity is
quantified by the number of blocks and connections
present in each GT model.

LLM Selections. Our framework involves five agents in total. Among them, the Investigator Agent is responsible for both visual recognition and visual reasoning tasks, while the remaining agents are focused solely on reasoning. Accordingly, we selected GPT-4.1 and o4-mini as the LLM backbones for the Investigator Agent. GPT-4.1 was chosen due to its status as OpenAI's flagship model and its state-of-the-art performance in multimodal benchmarks. o4-mini, on the other hand, was selected for its support of vision-integrated chain-ofthought reasoning, a feature we hypothesize to be 394 particularly beneficial for the Investigator Agent's interaction with the Unit Test Reviewer Agent. For the other agents, all of which primarily require reasoning capabilities, we use o3-mini as the underlying LLM, striking a balance between performance and computational efficiency. 400

Accuracy of the Generation. To evaluate the accuracy of the generated simulations produced by the framework, we define the overall accuracy as the average of the block accuracy and connection accuracy, calculated as follows:

Accuracy =  $\frac{1}{2} \left( \frac{|B_{\text{match}}|}{|B_{\text{GT}}|} + \frac{|C_{\text{match}}|}{|C_{\text{GT}}|} \right)$ 

where:

407

401

402

403

404

405

406

408

409

• *B*<sub>GT</sub> is the set of blocks in the ground truth (GT) model.

Table 2: Accuracy (%) on 9 simulation examples: Investigator agent using o4-mini/GPT-4.1, others use o3-mini.

| Method                     | o4-mini | GPT-4.1 |
|----------------------------|---------|---------|
| Artificial Algebraic Loops | 100     | 100     |
| State Event                | 73.3    | 68.55   |
| Zero Crossing Detection    | 95.5    | 72.25   |
| Simulink Model             | 100     | 87.75   |
| Bouncing Ball              | 100     | 100     |
| Bipolar Transistor         | 95      | 77.5    |
| RC Circuit                 | 95.3    | 71.88   |
| 2 Bus Loadflow             | 100     | 88.7    |
| Wheel Speed                | 91.25   | 82.15   |
| Average                    | 94.5    | 83.2    |

•  $B_{\text{gen}}$  is the set of blocks in the generated model.

410

411

412

413

414

415

416

417

418

419

420

421

422

423

424

425

426

- $B_{\text{match}} = B_{\text{GT}} \cap B_{\text{gen}}$  is the set of correctly predicted blocks, i.e., blocks that exist in both the GT and generated models.
- C<sub>GT</sub> is the set of connections (e.g., edges or links) in the ground truth model.
- C<sub>gen</sub> is the set of connections in the generated model.
- $C_{\text{match}} = C_{\text{GT}} \cap C_{\text{gen}}$  is the set of correctly predicted connections, i.e., connections that exist in both the GT and generated models.

## 4.2 Main Results

From Table 2, We observe that the Investigator agent successfully completes all 9 tasks when using either o4-mini or GPT-4.1. However, when powered by o4-mini, the agent achieves a Simulink

(1)

| Simulation Example         | Cost (\$) | Run Time (s) |
|----------------------------|-----------|--------------|
| Artificial Algebraic Loops | 0.06      | 121.91       |
| State Event                | 0.05      | 105.99       |
| Zero Crossing Detection    | 0.09      | 169.57       |
| Simulink Model             | 0.13      | 232.35       |
| Bouncing Ball              | 0.22      | 297.34       |
| Bipolar Transistor         | 0.41      | 642.44       |
| RC Circuit                 | 0.16      | 358.27       |
| 2 Bus Loadflow             | 0.14      | 221.89       |
| Wheel Speed                | 0.21      | 325.95       |
| Mean                       | 0.17      | 275.41       |

Table 3: Cost and run time for 9 simulation examples: Investigator agent uses o4-mini, others use o3-mini.

simulation reproduction accuracy of 94.5%, compared to 83.2% with GPT-4.1. We hypothesize that this performance gap arises from o4-mini's enhanced visual reasoning capabilities. Specifically, the Investigator agent equipped with o4-mini tends to zoom in on the image regions highlighted by the Unit Test Reviewer in their feedback, leading to more accurate identification of the corresponding blocks and connections in the simulation diagram.

427

428

429

430

431

432

433

434

435

436

437

438

439

440

441

442

443

444

445

446

447

448

449

450

451

452

453

454

455

456

457

458 459

460

461

462

463

We further observe that regardless of which model the Investigator agent uses, the reproduction accuracy tends to be inversely correlated with the complexity of the ground truth (GT) Simulink model. In cases where the number of connections is fewer than 10, the reproduced models generally achieve an accuracy close to 100%. An exception to this trend is the *State Event* case, where the accuracy drops significantly. We hypothesize that this may be due to limited exposure of both o4-mini and GPT-4.1 to high-quality images of various Simulink blocks during pretraining, particularly those related to event-based dynamics.

In terms of costing, Table 3 summarizes the monetary cost (in USD) and execution time (in seconds) for nine simulation examples processed by our framework. In each case, the Investigator agent utilizes the o4-mini model, whereas the remaining agents use o3-mini. Results demonstrate that typical simulation tasks can be completed within approximately \$0.17 and 275 seconds on average.

#### 4.3 Alblation Study

'In our ablation study, we evaluate three variants of the system: (1) without the Unit Test Reviewer, (2) without the Debug Locator, and (3) without both components. It is important to note that in the absence of the Debug Locator, the code is executed only once, and any runtime error is treated

as a task failure. Overall, from Table 3, we observe that excluding either the Unit Test Reviewer or the Debug Locator leads to a noticeable drop in simulation reproduction accuracy. Notably, when both components are removed, the accuracy drops significantly to 51.7%. This highlights a synergistic effect between the Unit Test Reviewer and the Debug Locator, indicating that their combined contribution to SimuGen is greater than the sum of their individual parts. MEanwhile, we observe that for the Bouncing Ball, RC Circuit, and Wheel Speed simulations, none can be completed in the absence of both the Unit Test Reviewer and the Debug Locator agents. This result highlights that, although the Unit Test Reviewer and Debug Locator operate via distinct mechanisms to reflect and reproduce the Simulink modeling process, both play an essential role in ensuring successful reconstruction of Simulink models.

464

465

466

467

468

469

470

471

472

473

474

475

476

477

478

479

480

481

482

483

484

485

486

487

488

489

490

491

492

493

494

495

496

497

498

499

500

501

502

503

504

506

507

508

509

510

511

512

513

It is also worth noting that in the *Bipolar Transistor* case, SimuGen achieves an accuracy that is 33.35% higher than the variant without the Unit Test Reviewer. This suggests that although the Unit Test Reviewer cannot directly assess whether the Investigator's predicted connections are correct, it can still provide valuable feedback on the logical coherence of the connections. Such feedback encourages the Investigator to re-examine the identified blocks and connections, ultimately leading to more accurate reproductions.

At the same time, we observe that for most of the 9 tasks, simulations can still be successfully reproduced even without the Debug Locator. This indicates that as long as the blocks and connections identified by the Investigator are logically sound, generating the corresponding executable code is relatively straightforward. However, in the *Bipolar Transistor* case, simulation fails when the Debug Locator is removed. This underscores the complementary role of the Debug Locator: in scenarios where the Unit Test Agent overlooks potential issues, the Debug Locator can interact with the execution environment and analyze runtime outputs to compensate for such oversights.

#### **5** Related works

LLMs for Code Generation and Mathematical Reasoning. Large language models trained on code have demonstrated impressive capabilities in both program synthesis and multi-step reasoning. Chen et al. (2021) evaluated code-trained LLMs on

| Method                     | SimuGen | w/o Unit Test Reviewer | w/o Debug Locator | w/o All |
|----------------------------|---------|------------------------|-------------------|---------|
| Artificial Algebraic Loops | 100     | 100                    | 100               | 100     |
| State Event                | 73.3    | 73.3                   | 73.3              | 73.3    |
| Zero Crossing Detection    | 95.5    | 95.5                   | 95.5              | 95.5    |
| Simulink Model             | 100     | 100                    | 100               | 100     |
| Bouncing Ball              | 100     | 92.8                   | 100               | _       |
| Bipolar Transistor         | 95      | 61.65                  | -                 | _       |
| RC Circuit                 | 95.3    | 95.3                   | 95.3              | _       |
| 2 Bus Loadflow             | 100     | 96.15                  | 100               | 96.15   |
| Wheel Speed                | 91.25   | 91.25                  | 91.25             | _       |
| Average                    | 94.5    | 86.21                  | 83.9              | 51.7    |

Table 4: Performance (%) of SimuGen and its ablations on 9 simulation examples: Investigator agent using o4-mini and all other agents using o3-mini

518

519

522

524

525

527

528

529

531

534

514

can generate nontrivial functions with high accuracy. Li et al. (2022) introduced AlphaCode, which achieved human-comparable performance on competitive programming benchmarks by combining a powerful generative model with search techniques. Chain-of-thought prompting has further improved LLMs' reasoning by eliciting intermediate logical steps, leading to better results on quantitative problems (Wei et al., 2022). Hong et al. (2024), Jing et al. (2024), and Li et al. (2024) proposed the collaboration of multiple LLMs for complex data science applications.

a diverse set of programming tasks, showing they

**Data Scarcity in Simulation Domains.** Liu et al. (2024a,b) proposed LLMs to generate Python codes for simulation tasks. However, many fields rely on the Simulink platform for their simulations. Luitel et al. (2024) applied LLMs to slice and analyze Simulink models. However, they did not propose a comprehensive system for generating and conducting simulations.

Multimodal Agentic Frameworks. Li et al. 535 (2023) introduced the first agent systems based 536 on LLMs that facilitate cooperative interactions 537 among multiple agents. In contrast, another form of agent interaction is adversarial (Gou et al., 2023). To address multimodal tasks, Liu et al. (2025) presented HM-RAG, a hierarchical multi-541 modal retrieval-augmented generation framework 543 designed for multi-agent systems focused on per-544 ception, reasoning, and generation. Qian et al. (2024) developed ChatDev, wherein communica-545 tive agents collaboratively drive the software development lifecycle, illustrating the benefits of modu-547

lar agentic workflows.

Our SimuGen framework builds on these ideas by assigning distinct roles—Investigator, Unit Test Reviewer, Block Builder, Executor, Debug Locator, and Report Writer—to reliably generate and validate simulation code. 548

549

550

551

552

553

554

555

556

557

558

559

560

561

562

563

564

565

566

567

568

569

570

571

572

573

574

575

576

## 6 Conclusion

SimuGen advances the automated generation of Simulink models by introducing a collaborative multi-agent framework that leverages both visual and domain knowledge. Through specialized agents for diagram interpretation, validation, code generation, and debugging-including a standard-conformance unit test reviewer that assesses whether model components adhere to Simulink modeling standards—SimuGen reliably transforms Simulink diagram images into accurate, executable simulation code. Experimental results show that SimuGen achieves an average reproduction accuracy of 94.5% across diverse simulation tasks. Case studies further indicate that, while the reviewer cannot directly assess the correctness of block connections, its ability to enforce modeling standards indirectly improves the overall reliability of the generated models. These results demonstrate the effectiveness of a modular, agent-based approach with built-in standard-conformance checks for Simulink model construction.

#### Limitations

While SimuGen can nearly perfectly replicate all5779 Simulink cases used in this study, these cases578represent relatively simple scenarios. Industrial579Simulink models are typically much larger, often580

featuring over 30 blocks and connections, presenting new challenges that we plan to explore in future work. Moreover, the average generation time for SimuGen across the 9 cases is 275 seconds, which exceeds the time typically required by an experienced simulation engineer. Exploring strategies to further reduce generation time will be a valuable direction in our ongoing research.

#### 89 Ethical Statement

590 No ethical issues.

#### References

591

612

613

614

617

618

619

621

622

623

625

627

631

- Ahmed Abdalla and John Smith. 2025. Generative artificial intelligence for model-based graphical programming in the automotive industry. *SSRN Electronic Journal*.
- Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde De Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, and 1 others. 2021. Evaluating large language models trained on code. *arXiv preprint arXiv:2107.03374*.
- Weizhi Fei, Xueyan Niu, Pingyi Zhou, Lu Hou, Bo Bai, Lei Deng, and Wei Han. 2024. Extending context window of large language models via semantic compression. In *Findings of the Association for Computational Linguistics: ACL 2024*, pages 5169–5181.
- Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Nan Duan, and Weizhu Chen. 2023. Critic: Large language models can self-correct with tool-interactive critiquing. arXiv preprint arXiv:2305.11738.
- Sirui Hong, Yizhang Lin, Bang Liu, Bangbang Liu, Binhao Wu, Ceyao Zhang, Chenxing Wei, Danyang Li, Jiaqi Chen, Jiayi Zhang, and 1 others. 2024. Data interpreter: An Ilm agent for data science. *arXiv preprint arXiv:2402.18679*.
- Mohammad Hosseini and Others. 2025. Efficient solutions for an intriguing failure of llms: Long context inference. In *Proceedings of the 2025 International Conference on Computational Linguistics* (COLING).
- Liqiang Jing, Zhehui Huang, Xiaoyang Wang, Wenlin Yao, Wenhao Yu, Kaixin Ma, Hongming Zhang, Xinya Du, and Dong Yu. 2024. Dsbench: How far are data science agents to becoming data science experts? *arXiv preprint arXiv:2409.07703*.
- Guohao Li, Hasan Hammoud, Hani Itani, Dmitrii Khizbullin, and Bernard Ghanem. 2023. Camel: Communicative agents for" mind" exploration of large language model society. *Advances in Neural Information Processing Systems*, 36:51991–52008.

Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, and 1 others. 2022. Competition-level code generation with alphacode. *Science*, 378(6624):1092–1097.

632

633

634

635

636

637

638

639

640

641

642

643

644

645

646

647

648

649

650

651

652

653

654

655

656

657

658

659

660

661

662

663

664

665

666

667

668

669

670

671

672

673

674

675

676

677

678

679

680

681

682

683

- Ziming Li, Qianbo Zang, David Ma, Jiawei Guo, Tuney Zheng, Minghao Liu, Xinyao Niu, Yue Wang, Jian Yang, Jiaheng Liu, and 1 others. 2024. Autokaggle: A multi-agent framework for autonomous data science competitions. *arXiv preprint arXiv:2410.20424*.
- Pei Liu, Xin Liu, Ruoyu Yao, Junming Liu, Siyuan Meng, Ding Wang, and Jun Ma. 2025. Hm-rag: Hierarchical multi-agent multimodal retrieval augmented generation. *arXiv preprint arXiv:2504.12330*.
- Zhihan Liu, Yubo Chai, and Jianfeng Li. 2024a. Toward automated simulation research workflow through llm prompt engineering design. *Journal of Chemical Information and Modeling*, 65(1):114–124.
- Zhihan Liu, Yubo Chai, and Jianfeng Li. 2024b. Towards fully autonomous research powered by Ilms: Case study on simulations. *arXiv preprint arXiv:2408.15512*.
- Dipeeka Luitel, Shiva Nejati, and Mehrdad Sabetzadeh. 2024. Requirements-driven slicing of simulink models using llms. In 2024 IEEE 32nd International Requirements Engineering Conference Workshops (REW), pages 72–82.
- MathWorks. 2025. Simulink simulation and modelbased design.
- Chen Qian, Wei Liu, Hongzhang Liu, Nuo Chen, Yufan Dang, Jiahao Li, Cheng Yang, Weize Chen, Yusheng Su, Xin Cong, Juyuan Xu, Dahai Li, Zhiyuan Liu, and Maosong Sun. 2024. ChatDev: Communicative agents for software development. In *Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)*, pages 15174–15186, Bangkok, Thailand. Association for Computational Linguistics.
- Sohil Lal Shrestha and Christoph Csallner. 2021. Slgpt: Using transfer learning to directly generate simulink model files and find bugs in the simulink toolchain. *arXiv preprint arXiv:2105.07465*.
- Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Fei Xia, Ed Chi, Quoc V Le, Denny Zhou, and 1 others. 2022. Chain-of-thought prompting elicits reasoning in large language models. *Advances in neural information processing systems*, 35:24824– 24837.
- John Yang, Carlos E. Jimenez, Alexander Wettig, Kilian Lieret, Shunyu Yao, Karthik Narasimhan, and Ofir Press. 2024. Swe-agent: Agent-computer interfaces enable automated software engineering. *arXiv preprint arXiv:2405.15793*.

| Yusen Zhang, Ruoxi Sun, Yanfei Chen, Tomas Pfister,  |
|------------------------------------------------------|
| Rui Zhang, and Sercan Ö. Arik. 2024. Chain of        |
| agents: Large language models collaborating on long- |
| context tasks. arXiv preprint arXiv:2406.02818.      |

## **A** Appendix: Challenges

To illustrate common conceptual errors made by LLMs when generating Simulink models, we present an<br/>example based on a user's request to build a simple RC circuit using Simscape blocks. The generated<br/>code successfully constructs the topology using matlab.engine, but fails to include a necessary voltage690<br/>691sensor for signal measurement.693

689

694

697

698

699

700

701

702

703

## **User Query**

Could you generate a simulation RC circuit using Simscape library from Simulink? Please follow the<br/>following steps:695696

1. Provide the blocks needed and their connections (there should be a Scope to observe the result).

2. Generate the simulation code using matlab.engine.

Note: Only generate blocks and connections; there is no need to set parameters.

#### **Blocks and Library Paths**

| Block Name            | Library Path                                        |
|-----------------------|-----------------------------------------------------|
| Solver Configuration  | Simscape > Commonly Used Blocks                     |
| Electrical Reference  | Simscape > Foundation Library > Electrical Elements |
| DC Voltage Source     | Simscape > Electrical Sources > DC Voltage Source   |
| Resistor              | Simscape > Electrical Elements > Resistor           |
| Capacitor             | Simscape > Electrical Elements > Capacitor          |
| PS-Simulink Converter | Simscape > Utilities > PS-Simulink Converter        |
| Scope                 | Simulink > Sinks > Scope                            |

#### **Connection Overview**

The following is a textual representation of the circuit structure:

This design routes the voltage across the capacitor directly into the Scope via the PS-Simulink Converter. 704

#### Generated Code Using matlab.engine

```
import matlab.engine
# Start MATLAB engine
eng = matlab.engine.start_matlab()
# Create and open model
model = 'rc_circuit'
eng.new_system(model, 'Model')
eng.open_system(model)
# Add blocks
blocks = {
     'SolverConfig': 'simscape/Commonly Used Blocks/Solver Configuration',
     'Ground': 'simscape/Foundation Library/Electrical/Electrical Elements/Electrical Reference',
     'Vsrc': 'simscape/Foundation Library/Electrical/Electrical Sources/DC Voltage Source',
     'R': 'simscape/Foundation Library/Electrical/Electrical Elements/Resistor'.
     'C': 'simscape/Foundation Library/Electrical/Electrical Elements/Capacitor',
     'PS2SL': 'simscape/Utilities/PS-Simulink Converter',
     'Scope': 'simulink/Sinks/Scope'
}
for name, path in blocks.items():
     eng.add_block(path, f'{model}/{name}')
# Make connections
# Make connections
eng.add_line(model, 'Vsrc/1', 'R/1', 'autorouting','on')
eng.add_line(model, 'R/2', 'C/1', 'autorouting','on')
eng.add_line(model, 'C/2', 'Ground/1', 'autorouting','on')
eng.add_line(model, 'Ground/1', 'SolverConfig/1','autorouting','on')
eng.add_line(model, 'Vsrc/2', 'Ground/1', 'autorouting','on')
eng.add_line(model, 'C/1', 'PS2SL/1', 'autorouting','on')
eng.add_line(model, 'PS2SL/1', 'Scope/1', 'autorouting','on')
# Save model
eng.save_system(model)
# eng.sim(model) # Uncomment to simulate
```

Listing 1: LLM-generated Python code to build the RC circuit model

#### Conceptual Issue Identified

Although the generated structure is syntactically correct, it fails conceptually: Simscape does not allow direct measurement from conserving electrical ports. Instead, a **Voltage Sensor** block must be used to measure across the capacitor, with its output routed to a PS-Simulink Converter and then to the Scope.

**Summary:** The LLM omits the essential measurement interface, demonstrating a conceptual gap in how physical signal flows work in Simscape. This is a typical example of domain-specific misunderstanding, even when code generation succeeds syntactically.



Figure B.1: Case study: artificial algebraic loops.

# **State-Event Handling**



Figure B.2: Case study: state event.



Figure B.3: Case study: zero crossing detection.



Figure B.4: Case study: simulink model.

**Bouncing Ball Model** 



Figure B.5: Case study: bouncing ball.



Figure B.6: Case study: bipolar transistor.



Figure B.7: Case study: RC circuit.







Figure B.9: Case study: wheel speed.

#### C Appendix: Prompts of Agents

## # ROLE # You are an Investigator responsible for deeply analyzing a simulation explanation, its corresponding simulation diagram, and a simulation blocks list. # TASK # You will receive a detailed simulation explanation, its corresponding simulation diagram, and a simulation blocks list that includes the block types. Your tasks are to: - List all the blocks used in the simulation, \*\*INCLUDING those without annotations if they are USED \*\*. Before listing them, identify their corresponding block types from the simulation blocks list, \*\*while please note that the diagram would NEVER include a SUBSYSTEM, so DO NOT request a subsystem.\*\* The list should be formatted as: BlockA (BlockA's block type) BlockB (BlockB's block type) where Used BlockA, BlockB are just example, if the annotation of a block is already given, please just use it. \*\*BUT NEVER include the special symbol '/' in the block name, if it happen in the annotation, you need to modified it.\*\* - \*\*Important:\*\* If the simulation diagram shows multiple instances of the same block without separate annotations, you should proactively annotate them to differentiate each instance. For example, there are two sum without annotation are shown in the diagram, you should list them like: sum 1 (sum) sum 2 (sum) # INPUT # - \*\*Simulation Explanation:\*\* {simulation\_explanation} - \*\*Simulation Blocks List:\*\* {simulation\_blocks\_list} # RESPONSE # Always list all identified blocks first and then request half description of \*\*ALL\*\* listed blocks using their \*\*block type (not Path) from the blocks list\*\* at the end of your response in a JSON format as (DO NOT need to request the same unique block type): { "request\_blocks": ["block type 1", "block type 2", ...] } Do not ask any clarifying questions or confirmations. Directly provide a complete answer.

Listing 2: Prompt of the Investigator agent in round 1.

# INPUT # - \*\*Simulation Blocks Description:\*\* {simulation\_blocks\_list} # TASK # According to the \*\*block you identified in the simulation diagram\*\*, you will receive an identified block's \*\*Simulation Blocks Description\*\* that includes their block type and port description. Your tasks are to: - Clearly describe the connections and data flow among these blocks in the simulation diagram using their block types. First, determine which port labels are connected, then match them to the corresponding real port names from the \*\*Simulation Blocks Description\*\*. \*\*MUST STRICTLY Format \*\* each connection description as: BlockA (BlockA's block type) PortX (\*\*related parameter setting to match the port number if necessary\*\*)<-> BlockB (BlockB's block type) PortY (\*\*related parameter setting to match the port number if necessary\*\*) \*\*Note:\*\* 1.Left side of '<->' is output port, while right side of '<->' is input port, never flip the relationship. \*\*In most of cases, the same Input port MUST NOT be connected more than one time (with \*\*ONLY\*\* the exception of Electrical Reference and Solver Configuration).\*\* For example, if you identify there is a Constant block that is connected to 2 different Gain block, you should describe them as: . . . Constant 1 (Constant) 1 <-> Gain 1 (Gain) 1 Constant 1 (Constant) 1 <-> Gain 2 (Gain) 1 rather than, ... Gain 1 (Gain) 1 <-> Constant 1 (Constant) Gain 2 (Gain) 1 <-> Constant 1 (Constant) similarly, if you see the port LConn2 of a Current-Controlled Current Source are connected to both the port LConn 1 of a Resistor and the port LConn1 of a Voltage Sensor, you should describe these connections as: CCCS 1 (Current-Controlled Current Source) LConn2 <-> Resistor 1 (Resistor) LConn1 CCCS 1 (Current-Controlled Current Source) LConn2 <-> Voltage Sensor 1 (Voltage Sensor) LConn1 rather than, . . . Resistor 1 (Resistor) LConn1 <-> CCCS 1 (Current-Controlled Current Source) LConn2 Voltage Sensor 1 (Voltage Sensor) LConn1 <-> CCCS 1 (Current-Controlled Current Source) LConn2 2.PortX and PortY are the real internal port names, BlockA and BlockB are the block name shown in the diagram. 3.Any \*\*parameters related to the number of port\*\* need to be declared, for example: Gain 2 (Gain) 1 <-> Sum (Sum) 1 (`Inputs` = `++-`), where the port number of Sum depends on ` Inputs', so it needs to be clarified, while the port number of Gain is fixed, so no parameters need to be clarified. \*\*IMPORTANT:\*\* - \*\*ONLY\*\* use the below listed blocks, and \*\*ALL\*\* listed blocks should be used. # Response # Do not ask any clarifying questions or confirmations. Directly provide a complete answer.

Listing 3: Prompt of the Investigator agent in round 2.

| You are an **Unit Test Reviewer**. Your job is to verify and validate the Investigator's provided simulation connections descript                                                                                                                                                                                                                                                                                                                                                                            | ion against the **simulation blocks description**.                                                                                                       |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------|
| <pre># TASK # You will receive: - A **simulation blocks description** that includes the port types, and port descriptions The Investigator's provided simulation information, which consists of: 1 *The blocks used in the simulation its a blick of blocks utilized in the simulation</pre>                                                                                                                                                                                                                 |                                                                                                                                                          |
| 2. **Connections Description:** This details how the blocks are connected, with each connection formatted as:                                                                                                                                                                                                                                                                                                                                                                                                |                                                                                                                                                          |
| <pre>BlockA (BlockA's block type) PortX (**related parameter setting to match the port number if necessary**)&lt;-&gt; BlockB (BlockB's<br/>match the port number if necessary**)</pre>                                                                                                                                                                                                                                                                                                                      | <pre>s block type) PortY (**related parameter setting to</pre>                                                                                           |
| <pre>**Note:** Left side of '&lt;-&gt;' is output port, while right side of '&lt;-&gt;' is input port, never flip the relationship. PortX and<br/>visual labels; BlockA and BlockB are the block names shown in the diagram. (Block and its block type CAN BE similar</pre>                                                                                                                                                                                                                                  | ${\sf I}$ PortY are the real internal port names, not the in some cases)                                                                                 |
| Your responsibilities are to:<br>1. **Identify the exist of block list:**<br>- Verify if the identified block are listed first, such as:<br>Swing bus ]pu (Load flow Source)<br>Electrical Reference 1 (Electrical Reference)<br>Load 90 MW 30 Mvar (Wye-Connected Load)                                                                                                                                                                                                                                     |                                                                                                                                                          |
| 2. **Identify any Extra Blocks:**<br>- Verify if there are blocks mentioned in "The blocks used in the simulation" (**A specific block, not the real block name**) 1                                                                                                                                                                                                                                                                                                                                         | that do not appear in the Connections Description.                                                                                                       |
| 3. **Formatting of block name:**<br>- Make sure the block names are NEVER included the special symbol '/'.                                                                                                                                                                                                                                                                                                                                                                                                   |                                                                                                                                                          |
| 4. **Formatting of connection description**<br>- Make sure the formatting of connection is strictily formmated as:<br>BlockA (BlockA's block type) PortX (**related parameter setting to match the port number if necessary**)<br>For example:<br>Current-Controlled Current Source (-h_felb) (Current-Controlled Current Source) RConnl <-> Electrical Reference (Electrical Ref<br>Controlled Current Source (-h_felb) (Current-Controlled Current Source) RConnl '>> Electrical Reference (Electrical Ref | ference) LConn1, is not correct, where 'Current-<br>nt-Controlled Current Source) RConn1'                                                                |
| and for '(**related parameter setting to match the port number if necessary**)', do not need to show any explanation if there :                                                                                                                                                                                                                                                                                                                                                                              | is no related parameter                                                                                                                                  |
| and you need to check the block type from Simulation Blocks Description carefully, some block type might include a '()'.                                                                                                                                                                                                                                                                                                                                                                                     |                                                                                                                                                          |
| <ol> <li>**Validate Parameter Settings in Connections:**         <ul> <li>Check that the (**related parameter setting to match the port number if necessary**) for each connection is correctly provide simulation blocks description**.</li> </ul> </li> </ol>                                                                                                                                                                                                                                              | ed and matches the expected configuration from the $\star\star$                                                                                          |
| <ul> <li>6. **Detect Duplicate Connections:**</li> <li>- Check if there are any duplicate connections where the **same block's Input port (not Output port)** is connected more than on Reference and Solver Configuration). For example:</li> </ul>                                                                                                                                                                                                                                                         | once (with **ONLY** the exception of Electrical                                                                                                          |
| <pre>Discrete-Time Integrator (Discrete-Time Integrator) 1 (`ExternalReset` = `none`, `InitialConditionSource` = `internal`) &lt;-&gt; {     Sum 1 (Sum) 1 (`Inputs` = `+-`) &lt;-&gt; Gain 2 (Gain) 1     '''</pre>                                                                                                                                                                                                                                                                                         | Gain 2 (Gain) 1                                                                                                                                          |
| These two connections are duplicate since there are two lines are connected to the input port 1 of Gain2. However:                                                                                                                                                                                                                                                                                                                                                                                           |                                                                                                                                                          |
| Voltage-Controlled Voltage source (Voltage-Controlled Voltage Source) LConn2 <-> RLoad (Resistor) LConn1<br>Current-Controlled Current source (Current-Controlled Current Source) LConn2 <-> RLoad (Resistor) LConn1                                                                                                                                                                                                                                                                                         |                                                                                                                                                          |
| <pre>is available since LConn1 of RLoad (Resistor) is not dedicated as an input/output port, and it can be reused even though it i<br/>check each port of each block if is dedicated as an input/output port from **Simulation Blocks Description**, Only po<br/>should be considered dedicated input/output ports - you do not need to make any inferences yourself. For example: - *<br/>Port:** - *Port name:** **!**)<br/>Another example is:</pre>                                                      | is placed on the input place twice. (you need to<br>orts explicitly described as Input/Output ports<br>**Input Port:** - **Port name:** **1** - **Output |
| Discrete-Time Integrator (Discrete-Time Integrator) 1 (`ExternalReset` = `none`, `InitialConditionSource` = `internal`) <-><br>Gain 2 (Gain) 1 <-> Sum 1 (Sum) 1 (`Inputs` = `+-`)                                                                                                                                                                                                                                                                                                                           | Gain 2 (Gain) 1                                                                                                                                          |
| These two connections are available, 'Gain 2 (Gain) 1' in the first connection means the input port1 of Gain 2, while 'Gain 2<br>output port1 of 'Gain2', that is not duplicate.                                                                                                                                                                                                                                                                                                                             | 2 (Gain) 1' in the second connections means the                                                                                                          |
| 7. **Validate Block Connection Types:**<br>- Ensure that no block is connected to another block that only has dedicated output port, for example:                                                                                                                                                                                                                                                                                                                                                            |                                                                                                                                                          |
| Gin 1 (Gain) 1 <-> Constant 1 (Constant) 1                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |                                                                                                                                                          |
| is not allowed, since Constant is a block only has dedicated output                                                                                                                                                                                                                                                                                                                                                                                                                                          |                                                                                                                                                          |
| - Ensure that no block that only has dedicated input port is connected to another block, for example:                                                                                                                                                                                                                                                                                                                                                                                                        |                                                                                                                                                          |
| Scope 1 (Scope) 1 <-> Gain 1 (Gain) 1                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |                                                                                                                                                          |
| is not allowed, since Scope is a block that only has dedicated input                                                                                                                                                                                                                                                                                                                                                                                                                                         |                                                                                                                                                          |
| - For ports that are not dedicated as input/output port can be reused as input and output, for example:                                                                                                                                                                                                                                                                                                                                                                                                      |                                                                                                                                                          |
| AC Voltage Source (AC Voltage Source) LConn1 <-> C1 (Capacitor) LConn1<br>C1 (Capacitor) LConn1 <-> RBias (Resistor) LConn1                                                                                                                                                                                                                                                                                                                                                                                  |                                                                                                                                                          |
| is available since LConn1 is not dedicated as an input/output port.                                                                                                                                                                                                                                                                                                                                                                                                                                          |                                                                                                                                                          |
| <ol> <li>**Verify Complete Port Connections:**         <ul> <li>Under the premise of the already set parameter settings, check whether every input and output port of each block has a connect</li> </ul> </li> </ol>                                                                                                                                                                                                                                                                                        | ction. For example:                                                                                                                                      |
| Discrete-Time Integrator (Discrete-Time Integrator) 1 (`ExternalReset` = `none`, `InitialConditionSource` = `internal`) <-> Ga<br>Gain 2 (Gain) 1 <-> Sum 1 (Sum) 1 (`Inputs` = `+-`)                                                                                                                                                                                                                                                                                                                        | in 2 (Gain) 1                                                                                                                                            |
| In this case, the ouput port and input port of Gain 2 are both properly connected.                                                                                                                                                                                                                                                                                                                                                                                                                           |                                                                                                                                                          |
| <pre># INPUT #<br/>-**Simulation Blocks Description:**<br/>{blocks_list}</pre>                                                                                                                                                                                                                                                                                                                                                                                                                               |                                                                                                                                                          |
| <ul> <li>**Investigator's Simulation Information:**         {investigator_simulation_info}         (This includes both "The blocks used in the simulation" and the "Connections Description" as described above.)</li> </ul>                                                                                                                                                                                                                                                                                 |                                                                                                                                                          |
| <pre># RESPONSE #<br/>Provide a **brief but clear** report addressing each of the eight responsibilities above. List any inconsistencies or errors foun<br/>any clarifying questions; directly provide your complete review.</pre>                                                                                                                                                                                                                                                                           | d, and articulate your findings clearly. Do not ask                                                                                                      |
| At the end of your response, output a JSON formatted object with the key "Investigator_unit_test_pass". Set its value to True if<br>or errors were detected. For example:                                                                                                                                                                                                                                                                                                                                    | no issues were found, or False if any inconsistencies                                                                                                    |
| <pre>```json {     "Investigator_unit_test_pass": True</pre>                                                                                                                                                                                                                                                                                                                                                                                                                                                 |                                                                                                                                                          |
| 3                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |                                                                                                                                                          |

# Listing 4: Prompt of the Unit Testing agent.

1003 1004 # ROLE # You are a Simulation Block Builder, responsible for generating MATLAB/Simulink code using matlab. 1005 engine. 1006 1007 1008 You will receive: 1009 - A \*\*code template\*\* outlining the required structure. 1010 - A set of \*\*functions\*\* that you are permitted to use. 1011 - A detailed \*\*blocks description\*\* that lists block types, paths, and port description. 1012 - Information provided by an Investigator Agent, which is extracted by the Investigator from a 1013 simulation diagram, including a List of Used Blocks for the specific simulation, and a Complete 1014 List of Connections for that simulation. The Complete List of Connections are formatted as: 1015 BlockA (BlockA's block type) PortX (\*\*related parameter setting to match the port number if necessary 1016 \*\*)<-> BlockB (BlockB's block type) PortY (\*\*related parameter setting to match the port number 1017 1018 if necessary\*\*) \*\*Note:\*\* PortX and PortY are the real internal port names (not the visual labels); BlockA and 1019 BlockB are the block name shown in the diagram. 1020 1021 # CODE TEMPLATES & REFERENCES # 1022 - \*\*Code Template:\*\* 1023 1024 {code\_template} 1025 - \*\*Functions:\*\* 1026 1027 {functions} 1028 1029 - \*\*Blocks Description:\*\* {blocks\_description} 1030 1031 - \*\*Information provided by an Investigator Agent:\*\* 1032 {investigator\_agent\_information} 1033 1034 1035 # TASK # 1036 1037 1038 Your objective is to generate MATLAB/Simulink code using matlab.engine to re-implement the simulation extracted by the Investigator. You are to implement the following operations: 1039 1040 - \*\*add block\*\* - \*\*add\_line\*\* 1041 - \*\*set\_param\*\* \*\*(limited strictly to parameters related to port count or connectivity)\*\* 1042 1043 \*\*Important Guidelines:\*\* 1044 - Follow the provided code template exactly. (You need to define the model\_name by yourself.) 1045 - Use only the provided functions and blocks. 1046 - Adhere strictly to the \*\*PATH, port naming and connection instructions\*\* as described in the \*\* 1047 Blocks Description:\*\*. Ensure that you use the \*\*exact port names specified (DO NOT use port 1048 1049 labels)\*\*, and that all calls to add\_line utilize these correct port names. - \*\*DO NOT\*\* set any block parameters except those that affect port count or connections. 1050 1051 1052 # RESPONSE # 1053 Please generate the \*\*complete and fully detailed\*\* MATLAB/Simulink code using `matlab.engine` based 1054 on the above instructions. \*\*ONLY\*\* generate the full Python code without omitting \*\*any\*\* parts 1055 or using \*\*any\*\* ellipsis (`...`) or placeholder symbols. \*\*D0 NOT\*\* include explanations or 1056 any content other than the Python code. 1058

Listing 5: Prompt of the Block Builder agent.

```
# ROLE #
You are a **Debug_locator**.
Your job is to diagnose and locate the source of errors within the execution code based on provided
    implementation details.
# TASK #
1. **Analyze and Understand:**
   - Read the provided implementation code information extracted by the Investigator.
   - Review the detailed **blocks description**, which lists block types, paths, and port description.
  - Examine the set of **functions** that you are permitted to use.
**Locate Error Source:**
   - Analyze the execution code and error message to determine the 5–10 lines of code that are most
       likely causing the error, as well as an additional 5-10 lines of code that are related to the
       error
   - Keep in mind that the error message might be triggered by issues originating in earlier code.
       Carefully evaluate any dependencies that could be contributing to the error.
3. **Assess Connection and Parameter Integrity to Determine Error Origin or Provide Fix
    Recommendations:**
   - First, check if the code's connection configuration exactly matches the Investigator's
       Implementation Details. This means verifying that the block types and the corresponding port
       names in the code are identical to those specified.
   - If the connection configuration is identical, then verify that the parameters related to these
       blocks and ports are set correctly according to the required number of ports specified in the
       Investigator's Implementation Details.
   - If both the connection configuration and the port-related parameters are correct, **then verify
       if the error is casued by any other reason except from connection configuration, **for
       instance the wrong setting of block's path**. If there is no other reason, this indicates that
       the error is due to a discrepancy in the Investigator's Implementation Details.** In this case, DO NOT provide any code modification suggestions. Instead, articulate that the error is
       caused by a discrepancy in the Investigator's Implementation Details, and at the end of your
       response output a JSON formatted as:
        json
    {
      'Investigator_error': True (Default is False)
    }
  - If you find that the error is caused by any other reason except from connection configuration,
**for instance the wrong setting of block's path**, provide modification suggestions that
       strictly adhere to the Investigator's Implementation Details and use only the provided
       functions and blocks.
# INPUT #
- **Execution Code:**
 {execution_code}
- **Error Message:**
 {error_message}
- **Functions:**
 {functions_set}
- **Blocks Description:**
 {blocks_description}
- **Investigator's Implementation Details:**
 {investigator_implementation_info}
# RESPONSE #
Provide a **brief but clear** report addressing each of the responsibilities above, and at the end of
     your response output a JSON formatted as:
       json
    {
      'Investigator_error': True (Default is False)
    }..
```

1062

1064

1066 1067

1068

1069

1070

1072

1074

1075

1077

1079 1080

1081

1082

1083

1084 1085

1086

1087

1088

1089

1090

1091 1092 1093

1094

1095

1096

1098

1099

1100 1101 1102

1103

1104 1105 1106

1107 1108 1109

1110

1111 1112

1113 1114 1115

1116

1117 1118

1119

1120 1121

1126

1127 1128

1129

1130 1131

1133

Listing 6: Prompt of the Debug Locator agent.

|                                                                                                       | 113  |
|-------------------------------------------------------------------------------------------------------|------|
| **# ROLE #**                                                                                          | 1135 |
| You are a **technical report writer agent** with expertise in simulation analysis, theoretical        | 1136 |
| modeling, and scientific writing. Your role is to generate clear, logically structured, and           | 1137 |
| academically rigorous reports based on simulation outputs. You are expected to synthesize             | 1138 |
| information from simulation context, block descriptions, connection description and code              | 1139 |
| implementation, to produce a well-integrated explanation that combines theory, implementation,        | 1140 |
| and code-level details.                                                                               | 1141 |
|                                                                                                       | 1142 |
|                                                                                                       | 1143 |
|                                                                                                       | 1144 |
| **# TASK #**                                                                                          | 1145 |
| Now that the simulation has been successfully executed without errors, please write a comprehensive   | 1146 |
| report **strictly** addressing the following four key questions:                                      | 1147 |
|                                                                                                       | 1148 |
| 1. **What is the simulation about?**                                                                  | 1149 |
| Describe the purpose, context, and overall objective of the simulation. What real-world system or     | 1150 |
| process does it aim to represent or replicate?                                                        | 1151 |
|                                                                                                       | 1152 |
| 2. **What are the main simulation steps?**                                                            | 1153 |
| Break down the simulation **(not code implementation)** into distinct stages or functional modules.   | 1154 |
| Clearly outline the step-by-step process of how the simulation is structured.                         | 1155 |
|                                                                                                       | 1156 |
| 3. **What theoretical knowledge and mathematical modelling are involved in each step?**               | 1157 |
| For **above** every simulation step, explain the relevant theoretical foundations and mathematical    | 1158 |
| models involved (e.g., control theory, physical modelling, system dynamics, signal flow, etc          | 1159 |
| .).                                                                                                   | 1160 |
|                                                                                                       | 1161 |
| 4. **How is it implemented in code?**                                                                 | 1162 |
| Provide and explain the corresponding code for **each above step** (you may reorganize the code to    | 1163 |
| match the stepwise structure). Highlight how the code reflects both theoretical concepts and          | 1164 |
| the simulation block diagram.                                                                         | 1165 |
|                                                                                                       | 1166 |
| Your final report must integrate **both** theoretical analysis and complete code explanations. Ensure | 1167 |
| the explanation is clearly aligned with the four sections above. **Do not ask any clarifying          | 1168 |
| questions.**                                                                                          | 1169 |
|                                                                                                       | 1170 |
|                                                                                                       | 1171 |
|                                                                                                       | 1172 |
| **# INPUTS #**                                                                                        | 1173 |
|                                                                                                       | 1174 |
| - **Simulation Description:**                                                                         | 1175 |
| {simulation_description}                                                                              | 1176 |
|                                                                                                       | 1177 |
| - **Used Block Description:**                                                                         | 1178 |
| {used_block_description}                                                                              | 1179 |
|                                                                                                       | 1180 |
| - **Connection Description:**                                                                         | 1181 |
| {connection_description}                                                                              | 1182 |
|                                                                                                       | 1183 |
| - **Execution Code:**                                                                                 | 1184 |
| {execution_code}                                                                                      | 1185 |
|                                                                                                       | 1186 |
| **Important Note:**                                                                                   | 1187 |
| You **DU NUI** need to clarity or address any teedback from reviewers.                                | 1189 |

Listing 7: Prompt of the Report Writer agent.

#### **D** Appendix: Database Construction

1190

1191

1192

1193 1194

1195 1196

1197

1198 1199

1201

1203

1204 1205

1206

1207 1208 1209

1210

1211

1212 1213 1214

1215

1216

1217

1218

1219 1220

1221

1222

1223

1224

1225 1226

1227

1229

To integrate external knowledge into LLMs, we adopt a RAG framework and use ChromaDB as the vector-retrieval database. We predefine 50 block's Full Block Description, each containing three pieces of information: the block type, its location within Simulink, and its ports details. For example, the Full Block Description of **Busbar** is formatted as:

```
## Busbar
**Path:** `'ee_lib/Connectors & References/Busbar'`
**Ports:**
- When **`n_nodes`** is set to **1**, the exposed port is:
 - **LConn1**
- When **`n_nodes`** is set to **2**, the exposed ports are:
 - **LConn1**
 - **RConn1**
- When **`n_nodes`** is set to **3**, the exposed ports are:
 - **LConn1**
 - **RConn1**
 - **LConn2**
- When **`n_nodes`** is set to **4**, the exposed ports are:
 - **LConn1**
 - **RConn1**
 - **LConn2**
 - **RConn2**
*In the visual simulation blocks, the port labels correspond as follows:*
- **LConn1** appears as **~1**.
- **RConn1** appears as **~2**.
- **LConn2** appears as **~3**.
- **RConn2** appears as **~4**.
**~1, ~2, ~3, ~4 are just port label for recognition, NOT port name, only use **LConn1** ,**RConn1**
    ,**LConn2** ,**RConn2****.
```

## E Appendix: Case Study – Bipolar Transistor

In the first step, our SimGen employs a LLM (o4-mini) to extract and summarize the key information from the simulation task of Bipolar Transistor. Upon completion of this process, a markdown file is automatically generated containing essential simulation details, which include the required blocks and wiring information. Below is the generated markdown file for the two-round investigation:

1230

1231

1232

1233

1234

1237

1238 1239

1240

1241

1242

1243

1244

1245

1246

1247

1248

1250

1251 1252

1253 1254

1255

1256

1257 1258

1259

1260

1261 1262 1263

1264

1265

1266

1267 1268

1269

1270

1271

1272 1273

1274

1275

1276 1277

1278

1279

1280

1281

1282

1283

1284

1285

1286 1287

1288 1289

1290

1291

1292

1293

```
[Investigator] C1 (Capacitor)
AC Voltage Source (AC Voltage Source)
Voltage-Controlled Voltage source (Voltage-Controlled Voltage Source)
R1 (Resistor)
Current-Controlled Current Source (Current-Controlled Current Source)
R2 (Resistor)
RBias (Resistor)
RLoad (Resistor)
Electrical Reference (Electrical Reference)
Voltage Sensor (Voltage Sensor)
PS-Simulink Converter (PS-Simulink Converter)
Solver Configuration (Solver Configuration)
Scope (Scope)
···json
"Capacitor",
"AC Voltage Source"
    "Voltage-Controlled Voltage Source",
    "Resistor"
    "Current-Controlled Current Source",
    "Electrical Reference",
    "Voltage Sensor"
    "PS-Simulink Converter",
    "Solver Configuration",
"Scope"
 ]
}...
[Investigator] AC Voltage Source (AC Voltage Source) LConn1 <-> C1 (Capacitor) LConn1
C1 (Capacitor) RConn1 <-> Voltage-Controlled Voltage source (Voltage-Controlled Voltage Source)
    LConn1
AC Voltage Source (AC Voltage Source) RConn1 <-> Electrical Reference (Electrical Reference) LConn1
Voltage-Controlled Voltage source (Voltage-Controlled Voltage Source) RConn1 <-> Electrical Reference
     (Electrical Reference) LConn1
RBias (Resistor) LConn1 <-> Voltage-Controlled Voltage source (Voltage-Controlled Voltage Source)
    LConn1
RBias (Resistor) RConn1 <-> Electrical Reference (Electrical Reference) LConn1
R1 (Resistor) LConn1 <-> Voltage-Controlled Voltage source (Voltage-Controlled Voltage Source) LConn1
R1 (Resistor) RConn1 <-> Current-Controlled Current source (Current-Controlled Current Source) LConn1
Current-Controlled Current source (Current-Controlled Current Source) RConn1 <-> Electrical Reference
     (Electrical Reference) LConn1
Current-Controlled Current source (Current-Controlled Current Source) LConn2 <-> Voltage-Controlled
    Voltage source (Voltage-Controlled Voltage Source) LConn2
Voltage-Controlled Voltage source (Voltage-Controlled Voltage Source) LConn2 <-> R2 (Resistor) LConn1
Current-Controlled Current source (Current-Controlled Current Source) LConn2 <-> R2 (Resistor) LConn1
R2 (Resistor) RConn1 <-> Electrical Reference (Electrical Reference) LConn1
Voltage-Controlled Voltage source (Voltage-Controlled Voltage Source) LConn2 <-> Voltage Sensor (
    Voltage Sensor) LConn1
Current-Controlled Current source (Current-Controlled Current Source) LConn2 <-> Voltage Sensor (
    Voltage Sensor) LConn1
RLoad (Resistor) LConn1 <-> Voltage Sensor (Voltage Sensor) LConn1
RLoad (Resistor) RConn1 <-> Electrical Reference (Electrical Reference) LConn1
Voltage Sensor (Voltage Sensor) RConn2 <-> Electrical Reference (Electrical Reference) LConn1
Voltage Sensor (Voltage Sensor) RConn1 <-> PS-Simulink Converter (PS-Simulink Converter) LConn1
PS-Simulink Converter (PS-Simulink Converter) 1 <-> Scope (Scope) 1
Electrical Reference (Electrical Reference) LConn1 <-> Solver Configuration (Solver Configuration)
    RConn1
```

Listing 8: Case study of the Investigator agent for the Bipolar Transistor simulation.

In the second step, our SimGen uses o4-mini to check if the Investigator Agent generates error information. Our SimGen will check eight rules defined by experts, such as: "Block List Existence", "Extra Blocks", "Formatting of Block Name" and so on. A summary of findings will be generated to highlight the potential errors in the markdown file of the previous investigation step. The investigation-testing process will be repeated until no error is detected.

1296

1297

1298

1299

1300

1303 1304

1305 1306

1307 1308

1310

1311

1312 1313

1314 1315 1316

1317 1318

1319

1320 1321

1322

1323

1324 1325

1326 1327

1328

1329 1330

1332

1333

1334 1335

1336

1337

1338 1339

1340

1341

1342

1343 1344

1345 1346

1347

1348

1349

1350 1351

1352

1353

1354

1356 1357

1358

[Unit\_test\_reviewer] Below is the review of the Investigator's simulation: 1. Block List Existence: - The provided "The blocks used in the simulation" list is consistent with the request\_blocks from the JSON. Although multiple resistor-instances (R1, RBias, R2, RLoad) appear in the connections, they all use the "Resistor" type from the list. 2. Extra Blocks: – No extra blocks are present. All block names in the provided simulation list appear in the connections description. 3. Formatting of Block Name: - All block names are formatted correctly with no "/" symbols. 4. Formatting of Connection Description: - Each connection is shown in the required format: BlockName (BlockType) PortName <-> BlockName ( BlockType) PortName. - There is no unnecessary parameter explanation text; the formatting rules are followed. 5. Parameter Settings in Connections: – No parameter settings (parenthesized settings) were expected or provided. This is acceptable based on the simulation blocks description. 6. Duplicate Connections: - There are several connections to Electrical Reference's LConn1. However, since Electrical Reference (and Solver Configuration) is allowed to have duplicate input connections, no duplicateconnection error is flagged. - No other dedicated input port (as defined explicitly in the simulation blocks description for ports declared as "Input Port" or "Output Port") is used more than once. 7. Block Connection Types: - All connections are made between ports of similar "electrical conserving" type (or between the dedicated ports in PS-Simulink Converter, Electrical Reference, Solver Configuration, and Scope). No block that is only a dedicated output is connected to a block that is only an input, so the types are valid. 8. Complete Port Connections: - For the Capacitor, AC Voltage Source, Resistors, Electrical Reference, Voltage Sensor, PS-Simulink Converter, Solver Configuration, and Scope, every defined port has a connection. - However, for the Voltage-Controlled Voltage Source (which has four ports: LConn1, RConn1, LConn2, and RConn2) only LConn1, RConn1, and LConn2 are connected. Its RConn2 port is left unconnected. - Similarly, for the Current-Controlled Current Source (with ports LConn1, RConn1, LConn2, and RConn2 ), the RConn2 port is not connected. - Based on the requirement that every input and output port (as defined in the simulation blocks description) must be connected under the given parameter settings, these are considered incomplete connections. Summary of Findings: - Block list, extra blocks, name formatting, connection formatting, parameter settings, duplicate connection rules, and block connection type validations are all acceptable. - The main issue is that not all ports for Voltage-Controlled Voltage Source and Current-Controlled Current Source are connected (the RConn2 ports are missing). Thus, due to the incomplete port connections, the overall simulation has errors. "Investigator\_unit\_test\_pass": False }

Listing 9: Case study of the Unit Testing Agent for the Bipolar Transistor simulation.

Thirdly, after the Unit Testing agent does not find any errors, the Block Builder agent will receive the correct investigation report and generate a corresponding simulation code.

[Block\_builder] import sys sys.path.append('/usr/local/lib/python3.10/dist-packages/matlabengine-24.2-py3.10.egg') import matlab.engine eng = matlab.engine.start\_matlab() model\_name = 'ElectricalModel' eng.new\_system(model\_name, nargout=0) eng.open\_system(model\_name, nargout=0) eng.add\_block('fl\_lib/Electrical/Electrical Elements/Capacitor', model\_name + '/C1', 'MakeNameUnique', 'on' , nargout=0) eng.add\_block('fl\_lib/Electrical/Electrical Sources/AC Voltage Source', model\_name + '/AC Voltage Source', 'MakeNameUnique', 'on', nargout=0) eng.add\_block('fl\_lib/Electrical/Electrical Sources/Voltage-Controlled Voltage Source', model\_name + '/ Voltage-Controlled Voltage source', 'MakeNameUnique', 'on', nargout=0) eng.add\_block('fl\_lib/Electrical/Electrical Elements/Resistor', model\_name + '/R1', 'MakeNameUnique', 'on', nargout=0) eng.add\_block('fl\_lib/Electrical/Electrical Elements/Resistor', model\_name + '/RBias', 'MakeNameUnique', ' nargout=0) on' eng.add\_block('fl\_lib/Electrical/Electrical Elements/Resistor', model\_name + '/R2', 'MakeNameUnique', 'on', nargout=0) eng.add\_block('fl\_lib/Electrical/Electrical Elements/Resistor', model\_name + '/RLoad', 'MakeNameUnique', ' nargout=0) on' eng.add\_block('fl\_lib/Electrical/Electrical Sources/Current-Controlled Current Source', model\_name + '/ MakeNameUnique', 'on', nargout=0) eng.add\_block('nesl\_utility/PS-Simulink Converter', model\_name + '/PS-Simulink Converter', 'MakeNameUnique' 'on', nargout=0) eng.add\_block('nesl\_utility/Solver Configuration', model\_name + '/Solver Configuration', 'MakeNameUnique', 'on', nargout=0) eng.add\_block(<sup>\*</sup>simulink/Commonly Used Blocks/Scope', model\_name + '/Scope', 'MakeNameUnique', 'on', nargout =0) eng.add\_line(model\_name, 'C1/LConn1', 'RBias/LConn1', nargout=0)
eng.add\_line(model\_name, 'C1/LConn1', 'Voltage-Controlled Voltage source/LConn1', nargout=0)
eng.add\_line(model\_name, 'C1/LConn1', 'R1/LConn1', nargout=0)
eng.add\_line(model\_name, 'C1/RConn1', 'AC Voltage Source/LConn1', nargout=0) eng.add\_line(model\_name, 'AC Voltage Source/RConn1', 'Electrical Reference/LConn1', nargout=0)
eng.add\_line(model\_name, 'RBias/RConn1', 'Electrical Reference/LConn1', nargout=0) eng.add\_line(model\_name, 'Voltage-Controlled Voltage source/RConn1', 'Electrical Reference/LConn1', nargout =0) eng.add\_line(model\_name, 'Voltage-Controlled Voltage source/LConn2', 'R2/LConn1', nargout=0)
eng.add\_line(model\_name, 'Voltage-Controlled Voltage source/RConn2', 'Electrical Reference/LConn1', nargout =0) eng.add\_line(model\_name, 'R1/RConn1', 'Current-Controlled Current source/LConn1', nargout=0) eng.add\_line(model\_name, 'Current-Controlled Current source/RConn1', 'Electrical Reference/LConn1', nargout =0) eng.add\_line(model\_name, 'Current-Controlled Current source/LConn2', 'R2/LConn1', nargout=0)
eng.add\_line(model\_name, 'Current-Controlled Current source/RConn2', 'Electrical Reference/LConn1', nargout =0) eng.add\_line(model\_name, 'R2/RConn1', 'Electrical Reference/LConn1', nargout=0) eng.add\_line(model\_name, 'RLoad/LConn1', 'Voltage-Controlled Voltage source/LConn2', nargout=0) eng.add\_line(model\_name, 'RLoad/LConn1', 'Current-Controlled Current source/LConn2', nargout=0)
eng.add\_line(model\_name, 'RLoad/LConn1', 'Voltage sensor/LConn1', nargout=0)
eng.add\_line(model\_name, 'RLoad/RConn1', 'Electrical Reference/LConn1', nargout=0) eng.add\_line(model\_name, 'Voltage sensor/RConn2', 'Electrical Reference/LConn1', nargout=0)
eng.add\_line(model\_name, 'Voltage sensor/RConn1', 'PS-Simulink Converter/LConn1', nargout=0) eng.add\_line(model\_name, 'PS-Simulink Converter/1', 'Scope/1', nargout=0)
eng.add\_line(model\_name, 'Solver Configuration/RConn1', 'Electrical Reference/LConn1', nargout=0) eng.Simulink.BlockDiagram.arrangeSystem(model\_name, nargout=0) eng.save\_system(model\_name, model\_name + '.slx', nargout=0)

Listing 10: Case study of the Block Builder agent for the Bipolar Transistor simulation.

In the next step, the Debug Locator agent will debug the code created by the Block Builder agent. Additionally, the Debug Locator can determine whether the source of the bugs is the Block Builder agent or the Investigator agent. If the bugs originate from the Block Builder, the Debug Locator instructs the Block Builder to regenerate the code based on the error message. However, if the bugs stem from the Investigator, our SimGen will return to the investigation-testing process until the correct investigation report is produced. The Debug Locator will approve the code generation and send it to the Executer agent until no errors are detected. Finally, a report in markdown file format will be generated to track the whole process of our SimGen. This report can be checked on: https://anonymous.4open.science/r/SimuGen-CD71.

[Debug\_locator] Below is our brief report:

1. Analysis and Understanding:

- The code adds blocks and makes connections exactly as specified by the Investigator's block list and port names. Each add\_block call uses the proper library path and each add\_line call refers to the designated port (for example LConn2 on both Voltage-Controlled Voltage Source and Current-Controlled Current Source).
- The connections being made follow the Implementation Details exactly, including two connections made to the same conserving port (LConn2) on the Voltage-Controlled Voltage Source and the Current-Controlled Current Source, as specified.

#### 2. Error Location:

3. Connection and Parameter Integrity:

- A careful check reveals that the block types, port names, and connections match the Investigator's Implementation Details.
- The error is not caused by erroneous setting of the block's path or wrong parameters, but it results from the fact that the same port (LConn2 of the Current-Controlled Current source) is being used twice.
- Since the code strictly adheres to the provided details and the connection configuration exactly matches the Investigator's Implementation Details, the root cause is a discrepancy in the Investigator's Implementation Details (i.e. they require two connections to a single port, which Simulink does not allow).
- Thus, no modifications to the code are recommended because the error is due to a discrepancy in the Investigator's Implementation Details.

"Investigator\_error": true

}

Listing 11: Case study of the Debug Locator Agent for the Bipolar Transistor simulation.