The Good, the Bad, and the Binary: An LSTM-Based Method for Section Boundary Detection in Firmware AnalysisOpen Website

Published: 01 Jan 2023, Last Modified: 05 Oct 2023IWSEC 2023Readers: Everyone
Abstract: Static analysis tools need information about the ISA and the boundaries of the code and data sections of the binary they analyze. This information is often not readily available in embedded systems firmware, often provided only in a non-standard format or as a raw memory dump. This paper proposes a novel methodology for ISA identification and code and data separation, that extends and improves the state of the art. We identify the main shortcoming of state-of-the-art approaches and add a capability to classify packed binaries’ architecture employing an entropy-based method. Then, we implement an LSTM-based model with heuristics to recognize the section boundaries inside a binary, showing that it outperforms state-of-the-art methods. Finally, we evaluate our approach on a dataset of binaries extracted from real-world firmware.
0 Replies

Loading