Publication Details
Reconstruction of Compiler's Instruction Idioms in a Retargetable Decompiler
Pokorný Fridolín, Ing.
compiler optimizations, reverse engineering, decompiler, Lissom, instruction idioms, bit twiddling hacks
Machine-code decompilation is a reverse-engineering discipline focused on reverse compilation. It performs an application recovery from binary executable files back into the high level language (HLL) representation. One of its critical tasks is to produce an accurate and well-readable code. However, this is a challenging task since the executable code may be produced by one of the modern compilers that use advanced optimizations. One type of such an optimization is usage of so-called instruction idioms. These idioms are used to produce faster or even smaller executable files. On the other hand, decompilation of instruction idioms without any advanced analysis produces almost unreadable HLL code that may confuse the user of a decompiler. In this paper, we present a method of instruction-idioms detection and reconstruction back into a readable form with the same meaning. This approach is adapted in an existing retargetable decompiler developed within the Lissom project. The implementation has been tested on several modern compilers and target architectures. According to our experimental results, the proposed solution is highly accurate on the RISC (Reduced Instruction Set Computer) processor families, but it should be further improved on the CISC (Complex Instruction Set Computer) architectures.
@inproceedings{BUT103493,
author="Jakub {Křoustek} and Fridolín {Pokorný}",
title="Reconstruction of Compiler's Instruction Idioms in a Retargetable Decompiler",
booktitle="4th Workshop on Advances in Programming Languages (WAPL'13)",
year="2013",
pages="1507--1514",
publisher="IEEE Computer Society",
address="Kraków",
isbn="978-1-4673-4471-5",
url="https://fedcsis.org/proceedings/2013/index.html"
}