Skip to content

Latest commit

 

History

History
106 lines (83 loc) · 5.19 KB

decompiling.md

File metadata and controls

106 lines (83 loc) · 5.19 KB

Decompiling

This document describes how you can start decompiling code and contribute to the project. Feel free to ask for help if you get stuck or need assistance.

Pick a source file

See the decomp tag in the issue tracker for a list of delinked source files that are ready to be decompiled. This list grows as more source files are delinked from the rest of the base ROM.

You can claim a source file by leaving a comment on its issue, so that GitHub allows us to assign you to it. This indicates that you are currently decompiling that source file.

If you want to unclaim the file, leave another comment so we can be certain that the source file is available to be claimed again. Remember to make a pull request of any progress you made on the source file, whether it is just header files or partially decompiled code.

Decompiling a source file

We use the object diffing tool objdiff to track differences between our decompiled C++ code and the base ROM's code.

  1. Download the latest release.
  2. Run configure.py <eur|usa> and ninja to generate objdiff.json in the repository root.
  3. In objdiff, set the project directory to the repository root.
  4. Select your source file in the left sidebar:
    List of objects in objdiff
  5. See the list of functions and data to decompile:
    List of symbols in objdiff

Note

If a source file is missing in objdiff, or objdiff fails to build a file, first rerun ninja to update objdiff.json. If the problem persists, feel free to ask for help.

Decompiling a function

Once you've opened a source file in objdiff, you can choose to decompile the functions in any order. We recommend starting with a small function if you're unfamiliar with decompilation. Here's an example:

Function in objdiff

As a starting point, we look at the decompiler output in Ghidra. You can request access to our shared Ghidra project in this section.

Decompiler in Ghidra

Looking at this output, we might try writing something like this:

ARM bool Actor::SetVelocity(Vec3p *vel) {
    if (mUnk_11b) {
        mVel = *vel;
        mUnk_11b = false;
        return true;
    }
    return false;
}

Now we can go back to objdiff and look at the result:

Matching function in objdiff

Success! Note that this was a simple example and that you'll sometimes get stuck on a function. In that case, try the following:

  • Decompile a different function and come back later.
  • Export to decomp.me:
    1. Press the decomp.me button in objdiff.
    2. Once you're sent to decomp.me, go to "Options" and change the preset to "Phantom Hourglass".
    3. Paste your code into the "Source code" tab.
    4. Share the link with us!

Decompiling .init functions

Note

This section will be updated as we learn more about global objects. Feel free to contribute or provide us with more information!

Functions in the .init section are static initializers. Their purpose is to call C++ constructors on global objects, and to register destructors so the global objects can be destroyed when their overlay unloads.

Static initializers are generated implicitly and do not require us to write any code ourselves. So, to generate one, you must define a global variable by using a constructor.

If the static initializer calls __register_global_object, that means the global object has a destructor. This means you'll have to declare a destructor if it doesn't exist already.

Another consequence of having a destructor is that a DestructorChain object will be added to the .bss section. This struct is 12 (0xc) bytes long and is also implicit, so we don't need to define it ourselves.

Important

An important thing to keep in mind is that a static initializer can construct multiple global objects.

Decompiling data

Note

Under construction! It's not fully clear how data is decompiled, as the compiler is strict on how it orders global variables. Feel free to contribute to this section or provide us with more information!

Other than .text and .init which contain code, there are the following sections for data:

  • .rodata: Global or static constants
  • .data: Global or static variables
  • .bss/.sbss: Global or static uninitialized variables

You can see examples of these data sections in the compilation section in build_system.md.

The Ghidra project

We use a shared Ghidra project to analyze the game and decompile functions. To gain access to the project, install Ghidra version 11.2.1 and request access from @aetias on Discord.