Code Injection (Generation V)

Author(s): Hello007, PlatinumMaster

Disclaimers

This tutorial assumes that you have some C++/ARMv5T assembly knowledge, and know how to use a compiler or build system. It is strongly recommended to check out the universal code injection guide, and brush up on C/C++/Assembly before attempting this.
- Our guide is designed to help you utilize these languages to modify game behavior. If for any reason you are uncomfortable with programming, then stop here and take some time to learn one of the languages. In this scenario, C/C++ would be the easiest, as it is the lowest level language that we have bindings for.
- It is okay to not understand how things work at first glance. If something is not explicitly spelled out for you, you should take your time to try and understand it; it will only benefit you in the long run.
This guide assumes that you are using a American Pokémon Black 2 [IREO] or Pokémon White 2 [IRDO] ROM. These are the ROM variants which are officially supported by CTRMap and PMC. It is possible to port these to other regions or games in Generation V, given the correct files are provided (as we will talk about below).
Please take your time reading this guide before jumping into code injection. It will not go anywhere.

Setting Up The Environment
Building code injection patches

Setting Up The Environment

So, you think you've got what it takes to have a crack at Generation V code injection? Then press onward!

First, we've got to set up a few prerequisites. It's a boring job but I promise it'll be smooth sailing from there on out. To start, download (and install, where applicable) all these:

Before you get into code injection, set up a CTRMap project and load it up. If everything went as planned, there should be a Code Injection section in your Extras tab.

This is where it gets interesting:

Click the Install/Update PMC button and select your PMC.rpm file. You should do this every time libRPM is updated, as the DLLs that CTRMap produces need the latest version of PMC to be recognized.
Make sure that there is a patches directory in your project's vfs/data. If there isn't one, create it. Move your NitroKernel.dll into that directory.
Export your ROM from CTRMap.
If everything went correctly, there should be a 00 byte at 0x02005050 in your game's RAM as part of PMC's initialization code.

Building code injection patches

Symbol maps

There are two quintessential things that you need in order to begin writing proper code injection modules.

A compiler with cross-compilation support for ARMv5T - you should already have this since the first chapter.
An ESDB (short for "external symbol database"), the stepping stone for interfacing and hijacking game routines.

ESDBs should be provided with the swan development headers. If for some reason you need to generate your own, you can do so utilizing the Interactive Disassembler (IDA), provided you have your own database (IDB).

Export the symbols you need from IDA using File > Produce file > Create MAP file...
Copy the contents of IDA's Segment Register table (View > Open subviews > Segment registers) into a text file.
Open the result .map file in a text editor (i.e Notepad++, Visual Studio Code), and fix the segment starting addresses to their proper values instead of 00000000. IDA's just sometimes moody like that.
Run MAP2ESDB. If you do not have it, there are two ways to acquire it.
- Use the CTRMap JAR (java -cp <path to CTRMap JAR> rpm.cli.MAP2ESDB) to launch MAP2ESDB.
- Clone and build the RPM authoring tools, then execute MAP2ESDB in your console.
Proceed with the instructions provided by the command line interface. Use your .map as --map and your segment register plaintext as --thmfile.
Once done, you should have an esdb.yml or similar all built and ready for code injection.

Programming (C/C++)

Here's where you'll finally put those headers you downloaded earlier to good use! Any function that you've included in your ESDB, you can now use from your C++ code as long as it is properly declared.

Some of the structures and functions we've researched have been compiled into the swan repo for convenience, and as long as they are in your ESDB, you can use them to their full advantage. Here are a couple of things you should keep in mind:

Functions that have C linkage should be treated as such. Declaring them inside namespaces, or even outside them as plain C++ functions, will most likely not link up with your ESDB.
Failure to link a function against the ESDB is not reported in any way, since it's indistinguishable from a regular extern function present in a DLL and linked at run-time. Always make sure you've got everything registered properly before you commit; you can do this by utilizing RPMDump.
Unless you really know what you're doing and you've got all the proper ABI functions handled, you shouldn't use the C++ standard library, RTTI or exceptions (in fact, you may want to disable them with -fno-rtti and -fno-exceptions in GCC).
Some C++ ABI features (pure virtual functions) are supported and backed by NitroKernel, but need to be included explicitly in order to prevent GCC from complaining. Including ExtLib.Include and its ABI/exl_CxxAbi.h should do the trick.
Floating point operations, albeit very slow, are supported on the DS to an extent, but should be avoided. __aeabi_#fcmp# functions aren't fully present on the target, so comparisons may produce undefined behavior.
Memory allocation through malloc and free will fail. This is because malloc by default allocates in the 0x02000000 - 0x02004000 area (as configured by the game), and said area is filled at boot for some reserved memory heaps. Instead, use the ExtLib new and delete operators (from Heap/exl_MemOperators.h) that pass through exl::heap::Allocators, or Game Freak's GFL_HeapAllocate and GFL_HeapDelete functions. What might also be handy to know is that an exl::heap::OSAllocator can be created on demand for any GFL Heap ID with exl::heap::OSAllocator::OSAllocator(HeapID heapId).
The entirety of the ExtLib.Heap library is included within your NitroKernel, so it's recommended that you use it wherever possible.
DLLs from the lib folder in the ROM root can be loaded using utility functions from NitroKernel's k::dll namespace.

Got all that? Don't worry, if you didn't, I'm fairly sure this text isn't going to go anywhere. But now, we finally get to the fun part - actually injecting stuff!

Programming (Assembly)

If you hate C/C++ for some reason, you can also use assembly. It serves the purpose well in scenarios where you need to do specific instruction tweaks, rather than a full reimplementation. The same advice applies from above, however with assembly, you will need to follow the calling conventions specified by ARM.

The ARMv5T manual is your best friend. It will not tell you how to write assembly functions, but it will teach you how each operation works.

Preparing our Code for Injection

Historically, there were many ways of injecting code into existing executables, but they all essentially get down to one thing: hooks, or branches.

Branches are, simply put, a processor directive that changes the program counter - a register containing a pointer to the currently executed instruction - to another value. Each ISA and CPU architecture has its own way of performing these, but in most cases only two things are required: a source and destination address.

In the early days of computing where programs had a fixed load address and memory space (also known as static loading), all branches were fully deterministic at compile time. This approach is still actually supported on modern CPUs through virtual memory addressing (in fact, some ancient Microsoft Windows programs rely on it to this day) and even though it's a thing of the past in desktop programming, it is still widely used in low-power embedded software, as is the case with standard Nintendo DS development.

However, since code injection is a highly volatile process with very little headroom, we've decided to borrow the more flexible approach of dynamic loading, which includes a lot of fun quirks that you can use to your own advantage.

As per the definition of branches, any program jump requires a source and destination address, but since those may not be fully known at compile-time, they are often stored into a relocation table that is used to correct the branch instructions within the program once the executable is fixed in memory.

Inasmuch as the DS has next to no memory protection, we can easily use these to perform the relocation process even outside of our program - such as the game code! All you've gotta do is adjust the relocation table accordingly, for which there are a grand total of two options:

Use RPMTool's (java -cp CTRMap.jar rpm.cli.RPMTool or build RPMAuthoringTools) --in-relocations-yml to manually specify the relocations using an YML file (not recommended).
Use the automated hook derivation process built into CTRMap.

Both options are presented in this manner, since option #1 is useful for testing relocations. While the automatic derivation might be great, it may be useful to have reassurance that hooking is done correctly.

To have the RPM converter automatically convert your function into a relocation, the name of the symbol (function name or assembly label) needs to be in one of the following formats:

<RELOCATION_TYPE>_FunctionName - hooks directly into the start of a named function (e.g. THUMB_BRANCH_BagSave_AddItem)
<RELOCATION_TYPE>_FunctionName_0xoffset - hooks into a function-relative offset (e.g. THUMB_BRANCH_LINK_BagSave_AddItem_0x2)
<RELOCATION_TYPE>_SEGMENT_0xaddress - hooks at an absolute address within a segment (e.g. FULL_COPY_ARM9_0x02008268)

I know that might be a lot to take in at first, so let's take this apart piece by piece.

First of all, there's the function names. If a function is to be overriden with another, these should match the names in the ESDB, meaning they require explicit C linkage if used in C++. The relocation will then take place at the address of the function as specified in the ESDB, optionally offset by a constant addend.
Additionally, if you don't feel like using the name database or are aiming for some memory wizardry, you can directly input the memory address of the relocation. However, as a side effect of the static loading method used on the Nintendo DS, you also have to specify a "segment" (in a similar way as the ESDB segment header does) of the address, which ensures that the relocation won't be inadvertently applied to an undesirable module, such as an overlay that shares the memory area with another.
Last, but certainly not least, there is the RELOCATION_TYPE parameter. Its value specifies what data or CPU instruction should be written to the destination address. This is especially important on the ARM architecture, as it distinguishes between two instruction sets (ARM and Thumb) which use two separate methods of encoding. As a result, you've got quite a lot of options, but be wary - they are not at all interchangeable! The specific procedures are described in detail here, but for starters, here's a quick reference that should hopefully guide you towards a correct choice:
- THUMB_BRANCH writes a one-way branch using the 16-bit Thumb instruction encoding. In most cases, this will be converted into a PUSH, BL, PUSH because of Thumb short branch restrictions, meaning you'll have to use another method for functions that use the stack for parameters (basically any with more than 4*4 bytes of arguments).
- THUMB_BRANCH_SAFESTACK does exactly that. While it takes up more space (which generally isn't a problem as the overriden function isn't going to be a little one just based off its argument count), this relocation type preserves all stack parameters.
- THUMB_BRANCH_LINK writes a simple BL/BLX using the 16-bit Thumb instruction encoding. This is useful for intercepting a function call in only one place as opposed to replacing the entire implementation.
- ARM_BRANCH and ARM_BRANCH_LINK - 32-bit ARM instructions equivalent to THUMB_BRANCH and THUMB_BRANCH_LINK respectively. Since ARM short branches cover a decent address range, there isn't a need for THUMB_BRANCH_SAFESTACK.
- FULL_COPY copies the raw contents of the function/symbol onto the destination address. Bear in mind that this will not carry over relocations (unless hard-linked into the ROM, which isn't our objective here) inside the function, so it is only useful for simple injections.

From here on, if you name your function according to these rules, the linker will magically make it so that it will override the specified code. The future is now, thanks to science!

Compiling

You can compile your code using standard GCC CLI, or using a build system like CMake. While compiling, be sure to use the following options:

-r - produce a relocatable executable.
-march=armv5t - target the ARMv5T architecture.
-mthumb (optional) - generate Thumb instructions (instead of ARM).
-Os - optimize for code size.

Additionally, if using CMake, you'll need to provide the following variables before compiler configuration to pass the compiler test:

set(CMAKE_SYSTEM_NAME             Generic)
set(CMAKE_SYSTEM_PROCESSOR            arm)
set(CMAKE_C_FLAGS   "--specs=nosys.specs")
set(CMAKE_CXX_FLAGS "--specs=nosys.specs")

If you're using dynamic linking for parts of your code, be sure to load the dependencies (and properly release them!) using k::dll::LoadLibrary (resp. k::dll::ReleaseLibrary) from NitroKernel. If you need to do this in initialization, create a DllMain function using libRPM's RPM_DLLMAIN_DECLARE and RPM_DLLMAIN_DEFINE macros, then write your loading code to run on module load/unload.

Assembling

If you do not feel like writing C or C++, or you want to do a relatively simple patch, then you are also able to use assembly! C and C++ compile into assembly, which is then assembled into an executable and linkable format (ELF) file. Assembly is just assembled into the ELF format directly. So, you can very easily use tools such as the GNU assembler.

To use assembly, just follow the same conventions above for naming functions/symbols.

Then, assemble and link your file. Assuming you are using the GNU assembler, you just need to use the following options when assembling:

-march=armv5t - target the ARMv5T architecture.
-mthumb (optional) - generate Thumb instructions (instead of ARM).
-mtune=arm946e-s (optional) - target the ARM946E-S architecture.

It can then be linked with compiled C/C++ code, by using standard linking procedures (arm-none-eabi-ld or similar).

Linking

Once you have your final ELF binary, you need to fix the linkage of the game functions (as mentioned in the general code injection guide).

Fortunately, linking with CTRMap is probably the simplest part of the entire process. In fact, it's so trivial that we'll skip over it in just one (1) step:

Click the Convert ELF to DLL button in CTRMap's Extras panel and follow the instructions.

Just in case you didn't hear me, that's:

When prompted, select the ESDB file to be used for linking (be wary that this will not be reloaded when changed on disk).
Then select the ELF file you compiled and a destination DLL file.
Copy the result DLL into your patches or lib folder.
You're done! Save your ROM, cross your fingers and start it up!

Patch priority

Should you need to change the priority of loading a DLL patch to higher than 4 (default), hold the Alt key while clicking Convert ELF to DLL and you'll be prompted to choose the priority after conversion.

Changing ESDBs mid-stream

Holding Shift while clicking Convert ELF to DLL will prompt you to re-select an ESDB even if you've selected one already.

Dynamic loading

Unlike WinAPI where you need to use GetProcAddress unless you know the exact address layout of your DLL, libRPM's dynamic linker only requires function names to match (the lookup is done using fast hash tables, so technically there are only about 4 billion possible combos, meaning collision is possible). As a result, headers are all that's needed to properly include a library.

NitroKernel's library loader forces all libraries to be stored in /lib with the .dll extension. A library named Library.dll would then be loaded with k::dll::LoadLibrary("Library").

Debug prints

If you're using DeSmuMe or a similar "nocash message" compliant emulator, you can use the _DeSmuMe DLL of NitroKernel to enable debug prints through k::Print or k::Printf.

The maximum length of a print string in NitroKernel is 1024 characters.

Multithreading

Should you really need to simulate a multi-threaded context on the Nintendo DS's single-core CPU (such as for audio processing or asynchronous rendering), you can do so using NitroKernel's kThread library. Keep in mind though that this will add a slight overhead to your code and should wherever possible be substituted with a single-threaded replacement.

RPM version support

RPMs that do not target the libRPM version present in the installed PMC module will produce undefined behavior. It is therefore recommended to recompile all modules to the latest version in case of uncertainty. If you're using CTRMap, you can upgrade all modules in the patches and lib VFS folders using a button in the Extras panel.

Troubleshooting

DeSmuME + IDA/melonDS debugger is your best friend.

Disclaimers​

Table of Contents​

Setting Up The Environment​

Building code injection patches​

Symbol maps​

Programming (C/C++)​

Programming (Assembly)​

Preparing our Code for Injection​

Compiling​

Assembling​

Linking​

Patch priority​

Changing ESDBs mid-stream​

Dynamic loading​

Debug prints​

Multithreading​

RPM version support​

Troubleshooting​