全部博文(1159)
分类: Android平台
2016-01-05 22:14:43
Imagine you have a Linux kernel image for an Android phone, but you don’t have the corresponding source, nor do you have the corresponding kernel headers. Imagine that kernel has module support (fortunately), and that you’d like to build a module for it to load. There are several good reasons why you can’t just build a new kernel from source and be done with it (e.g. the resulting kernel lacks support for important hardware, like the LCD or touchscreen). With the ever-changing Linux kernel ABI, and the lack of source and headers, you’d think you’re pretty much in a dead-end.
As a matter of fact, if you build a kernel module against different kernel headers, the module will fail to load with errors depending on how different they are. It can complain about bad signatures, bad version or other different things.
But more on that later.
The first thing is to find a kernel source for something close enough to the kernel image you have. That’s probably the trickiest part with getting a proper configuration. Start from the version number you can read from /proc/version. If, like me, you’re targeting an Android device, try Android kernels from Code Aurora, Linaro, Cyanogen or Android, whichever is closest to what is in your phone. In my case, it was msm-3.0 kernel. Note you don’t necessarily need the exact same version. A minor version difference is still likely to work. I’ve been using a 3.0.21 source, which the kernel image was 3.0.8. Don’t however try e.g. using a 3.1 kernel source when the kernel you have is 3.0.x.
If the kernel image you have is kind enough to provide a /proc/config.gz file, you can start from there, otherwise, you can try starting from the default configuration, but you need to be extra careful, then (although I won’t detail using the default configuration because I was fortunate enough that I didn’t have to, there will be some details further below as to why a proper configuration is important).
Assuming arm-eabi-gcc is in your PATH, and that you have a shell opened in the kernel source directory, you need to start by configuring the kernel and install headers and scripts:
$ mkdir build $ gunzip -c config.gz > build/.config # Or whatever you need to prepare a .config $ make silentoldconfig prepare headers_install scripts ARCH=arm CROSS_COMPILE=arm-eabi- O=build KERNELRELEASE=`adb shell uname -r`
The silentoldconfig target is likely to ask you some questions about whether you want to enable some things. You may want to opt for the default, but that may also not work properly.
You may use something different for KERNELRELEASE, but it needs to match the exact kernel version you’ll be loading the module from.
To create a dummy module, you need to create two files: a source file, and a Makefile.
Place the following content in a hello.c file, in some dedicated directory:
#include/* Needed by all modules */ #include /* Needed for KERN_INFO */ #include /* Needed for the macros */ static int __init hello_start(void) { printk(KERN_INFO "Hello world\n"); return 0; } static void __exit hello_end(void) { printk(KERN_INFO "Goodbye world\n"); } module_init(hello_start); module_exit(hello_end);
Place the following content in a Makefile under the same directory:
obj-m = hello.o
Building such a module is pretty straightforward, but at this point, it won’t work yet. Let me enter some details first.
When you normally build the above module, the kernel build system creates a hello.mod.c file, which content can create several kind of problems:
MODULE_INFO(vermagic, VERMAGIC_STRING);
VERMAGIC_STRING is derived from the UTS_RELEASE macro defined in include/generated/utsrelease.h, generated by the kernel build system. By default, its value is derived from the actual kernel version, and git repository status. This is what setting KERNELRELEASE when configuring the kernel above modified. If VERMAGIC_STRING doesn’t match the kernel version, loading the module will lead to the following kind of message in dmesg:
hello: version magic '3.0.21-perf-ge728813-00399-gd5fa0c9' should be '3.0.8-perf'
Then, there’s the module definition.
struct module __this_module __attribute__((section(".gnu.linkonce.this_module"))) = { .name = KBUILD_MODNAME, .init = init_module, #ifdef CONFIG_MODULE_UNLOAD .exit = cleanup_module, #endif .arch = MODULE_ARCH_INIT, };
In itself, this looks benign, but the struct module, defined in include/linux/module.h comes with an unpleasant surprise:
struct module { (...) #ifdef CONFIG_UNUSED_SYMBOLS (...) #endif (...) /* Startup function. */ int (*init)(void); (...) #ifdef CONFIG_GENERIC_BUG (...) #endif #ifdef CONFIG_KALLSYMS (...) #endif (...) (... plenty more ifdefs ...) #ifdef CONFIG_MODULE_UNLOAD (...) /* Destruction function. */ void (*exit)(void); (...) #endif (...) }
This means for the init pointer to be at the right place, CONFIG_UNUSED_SYMBOLS needs to be defined according to what the kernel image uses. And for the exit pointer, it’s CONFIG_GENERIC_BUG, CONFIG_KALLSYMS, CONFIG_SMP, CONFIG_TRACEPOINTS, CONFIG_JUMP_LABEL, CONFIG_TRACING, CONFIG_EVENT_TRACING, CONFIG_FTRACE_MCOUNT_RECORD and CONFIG_MODULE_UNLOAD.
Start to understand why you’re supposed to use the exact kernel headers matching your kernel?
Then, the symbol version definitions:
static const struct modversion_info ____versions[] __used __attribute__((section("__versions"))) = { { 0xsomehex, "module_layout" }, { 0xsomehex, "__aeabi_unwind_cpp_pr0" }, { 0xsomehex, "printk" }, };
These come from the Module.symvers file you get with your kernel headers. Each entry represents a symbol the module requires, and what signature it is expected to have. The first symbol, module_layout, varies depending on what struct module looks like, i.e. depending on which of the config options mentioned above are enabled. The second, __aeabi_unwind_cpp_pr0, is an ARM ABI specific function, and the last, is for our printk function calls.
The signature for each function symbol may vary depending on the kernel code for that function, and the compiler used to compile the kernel. This means that if you have a kernel you built from source, modules built for that kernel, and rebuild the kernel after modifying e.g. the printk function, even in a compatible way, the modules you built initially won’t load with the new kernel.
So, if we were to build a kernel from the hopefully close enough source code, with the hopefully close enough configuration, chances are we wouldn’t get the same signatures as the binary kernel we have, and it would complain as follows, when loading our module:
hello: disagrees about version of symbol symbol_name
Which means we need a proper Module.symvers corresponding to the binary kernel, which, at the moment, we don’t have.
Conveniently, since the kernel has to do these verifications when loading modules, it actually contains a list of the symbols it exports, and the corresponding signatures. When the kernel loads a module, it goes through all the symbols the module requires, in order to find them in its own symbol table (or other modules’ symbol table when the module uses symbols from other modules), and check the corresponding signature.
The kernel uses the following function to search in its symbol table (in kernel/module.c):
bool each_symbol_section(bool (*fn)(const struct symsearch *arr, struct module *owner, void *data), void *data) { struct module *mod; static const struct symsearch arr[] = { { __start___ksymtab, __stop___ksymtab, __start___kcrctab, NOT_GPL_ONLY, false }, { __start___ksymtab_gpl, __stop___ksymtab_gpl, __start___kcrctab_gpl, GPL_ONLY, false }, { __start___ksymtab_gpl_future, __stop___ksymtab_gpl_future, __start___kcrctab_gpl_future, WILL_BE_GPL_ONLY, false }, #ifdef CONFIG_UNUSED_SYMBOLS { __start___ksymtab_unused, __stop___ksymtab_unused, __start___kcrctab_unused, NOT_GPL_ONLY, true }, { __start___ksymtab_unused_gpl, __stop___ksymtab_unused_gpl, __start___kcrctab_unused_gpl, GPL_ONLY, true }, #endif }; if (each_symbol_in_section(arr, ARRAY_SIZE(arr), NULL, fn, data)) return true; (...)
The struct used in this function is defined in include/linux/module.h as follows:
struct symsearch { const struct kernel_symbol *start, *stop; const unsigned long *crcs; enum { NOT_GPL_ONLY, GPL_ONLY, WILL_BE_GPL_ONLY, } licence; bool unused; };
Note: this kernel code hasn’t changed significantly in the past four years.
What we have above is three (or five, when CONFIG_UNUSED_SYMBOLS is defined) entries, each of which contains the start of a symbol table, the end of that symbol table, the start of the corresponding signature table, and two flags.
The data is static and constant, which means it will appear as is in the kernel binary. By scanning the kernel for three consecutive sequences of three pointers within the kernel address space followed by two integers with the values from the definitions in each_symbol_section, we can deduce the location of the symbol and signature tables, and regenerate a Module.symvers from the kernel binary.
Unfortunately, most kernels these days are compressed (zImage), so a simple search is not possible. A compressed kernel is actually a small bootstrap binary followed by a compressed stream. It is possible to scan the kernel zImage to look for the compressed stream, and decompress it from there.
I wrote a script to do . It should work on any recent kernel, provided it is not relocatable and you know the base address where it is loaded. It takes options for the number of bits and endianness of the architecture, but defaults to values suitable for ARM. The base address, however, always needs to be provided. It can be found, on ARM kernels, in dmesg:
$ adb shell dmesg | grep "\.init" <5>[01-01 00:00:00.000] [0: swapper] .init : 0xc0008000 - 0xc0037000 ( 188 kB)
The base address in the example above is 0xc0008000.
If like me you’re interested in loading the module on an Android device, then what you have as a binary kernel is probably a complete boot image. A boot image contains other things besides the kernel, so you can’t use it directly with the script. Except if the kernel in that boot image is compressed, in which case the part of the script that looks for the compressed image will find it anyways.
If the kernel is not compressed, you can use the unbootimg program as outlined in this old post of mine to get the kernel image out of your boot image. Once you have the kernel image, the script can be invoked as follows:
$ python extract-symvers.py -B 0xc0008000 kernel-filename > Module.symvers
Symbols and signature info could also be extracted from binary modules, but I was not interested in that information so the script doesn’t handle that.
Now that we have a proper Module.symvers for the kernel we want to load our module in, we can finally build the module:
(again, assuming arm-eabi-gcc is in your PATH, and that you have a shell opened in the kernel source directory)
$ cp /path/to/Module.symvers build/ $ make M=/path/to/module/source ARCH=arm CROSS_COMPILE=arm-eabi- O=build modules
And that’s it. You can now copy the resulting hello.ko onto the device and load it.
$ adb shell # insmod hello.ko # dmesg | grep insmod <6>[mm-dd hh:mm:ss.xxx] [id: insmod]Hello world # lsmod hello 586 0 - Live 0xbf008000 (P) # rmmod hello # dmesg | grep rmmod <6>[mm-dd hh:mm:ss.xxx] [id: rmmod]Goodbye world
2012-08-06 15:11:41+0100
You can leave a response, or trackback from your own site.
2012-08-06 19:42:51+0100
Excellent post
2013-01-17 09:32:47+0100
Hi,
I wanted to thank you for this guide,
you’ve explained very well a lot of details I couldn’t find elsewhere.
I’m building a module for a device which is based on AML8726-M3, I’m currently stuck in retrieving symbols and signature info, I’ve retrieved the boot image from the device but when I use unbootimg I get this error: “kernel magic mismatch”. I’ve tried other tools like split_bootimg.pl and unpack_bootimg.pl but they fail too.
any idea on how to extract the kernel image?
also, you’ve outlined that symbols and signature can also be retrieved from a binary module, how would you do this?
thank you!
2013-01-17 11:13:06+0100
for everyone interested in AML8726-M3 it seems that the boot image is uImage and it is lzma compressed so.. let’s see how to extract the kernel from that
2013-01-25 14:31:38+0100
Hi, excellent post, thanks! I am currently dealing with the reverse issue. I am a custom kernel dev and I am trying to support an app on the app store that uses a proprietary kernel module. I have been working with the dev to try to get him to support my kernel, but it has been an nightmare. To support all the different custom kernels, the dev is doing what you have described above and has to provide a different module for each one he supports. He probes and tries to load the correct module.
So, I was wondering. Is there a way, since I have the kernel source, to reverse engineer my kernel to match the binary footprint this proprietary kernel module? My source is based on stock source and so is the base where he compiles his module. Some other custom kernels get lucky and the module loads and works. But in my kernel that his little changes from stock still doesn’t work. If I set MODVERSIONS=n, then it will load but still fails.
If you have any ideas, please let me know! I’m sure the dev would appreciate it to since he could build one module and then tell the kernel devs what to do to make the kernel compatible so he doesn’t have to do all the work to support new kernels and existing ones as they change.
Thanks again!
2013-03-15 21:43:31+0100
Hi! Thanks for great post! How to extract symbols from binary modules? (.ko files, right?)
2013-05-12 04:46:14+0100
[…] the process of building and installing one. Much of this article draws on the more general Building a Linux kernel module without the exact kernel headers and adds specifics for this particular purpose. I strongly suggest reading that article for more […]
2013-06-03 00:27:35+0100
[…] http://glandium.org/blog/?p=2664 […]
2013-10-22 05:32:20+0100
hi. excellent writeup and script, HOWEVER, when running (using correct value), I get output like:
0x00000000 ??!???????????
(0??@C????U vmlinux EXPORT_SYMBOL_GPL
0x00000000 ??!???????????
(0??@C????U vmlinux EXPORT_SYMBOL_GPL
0x00000000 ??!???????????
(0??@C????U vmlinux EXPORT_SYMBOL_GPL
0x00000000 ??!???????????
(0??@C????U vmlinux EXPORT_SYMBOL_GPL
0x00000000 ??!???????????
(0??@C????U vmlinux EXPORT_SYMBOL_GPL
0x00000000 ??!???????????
(0??@C????U vmlinux EXPORT_SYMBOL_GPL
any ideas?
2013-11-09 21:22:41+0100
Excellent article!,
I am trying to build Generic USB audio driver for my tablet. its Android ICS, “3.0.8-vimicro preemt”.The kernel src is not avail from the vendor “Penta tablets”. I have requested them but after this article i can go ahead and start working :)
2014-02-26 08:19:18+0100
This is a fantastic article, that works for many different Android tablets. But some times this doesn’t work. Mostly because different kernels could be compiled in different ways.
Some times you can get info from a kernel by using “objdump” or “readelf” (instead of looking at dmesg).
And other times you can extract the Module.symvers file from /dev/kmem by using this script:
Finaly, I’d like to show you the URL of another interesting article about inserting modules into the kernel by using /dev/kmem
Kind regards!
2014-06-25 21:11:17+0100
I run:
python extract-symvers.py -B 0xc05f8000 uImage > Module.symvers
and
python extract-symvers.py -B 0xc0008000 boot.img-kernel > Module.symvers
with two different arm kernels and I get the same output:
Traceback (most recent call last):
File “extract-symvers.py”, line 139, in
main()
File “extract-symvers.py”, line 135, in main
for s, crc, t in kernel.symbols():
File “extract-symvers.py”, line 102, in symbols
for t, s in symsearch.items():
AttributeError: ‘NoneType’ object has no attribute ‘items’
It creates an empty “Module.symvers” file
Is there anything I should do to fix it?
2014-09-10 22:35:07+0100
Thanks this seems to be exactly what I need, but when I run the make m= I get this error:
ERROR: Kernel configuration is invalid.
include/generated/autoconf.h or include/config/auto.conf are missing.
Run ‘make oldconfig && make prepare’ on kernel src to fix it.
I have tried to run make oldconfig and make prepare, but the error stays the same, any ideas?
2015-03-20 17:14:40+0100
Great post! Thumbs up for the very detailed explanations.
The base address from the dmesg output did not seem to work for me but 0xc0008000 worked fine. Following is the command
python extract-symvers.py -B 0xc0008000 zImage > Module.symvers
Best
2015-06-12 15:22:59+0100
Thank you! This is an amazing article.
The based address from .init didn’t worked with extract-symvers but the .text address worked, as Onur already commented.
Best regards!