全部博文(2759)
分类:
2012-08-30 01:52:01
原文地址:64位Linux的内核和用户地址空间 作者:ztguang
x86-64 is an extension of the . It supports vastly larger virtual and physical address spaces than are possible on x86, thereby allowing programmers to conveniently work with much larger data sets. x86-64 also provides general purpose registers and numerous other enhancements. The original specification was created by , and has been implemented by AMD, , , and others. It is fully backwards compatible with Intel 16-bit and 32-bit code.(p13-14) Because the full x86 16-bit and 32-bit instruction sets remains implemented in hardware without any intervening emulation, existing x86 executables run with no compatibility or performance penalties, although existing applications that are recoded to take advantage of new features of the processor design may see performance increases.
AMD's method of extending Intel's x86 32-bit instruction set to be a subset of its x86-64 instruction set is the same technique Intel employed to extend its .
Prior to launch, "x86-64" and "x86_64" were used to refer to the instruction set. Upon release, AMD named it AMD64 Intel initially used the names IA-32e and EM64T before finally settling on Intel 64 for their implementation. x86-64 is still used by many in the industry, while others, notably (now ) and , use x64 while the BSD family of OSs use AMD64.
The core was the first to implement the architecture; this was the first significant addition to the architecture designed by a company other than Intel. Intel was forced to follow suit and introduced a modified family which was fully software-compatible with AMD's design and specification. introduced x86-64 in their VIA Isaiah architecture, with the .
The x86-64 specification is distinct from the Intel (formerly IA-64) architecture, which is not compatible on the native instruction set level with the x86 architecture.
Contents
|
The AMD64 instruction set is implemented in AMD's , , , , , , , , , , and later processors.
History of AMD64AMD64 was created as an alternative to the radically different architecture, which was designed by Intel and . Originally announced in 1999 with a full specification in August 2000, the AMD64 architecture was positioned by AMD from the beginning as an evolutionary way to add 64-bit computing capabilities to the existing x86 architecture, as opposed to Intel's approach of creating an entirely new 64-bit architecture with IA-64.
The first AMD64-based processor, the , was released in April 2003.
Architectural featuresThe primary defining characteristic of AMD64 is the availability of 64-bit general-purpose , e.g. rax, rbx etc., 64-bit arithmetic and logical operations, and 64-bit . The designers took the opportunity to make other improvements as well. The most significant changes include:
Although virtual addresses are 64 bits wide in 64-bit mode, current implementations (and all chips known to be in the planning stages) do not allow the entire virtual address space of 264 bytes (16 EB) to be used. Most operating systems and applications will not need such a large address space for the foreseeable future (for example, Windows implementations for AMD64 are only populating 16 TB, or 44 bits' worth), so implementing such wide virtual addresses would simply increase the complexity and cost of address translation with no real benefit. AMD therefore decided that, in the first implementations of the architecture, only the least significant 48 bits of a virtual address would actually be used in address translation ( lookup).(p120) Further, bits 48 through 63 of any virtual address must be copies of bit 47 (in a manner akin to ), or the processor will raise an exception.(p131) Addresses complying with this rule are referred to as "canonical form."(p130) Canonical form addresses run from 0 through 00007FFF'FFFFFFFF, and from FFFF8000'00000000 through FFFFFFFF'FFFFFFFF, for a total of 256 TB of usable virtual address space.
This "quirk" allows an important feature for later scalability to true 64-bit addressing: many operating systems (including, but not limited to, the family) take the higher-addressed half of the address space (named kernel space) for themselves and leave the lower-addressed half (user space) for application code, user mode stacks, heaps, and other data regions. The "canonical address" design ensures that every AMD64 compliant implementation has, in effect, two memory halves: the lower half starts at 00000000'00000000 and "grows upwards" as more virtual address bits become available, while the higher half is "docked" to the top of the address space and grows downwards. Also, fixing the contents of the unused address bits prevents their use by operating system as flags, privilege markers, etc., as such use could become problematic when the architecture is extended to implement more bits of virtual addresses.
Current 48-bit implementation | 56-bit implementation | Full 64-bit implementation |
The 64-bit addressing mode ("") is a superset of (PAE); because of this, sizes may be 4 (212 bytes) or 2 (221 bytes).(p120) Long mode also supports page sizes of 1 (230 bytes).(p120) Rather than the three-level system used by systems in PAE mode, systems running in use four levels of page table: PAE's Page-Directory Pointer Table is extended from 4 entries to 512, and an additional Page-Map Level 4 (PML4) Table is added, containing 512 entries in 48-bit implementations.(p131) In implementations providing larger virtual addresses, this latter table would either grow to accommodate sufficient entries to describe the entire address range, up to a theoretical maximum of 33,554,432 entries for a 64-bit implementation, or be over ranked by a new mapping level, such as a PML5. A full mapping hierarchy of 4 KB pages for the whole 48-bit space would take a bit more than 512 GB of RAM (about 0.196% of the 256 TB virtual space).
Operating system limitsThe operating system can also limit the virtual address space. Details, where applicable, are given in the "" section.
Physical address space detailsCurrent AMD64 implementations support a physical address space of up to 248 bytes of RAM, or 256 TB,. A larger amount of installed RAM allows the operating system to keep more of the workload's pageable data and code in RAM, which can improve performance, though various workloads will have different points of diminishing returns.
The upper limit on RAM that can be used in a given x86-64 system depends on a variety of factors and can be far less than that implemented by the processor. For example, as of June 2010, there are no known for x86-64 processors that support 256 TB of RAM. The operating system may place additional limits on the amount of RAM that is usable or supported. Details on this point are given in the "" section of this article.
Operating modesOperating mode | required | Compiled-application rebuild required | Default address size | Default operand size | Register extensions | Typical width | |
---|---|---|---|---|---|---|---|
64-bit mode | OS with 64-bit support, or bootloader for 64-bit OS | Yes | 64 | 32 | Yes | 64 | |
Compatibility mode | No | 32 | 32 | No | 32 | ||
16 | 16 | 16 | |||||
Legacy mode | Legacy 16-bit or 32-bit OS; or bootloader for 16, 32, or 64-bit OS | No | 32 | 32 | No | 32 | |
16 | 16 | 16 | |||||
Legacy 16-bit or 32-bit OS | 16 | 16 | 16 | ||||
Legacy 16-bit OS; or bootloader for 16, 32, or 64 bit OS |
The architecture has two primary modes of operation:
Long modeThe architecture's intended primary mode of operation; it is a combination of the processor's native 64-bit mode and a combined 32-bit and 16-bit compatibility mode. It is used by 64-bit operating systems. Under a 64-bit operating system, 64-bit programs run under 64-bit mode, and 32-bit and 16-bit protected mode applications (that do not need to use either real mode or virtual 8086 mode in order to execute at any time) run under compatibility mode. Real-mode programs and programs that use virtual 8086 mode at any time cannot be run in long mode unless they are emulated in software.
Since the basic instruction set is the same, there is almost no performance penalty for executing protected mode x86 code. This is unlike Intel's , where differences in the underlying means that running 32-bit code must be done either in emulation of x86 (making the process slower) or with a dedicated x86 core. However, on the x86-64 platform, many x86 applications could benefit from a 64-bit , due to the additional registers in 64-bit code and guaranteed SSE2-based FPU support, which a can use for optimization. However, applications that regularly handle integers wider than 32 bits, such as cryptographic algorithms, will need a rewrite of the code handling the huge integers in order to take advantage of the 64-bit registers.
Legacy modeThe mode used by 16-bit ('protected mode' or 'real mode') and 32-bit operating systems. In this mode, the processor acts like a 32-bit x86 processor, and only 16-bit and 32-bit code can be executed. Legacy mode allows for a maximum of 32 bit virtual addressing which limits the virtual address space to 4 GB.(p14)(p24)(p118) 64-bit programs cannot be run from legacy mode.
AMD64 implementationsThe following processors implement the AMD64 architecture:
Intel 64 is Intel's implementation of x86-64. It is used in newer versions of , , and processors, the D510, N450, N550, N2600 and N2800 and in all versions of the , , , , and processors.
History of Intel 64Historically, AMD has developed and produced processors patterned after Intel's original designs, but with x86-64, roles were reversed: Intel found itself in the position of adopting the architecture which AMD had created as an extension to Intel's own x86 processor line.
Intel's project was originally Yamhill (after the in Oregon's Willamette Valley). After several years of denying its existence, Intel announced at the February 2004 that the project was indeed underway. Intel's chairman at the time, , admitted that this was one of their worst kept secrets.
Intel's name for this instruction set has changed several times. The name used at the IDF was CT (presumably for Clackamas Technology, another codename from an ); within weeks they began referring to it as IA-32e (for extensions) and in March 2004 unveiled the "official" name EM64T (Extended Memory 64 Technology). In late 2006 Intel began instead using the name Intel 64 for its implementation, paralleling AMD's use of the name AMD64.
Intel 64 implementationsThe first processor to implement Intel 64 was the multi-socket processor code-named in June 2004. In contrast, the initial Prescott chips (February 2004) did not enable this feature. Intel subsequently began selling Intel 64-enabled Pentium 4s using the E0 revision of the Prescott core, being sold on the OEM market as the Pentium 4, model F. The E0 revision also adds eXecute Disable (XD) (Intel's name for the ) to Intel 64, and has been included in then current Xeon code-named Irwindale. Intel's official launch of Intel 64 (under the name EM64T at that time) in mainstream desktop processors was the N0 Stepping Prescott-2M. All 9xx, 8xx, 6xx, 5x9, 5x6, 5x1, 3x6, and 3x1 series CPUs have Intel 64 enabled, as do the CPUs, as will future Intel CPUs for workstations or servers. Intel 64 is also present in the last members of the line.
The first Intel implementing Intel 64 is the version of the processor, which was released on 27 July 2006. None of Intel's earlier notebook CPUs (, , , ) implements Intel 64.
The following processors implement the Intel 64 architecture:
The VIA Nano (formerly VIA Isaiah) is a for . The VIA Nano was released by in 2008 after five years of development by its CPU division, . This new Isaiah 64-bit architecture was designed from scratch, unveiled on 24 January 2008, and launched on May 29, including low-voltage variants and the Nano brand name. The processor supports a number of VIA-specific x86 extensions designed to boost efficiency in low-power appliances. It is expected that the VIA Isaiah will be twice as fast in integer performance and four times as fast in performance as the previous-generation at an equivalent . Power consumption is also expected to be on par with the previous-generation VIA CPUs, with ranging from 5 W to 25 W. Being a completely new design, the Isaiah architecture was built with support for features like the x86-64 instruction set and which were unavailable on its predecessors, the line, while retaining their encryption extensions.
Differences between AMD64 and Intel 64Although nearly identical, there are some differences between the two instruction sets in the semantics of a few seldom used machine instructions (and/or situations), which are mainly used for . Compilers generally produce executables (i.e. ) that avoid any differences, at least for ordinary . This is therefore of interest mainly to developers of compilers, operating systems and similar, which must deal with individual and special system instructions.
Recent implementationsThe following operating systems and releases support the x86-64 architecture in .
BSD DragonFly BSDPreliminary infrastructure work was started in February 2004 for a x86-64 port. This development later stalled. Development started again during July 2007 and continued during 2008 and SoC 2009. The first official release to contain x86-64 support was version 2.4.
FreeBSDfirst added x86-64 support under the name "amd64" as an experimental architecture in 5.1-RELEASE in June 2003. It was included as a standard distribution architecture as of 5.2-RELEASE in January 2004. Since then, FreeBSD has designated it as a Tier 1 platform. The 6.0-RELEASE version cleaned up some quirks with running x86 executables under amd64, and most drivers work just as they do on the x86 architecture. Work is currently being done to integrate more fully the x86 (ABI), in the same manner as the Linux 32-bit ABI compatibility currently works.
NetBSDx86-64 architecture support was first committed to the source tree on 19 June 2001. As of NetBSD 2.0, released on 9 December 2004, NetBSD/amd64 is a fully integrated and supported port. 32-bit code is still supported in 64-bit mode, with a netbsd-32 kernel compatibility layer for 32-bit syscalls. The NX bit is used to provide non-executable stack and heap with per-page granularity (segment granularity being used on 32-bit x86).
OpenBSDhas supported AMD64 since OpenBSD 3.5, released on 1 May 2004. Complete in-tree implementation of AMD64 support was achieved prior to the hardware's initial release due to AMD's loaning of several machines for the project's that year. OpenBSD developers have taken to the platform because of its support for the , which allowed for an easy implementation of the feature.
The code for the AMD64 port of OpenBSD also runs on Intel 64 processors which contains cloned use of the AMD64 extensions, but since Intel left out the page table NX bit in early Intel 64 processors, there is no W^X capability on those Intel CPUs; later Intel 64 processors added the NX bit under the name "XD bit". (SMP) works on OpenBSD's AMD64 port, starting with release 3.6 on 1 November 2004.
DOSIt is possible to enter under without a DOS extender, but the user must return to real mode in order to call BIOS or DOS interrupts.
It may also be possible to enter with a similar to , but more complex since x86-64 lacks . DOS itself is not aware of that, and no benefits should be expected unless running DOS in an emulation with an adequate virtualization driver backend, for example: the mass storage interface.
Linuxwas the first operating system kernel to run the x86-64 architecture in , starting with the 2.4 version in 2001 (prior to the physical hardware's availability). Linux also provides backward compatibility for running 32-bit executables. This permits programs to be recompiled into long mode while retaining the use of 32-bit programs. Several Linux distributions currently ship with x86-64-native kernels and . Some, such as , , , and , allow users to install a set of 32-bit components and libraries when installing off a 64-bit DVD, thus allowing most existing 32-bit applications to run alongside the 64-bit OS. Other distributions, such as , and , are available in one version compiled for a 32-bit architecture and another compiled for a 64-bit architecture. Fedora and allow concurrent installation of all userland components in both 32 and 64-bit versions on a 64-bit system.
(Application Binary Interface), introduced in Linux 3.4, allows programs compiled for the x32 ABI to run in the 64-bit mode of x86-64 while only using 32-bit pointers and data fields. Though this limits the program to a virtual address space of 4 GB it also decreases the memory footprint of the program and in some cases can allow it to run faster.
64-bit Linux allows up to 128 of virtual address space for individual processes, and can address approximately 64 TB of physical memory, subject to processor and system limitations.
Mac OS XMac OS X v10.4.7 and higher versions of run 64-bit command-line tools using the POSIX and math libraries on 64-bit Intel-based machines, just as all versions of Mac OS X v10.4 and 10.5 run them on 64-bit PowerPC machines. No other libraries or frameworks work with 64-bit applications in Mac OS X v10.4. The kernel, and all kernel extensions, are 32-bit only.
supports 64-bit GUI applications using , , , and on 64-bit Intel-based machines, as well as on 64-bit machines. All non-GUI libraries and frameworks also support 64-bit applications on those platforms. The kernel, and all kernel extensions, are 32-bit only.
is the first version of Mac OS X that supports a 64-bit . However, with its first release (v10.6.0), not all 64-bit computers are currently supported. The 64-bit kernel, like the 32-bit kernel, supports 32-bit applications; both kernels also support 64-bit applications. 32-bit applications have a virtual address space limit of 4 GB under either kernel.
The 64-bit kernel does not support 32-bit , and the 32-bit kernel does not support 64-bit kernel extensions.
Mac OS X uses the format to package 32- and 64-bit versions of application and library code into a single file; the most appropriate version is automatically selected at load time. In Mac OS X 10.6, the universal binary format is also used for the kernel and for those kernel extensions that support both 32-bit and 64-bit kernels.
Solaris10 and later releases support the x86-64 architecture.
For Solaris 10, just as with the architecture, there is only one operating system image, which contains a 32-bit kernel and a 64-bit kernel; this is labeled as the "x64/x86" DVD-ROM image. The default behavior is to boot a 64-bit kernel, allowing both 64-bit and existing or new 32-bit executables to be run. A 32-bit kernel can also be manually selected, in which case only 32-bit executables will run. The isainfo command can be used to determine if a system is running a 64-bit kernel.
For Solaris 11, only the 64-bit kernel is provided. However, the 64-bit kernel supports both 32- and 64-bit executables, libraries, and system calls.
Windowsx64 editions of Microsoft Windows client and server, and x64 Edition were released in March 2005. Internally they are actually the same build (5.2.3790.1830 SP1), as they share the same source base and operating system binaries, so even system updates are released in unified packages, much in the manner as Windows 2000 Professional and Server editions for x86. , which also has many different editions, was released in January 2007. was released in July 2009. and later versions will only be available as x64 versions. Windows for x64 has the following characteristics:
Since AMD64 and Intel 64 are substantially similar, many software and hardware products use one vendor-neutral term to indicate their compatibility with both implementations. AMD's original designation for this processor architecture, "x86-64", is still sometimes used for this purpose, as is the variant "x86_64". Other companies, such as and , use the contraction "x64" in marketing material.
The term refers to the processor, and should not be confused with x86-64, as it is a completely different instruction set.
Many operating systems and products, especially those that introduced x86-64 support prior to Intel's entry into the market, use the term "AMD64" or "amd64" to refer to both AMD64 and Intel 64.
Intel licenses to AMD the right to use the original x86 architecture (upon which AMD's x86-64 is based). In 2009, AMD and Intel settled several lawsuits and cross-licensing disagreements, extending their cross-licensing agreements.