doc: format l1tf.rst

Format this file to address review comments.

Tracked-On: #1672
Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>
This commit is contained in:
Yonghua Huang 2018-11-03 02:39:35 +08:00 committed by David Kinder
parent d6247ff721
commit 2c85480c2f

View File

@ -1,7 +1,18 @@
.. _l1tf: .. _l1tf:
L1TF Overview L1 Terminal Fault Mitigation
############# ############################
Overview
********
Refer to `Intel Analysis of L1TF`_ and `Linux L1TF document`_ for details.
.. _Intel Analysis of L1TF:
https://software.intel.com/security-software-guidance/insights/deep-dive-intel-analysis-l1-terminal-fault
.. _Linux L1TF document:
https://github.com/torvalds/linux/blob/master/Documentation/admin-guide/l1tf.rst
L1 Terminal Fault is a speculative side channel which allows unprivileged L1 Terminal Fault is a speculative side channel which allows unprivileged
speculative access to data which is available in the Level 1 Data Cache speculative access to data which is available in the Level 1 Data Cache
@ -27,7 +38,7 @@ Because the resulting probed physical address is not a true translation of
the virtual address, the resulting address is not constrained by various the virtual address, the resulting address is not constrained by various
memory range checks or nested translations. Specifically: memory range checks or nested translations. Specifically:
* Intel SGX protected memory checks are not applied. * Intel® SGX protected memory checks are not applied.
* Extended Page Table (EPT) guest physical to host physical address * Extended Page Table (EPT) guest physical to host physical address
translation is not applied. translation is not applied.
* SMM protected memory checks are not applied. * SMM protected memory checks are not applied.
@ -35,22 +46,13 @@ memory range checks or nested translations. Specifically:
The following CVE entries are related to the L1TF: The following CVE entries are related to the L1TF:
============= ================= ============================== ============= ================= ==============================
CVE-2018-3615 L1 Terminal Fault SGX related aspects CVE-2018-3615 L1 Terminal Fault Intel® SGX related aspects
CVE-2018-3620 L1 Terminal Fault OS, SMM related aspects CVE-2018-3620 L1 Terminal Fault OS, SMM related aspects
CVE-2018-3646 L1 Terminal Fault Virtualization related aspects CVE-2018-3646 L1 Terminal Fault Virtualization related aspects
============= ================= ============================== ============= ================= ==============================
Please refer to `Intel Analysis of L1TF`_ and `Linux L1TF document`_ for
more details.
.. _Intel Analysis of L1TF:
https://software.intel.com/security-software-guidance/insights/deep-dive-intel-analysis-l1-terminal-fault
.. _Linux L1TF document:
https://github.com/torvalds/linux/blob/master/Documentation/admin-guide/l1tf.rst
L1TF Problem in ACRN L1TF Problem in ACRN
#################### ********************
There are mainly three attack scenarios considered in ACRN: There are mainly three attack scenarios considered in ACRN:
@ -62,48 +64,35 @@ Malicious user space is not a concern to ACRN hypervisor, because
every guest runs in VMX non-root. It is reponsibility of guest kernel every guest runs in VMX non-root. It is reponsibility of guest kernel
to protect itself from malicious user space attack. to protect itself from malicious user space attack.
SGX/SMM related attacks are mitigated by using latest ucode. There is Intel® SGX/SMM related attacks are mitigated by using latest microcode.
no additional action in ACRN hypervisor. There is no additional action in ACRN hypervisor.
Guest -> hypervisor Attack Guest -> hypervisor Attack
****************************** ==========================
ACRN always enables EPT for all guests (SOS and UOS), thus a malicious ACRN always enables EPT for all guests (SOS and UOS), thus a malicious
guest can directly control guest PTEs to construct L1TF-based attack guest can directly control guest PTEs to construct L1TF-based attack
to hypervisor. Alternatively if ACRN EPT are not sanitized with some to hypervisor. Alternatively if ACRN EPT is not sanitized with some
PTEs (with present bit cleared, or reserved bit set) pointing to valid PTEs (with present bit cleared, or reserved bit set) pointing to valid
host PFNs, a malicious guest may use those EPT PTEs to construct attack. host PFNs, a malicious guest may use those EPT PTEs to construct an attack.
A special aspect of L1TF in the context of virtualization is symmetric A special aspect of L1TF in the context of virtualization is symmetric
multi threading (SMT), e.g. Intel(R) Hyper-Threading Technology. multi threading (SMT), e.g. Intel® Hyper-Threading Technology.
Logical processors on the affected physical cores share the L1 Data Cache Logical processors on the affected physical cores share the L1 Data Cache
(L1D). This fact could make more variants of L1TF-based attack, e.g. (L1D). This fact could make more variants of L1TF-based attack, e.g.
a malicious guest running on one logical processor can attack the data which a malicious guest running on one logical processor can attack the data which
is brought into L1D by the context which runs on the sibling thread of is brought into L1D by the context which runs on the sibling thread of
the same physical core. This context can be any code in hypervisor. the same physical core. This context can be any code in hypervisor.
--secure data in ACRN hypervisor--
It is hard to decide which data in ACRN hypervisor is secret or valuable
data. The amount of valuable data from ACRN contexts cannot be declared as
non-interesting for an attacker without deep inspection of the code.
But obviously, the most import secret data in ACRN is the physical platform
seed generated from CSME and virtual seeds which are derived from that
platform seed. They are critical secrets to serve for guest keystore or
other security usage, e.g. disk encryption, secure storage.
Guest -> guest Attack Guest -> guest Attack
****************************** =====================
A malicious guest may use the same side channel as introduced in The possibility of guest -> guest attack varies on specific configuration,
last section to attack other guests. The possibility of guest->guest e.g. whether CPU partitioning is used, whether Hyper-Threading is on, etc.
attack varies on specific configuration, e.g. whether CPU partitioning
is used, whether Hyper-Threading is on, etc.
If CPU partitioning is enabled (default policy in ACRN), there is If CPU partitioning is enabled (default policy in ACRN), there is
1:1 mapping between vCPUs and pCPUs i.e. no sharing on pCPU. Then 1:1 mapping between vCPUs and pCPUs i.e. no sharing of pCPU. There
the only attack possibility is when Hyper-Threading is on, where may be an attack possibility when Hyper-Threading is on, where
logical processors of same physical core may be allocated to two logical processors of same physical core may be allocated to two
different guests. Then one guest may be able to attack the other guest different guests. Then one guest may be able to attack the other guest
on sibling thread due to shared L1D. on sibling thread due to shared L1D.
@ -113,34 +102,34 @@ same pCPU thus next VM may steal information in L1D which comes
from activity of previous VM on the same pCPU. from activity of previous VM on the same pCPU.
Normal_world -> Secure_world Attack Normal_world -> Secure_world Attack
********************************* ===================================
ACRN supports Android guest, which requires two running worlds ACRN supports Android guest, which requires two running worlds
(normal world vs. secure world). Two worlds run on the same CPU, (normal world and secure world). Two worlds run on the same CPU,
and world switch is conducted on demand. Then it is possible for and world switch is conducted on demand. It could be possible for
normal world to construct L1TF-based stack to secure world, thus normal world to construct an L1TF-based stack to secure world,
break the security model as expected by Android guest. breaking the security model as expected by Android guest.
Affected Processors Affected Processors
****************************** ===================
L1TF affects a range of Intel processors, but Intel ATOM processors L1TF affects a range of Intel processors, but Intel ATOM® processors
(including Apollo Lake) are immune to it. Currently ACRN hypervisor (including Apollo Lake) are immune to it. Currently ACRN hypervisor
supports only Apollo Lake, but other core-based platforms may be also supports only Apollo Lake. Support for other core-based platforms is
supported in the future so we still need a mitigation plan in ACRN. planned, so we still need a mitigation plan in ACRN.
Processors that have the RDCL_NO bit set to one (1) in the Processors that have the RDCL_NO bit set to one (1) in the
IA32_ARCH_CAPABILITIES MSR are not susceptible to the L1TF IA32_ARCH_CAPABILITIES MSR are not susceptible to the L1TF
speculative exectuion side channel. speculative execution side channel.
Please refer to `Intel Analysis of L1TF`_ for more details. Please refer to `Intel Analysis of L1TF`_ for more details.
L1TF Mitigation in ACRN L1TF Mitigation in ACRN
####################### ***********************
The basic assumption is to use latest ucode, which mitigates SMM Use the latest microcode, which mitigates SMM and Intel® SGX cases
and SGX cases while also providing necessary capability for VMM while also providing necessary capability for VMM to use for further
to use for further mitigation. mitigation.
ACRN will check the platform capability based on `CPUID enumeration ACRN will check the platform capability based on `CPUID enumeration
and architectural MSR`_. For L1TF affected platform (CPUID.07H.EDX.29 and architectural MSR`_. For L1TF affected platform (CPUID.07H.EDX.29
@ -150,27 +139,13 @@ must be supported.
.. _CPUID enumeration and architectural MSR: .. _CPUID enumeration and architectural MSR:
https://software.intel.com/security-software-guidance/insights/deep-dive-cpuid-enumeration-and-architectural-msrs https://software.intel.com/security-software-guidance/insights/deep-dive-cpuid-enumeration-and-architectural-msrs
Not all of below mitigations will be implemented. Even for Not all of mitigations below will be implemented in ACRN, and
implemented mitigations not all of them apply in a given ACRN not all of them apply to a specific ACRN deployment. Check the
deployment. Please always check *status* section and 'Mitigation Status'_ and 'Mitigation Recommendations'_ sections
*recommendation* section for detail guidance. for guidance.
EPT Sanitization
****************
EPT is sanitized to avoid pointing to valid host memory in PTEs
which has present bit cleared or reserved bits set.
For non-present PTEs, ACRN currently set pfn bits to ZERO, which
means page ZERO might fall into risk if containing security info.
ACRN reserves page ZERO (0~4K) from page allocator thus page ZERO
won't be used by anybody for valid usage. This sanitization logic
is always enabled on all platforms.
ACRN hypervisor doesn't set reserved bits in any EPT entry.
L1D flush on VMENTRY L1D flush on VMENTRY
************************** ====================
ACRN may optionally flush L1D at VMENTRY, which ensures no ACRN may optionally flush L1D at VMENTRY, which ensures no
sensitive information from hypervisor or previous VM revealed sensitive information from hypervisor or previous VM revealed
@ -186,8 +161,31 @@ Due to such performance reason, ACRN provides a config option
(L1D_FLUSH_VMENTRY) to enable/disable L1D flush during (L1D_FLUSH_VMENTRY) to enable/disable L1D flush during
VMENTRY. By default this option is disabled. VMENTRY. By default this option is disabled.
EPT Sanitization
================
EPT is sanitized to avoid pointing to valid host memory in PTEs
which has present bit cleared or reserved bits set.
For non-present PTEs, ACRN currently set pfn bits to ZERO, which
means page ZERO might fall into risk if containing security info.
ACRN reserves page ZERO (0~4K) from page allocator thus page ZERO
won't be used by anybody for valid usage. This sanitization logic
is always enabled on all platforms.
ACRN hypervisor doesn't set reserved bits in any EPT entry.
Put Secret Data into Uncached Memory Put Secret Data into Uncached Memory
************************************ ====================================
It is hard to decide which data in ACRN hypervisor is secret or valuable
data. The amount of valuable data from ACRN contexts cannot be declared as
non-interesting for an attacker without deep inspection of the code.
But obviously, the most import secret data in ACRN is the physical platform
seed generated from CSME and virtual seeds which are derived from that
platform seed. They are critical secrets to serve for guest keystore or
other security usage, e.g. disk encryption, secure storage.
If the critical secret data in ACRN is identified, then such If the critical secret data in ACRN is identified, then such
data can be put into un-cached memory. As the content will data can be put into un-cached memory. As the content will
@ -207,7 +205,7 @@ However if such 100% identification is not possible, user should
consider other mitigation options to protect hypervisor. consider other mitigation options to protect hypervisor.
L1D flush on World Switch L1D flush on World Switch
************************** =========================
For L1D-affected platforms, ACRN writes to aforementioned MSR For L1D-affected platforms, ACRN writes to aforementioned MSR
to flush L1D when switching from secure world to normal world. to flush L1D when switching from secure world to normal world.
@ -221,47 +219,43 @@ normal world is less privileged entity to secure world.
This mitigation is always enabled. This mitigation is always enabled.
Core-based scheduling Core-based scheduling
*********************** =====================
If Hyper-Threading is enabled, there is no easy method to mitigate If Hyper-Threading is enabled, it's important to avoid running
L1TF attack from a sibling processor on the same physical core. sensitive context (if containing security data which a given VM
has no premission to access) on the same physical core that runs
said VM. It requires scheduler enhancement to enable core-based
scheduling policy, so all threads on the same core are always
scheduled to the same VM. Also there are some further actions
required to protect hypervisor and secure world from sibling
attacks in core-based scheduler.
A basic idea is to avoid running sensitive context (if containing .. note:: There is no current plan to implement this scheduling
security data which a given VM has no premission to access) on policy. The ACRN community will evaluate the need for this based
the same physical core that runs said VM. It requires scheduler on usage requirements and hardware platform status.
enhancement to enable core-based scheduling policy, so all threads
on the same core are always scheduled to the same VM. Also there
are some further actions required to protect hypervisor and
secure world from sibling attacks in core-based scheduler.
Please note there is no commitment of implementation it so far.
ACRN community will keep evaluating this part based on usage
requirements and hardware platform status.
Mitigation Recommendations Mitigation Recommendations
########################## **************************
There is no mitigation required on Apollo Lake based platforms. There is no mitigation required on Apollo Lake based platforms.
For other affected platforms:
The majority use case for ACRN is in pre-configured environment, The majority use case for ACRN is in pre-configured environment,
where the whole software stack (from ACRN hypervisor to guest where the whole software stack (from ACRN hypervisor to guest
kernel to SOS root) is tightly controlled by solution provider kernel to SOS root) is tightly controlled by solution provider
and not allowed for run-time change after sale (guest kernel is and not allowed for run-time change after sale (guest kernel is
sort of trusted). In that case solution provider will make sure trusted). In that case solution provider will make sure that guest
that guest kernel is up-to-date including necessary page table kernel is up-to-date including necessary page table sanitization,
sanitization, thus there is no attack interface exposed within thus there is no attack interface exposed within guest. Then a
guest. Then a minimal mitigation configuration is sufficient minimal mitigation configuration is sufficient with negligible
with negligible performance impact, as explained below: performance impact, as explained below:
1) Use latest ucode 1) Use latest microcode
2) Guest kernel is up-to-date with page table sanitization 2) Guest kernel is up-to-date with page table sanitization
3) EPT sanitization (always enabled) 3) EPT sanitization (always enabled)
4) Flush L1D at world switch (Android specific, always enabled) 4) Flush L1D at world switch (Android specific, always enabled)
In case that someone wants to deploy ACRN into an open environment In case that someone wants to deploy ACRN into an open environment
where guest kernel is considered untrusted. There are more where guest kernel is considered untrusted, there are more
mitigation options required according to the specific usage mitigation options required according to the specific usage
requirements: requirements:
@ -270,16 +264,20 @@ requirements:
- Doing 5) is not feasible, or - Doing 5) is not feasible, or
- CPU sharing is enabled (in the future) - CPU sharing is enabled (in the future)
If Hyper-Threading is enabled, there is no available option If Hyper-Threading is enabled, there is no available mitigation
before core scheduling is planned. User should understand option before core scheduling is planned. User should understand
the security implication and only turn on Hyper-Threading the security implication and only turn on Hyper-Threading
when the potential risk is acceptable to their usage. when the potential risk is acceptable to their usage.
Status Mitigation Status
###### *****************
EPT sanitization: supported =========================== =============
L1D flush on VMENTRY: supported Mitigation status
L1D flush on world switch: supported =========================== =============
Uncached security data: n/a EPT sanitization supported
Core scheduling: n/a L1D flush on VMENTRY supported
L1D flush on world switch supported
Uncached security data n/a
Core scheduling n/a
=========================== =============