acrn-hypervisor

mirror of https://github.com/projectacrn/acrn-hypervisor.git synced 2025-12-21 04:12:32 +00:00

Author	SHA1	Message	Date
Victor Sun	33c864fae6	HV: modularization: rename mi_acpi_rsdp_va to acpi_rsdp_va The simply rename mi_acpi_rsdp_va in acrn_boot_info struct to acpi_rsdp_va; Tracked-On: #5661 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-06-04 16:29:33 +08:00
Victor Sun	c09e55c376	HV: modularizatoin: use abi_efi_info struct in acrn boot info Previousely the efi_info structure in acrn_boot_info struct is defined as same as Linux kernel so that the native efi info from host could be passed to SOS zeropage with memcpy() api directly. Now replace it with abi_efi_info struct to make the content more generic; Tracked-On: #5661 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-06-04 16:29:33 +08:00
Victor Sun	57d7225901	HV: modularization: use abi_mmap struct in acrn boot info Use more generic abi_mmap struct to replace multiboot_mmap struct in acrn_boot_info; Tracked-On: #5661 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-06-04 16:29:33 +08:00
Victor Sun	45d56a8173	HV: modularization: use abi_module struct in acrn boot info Use more generic abi_module struct to replace multiboot_module struct in acrn_boot_info; Tracked-On: #5661 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-06-04 16:29:33 +08:00
Victor Sun	aa5502c5a4	HV: modularization: rename mi_loader_name to loader_name The patch has below changes: 1. rename mi_loader_name in acrn_boot_info struct to loader_name; 2. remove mi_drivers_length and mi_drivers_addr which are never used; Tracked-On: #5661 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-06-04 16:29:33 +08:00
Victor Sun	acff2959ff	HV: modularization: rename mi_cmdline to cmdline in acrn boot info The name of mi_cmdline in acrn_boot_info structure would cause confusion with mi_cmdline in multiboot_info structure, rename it to cmdline; Tracked-On: #5661 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-06-04 16:29:33 +08:00
Victor Sun	56c52d0b95	HV: modularization: remove mi_flags from acrn boot info The mi_flags is not needed any more so remove it from acrn_boot_info struct; Tracked-On: #5661 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-06-04 16:29:33 +08:00
Victor Sun	23222d0192	HV: modularization: rename multiboot.h to boot.h Given the structure in multiboot.h could be used for any boot protocol, use a more generic name "boot.h" instead; Tracked-On: #5661 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-06-04 16:29:33 +08:00
Victor Sun	fd75fcdef0	HV: modularization: add boot.c to wrap multiboot module Add a wrapper API init_acrn_boot_info() so that it could be used to boot ACRN with any boot protocol; Another change is change term of multiboot1 to multiboot because there is no such term officially; Tracked-On: #5661 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-06-04 16:29:33 +08:00
Victor Sun	c9841d7618	HV: modularization: name change on acrn_multiboot_info The acrn_multiboot_info structure stores acrn specific boot info and should not be limited to support multiboot protocol related structure only. This patch only do below changes: 1. change name of acrn_multiboot_info to acrn_boot_info; 2. change name of mbi to abi because of the change in 1, also the naming might bring confusion with native multiboot info; Tracked-On: #5661 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-06-04 16:29:33 +08:00
Victor Sun	bc828dff19	HV: modularization: move module check to sanitize multiboot info ACRN used to support deprivileged boot mode which do not need multiboot modules, while direct boot mode need multiboot modules at lease for service VM bzImage, so ACRN postponed the multiboot modules sanity check in init_vm_boot_info. Now deprivileged boot mode was totally removed, so we can do multiboot module check in sanitize_acrn_multiboot_info(). Tracked-On: #5661 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-04 16:29:33 +08:00
Yonghua Huang	31980ce345	hv: validate input for dmar_free_irte function Malicious input 'index' may trigger buffer overflow on array 'irte_alloc_bitmap[]'. This patch validate that 'index' shall be less than 'CONFIG_MAX_IR_ENTRIES' and also remove unnecessary check on 'index' in function 'ptirq_free_irte()' function with this fix. Tracked-On: #6132 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2021-06-04 09:32:07 +08:00
Yonghua Huang	4a624e3b42	hv: remove unnecessary ASSERT in vlapic_write vlapic_write handle 'offset' that is valid and ignore all other invalid 'offset'. so ASSERT on this 'offset' input is unnecessary. This patch removes above ASSERT to avoid potential hypervisor crash by guest malicious input when debug build is used. Tracked-On: #6131 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2021-06-04 09:32:07 +08:00
Tao Yuhong	847affb30a	HV: VPCI: Fix too much of pci_vdev_update_vbar_base log Should call vpci_init_vdevs() after mmio window resource has been initialized Tracked-On: #6011 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com>	2021-06-04 08:52:48 +08:00
Shuo A Liu	da288a670b	hv: nested: audit guest EPT mapping during shadow EPT entries setup generate_shadow_ept_entry() didn't verify the correctness of the requested guest EPT mapping. That might leak host memory access to L2 VM. To simplify the implementation of the guest EPT audit, hide capabilities 'map 2-Mbyte page' and 'map 1-Gbyte page' from L1 VM. In addition, minimize the attribute bits of EPT entry when create a shadow EPT entry. Also, for invalid requested mapping address, reflect the EPT_VIOLATION to L1 VM. Here, we have some TODOs: 1) Enable large page support in generate_shadow_ept_entry() 2) Evaluate if need to emulate the invalid GPA access of L2 in HV directly. 3) Minimize EPT entry attributes. Tracked-On: #5923 Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-03 16:54:45 +08:00
Shuo A Liu	fd81937bd5	hv: nested: INVEPT emulation supports shadow EPT L1 VM changes the guest EPT and do INVEPT to invalidate the previous TLB cache of EPT entries. The shadow EPT replies on INVEPT instruction to do the update. The target shadow EPTs can be found according to the 'type' of INVEPT. Here are two types and their target shadow EPT, 1) Single-context invalidation Get the EPTP from the INVEPT descriptor. Then find the target shadow EPT. 2) Global invalidation All shadow EPTs of the L1 VM. The INVEPT emulation handler invalidate all the EPT entries of the target shadow EPTs. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-03 16:54:45 +08:00
Shuo A Liu	bc4dca179a	hv: nested: Introduce shadow EPT release function When a shadow EPT is not used anymore, its resources need to be released. free_sept_table() is introduced to walk the whole shadow EPT table and free the pagetable pages. Please note, the PML4E page of shadow EPT is not freed by free_sept_table() as it still be used to present a shadow EPT pointer. Tracked-On: #5923 Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-03 16:54:45 +08:00
Shuo A Liu	8c2eea94b8	hv: nested: Introduce L2 VM EPT VIOLATION handler With shadow EPT, the hypervisor walks through guest EPT table: * If the entry is not present in guest EPT, ACRN injects EPT_VIOLATION to L1 VM and resumes to L1 VM. * If the entry is present in guest EPT, do the EPT_MISCONFIG check. Inject EPT_MISCONFIG to L1 VM if the check failed. * If the entry is present in guest EPT, do permission check. Reflect EPT_VIOLATION to L1 VM if the check failed. * If the entry is present in guest EPT but shadow EPT entry is not present, create the shadow entry and resumes to L2 VM. * If the entry is present in guest EPT but the GPA in the entry is invalid, injects EPT_VIOLATION to L1 VM and resumes L1 VM. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-03 16:54:45 +08:00
Shuo A Liu	5de2c69c62	hv: nested: Hide some capability bits from L1 guest * Hide 5 level EPT capability, let L1 guest stick to 4 level EPT. * Access/Dirty bits are not support currently, hide corresponding EPT capability bits. * "Mode-based execute control for EPT" is also not support well currently, hide its capability bit from MSR_IA32_VMX_PROCBASED_CTLS2. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-03 16:54:45 +08:00
Shuo A Liu	1b41735f48	hv: nested: Manage shadow EPTP according to guest VMCS change 'struct nept_desc' is used to associate guest EPTP with a shadow EPTP. It's created in the first reference and be freed while no reference. The life cycle seems like, While guest VMCS VMX_EPT_POINTER_FULL is changed, the 'struct nept_desc' of the new guest EPTP is referenced; the 'struct nept_desc' of the old guest EPTP is dereferenced. While guest VMCS be cleared(by VMCLEAR in L1 VM), the 'struct nept_desc' of the old guest EPTP is dereferenced. While a new guest VMCS be loaded(by VMPTRLD in L1 VM), the 'struct nept_desc' of the new guest EPTP is referenced. The 'struct nept_desc' of the old guest EPTP is dereferenced. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-03 16:54:45 +08:00
Shuo A Liu	9ebfc3adb1	hv: nested: Introduce shadow EPT infrastructure To shadow guest EPT, the hypervisor needs construct a shadow EPT for each guest EPT. The key to associate a shadow EPT and a guest EPT is the EPTP (EPT pointer). This patch provides following structure to do the association. struct nept_desc { /* * A shadow EPTP. * The format is same with 'EPT pointer' in VMCS. * Its PML4 address field is a HVA of the hypervisor. / uint64_t shadow_eptp; / * An guest EPTP configured by L1 VM. * The format is same with 'EPT pointer' in VMCS. * Its PML4 address field is a GPA of the L1 VM. */ uint64_t guest_eptp; uint32_t ref_count; }; Due to lack of dynamic memory allocation of the hypervisor, a array nept_bucket of type 'struct nept_desc' is introduced to store those association information. A guest EPT might be shared between different L2 vCPUs, so this patch provides several functions to handle the reference of the structure. Interface get_shadow_eptp() also is introduced. To find the shadow EPTP of a specified guest EPTP. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-03 16:54:45 +08:00
Shuo A Liu	4bde5031e7	hv: nested: Create a page pool for shadow EPT construction Shadow EPT uses lots of pages to construct the shadow page table. To utilize the memory more efficient, a page poll sept_page_pool is introduced. For simplicity, total platform RAM size is considered to calculate the memory needed for shadow page tables. This is not an accurate upper bound. This can satisfy typical use-cases where there is not a lot of overcommitment and sharing of memory between L2 VMs. Memory of the pool is marked as reserved from E820 table in early stage. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-03 16:54:45 +08:00
Zide Chen	d3f4342725	hv: nested: implement nested VM exit handler Nested VM exits happen when vCPU is in guest mode (VMCS02 is current). Initially we reflect all nested VM exits to L1 hypervisor. To prepare the environment to run L1 guest: - restore some VMCS fields to the value as what L1 hypervisor programmed. - VMCLEAR VMCS02, VMPTRLD VMCS01 and enable VMCS shadowing. - load the non-shadowing host states from VMCS12 to VMCS01 guest states. - VMRESUME to L1 guest with this modified VMCS01. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Signed-off-by: Alexander Merritt <alex.merritt@intel.com>	2021-06-03 11:46:12 +08:00
Zide Chen	6224678e20	hv: nested: update run_vcpu() function for nested case Since L2 guest vCPU mode and VPID are managed by L1 hypervisor, so we can skip these handling in run_vcpu(). And be careful that we can't cache L2 registers in struct acrn_vcpu. Tracked-On: #5923 Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-03 11:46:12 +08:00
Zide Chen	f97e8738fc	hv: nested: support for INVEPT and INVVPID emulation invvpid and invept instructions cause VM exits unconditionally. For initial support, we pass all the instruction operands as is to the pCPU. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-03 11:46:12 +08:00
Zide Chen	63041de4d9	hv: nested: support for VMLAUNCH and VMRESUME emulation Implement the VMLAUNCH and VMRESUME instructions, allowing a L1 hypervisor to run nested guests. - merge VMCS control fields and VMCS guest fields to VMCS02 - clear shadow VMCS indicator on VMCS02 and load VMCS02 as current - set VMCS12 launch state to "launched" in VMLAUNCH handler Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Signed-off-by: Alex Merritt <alex.merritt@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-06-03 11:46:12 +08:00
Yonghua Huang	1ba5b64b07	hv: update RTCT ACPI table detecting Signature of RTCT ACPI table maybe "PTCT"(v1) or "RTCT"(v2). and the MAGIC number in CRL header is also changed from "PTCM" to "RTCM". This patch refine the code to detect RTCT table for both v1 and v2. Tracked-On: #6020 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2021-05-27 15:39:32 +08:00
Tao Yuhong	c1100a63d5	HV: vpci: refine vbar sizing For a pci BAR, its size aligned bits have fixed to 0(except the memory type bits, they have another fixed value), they are read-only. When write ~0U to BAR for sizing, (type_bits \| size_mask) is written into BAR. So do not need to distinguish between sizing vBAR and programming vBAR. When write a value to vBAR, always store (value & size_mask \| type_bit) to vfcg. pci_vdev_read_vbar() is unnecessary, because it is only need to read vcfg. Tracked-On: #6011 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com> Reviewed-by: Eddie Dong <eddie.dong@intel.com> Reviewed-by: Li Fei <fei1.li@intel.com>	2021-05-27 14:02:07 +08:00
Tao Yuhong	ae9b4fcb0a	HV: vpci: check if address is in VM BAR MMIO space When guest doing BAR re-programming, we should check whether the base address of the BAR is valid.This patch does this check by: 1. whether the gpa is located in the responding MMIO window 2. whether the gpa is aligned with the BAR size Tracked-On: #6011 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com> Reviewed-by: Eddie Dong <eddie.dong@intel.com> Reviewed-by: Li Fei <fei1.li@intel.com>	2021-05-27 14:02:07 +08:00
Tao Yuhong	8cf4ac944d	HV: vpci: Fix do not mask I/O BAR upper 16-bit Now we use pci_vdev_update_vbar_base to update vBAR base address when guest re-programming BAR. For a IO BAR, we would calculate the 32 bits base address then mask the high 16 bits. However, the mask code would never be called since the first if condition statement is always true. This patch fix it by move the unamsk code into the first if condition statement. Tracked-On: #6011 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com> Reviewed-by: Eddie Dong <eddie.dong@intel.com> Reviewed-by: Li Fei <fei1.li@intel.com>	2021-05-27 14:02:07 +08:00
Li Fei1	239dba7bf1	hv: vlapic: a minor refine about vlapic_x2apic_pt_icr_access In physical destination mode, the destination processor is specified by its local APIC ID. When a CPU switch xAPIC Mode to x2APIC Mode or vice versa, the local APIC ID is not changed. So a vcpu in x2APIC Mode could use physical Destination Mode to send an IPI to another vcpu in xAPIC Mode by writing ICR. This patch adds support for a vCPU A could write ICR to send IPI to another vCPU B which is in different APIC mode. Tracked-On: #5923 Signed-off-by: Li Fei1 <fei1.li@intel.com>	2021-05-27 11:07:02 +08:00
Yifan Liu	8f640c61ae	hv: restore the hypervisor 64-bit entry address for efi-stub EFI-stub is required again to support secure boot on certain platform. Tracked-On: #6078 Signed-off-by: Toshiki Nishioka <toshiki.nishioka@intel.com> Signed-off-by: Yifan Liu <yifan1.liu@intel.com>	2021-05-27 08:56:05 +08:00
Rong Liu	d51ac9d368	hv: ensure PTM root is always enabled in hw For post launch VM, ACRN supports PTM under these conditions: 1. HW implements a simple PTM hierarchy: PTM requestor device (ep) is directly connected to PTM root capable root port. Or 2. ptm requestor itself is root complex integrated ep. Currently acrn doesn't support emulation of other type of PTM hiearchy, such as if there is an intermediate PTM node (for example, switch) inbetween PTM requestor and PTM root. To avoid VM touching physical hardware, acrn hv ensures PTM is always enabled in the hardware. During hv's pci init, if root port is ptm capable, hv will enable PTM on that root port. In addition, log error (and don't enable PTM) if ptm root capability is on intermediate node other than root port. V2: - Modify commit messages to clarify the limitation of current PTM implementation. - Fix code that may fail FUSA - Remove pci_ptm_info() and put info log inside pci_enable_ptm_root(). Tracked-On: #5915 Signed-off-by: Rong Liu <rong.l.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-05-26 14:35:43 +08:00
dongshen	afdfc4f38f	hv: hypercall: 72 reserved bytes are used in struct hc_platform_info to pass physical APIC and cat IDs shift to DM So that DM can retrieve physical APIC IDs and use them to fill in the ACPI MADT table for post-launched VMs. Note: 1. DM needs to use the same logic as hypervisor to calculate vLAPIC IDs based on physical APIC IDs and CPU affinity setting 2. Using reserved0[] in struct hc_platform_info to pass physical APIC IDs means we can only support at most 116 cores. And it assumes LAPIC ID is 8bits (X2APIC mode supports 32 bits). Cat IDs shift will be used by DM RTCT V2 Tracked-On: #6020 Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Signed-off-by: dongshen <dongsheng.x.zhang@intel.com>	2021-05-26 13:24:11 +08:00
dongshen	7c8c3b2b18	hv: vcpuid: passthrough host CPUID leaf.0BH to guest VMs Using physical APIC IDs as vLAPIC IDs for pre-Launched and post-launched VMs is not sufficient to replicate the host CPU and cache topologies in guest VMs, we also need to passthrough host CPUID leaf.0BH to guest VMs, otherwise, guest VMs may see weird CPU topology. Note that in current code, ACRN has already passthroughed host cache CPUID leaf 04H to guest VMs Tracked-On: #6020 Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Signed-off-by: dongshen <dongsheng.x.zhang@intel.com>	2021-05-26 13:24:11 +08:00
dongshen	4572fd09d4	hv: vlapic: use physical APIC IDs as vLAPIC IDs for pre-launched and post-launched VMs In current code, ACRN uses physical APIC IDs as vLAPIC IDs for SOS, and vCPU ids (contiguous) as vLAPIC IDs for pre-Launched and post-Launched VMs. Using vCPU ids as vLAPIC IDs for pre-Launched and post-Launched VMs would result in wrong CPU and cache topologies showing in the guest VMs, and could adversely affect performance if the guest VM chooses to detect CPU and cache topologies and optimize its behavior accordingly. Uses physical APIC IDs as vLAPIC IDs (and related CPU/cache topology enumeration CPUIDs passthrough) will replicate the host CPU and cache topologies in pre-Launched and post-Launched VMs. Tracked-On: #6020 Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Signed-off-by: dongshen <dongsheng.x.zhang@intel.com>	2021-05-26 13:24:11 +08:00
Yang,Yu-chu	4eb36c150d	config-tools: build acrn with xslt generated pci_dev.c and board_info.h Build acrn using xslt transformed pci_dev.c and board_info.h. Tracked-On: #6024 Signed-off-by: Yang,Yu-chu <yu-chu.yang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2021-05-24 16:55:46 +08:00
Zide Chen	1ca0c7549e	hv: nested: support for VMREAD and VMWRITE emulation This patch implements the VMREAD and VMWRITE instructions. When L1 guest is running with an active VMCS12, the “VMCS shadowing” VM-execution control is always set to 1 in VMCS01. Thus the possible behavior of VMREAD or VMWRITE from L1 could be: - It causes a VM exit to L0 if the bit corresponds to the target VMCS field in the VMREAD bitmap or VMWRITE bitmap is set to 1. - It accesses the VMCS referenced by VMCS01 link pointer (VMCS02 in our case) if the above mentioned bit is set to 0. This patch handles the VMREAD and VMWRITE VM exits in this way: - on VMWRITE, it writes the desired VMCS value to the respective field in the cached VMCS12. For VMCS fields that need to be synced to VMCS02, sets the corresponding dirty flag. - on VMREAD, it reads the desired VMCS value from the cached VMCS12. Tracked-On: #5923 Signed-off-by: Alex Merritt <alex.merritt@intel.com> Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-05-24 15:30:46 +08:00
Zide Chen	2c17b88c95	hv: nested: support for VMCLEAR emulation This patch is to emulate VMCLEAR instruction. L1 hypervisor issues VMCLEAR on a VMCS12 whose state could be any of these: active and current, active but not current, not yet VMPTRLDed. To emulate the VMCLEAR instruction, ACRN sets the VMCS12 launch state to "clear", and if L0 already cached this VMCS12, need to sync it back to guest memory: - sync shadow fields from shadow VMCS VMCS to cache VMCS12 - copy cache VMCS12 to L1 guest memory Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com>	2021-05-24 15:30:46 +08:00
Zide Chen	91ecb6fcc6	hv: nested: define VMCS shadow fields Enable VMCS shadowing for most of the VMCS fields, so that execution of the VMREAD or VMWRITE on these shadow VMCS fields from L1 hypervisor won't cause VM exits, but read from or write to the shadow VMCS. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Alexander Merritt <alex.merritt@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com>	2021-05-24 15:30:46 +08:00
Zide Chen	7058ae5193	hv: nested: define software layout for VMCS12 and helper functions Software layout of VMCS12 data is a contract between L1 guest and L0 hypervisor to run a L2 guest. ACRN hypervisor caches the VMCS12 which is passed down from L1 hypervisor by the VMPTRLD instructin. At the time of VMCLEAR, ACRN syncs the cached VMCS12 back to L1 guest memory. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-05-24 15:30:46 +08:00
Zide Chen	51699448a2	hv: nested: support for VMPTRLD emulation This patch emulates the VMPTRLD instruction. L0 hypervisor (ACRN) caches the VMCS12 that is passed down from the VMPTRLD instruction, and merges it with VMCS01 to create VMCS02 to run the nested VM. - Currently ACRN can't cache multiple VMCS12 on one vCPU, so it needs to flushes active but not current VMCS12s to L1 guest. - ACRN creates VMCS02 to run nested VM based on VMCS12: 1) copy VMCS12 from guest memory to the per vCPU cache VMCS12 2) initialize VMCS02 revision ID and host-state area 3) load shadow fields from cache VMCS12 to VMCS02 4) enable VMCS shadowing before L1 Vm entry Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com>	2021-05-24 15:30:46 +08:00
Zide Chen	7cbc59bde5	hv: nested: support for VMXOFF emulation This patch implements the VMXOFF instruction. By issuing VMXOFF, L1 guest Leaves VMX Operation. - cleanup VCPU nested virtualization context states in VMXOFF handler. - implement check_vmx_permission() to check permission for VMX operation for VMXOFF and other VMX instructions. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-05-24 15:30:46 +08:00
Zide Chen	4f2e5f237b	hv: nested: check prerequisites to enter VMX operation According to VMXON Instruction Reference, do the following checks in the virtual hardware environment: vCPU CPL, guest CR0, CR4, revision ID in VMXON region, etc. Currently ACRN doesn't support 32-bit L1 hypervisor, and injects an #UD exception if L1 hypervisor is not running in 64-bit mode. Tracked-On: #5923 Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-05-24 15:30:46 +08:00
Zide Chen	55d9161138	hv: nested: support for VMXON emulation This patch emulates VMXON instruction. Basically checks some prerequisites to enable VMX operation on L1 guest (next patch), and prepares some virtual hardware environment in L0. Tracked-On: #5923 Signed-off-by: Sainath Grandhi <sainath.grandhi@intel.com> Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2021-05-24 15:30:46 +08:00
Li Fei1	309bdddd3c	hv: vlapic: extend vlapic_x2apic_pt_icr_access to support more destination mode Now guest would use `Destination Shorthand` to broadcast IPIs if there're more than one destination. However, it is not supported when the guest is in LAPIC passthru situation, and all active VCPUs are working in X2APIC mode. As a result, the guest would not work properly since this kind broadcast IPIs was ignored by ACRN. What's worse, ACRN Hypervisor would inject GP to the guest in this case. This patch extend vlapic_x2apic_pt_icr_access to support more destination modes (both `Physical` and `Logical`) and destination shorthand (`No Shorthand`, `Self`, `All Including Self` and `All Excluding Self`). Tracked-On: #5923 Signed-off-by: Zide Chen <zide.chen@intel.com> Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-05-24 10:27:16 +08:00
Li Fei1	153ce8ee48	hv: vlapic: wrap a function to calculate destination vcpu mask by shorthand 1. Rename vlapic_calc_dest to vlapic_calc_dest_noshort 2. Remove vlapic_calc_dest_lapic_pt, use vlapic_calc_dest_noshort instead 3. Wrap vlapic_calc_dest to calculate destination vcpu mask according shorthand Tracked-On: #5923 Signed-off-by: Zide Chen <zide.chen@intel.com> Signed-off-by: Li Fei1 <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-05-24 10:27:16 +08:00
Tao Yuhong	f782a8cc39	HV: cache: Fix page fault by flushing cache for VM trusty RAM in HV The accrss right of HV RAM can be changed to PAGE_USER (eg. trusty RAM of post-launched VM). So before using clflush(or clflushopt) to flush HV RAM cache, must allow explicit supervisor-mode data accesses to user-mode pages. Otherwise, it may trigger page fault. Tracked-On: #6020 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com>	2021-05-21 13:35:45 +08:00
Victor Sun	5df3455e8d	HV: remove SOS kernel hugepage related bootargs Hypervisor does not need to care about hugepage settings in SOS kernel, user could enable these settings in the scenario config file or GRUB menu. Tracked-On: #5815 Signed-off-by: Victor Sun <victor.sun@intel.com>	2021-05-20 13:31:56 +08:00
Victor Sun	ef737c3089	HV: refine init_vm_bootargs_info() changes: 1. The VM load order type condition is not needed, since the function is called only when create SOS VM or pre-launched VM; 2. Fixed wrong parameter of fill_seed_arg() which introduced by commit `80262f0602`. 3. More comments on why multiboot string could override the pre- configured VM bootargs and why append multiboot cmdline to SOS VM bootargs; Tracked-On: #5815 Signed-off-by: Victor Sun <victor.sun@intel.com>	2021-05-20 13:31:56 +08:00

1 2 3 4 5 ...

3096 Commits