acrn-hypervisor

mirror of https://github.com/projectacrn/acrn-hypervisor.git synced 2025-09-21 16:57:20 +00:00

Author	SHA1	Message	Date
Liang Yi	2b3620de7d	hv: mask off LA57 in cpuid Mask off support of 57-bit linear addresses and five-level paging. ICX-D has LA57 but ACRN doesn't support 5-level paging yet. Tracked-On: #6357 Signed-off-by: Liang Yi <yi.liang@intel.com> Signed-off-by: Li, Fei1 <fei1.li@intel.com>	2021-08-20 11:02:21 +08:00
Shiqing Gao	91777a83b5	config_tools: add a new entry MAX_EFI_MMAP_ENTRIES It is used to specify the maximum number of EFI memmap entries. On some platforms, like Tiger Lake, the number of EFI memmap entries becomes 268 when the BIOS settings are changed. The current value of MAX_EFI_MMAP_ENTRIES (256) defined in hypervisor is not big enough to cover such cases. As the number of EFI memmap entries depends on the platforms and the BIOS settings, this patch introduces a new entry MAX_EFI_MMAP_ENTRIES in configurations so that it can be adjusted for different cases. Tracked-On: #6442 Signed-off-by: Shiqing Gao <shiqing.gao@intel.com>	2021-08-20 09:50:39 +08:00
Shiqing Gao	651d44432c	hv: initialize the XSAVE related processor state for guest If SOS is using kernel 5.4, hypervisor got panic with #GP. Here is an example on KBL showing how the panic occurs when kernel 5.4 is used: Notes: * Physical MSR_IA32_XSS[bit 8] is 1 when physical CPU boots up. * vcpu_get_guest_msr(vcpu, MSR_IA32_XSS)[bit 8] is initialized to 0. Following thread switches would happen at run time: 1. idle thread -> vcpu thread context_switch_in happens and rstore_xsave_area is called. At this moment, vcpu->arch.xsave_enabled is false as vcpu is not launched yet and init_vmcs is not called yet (where xsave_enabled is set to true). Thus, physical MSR_IA32_XSS is not updated with the value of guest MSR_IA32_XSS. States at this point: * Physical MSR_IA32_XSS[bit 8] is 1. * vcpu_get_guest_msr(vcpu, MSR_IA32_XSS)[bit 8] is 0. 2. vcpu thread -> idle thread context_switch_out happens and save_xsave_area is called. At this moment, vcpu->arch.xsave_enabled is true. Processor state is saved to memory with XSAVES instruction. As physical MSR_IA32_XSS[bit 8] is 1, ectx->xs_area.xsave_hdr.hdr.xcomp_bv[bit 8] is set to 1 after the execution of XSAVES instruction. States at this point: * Physical MSR_IA32_XSS[bit 8] is 1. * vcpu_get_guest_msr(vcpu, MSR_IA32_XSS)[bit 8] is 0. * ectx->xs_area.xsave_hdr.hdr.xcomp_bv[bit 8] is 1. 3. idle thread -> vcpu thread context_switch_in happens and rstore_xsave_area is called. At this moment, vcpu->arch.xsave_enabled is true. Physical MSR_IA32_XSS is updated with the value of guest MSR_IA32_XSS, which is 0. States at this point: * Physical MSR_IA32_XSS[bit 8] is 0. * vcpu_get_guest_msr(vcpu, MSR_IA32_XSS)[bit 8] is 0. * ectx->xs_area.xsave_hdr.hdr.xcomp_bv[bit 8] is 1. Processor state is restored from memory with XRSTORS instruction afterwards. According to SDM Vol1 13.12 OPERATION OF XRSTORS, a #GP occurs if XCOMP_BV sets a bit in the range 62:0 that is not set in XCR0 \| IA32_XSS. So, #GP occurs once XRSTORS instruction is executed. Such issue does not happen with kernel 5.10. Because kernel 5.10 writes to MSR_IA32_XSS during initialization, while kernel 5.4 does not do such write. Once guest writes to MSR_IA32_XSS, it would be trapped to hypervisor, then, physical MSR_IA32_XSS and the value of MSR_IA32_XSS in vcpu->arch.guest_msrs are updated with the value specified by guest. So, in the point 2 above, correct processor state is saved. And #GP would not happen in the point 3. This patch initializes the XSAVE related processor state for guest. If vcpu is not launched yet, the processor state is initialized according to the initial value of vcpu_get_guest_msr(vcpu, MSR_IA32_XSS), ectx->xcr0, and ectx->xs_area. With this approach, the physical processor state is consistent with the one presented to guest. Tracked-On: #6434 Signed-off-by: Shiqing Gao <shiqing.gao@intel.com> Reviewed-by: Li Fei1 <fei1.li@intel.com>	2021-08-20 09:46:09 +08:00
Zide Chen	2e6cf2b85b	hv: nested: fix bugs in init_vmx_msrs() Currently init_vmx_msrs() emulates same value for the IA32_VMX_xxx_CTLS and IA32_VMX_TRUE_xxx_CTLS MSRs. But the value of physical MSRs could be different between the pair, and we need to adjust the emulated value accordingly. Tracked-On: #6289 Signed-off-by: Zide Chen <zide.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-20 09:40:50 +08:00
Zide Chen	ad37553873	hv: nested: redundant permission check on nested_vmentry() check_vmx_permission() is called in vmresume_vmexit_handler() and vmlaunch_vmexit_handler() already. Tracked-On: #6289 Signed-off-by: Zide Chen <zide.chen@intel.com>	2021-08-20 08:14:40 +08:00
Yifan Liu	d575edf79a	hv: Change sched_event structure to resolve data race in event handling Currently the sched event handling may encounter data race problem, and as a result some vcpus might be stalled forever. One example can be wbinvd handling where more than 1 vcpus are doing wbinvd concurrently. The following is a possible execution of 3 vcpus: ------- 0 1 2 req [Note: 0] req bit0 set [Note: 1] IPI -> 0 req bit2 set IPI -> 2 VMExit req bit2 cleared wait vcpu2 descheduled VMExit req bit0 cleared wait vcpu0 descheduled signal 0 event0->set=true wake 0 signal 2 event2->set=true [Note: 3] wake 2 vcpu2 scheduled event2->set=false resume req req bit0 set IPI -> 0 req bit1 set IPI -> 1 (doesn't matter) vcpu0 scheduled [Note: 4] signal 0 event0->set=true (no wake) [Note: 2] event0->set=false (the rest doesn't matter) resume Any VMExit req bit0 cleared wait idle running (blocked forever) Notes: 0: req: vcpu_make_request(vcpu, ACRN_REQUEST_WAIT_WBINVD). 1: req bit: Bit in pending_req_bits. Bit0 stands for bit for vcpu0. 2: In function signal_event, At this time the event->waiting_thread is not NULL, so wake_thread will not execute 3: eventX: struct sched_event of vcpuX. 4: In function wait_event, the lock does not strictly cover the execution between schedule() and event->set=false, so other threads may kick in. ----- As shown in above example, before the last random VMExit, vcpu0 ended up with request bit set but event->set==false, so blocked forever. This patch proposes to change event->set from a boolean variable to an integer. The semantic is very similar to a semaphore. The wait_event will add 1 to this value, and block when this value is > 0, whereas signal_event will decrease this value by 1. It may happen that this value was decreased to a negative number but that is OK. As long as the wait_event and signal_event are paired and program order is observed (that is, wait_event always happens-before signal_event on a single vcpu), this value will eventually be 0. Tracked-On: #6405 Signed-off-by: Yifan Liu <yifan1.liu@intel.com>	2021-08-20 08:11:40 +08:00
Zhou, Wu	b394777908	HV: Add implements of 32bit and 64bit elf loader This is a simply implement for the 32bit and 64bit elf loader. The loading function first reads the image header, and finds the program entries that are marked as PT_LOAD, then loads segments from elf file to guest ram. After that, it finds the bss section in the elf section entries, and clear the ram area it points to. Limitations: 1. The e_type of the elf image must be ET_EXEC(executable). Relocatable or dynamic code is not supported. 2. The loader only copies program segments that has a p_type of PT_LOAD(loadable segment). Other segments are ignored. 3. The loader doesn’t support Sections that are relocatable (sh_type is SHT_REL or SHT_RELA) 4. The 64bit elf’s entry address must below 4G. 5. The elf is assumed to be able to put segments to valid guest memory. Tracked-On: #6323 Signed-off-by: Zhou, Wu <wu.zhou@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-19 20:00:45 +08:00
Zhou, Wu	c2468d2791	HV: Add elf loader sketch This patch adds a function elf_loader() to load elf image. It checks the elf header, get its 32/64 bit type, then calls the corresponding loading routines, which are empty, and will be realized later. Tracked-On: #6323 Signed-off-by: Zhou, Wu <wu.zhou@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-19 20:00:45 +08:00
Zhou, Wu	537f69dde9	HV: Add elf header file for elf loader Source: https://github.com/freebsd/freebsd-src/blob/main/sys/sys/elf_common.h Trimed to meet the minimal requirements for the Zephyr elf file to be loaded Also added elf file header data struct and program/section entry data structs. Tracked-On: #6323 Signed-off-by: Zhou, Wu <wu.zhou@intel.com> Reviewed-by: Victor Sun <victor.sun@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-19 20:00:45 +08:00
Zhou, Wu	8100b1dd56	HV: Remove 'vm_' of vm_elf_loader and etc. In order to make better sense, vm_elf_loader, vm_bzimage_loader and vm_rawimage_loader are changed to elf_loaer, bzimage_loaer and rawimage_loader. Tracked-On: #6323 Signed-off-by: Zhou, Wu <wu.zhou@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-19 20:00:45 +08:00
Zhou, Wu	53f6720d13	HV: Combine the acpi loading fucntion to one place Remove the acpi loading function from elf_loader, rawimage_loaer and bzimage_loader, and call it together in vm_sw_loader. Now the vm_sw_loader's job is not just loading sw, so we rename it to prepare_os_image. Tracked-On: #6323 Signed-off-by: Zhou, Wu <wu.zhou@intel.com> Reviewed-by: Victor Sun <victor.sun@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-19 20:00:45 +08:00
Zhou, Wu	e78aacbe55	HV: Correct some naming issues For the guest OS loaders, prapare_loading_xxx are not accurate for what those functions actually do. Now they are changed to load_xxx: load_rawimage, load_bzimage. And the 'bsp' expression is confusing in the comments for init_vcpu_protect_mode_regs, changed to a better way. Tracked-On: #6323 Signed-off-by: Zhou, Wu <wu.zhou@intel.com> Reviewed-by: Victor Sun <victor.sun@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-19 20:00:45 +08:00
Victor Sun	3124018917	HV: vm_load: rename vboot_info.h to vboot.h vboot_info.h declares vm loader function also, so rename the file name to vboot.h; Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-19 20:00:45 +08:00
Victor Sun	9b632c0e4b	HV: vm_load: split vm_load.c to support diff kernel format The patch splits the vm_load.c to three parts, the loader function of bzImage kernel is moved to bzimage_loader.c, the loader function of raw image kernel is moved to rawimage_loader.c, the stub is still stayed in vm_load.c to load the corresponding kernel loader function. Each loader function could be isolated by CONFIG_GUEST_KERNEL_XXX macro which generated by config tool. Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-19 20:00:45 +08:00
Victor Sun	2524572fb2	HV: vm_load: refine vm_sw_loader API Change if condition to switch in vm_sw_loader() so that the sw loader could be compiled conditionally. Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-19 20:00:45 +08:00
Yang,Yu-chu	73dc610d90	config-tool: refine guest kernel types Rename KERNEL_ZEPHYR to KERNEL_RAWIMAGE. Added new type "KERNEL_ELF". Add CONFIG_GUEST_KERNEL_RAWIMAGE, CONFIG_GUEST_KERNEL_ELF and/or CONFIG_GUEST_KERNEL_BZIMAGE to config.h if it's configured. Tracked-On: #6323 Signed-off-by: Yang,Yu-chu <yu-chu.yang@intel.com> Reviewed-by: Victor Sun <victor.sun@intel.com>	2021-08-19 20:00:45 +08:00
Victor Sun	178b3e85e3	HV: vm_load: change kernel type for zephyr image Previously we only support loading raw format of zephyr image as prelaunched Zephyr VM, this would cause guest F segment overridden issue because the zephyr raw image covers memory space from 0x1000 to 0x100000 upper. To fix this issue, we should support ELF format image loading so that parse and load the multiple segments from ELF image directly. Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-19 20:00:45 +08:00
Fei Li	2e7491a8ec	hv: mmiodev: a minor bug fix about refine acrn_mmiodev data structure Rename base_hpa to host_pa in acrn_mmiodev data structure. Tracked-On: #6366 Signed-off-by: Fei Li <fei1.li@intel.com>	2021-08-19 12:01:35 +08:00
Liu,Junming	2c5c8754de	hv:enable GVT-d for pre-launched linux guest in logical partion mode When pass-thru GPU to pre-launched Linux guest, need to pass GPU OpRegion to the guest. Here's the detailed steps: 1. reserve a memory region in ve820 table for GPU OpRegion 2. build EPT mapping for GPU OpRegion to pass-thru OpRegion to guest 3. emulate the pci config register for OpRegion For the third step, here's detailed description: The address of OpRegion locates on PCI config space offset 0xFC, Normal Linux guest won't write this register, so we can regard this register as read-only. When guest reads this register, return the emulated value. When guest writes this register, ignore the operation. Tracked-On: #6387 Signed-off-by: Liu,Junming <junming.liu@intel.com>	2021-08-19 11:56:26 +08:00
Jian Jun Chen	dc77ef9e52	hv: ivshmem: map SHM BAR with PAT ignored ACRN does not support the variable range vMTRR. The default memory type of vMTRR is UC. With this vMTRR emulation guest VM such as Linux refuses to map the MMIO address space as WB. In order to get better performance SHM BAR of ivshmem is mapped with PAT ignored and memory type of SHM BAR is fixed to WB. Tracked-On: #6389 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-13 11:17:15 +08:00
Yang,Yu-chu	d997f4bbc1	config-tools: refine bin_gen.py and create virtual TPM2 acpi table Create virtual acpi table of tpm2 based on the raw data if the TPM2 device is presented and the passthrough tpm2 is enabled. Refine the arguments of bin_gen.py. The --board and --scenario take the path to the XMLs as the argument. The allocation.xml is needed for bin_gen.py to generate tpm2 acpi table. Refine the condition of tpm2_acpi_gen. The tpm2 device "MSFT0101" can be present in device id or compatible_id(CID). Check both attributes and child node of tpm2 device. Tracked-On: #6320 Signed-off-by: Yang,Yu-chu <yu-chu.yang@intel.com>	2021-08-11 14:45:55 +08:00
Fei Li	a705ff2dac	hv: relocate ACPI DATA address to 0x7fe00000 Relocate ACPI address to 0x7fe00000 and ACPI NVS to 0x7ff00000 correspondingly. In this case, we could include TPM event log region [0x7ffb0000, 0x80000000) into ACPI NVS. Tracked-On: #6320 Signed-off-by: Fei Li <fei1.li@intel.com>	2021-08-11 14:45:55 +08:00
Fei Li	74e68e39d1	hv: tpm2: do tpm2 fixup for security vm ACRN used to prepare the vTPM2 ACPI Table for pre-launched VM at the build stage using config tools. This is OK if the TPM2 ACPI Table never changes. However, TPM2 ACPI Table may be changed in some conditions: change BIOS configuration or update BIOS. This patch do TPM2 fixup to update the vTPM2 ACPI Table and TPM2 MMIO resource configuration according to the physical TPM2 ACPI Table. Tracked-On: #6366 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com> Signed-off-by: Fei Li <fei1.li@intel.com>	2021-08-11 14:45:55 +08:00
Fei Li	f81b39225c	HV: refine acrn_mmiodev data structure 1. add a name field to indicate what the MMIO Device is. 2. add two more MMIO resource to the acrn_mmiodev data structure. Tracked-On: #6366 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com> Signed-off-by: Fei Li <fei1.li@intel.com>	2021-08-11 14:45:55 +08:00
Fei Li	20061b7c39	hv: remove xsave dependence ACRN could run without XSAVE Capability. So remove XSAVE dependence to support more (hardware or virtual) platforms. Tracked-On: #6287 Signed-off-by: Fei Li <fei1.li@intel.com>	2021-08-10 16:36:15 +08:00
Fei Li	84235bf07c	hv: vtd: a minor refine about dmar_wait_completion Check whether condition is met before check whether time is out after iommu_read32. This is because iommu_read32 would cause time out on some virtual platform in spite of the current DMAR status meets the pre_condition. Tracked-On: #6371 Signed-off-by: Fei Li <fei1.li@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-08-10 16:36:15 +08:00
Tao Yuhong	171856c46b	hv: uc-lock: Fix do not trap #GP If HV enable trigger #GP for uc-lock, and is about to emulate guest uc-lock instructions, should trap guest #GP. Guest uc-lock instrucction trigger #GP, cause vmexit for #GP, HV handle this vmexit and emulate uc-lock instruction. Tracked-On: #6299 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com>	2021-08-09 15:33:12 +08:00
liu hang1	d07bd78b13	Makefile:add targz-pkg entry in Makefile User could use make targz-pkg command to generate tar package in build directory,which could help user simplify the process of installing acrn hypervisor in target board. user need to copy the tarball package to target board,and extract it to "/" directory. Tracked-On: #6355 Signed-off-by: liu hang1 <hang1.liu@intel.com> Reviewed-by: VanCutsem, Geoffroy <geoffroy.vancutsem@intel.com> Acked-by: Wang, Yu1 <yu1.wang@intel.com>	2021-08-09 11:52:27 +08:00
Kunhui-Li	578c18b962	config_tools: remove obsolete kconfig files Remove obsolete Kconfig files; Update Kconfig related README and error message. Tracked-On: #6315 Signed-off-by: Kunhui-Li <kunhuix.li@intel.com>	2021-08-09 09:25:02 +08:00
Victor Sun	4a53a23faa	HV: debug: support 64bit BAR pci uart with 32bit space Currently the HV console does not support PCI UART with 64bit BAR, but in the case that the BAR is in 64bit and the BAR space is below 4GB (i.e. the high 32bit address of the 64bit BAR is zero), HV should be able to support it. Tracked-On: #6334 Signed-off-by: Victor Sun <victor.sun@intel.com>	2021-08-04 10:10:35 +08:00
Victor Sun	2fbc4c26e6	HV: vm_load: remove kernel_load_addr in sw_kernel_info struct When guest kernel has multiple loading segments like ELF format image, just define one load address in sw_kernel_info struct is meaningless. The patch removes kernel_load_addr member in struct sw_kernel_info, the load address should be parsed in each specified format image processing. Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-08-03 13:44:51 +08:00
Victor Sun	d1d59437ea	HV: vm_load: correct needed size of bzImage kernel The previous code did not load bzImage start from protected mode part, result in the protected mode part un-align with kernel_alignment field and then cause kernel decompression start from a later aligned address. In this case we had to enlarge the needed size of bzImage kernel to kernel_init_size plus double size of kernel_alignment. With loading issue of bzImage protected mode part fixed, the kernel needed size is corrected in this patch. Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-08-03 13:44:51 +08:00
Victor Sun	2b5bd2e87a	HV: vm_load: load protected mode code only for bzImage When LaaG boots with bzImage module file, only protected mode code need to be loaded to guest space since the VM will boot from protected mode directly. Futhermore, per Linux boot protocol the protected mode code better to be aligned with kernel_alignment field in zeropage, otherwise kernel will take time to do "rep movs" to the aligned address. In previous code, the bzImage is loaded to the address where aligned with kernel_alignment, this would make the protected mode code unalign with kernel_alignment. If the kernel is configured with CONFIG_RELOCATABLE=n, the guest would not boot. This patch fixed this issue. Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-08-03 13:44:51 +08:00
Victor Sun	9caff7360f	HV: vm_load: set kernel load addr in vm_load.c This patch moves get_bzimage_kernel_load_addr() from init_vm_sw_load() to vm_sw_loader() stage so will set kernel load address of bzImage type kernel in vm_bzimage_loader() in vm_load.c. Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-08-03 13:44:51 +08:00
Victor Sun	e40a258102	HV: vm_load: set ramdisk load addr in vm_load.c This patch moves get_initrd_load_addr() API from init_vm_sw_load() to vm_sw_loader() stage. The patch assumes that the kernel image have been loaded to guest space already. Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-08-03 13:44:51 +08:00
Victor Sun	afe24731a7	HV: vm_load: remove load_sw_modules() api In load_sw_modules() implementation, we always assuming the guest kernel module has one load address and then the whole kernel image would be loaded to guest space from its load address. This is not true when guest kernel has multiple load addresses like ELF format kernel image. This patch removes load_sw_modules() API, and the loading method of each format of kernel image could be specified in prepare_loading_xxximage() API. Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-08-03 13:44:51 +08:00
Victor Sun	2938eca363	HV: vm_load: refine api of get_bzimage_kernel_load_addr() As the previous commit said the kernel load address should be moved from init_vm_sw_load() to vm_sw_loader() stage. This patch refines the API of get_bzimage_kernel_load_addr() in init_vm_kernel_info() for later use. Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-08-03 13:44:51 +08:00
Victor Sun	33d226bf58	HV: vm_load: refine api of get_initrd_load_addr() Currently the guest kernel load address and ramdisk load address are initialized during init_vm_sw_load() stage, this is meaningless when guest kernel has multiple segments with different loading addresses. In that case, the kernel load addresses should be parsed and loaded in vm_sw_loader() stage, the ramdisk load address should be set in that stage also because it is depended on kernel load address. This patch refines the API of get_initrd_load_addr() which will set proper initrd load address of bzImage type kernel for later use. Tracked-On: #6323 Signed-off-by: Victor Sun <victor.sun@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com>	2021-08-03 13:44:51 +08:00
Fei Li	bc5c3a0bb7	hv: vpci: modify Interrupt Line Register as writable According to PCIe Spec, for a RW register bits, If the optional feature that is associated with the bits is not implemented, the bits are permitted to be hardwired to 0b. However Zephyr would use INTx Line Register as writable even this PCI device has no INTx, so emulate INTx Line Register as writable. Tracked-On: #6330 Signed-off-by: Fei Li <fei1.li@intel.com>	2021-08-03 11:01:24 +08:00
Fei Li	481e9fba9d	hv: remove the constraint "MMU and EPT must both support large page or not" There're some virtual platform which doesn't meet this constraint. So remove this constraint. Tracked-On: #6329 Signed-off-by: Fei Li <fei1.li@intel.com>	2021-08-03 11:01:24 +08:00
Minggui Cao	80ae3224d9	hv: expose PMC to core partition VM for core partition VM (like RTVM), PMC is always used for performance profiling / tuning, so expose PMC capability and pass-through its MSRs to the VM. Tracked-On: #6307 Signed-off-by: Minggui Cao <minggui.cao@intel.com> Reviewed-by: Jason Chen CJ <jason.cj.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-07-27 14:58:28 +08:00
Minggui Cao	eba8c4e78b	hv: use ARRAY_SIZE to calc local array size if one array just used in local only, and its size not used extern, use ARRAY_SIZE macro to calculate its size. Tracked-On: #6307 Signed-off-by: Minggui Cao <minggui.cao@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2021-07-27 14:58:28 +08:00
Yifan Liu	69fef2e685	hv: debug: Add hv console callback to VM-exit event In some scenarios (e.g., nested) where lapic-pt is enabled for a vcpu running on a pcpu hosting console timer, the hv console will be inaccessible. This patch adds the console callback to every VM-exit event so that the console can still be somewhat functional under such circumstance. Since this is VM-exit driven, the VM-exit/second can be low in certain cases (e.g., idle or running stress workload). In extreme cases where the guest panics/hangs, there will be no VM-exits at all. In most cases, the shell is laggy but functional (probably enough for debugging purpose). Tracked-On: #6312 Signed-off-by: Yifan Liu <yifan1.liu@intel.com>	2021-07-22 10:08:23 +08:00
Tao Yuhong	8360c3dfe6	HV: enable #GP for UC lock For an atomic operation using bus locking, it would generate LOCK# bus signal, if it has Non-WB memory operand. This is an UC lock. It will ruin the RT behavior of the system. If MSR_IA32_CORE_CAPABILITIES[bit4] is 1, then CPU can trigger #GP for instructions which cause UC lock. This feature is controlled by MSR_TEST_CTL[bit28]. This patch enables #GP for guest UC lock. Tracked-On: #6299 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-07-21 11:25:47 +08:00
Tao Yuhong	2aba7f31db	HV: rename splitlock file name Because the emulation code is for both split-lock and uc-lock, rename splitlock.c/splitlock.h to lock_instr_emul.c/lock_instr_emul.h Tracked-On: #6299 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-07-21 11:25:47 +08:00
Tao Yuhong	7926504011	HV: rename split-lock emulation APIs Because the emulation code is for both split-lock and uc-lock, Changed these API names: vcpu_kick_splitlock_emulation() -> vcpu_kick_lock_instr_emulation() vcpu_complete_splitlock_emulation() -> vcpu_complete_lock_instr_emulation() emulate_splitlock() -> emulate_lock_instr() Tracked-On: #6299 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-07-21 11:25:47 +08:00
Tao Yuhong	bbd7b7091b	HV: re-use split-lock emulation code for uc-lock Split-lock emulation can be re-used for uc-lock. In emulate_splitlock(), it only work if this vmexit is for #AC trap and guest do not handle split-lock and HV enable #AC for splitlock. Add another condition to let emulate_splitlock() also work for #GP trap and guest do not handle uc-lock and HV enable #GP for uc-lock. Tracked-On: #6299 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2021-07-21 11:25:47 +08:00
Tao Yuhong	553d59644b	HV: Fix decode_instruction() trigger #UD for emulating UC-lock When ACRN uses decode_instruction to emulate split-lock/uc-lock instruction, It is actually a try-decode to see if it is XCHG. If the instruction is XCHG instruction, ACRN must emulate it (inject #PF if it is triggered) with peer VCPUs paused, and advance the guest IP. If the instruction is a LOCK prefixed instruction with accessing the UC memory, ACRN Halted the peer VCPUs, and advance the IP to skip the LOCK prefix, and then let the VCPU Executes one instruction by enabling IRQ Windows vm-exit. For other cases, ACRN injects the exception back to VCPU without emulating it. So change the API to decode_instruction(vcpu, bool full_decode), when full_decode is true, the API does same thing as before. When full_decode is false, the different is if decode_instruction() meet unknown instruction, will keep return = -1 and do not inject #UD. We can use this to distinguish that an #UD has been skipped, and need inject #AC/#GP back. Tracked-On: #6299 Signed-off-by: Tao Yuhong <yuhong.tao@intel.com>	2021-07-21 11:25:47 +08:00
Yonghua Huang	48908d522f	hv: minor coding style fix in list.h To add brakets for '(char *)(ptr)' in MACRO container_of(), which may be used recursively. Tracked-On: #6284 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2021-07-15 14:18:13 +08:00
Shuo A Liu	4896faaebb	hv: dm: Remove aligned attribute of common structures Common structures are used by DM, kernel, HV. Aligned attribute might caused structures size mismatch between DM/HV and kernel, as kernel uses default GCC alignment. So, make DM/HV also use the default GCC alignment. Tracked-On: #6282 Signed-off-by: Shuo A Liu <shuo.a.liu@intel.com>	2021-07-15 11:53:54 +08:00

1 2 3 4 5 ...

3275 Commits