acrn-hypervisor

mirror of https://github.com/projectacrn/acrn-hypervisor.git synced 2025-06-06 06:02:20 +00:00

Author	SHA1	Message	Date
Wu Zhou	064be1e3e6	hv: support halt in hv idle When all vCPU threads on one pCPU are put to sleep (e.g., when all guests execute HLT), hv would schedule to idle thread. Currently the idle thread executes PAUSE which does not enter any c-state and consumes a lot of power. This patch is to support HLT in the idle thread. When we switch to HLT, we have to make sure events that would wake a vCPU must also be able to wake the pCPU. Those events are either generated by local interrupt or issued by other pCPUs followed by an ipi kick. Each of them have an interrupt involved, so they are also able to wake the halted pCPU. Except when the pCPU has just scheduled to idle thread but not yet halted, interrupts could be missed. sleep-------schedule to idle------IRQ ON---HLT--(kick missed) ^ wake---kick\| This areas should be protected. This is done by a safe halt mechanism leveraging STI instruction’s delay effect (same as Linux). vCPUs with lapic_pt or hv with CONFIG_KEEP_IRQ_DISABLED=y does not allow interrupts in root mode, so they could never wake from HLT (INIT kick does not wake HLT in root mode either). They should continue using PAUSE in idle. Tracked-On: #8507 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-15 11:52:40 +08:00
Yonghua Huang	791019edc5	hv: define a MACRO to indicate maximum memory size ~0UL is widely used to specify the maximum memory size when calling e820_alloc_memory(), this patch to define a MACRO for it to avoid using this magic number. Tracked-On: #8502 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-12 13:52:48 +08:00
Qiang Zhang	a4a73b5aac	HV: emulate dummy multi-function dev in Service VM For a pdev which allocated to prelaunched VM or owned by HV, we need to check whether it is a multifuction dev at function 0. If yes we have to emulate a dummy function dev in Service VM, otherwise the sub-function devices will be lost in guest OS pci probe process. Tracked-On: #8492 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com> Signed-off-by: Victor Sun <victor.sun@intel.com>	2023-09-11 16:13:16 +08:00
Qiang Zhang	bf653d277b	HV: init one dev config with service vm config param When we do init_all_dev_config() in pci.c, the pdevs added to pci dev_config will be exposed to Service VM or passthru to prelauched VM. The original code would find service VM config in every pci pdev init loop, this is unnecessary and definitely impact performance. Here we generate Service VM config pointer with config tool so that init_one_dev_config() could refer service VM config directly. Tracked-On: #8491 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com> Signed-off-by: Victor Sun <victor.sun@intel.com>	2023-09-11 16:13:16 +08:00
Qiang Zhang	deccb22ea8	hv: rename is_allocated_to_prelaunched_vm to allocate_to_prelaunched_vm Rename is_allocated_to_prelaunched_vm to allocate_to_prelaunched_vm as it not only checks whether the PCI device is allocated to a Pre-launched VM but also associate it with Pre-launched VM's dev_config. Tracked-On: #8491 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-11 16:13:16 +08:00
Qiang Zhang	aebc16e9e5	hv: fix Service VM EPT mapping upper bound On some platforms, the last e820 entry may not be of type E820_TYPE_RAM, such as E820_TYPE_ACPI_NVS which may also be used by Service VM. So we need take all e820 entry types into account when finding the upper bound of Service VM EPT mapping. Tracked-On: #8495 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-05 11:09:46 +08:00
Muhammad Qasim Abdul Majeed	a457e65619	doc: Fix spelling and typo mistakes. Tracked-On: #8488 Signed-off-by: Muhammad Qasim Abdul Majeed <qasim.majeed20@gmail.com>	2023-09-05 09:34:21 +08:00
Jiaqing Zhao	7bfbdf04b8	doc: remove '@return None' for void functions doxygen will warn that documented return type is found for functions that does not return anything in 1.9.4 or later versions. 'None' is not a special keyword in doxyge, it will recognize it as description to the return value that does not exist in void functions. Tracked-On: #8425 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-08-03 14:56:29 -07:00
Wu Zhou	8af2c263db	hv: disable HFI and ITD for guests The Hardware Feedback Interface (HFI) and Intel® Thread Director (ITD) features require OS to provide a physical page address to IA32_HW_FEEDBACK_PTR. Then the hardware will update the processor information to the page address. The issue is that guest VM will program its GPA to that MSR, causing great risk of tempering memory. So HFI and ITD should be made invisible to guests, until we provide proper virtulization of those features. Tracked-On: #8463 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-08-01 14:57:23 +08:00
Wu Zhou	fa97e32917	hv: bugfix: skip invalid ffs64 return value ffs64() returns INVALID_BIT_INDEX (0xffffU) when it tries to deal with zero input value. This may happen In calculate_logical_dest_mask() when the guest tries to write some illegal destination IDs to MSI config registers of a pt-device. The ffs64() return value is used as per_cpu array index, and it would cause a page fault. This patch adds protection to the per_cpu array, making this function return zero on illegal value. As in logical destination's definition, a zero logical designation would point to no CPU. Fixes: `89d11d91e` Tracked-On: #8454 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-07-14 17:38:16 +08:00
Wu Zhou	89d11d91e2	hv: bugfix: fix the ptdev irq destination issue According to SDM Vol3 11.12.10, in x2APIC mode, Logical Destination has two parts: - Cluster ID (LDR[31:16]) - Logical ID (LDR[15:0]) Cluster ID is a numerical address, while Logical ID is a 16bit mask. We can only use Logical ID to address multi destinations within a Cluster. So we can't just 'or' all the Logical Destination in LDR registers to get one mask for all target pCPUs. This would get a wrong destination mask if the target Destinations are from different Clusters. For example in ADL/RPL x2APIC LDRs for core 2-5 are 0x10001 0x10100 0x20001 0x20100. If we 'or' them together, we would get a Logical Destination of 0x30101, which points to core 6 and another core. If core 6 is running a RTVM, then the irq is unable to get to core 2-5, causing the guest on core 2-5 driver fail. Guests working in xAPIC mode may use 'Flat Model' to select an arbitrary list of CPUs as its irq destination. HV may not be able to include them all when transfering to physical destinations, because the HW is working in x2APIC mode and can only use 'Cluster Model'. There would be no perfect fix for this issue. This patch is a simple fix, by just keep the first Cluster of all target Logical Destinations. Tracked-On: #8435 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-07-05 17:41:16 +08:00
Jiaqing Zhao	e92320cf56	hv: allow non-service vm to read MSR_PLATFORM_INFO (CEh) Guests bootloader may read MSR_PLATFORM_INFO to get TSC frequency for time measurement, so injecting #GP on read may crash the vm on boot. This patch emulates MSR_PLATFORM_INFO with 0, same behavior in kvm, to tell the guest it's a virtualized environment. Tracked-On: #8406 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com>	2023-06-19 17:45:11 +08:00
Wu Zhou	db83648a8d	hv: hide thermal interface from guests Thermal events are delivered through lapic thermal LVT. Currently ACRN does not support delivering those interrupts to guests by virtual lapic. They need to be virtualized to provide guests some thermal management abilities. Currently we just hide thermal lvt from guests, including: 1. Thermal LVT: There is no way to hide thermal LVT from guests. But we need do something to make sure no interrupt can be actually trigered: - skip thermal LVT in vlapic_trigger_lvt() - trap-and-emulate thermal LVT in lapic-pt mode 2. As We have plan to introduce virtualization of thermal monitor in the future, we use a vm flag GUEST_FLAG_VTM which is default 0 to control the access to it. So that it can help enabling VTM in the future. Tracked-On: #8414 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-15 20:36:44 +08:00
Wu Zhou	c5d019b836	hv: emulate cpuids and MSRs for VHWP Changes made by this patch includes: 1. Emulate HWP and pstate MSRs/CPUIDs. Those are exposed to guest when the GUEST_FLAG_VHWP is set: - CPUID[6].EAX[7,9,10]: MSR_IA32_PM_ENABLE(enabled by hv, always read 1), MSR_IA32_HWP_CAPABILITIES, MSR_IA32_HWP_REQUEST, MSR_IA32_HWP_STATUS, - CPUID[6].ECX[0]: MSR_IA32_MPERF, MSR_IA32_APERF - MSR_IA32_PERF_STATUS(read as base frequency when not owning pCPU) - MSR_IA32_PERF_CTL(ignore writes) 2. Always hide HWP interrupt and package control MSRs/CPUIDs: - CPUID[6].EAX[8]: MSR_IA32_HWP_INTERRUPT(currently ACRN is not able to deliver thermal LVT virtual interrupt to guests) - CPUID[6].EAX[11,22]: MSR_IA32_HWP_REQUEST_PKG, MSR_IA32_HWP_CTL Tracked-On: #8414 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-09 10:06:42 +08:00
Wu Zhou	2edf141047	hv: add VHWP guest flag and its helper func Currently CPU frequency control is hidden to guests, and controlled by hypervisor. While it is sufficient in most cases, some guest OS may still need CPU performance info to make multi-core scheduling decisions. This is seen on Linux kernel, which uses HWP highest performance level as CPU core's priority in multi-core scheduling (CONFIG_SCHED_MC_PRIO). Enabling this kernel feature could improve performance as single thread workloads are scheduled on the highest performance cores. This is significantly useful for guests with hybrid cores. The concept is to expose performance interface to guest who exclusively owns pCPU assigned to it. So that Linux guest can load intel_pstate driver which will then provide the kernel with each core's schedule priority. Intel_pstate driver also relies on CONFIG_ACPI_CPPC_LIB to implement this mechanic, this means we also need to provide ACPI _CPC in DM. This patch sets up a guest flag GUEST_FLAG_VHWP to indicate whether the guest can have VHWP feature. Tracked-On: #8414 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-09 10:06:42 +08:00
Jiaqing Zhao	e4e25f076c	hv: sgx: refactor partition_epc() This patch refactors partition_epc() to make the code easier to understand, also fixes the maybe-uninitialized warning for gcc-13. Initializing 'vm_config' to get_vm_config(0) is okay here as scenario validator ensures CONFIG_MAX_VM_NUM to be always larger than 0. Tracked-On: #8413 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com>	2023-06-06 15:22:19 +08:00
Jiaqing Zhao	770cf8c434	hv: emulate MSR_PLATFORM_INFO (17h) This patch emulates the PLATFORM_INFO MSR in hypervisor to make it only visible to Service VM, and only processor ratios (bit 15:8, 47:40 and 55:48) and sample part bit (27) are exponsed. This is intended to prevent Service VM from changing processor parameters like turbo ratio. Tracked-On: #8406 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com>	2023-05-30 15:10:05 +08:00
Qiang Zhang	fcb8e9bb2d	ptirq: Fix INTx assignment for Post-launched VM When assigning a physical interrupt to a Post-launched VM, if it has been assigned to ServiceVM, we should remove that mapping first to reset ioapic pin state and rte, and build new mapping for the Post-launched VM. Tracked-On: #8370 Signed-off-by: Qiang Zhang <qiang4.zhang@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2023-04-13 12:24:57 +08:00
Zhangwei6	38f910e4fb	hv: change the version format The version info is mainly used to tell the user when and where the binary is compiled and built, this will change the hv version format. The hv follows the format: major.minor-stable/unstable-remote_branch-acrn-commit_date-commit_id-dirty DBG/REL(tag-current_commit_id) scenario@board build by author date. The '(tag-current_commit_id)' is optional, it exits only when there are tags for current commit. e.g. with tag: ACRN:\>version HV: 3.1-stable-release_3.1-2022-09-27-11:15:42-7fad37e02-dirty DBG(tag: v3.1) scenario3.1@my_desk_3.1 build by zhangwei 2022-11-16 07:02:37 without tag: ACRN:\>version HV: 3.2-unstable-master-2022-11-16-14:34:49-11f53d849-dirty DBG scenario3.1@my_desk_3.1 build by zhangwei 2022-11-16 06:49:44 Tracked-On #8303 Signed-off-by: Zhangwei6 <wei6.zhang@intel.com>	2022-11-21 09:45:26 +08:00
Yuanyuan Zhao	0a4c76357e	hv: hide mwait from guest. When CPU support MONITOR/MWAIT, OS prefer to use it enter deeper C-state. Now ACRN pass through MONITOR/MWAIT to guest. For vCPUs (ie vCPU A and vCPU B) share a pCPU, if vCPU A uses MWait to enter C state, vCPU B could run only after the time slice of vCPU A is expired. This time slice of vCPU A is gone to waste. For Local APIC pass-through (used for RTVM), the guest pay more attention to timeliness than power saving. So this patch hides MONITOR/MWAIT by: 1. Clear vCPUID.05H, vCPUID.01H:ECX.[bit 3] and MSR_IA32_MISC_ENABLE_MONITOR_ENA to tell the guest VM's vCPU does not support MONITOR/MAIT. 2. Enable MSR_IA32_MISC_ENABLE_MONITOR_ENA bit for MSR_IA32_MISC_ENABLE inject 'GP'. 3. Trap instruction 'MONITOR' and 'MWAIT' and inject 'UD'. 4. Clear vCPUID.07H:ECX.[bit 5] to hide 'UMONITOR/UMWAIT'. 5. Clear "enable user wait and pause" VM-execution control, so UMONITOR/MWAIT causes an 'UD'. Tracked-On: #8253 Signed-off-by: Yuanyuan Zhao <yuanyuan.zhao@linux.intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2022-11-04 18:55:52 +08:00
Chenli Wei	dcb0f05efc	hv: refine the sworld memory allocate The current code uses a predefined sworld memory array to reserve memory for trusty VMs, and assume all post launched VMs are trusty VM which is not correct. This patch statically reserved memory just for trusty VMs and save 16M memory for every non trusty VM. Tracked-On: #6690 Signed-off-by: Chenli Wei <chenli.wei@intel.com> Acked-by: Eddie Dong <eddie.dong@Intel.com>	2022-10-19 15:58:25 +08:00
Wu Zhou	fbdc2774af	hv: add ACRN CPU frequency initializer The design of ACRN CPU performance management is to let hardware do the autonomous frequency selection(or set to a fixed value), and remove guest's ability to control CPU frequency. This patch is to implement the CPU frequency initializer, which will setup CPU frequency base on the performance policy type. Two performance policy types are provided for user to choose from: - 'Performance': CPU runs at its CPU runs at its maximum frequency. Enable hardware autonomous frequency selection if HWP is presented. - 'Nominal': CPU runs at its guaranteed frequency. The policy type is passed to hypervisor through boot parameter, as either 'cpu_perf_policy=Nominal' or 'cpu_perf_policy=Performance'. The default type is 'Performance'. Both HWP and ACPI p-state are supported. HWP is the first choice, for it provides hardware autonomous frequency selection, while keeps frequency transaction time low. Two functions are added to the hypervisor to call: - init_frequency_policy(): called by BSP at start up time. It processes the boot parameters, and enables HWP if it is presented. - apply_frequency_policy(): called after init_frequency_policy(). It applies initial CPU frequency policy setting for each core. It uses a set of frequency limits data struct to quickly decide what the highest/nominal frequency is. The frequency limits are generated by config-tools. The hypervisor will not be governing CPU frequency after initial policy is applied. Cores running RTVMs are fixed to nominal/guaranteed frequency, to get more certainty in latency. This is done by setting the core's frequency limits to highest=lowest=nominal in config-tools. Tracked-On: #8168 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-10-08 11:13:21 +08:00
Conghui	4c79354798	hv: add hypercall to register asyncio Add hypercall to add/remove asyncio request info. Hv will record the info in a list, and when a new ioreq is come, hv will check if it is in the asyncio list, if yes, queue the fd to asyncio buffer. Tracked-On: #8209 Signed-off-by: Conghui <conghui.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-09-27 10:26:42 +08:00
Conghui	12bfa98a37	hv: support asyncio request Current IO emulation is synchronous. The user VM need to wait for the completion of the the I/O request before return. But Virtio Spec introduces introduces asynchronous IO with a new register in MMIO/PIO space named NOTIFY, to be used for FE driver to notify BE driver, ACRN hypervisor can emulate this register by sending a notification to vCPU in Service VM side. This way, FE side can resume to work without waiting for the full completion of BE side response. Tracked-On: #8209 Signed-off-by: Conghui <conghui.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-09-27 10:26:42 +08:00
Conghui	9cf9606e56	hv: extend sbuf hypercall Extend sbuf hypercall to support other kinds of share buffer. Tracked-On: #8209 Signed-off-by: Conghui <conghui.chen@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-09-27 10:26:42 +08:00
Minggui Cao	6d4ca4b3a1	hv: improve smp call to support debugging RTVM Improve SMP call to support ACRN shell to operate RTVM. before, the RTVM CPU can't be kicked off by notification IPI, so some shell commands can't support it, like rdmsr/wrmsr, memory/registers dump. So INIT will be used for RTVM, which LAPIC is pass-thru. Tracked-On: #8207 Signed-off-by: Minggui Cao <minggui.cao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-09-26 13:28:02 +08:00
Minggui Cao	bc4c773cf8	hv: add param to control INIT used to kick pCPU By default, notification IPI used to kick sharing pCPU, INIT used to kick partition pCPU. If USE_INIT_IPI flag is passed to hypervisor, only INIT will be used to kick pCPU. Tracked-On: #8207 Signed-off-by: Minggui Cao <minggui.cao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-09-26 13:28:02 +08:00
Minggui Cao	2c140addaf	hv: use kick-mode in per-cpu to control kick pCPU INIT signal has been used to kick off the partitioned pCPU, like RTVM, whose LAPIC is pass-through. notification IPI is used to kick off sharing pCPU. Add mode_to_kick_pcpu in per-cpu to control the way of kicking pCPU. Tracked-On: #8207 Signed-off-by: Minggui Cao <minggui.cao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-09-26 13:28:02 +08:00
Wu Zhou	6a430de814	hv: remove CPU frequency control from guests The design of ACRN CPU performance management is to let hardware do the autonomous frequency selection(or set to a fixed value), and remove guest's ability to control CPU frequency. This patch is to remove guest's ability to control CPU frequency by removing the guests' HWP/EIST CPUIDs and blocking the related MSR accesses. Including: - Remove CPUID.06H:EAX[7..11] (HWP) - Remove CPUID.01H:ECX[7] (EIST) - Inject #GP(0) upon accesses to MSR_IA32_PM_ENABLE, MSR_IA32_HWP_CAPABILITIES, MSR_IA32_HWP_REQUEST, MSR_IA32_HWP_STATUS, MSR_IA32_HWP_INTERRUPT, MSR_IA32_HWP_REQUEST_PKG - Emulate MSR_IA32_PERF_CTL. Value written to MSR_IA32_PERF_CTL is just stored for reading. This is like how the native environment would behavior when EIST is disabled from BIOS. - Emulate MSR_IA32_PERF_STATUS by filling it with base frequency state. This is consistent with Windows, which displays current frequency as base frequency when running in VM. - Hide the IA32_MISC_ENABLE bit 16 (EIST enable) from guests. This bit is dependent to CPUID.01H:ECX[7] according to SDM. - Remove CPID.06H:ECX[0] (hardware coordination feedback) - Inject #GP(0) upon accesses to IA32_MPERF, IA32_APERF Also DM do not need to generate _PSS/_PPC for post-launched VMs anymore. This is done by letting hypercall HC_PM_GET_CPU_STATE sub command ACRN_PMCMD_GET_PX_CNT and ACRN_PMCMD_GET_PX_DATA return (-1). Tracked-On: #8168 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-09-21 03:48:58 +08:00
Jian Jun Chen	1bf984890b	hv: tsc: start HPET counter before calibration HPET is used to calibrate the tsc frequency if system fails to get the accurate frequency from CPUID 0x15. But on some platforms (for example: the emulated ACRN on QEMU) HPET is not started by default, which causes the failure of calibration TSC by HPET. Tracked-On: #8113 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Reviewed-by: Zhao Yakui <yakui.zhao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-09-15 03:14:01 +08:00
Wu Zhou	8c5bb8b471	hv: change 'DISABLED' settings to 'ENABLED' In order to improve DX, 'DISABLED' style configurator settings are changed to 'ENABLED' style. The meaning of the MACROs are reversed, so in the hv code we have to change '#ifndef' -> '#ifdef' or '#ifdef' -> '#ifndef'. Including: - MCE_ON_PSC_DISABLED -> MCE_ON_PSC_ENABLED - ENFORCE_TURNOFF_AC -> SPLIT_LOCK_DETECTION_ENABLED - ENFORCE_TURNOFF_GP -> UC_LOCK_DETECTION_ENABLED Tracked-On: #7661 Acked-by: Eddie Dong <eddie.dong@intel.com> Signed-off-by: Wu Zhou <wu.zhou@intel.com>	2022-08-17 09:23:33 +08:00
Minggui Cao	83164d6030	hv: shell: improve console to modify input easier 1. make memcpy_erms as a public API; add a new one memcpy_erms_backwards, which supports to copy data from tail to head. 2. improve to use right/left/home/end key to move cursor, and support delete/backspace key to modify current input command. Tracked-On: #7931 Signed-off-by: Minggui Cao <minggui.cao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-07-28 23:31:43 +08:00
Jian Jun Chen	22a302599a	hv: tlfs: fix the incorrect vLAPIC freq MSR When LAPIC timer is working in oneshot or periodic mode, OS uses initial counter register/current counter register to program a timer. Both initial counter and current counter depend on the LAPIC frequency. ACRN emulated vLAPIC timer based on the TSC. vLAPIC freq is the same as TSC freq. Tracked-On: #7876 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Reviewed-by: Zhao Yakui <yakui.zhao@intel.com>	2022-07-26 05:53:19 +08:00
Yifan Liu	4f4da08490	hv: cve hotfix: Disable RRSBA on platform using retpoline For platform that supports RRSBA (Restricted Return Stack Buffer Alternate), using retpoline may not be sufficient to guard against branch history injection or intra-mode branch target injection. RRSBA must be disabled to prevent CPUs from using alternate predictors for RETs. Quoting Intel CVE-2022-0001/CVE-2022-0002: Where software is using retpoline as a mitigation for BHI or intra-mode BTI, and the processor both enumerates RRSBA and enumerates RRSBA_DIS controls, it should disable this behavior. ... Software using retpoline as a mitigation for BHI or intra-mode BTI should use these new indirect predictor controls to disable alternate predictors for RETs. See: https://www.intel.com/content/www/us/en/developer/articles/technical/ software-security-guidance/technical-documentation/branch-history-injection.html Tracked-On: #7907 Signed-off-by: Yifan Liu <yifan1.liu@intel.com>	2022-07-22 09:38:41 +08:00
Jian Jun Chen	c88860250e	hv: tlfs: add tlfs TSC freq MSR support for WaaG TLFS defined 2 vMSRs which can be used by Windows guest to get the TSC/APIC frequencies from hypervisor. This patch adds the support of HV_X64_MSR_TSC_FREQUENCY/HV_X64_MSR_APIC_FREQUENCY vMSRS whose availability is exposed by CPUID.0x40000003:EAX[bit11] and EDX[bit8]. v1->v2: - revise commit message to highlight that the changes are for WaaG Tracked-On: #7876 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Reviewed-by: Zhao Yakui <yakui.zhao@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2022-07-18 16:15:29 +08:00
Jian Jun Chen	97a2919138	hv: tsc: calibrate TSC by HPET On some platforms CPUID.0x15:ECX is zero and CPUID.0x16 can only return the TSC frequency in MHZ which is not accurate. For example the TSC frequency obtained by CPUID.0x16 is 2300 MHZ and the TSC frequency calibrated by HPET is 2303.998 MHZ which is much closer to the actual TSC frequency 2304.000 MHZ. This patch adds the support of using HPET to calibrate TSC when HPET is available and CPUID.0x15:ECX is zero. v3->v4: - move calc_tsc_by_hpet into hpet_calibrate_tsc v2->v3: - remove the NULL check in hpet_init - remove ""& 0xFFFFFFFFU" in tsc_read_hpet - add comment for the counter wrap in the low 32 bits in calc_tsc_by_hpet - use a dedicated function for hpet_calibrate_tsc v1->v2: - change native_calibrate_tsc_cpuid_0x15/0x16 to native_calculate_tsc_cpuid_0x15/0x16 - move hpet_init to BSP init - encapsulate both HPET and PIT calibration to one function - revise the commit message with an example" Tracked-On: #7876 Signed-off-by: Jian Jun Chen <jian.jun.chen@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2022-07-17 16:48:47 +08:00
Ziheng Li	eb8bcb06b3	Update copyright year range in code headers Modified the copyright year range in code, and corrected "int32_tel" into "Intel" in two "hypervisor/include/debug/profiling.h" and "hypervisor/include/debug/profiling_internal.h". Tracked-On: #7559 Signed-off-by: Ziheng Li <ziheng.li@intel.com>	2022-07-15 11:48:35 +08:00
Yifan Liu	05460f151a	hv: Serialize WBINVD using wbinvd_lock As mentioned in previous patch, wbinvd utilizes the vcpu_make_request and signal_event call pair to stall other vcpus. Due to the fact that these two calls are not thread-safe, we need to avoid concurrent call to this API pair. This patch adds wbinvd lock to serialize wbinvd emulation. Tracked-On: #7887 Signed-off-by: Yifan Liu <yifan1.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-07-14 09:05:37 +08:00
Yifan Liu	745e70fb06	hv: Change sched_event back to boolean-based implementation Commit `d575edf79a` changes the internal implementation of wait_event and signal_event to use a counter instead of a boolean value. The background was: ACRN utilizes vcpu_make_request and signal_event pair to shoot down other vcpus and let them wait for signals. vcpu_make_request eventually leads to target vcpu calling wait_event. However vcpu_make_request/signal_event pair was not thread-safe, and concurrent calls of this pair of API could lead to problems. One such example is the concurrent wbinvd emulation, where vcpus may concurrently issue vcpu_make_request/signal_event to synchronize wbinvd emulation. `d575edf` commit uses a counter in internal implementation of wait_event/signal_event to avoid data races. However by using a counter, the wait/signal pair now carries semantics of semaphores instead of events. Semaphores require caller to carefully plan their calls instead of multiply signaling any number of times to the same event, which deviates from the original "event" semantics. This patch changes the API implementation back to boolean-based, and re-resolve the issue of concurrent wbinvd in next patch. This also partially reverts commit `10963b04d1`, which was introduced because of the `d575edf`. Tracked-On: #7887 Signed-off-by: Yifan Liu <yifan1.liu@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2022-07-14 09:05:37 +08:00
Fei Li	df3390f401	hv: vtd: reset the one-shot bits for GCMD_REG If multiple control fields in GCMD_REG register need to be modified, software must serialize the modifications through multiple writes to this register. So one-shot bits (bits 30-29, 27 and 24) in gcmd should not been set. Otherwise, other control field may be written to GCMD_REG at the same time with one-shot bit (Clearing one-shot bit has no effect, software sets this field would set/update this control field used by hardware). Tracked-On: #7381 Signed-off-by: Fei Li <fei1.li@intel.com>	2022-05-20 09:30:25 +08:00
Chenli Wei	5f0588b5f8	hv: move the define of MAX_IR_ENTRIES to offline tool There is an issue of calculate 2^n roundup of CONFIG_MAX_PT_IRQ_ENTRIES, and the code style is very ugly when we use macro to fix it. So this patch move MAX_IR_ENTRIES to offline tool which could do align check and calculate it automatically. Signed-off-by: Chenli Wei <chenli.wei@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2022-05-20 09:08:47 +08:00
Yonghua Huang	961b5d16f4	hv: update SSRAM regions EPT memory type to WB when SSRAM regions are assigned to service VM to support virtulization of SSRAM for post-launched RTVMs, service VM need to access all SSRAM regions for management, typically, service VM does data cleanup in SSRAM region when it is reclaimed from a shutdown RTVM. This patch update memory type from UC(by default) to WB, else SSARM region will be evicted when access from guest happens. Tracked-On: #7425 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com>	2022-05-10 15:45:52 +08:00
Zhou, Wu	3ba5b1522f	hv: fix post RTVM booting failure with SSRAM When booting prelaunch RTVM with SSRAM enabled, we need to delete the SSRAM region that is used by prelaunch RTVM from Service VM EPT mapping. If it is not used, or it is not fully used, the SSRAM or the rest SSRAM should be in Service VM map. But current code has a issue that it always deletes all SSRAM region from Service VM EPT, even when no SSRAM is enabled for prelaunch RTVM. This could cause the post RTVM with SSRAM boot failure, as DM checks and removes SSRAM region from Service VM EPT during post RTVM setup. Changing get_software_sram_size() to PRE_RTVM_SW_SRAM_MAX_SIZE could solve the issue, as PRE_RTVM_SW_SRAM_MAX_SIZE is the SSRAM size that prelaunch RTVM actually uses. Tracked-On: #7401 Signed-off-by: Zhou, Wu <wu.zhou@intel.com>	2022-05-06 14:41:58 +08:00
Jiang, Yanting	599894e571	Fix: write xmm registers correctly The movdqu instruction moves unaligned double quadword (128 bit) contained in XMM registers. This patch uses pointers as input parameters of the function write_xmm_0_2() to get 128-bit value from 64-bit array for each XMM register. Tracked-On: #7380 Reviewed-by: Fei Li <fei1.li@intel.com> Signed-off-by: Jiang, Yanting <yanting.jiang@intel.com>	2022-05-06 10:29:33 +08:00
Tw	e2f7b1fc51	hv: remove obsolete declarations related to RDT Since CAT support for hybrid platform is landed, let's remove some old declarations which are no longer used. Tracked-On: #6690 Signed-off-by: Tw <wei.tan@intel.com>	2022-04-26 14:27:01 +08:00
Chenli Wei	ed1c638c87	hv: refine for HPAn setting The current code only supports 2 HPA regions per VM. This patch extended ACRN to support 2+ HPA regions per VM, to use host memory better if it is scatted among multiple regions. This patch uses an array to describe the hpa region for the VM, and change the logic of ve820 to support multiple regions. This patch dependent on the config tool and GPA SSRAM change Tracked-On: #6690 Signed-off-by: Chenli Wei <chenli.wei@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2022-04-22 14:46:05 +08:00
Tw	e246ada6b0	hv: fix core type error when launch RTVM use atom core When CPUID executes with EAX set to 1AH, the processor returns information about hybrid capabilities. This information is percpu related, and should be obtained directly from the physical cpu. Tracked-On: #6899 Signed-off-by: Tw <wei.tan@intel.com>	2022-04-21 09:21:16 +08:00
Yonghua Huang	80292a482d	hv: remove pgentry_present field in struct pgtable Page table entry present check is page table type specific and static, e.g. just need to check bit0 of page entry for entries of MMU page table and bit2~bit0 for EPT page table case. hence no need to check it by callback function every time. This patch remove 'pgentry_present' callback field and add a new bitmask field for this page entry present check. It can get better performance especially when this check is executed frequently. Tracked-On: #7327 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Eddie Dong <eddie.dong@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com> Reviewed-by: Yu Wang <yu1.wang@intel.com>	2022-04-20 17:38:02 +08:00
Tw	deb35a0de9	hv: fix cpuid 0x2 mismatch when launch RTVM use atom core When CPUID executes with EAX set to 02H, the processor returns information about cache and TLB information. This information is percpu related, and should be obtained directly from the physical cpu. BTW, this patch is backported from v2.7 branch. Tracked-On: #6931 Signed-off-by: Tw <wei.tan@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2022-04-20 16:13:10 +08:00
Zhou, Wu	32cb5954f2	hv: refine the hard-coded GPA SSRAM area size Using the SSRAM area size extracted by config_tools, the patch changes the hard-coded GPA SSRAM area size to its actual size, so that pre-launched VMs can support large(>8MB) SSRAM area. When booting service VM, the SSRAM area has to be removed from Service VM's mem space, because they are passed-through to the pre-rt VM. The code was bugged since it was using the SSRAM area's GPA in the pre-rt VM. Changed it to GPA in Service VM. Tracked-On: #7212 Acked-by: Eddie Dong <eddie.dong@intel.com> Signed-off-by: Zhou, Wu <wu.zhou@intel.com>	2022-04-18 16:47:23 +08:00

1 2 3 4 5 ...

2314 Commits