acrn-hypervisor

mirror of https://github.com/projectacrn/acrn-hypervisor.git synced 2025-06-03 12:49:45 +00:00

Author	SHA1	Message	Date
Yi Sun	e0d03b27d0	hv: return error code for default case in hcall_vm_intr_monitor In hcall_vm_intr_monitor(), the default case for intr_hdr->cmd is a wrong case. So, it should return error code back. But it returns success code 0 in current codes. Tracked-On: #8580 Reviewed-by: Fei Li <fei1.li@intel.com> Signed-off-by: Yi Sun <yi.y.sun@linux.intel.com>	2024-04-23 15:58:36 +08:00
Yonghua Huang	7bcd9d783e	hv: refine set_fs_base() function Leave canary of stack protector untouched on pCPU if it has been initialized, instead of generating a new one. Tracked-On: #8577 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2024-04-23 11:00:43 +08:00
Yonghua Huang	d5d21fdc1b	hv: fix potential NULL pointer dereferrence in ivshmem.c secure coding fix. Tracked-On: #8566 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-04-10 12:20:54 +08:00
Yonghua Huang	ddfe218747	hv: fill region ID to hv-land ivshmem PCI config space 1) region ID shall be configured by user via config tool. 2) region ID is programmed to "Subsystem ID" of PCI config space. 2) "Subsystem Vendor ID" is harded coded as 0x8086 Tracked-On: #8566 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-03-28 14:34:38 +08:00
Yonghua Huang	a7a6732580	config_tools: support IVSHMEM devices region ID configuration This patch adds ivshmem region ID configuration support when user configure ACRN IVSHMEM devices via ACRN config tool, this ID provides VMs with a stable identification of multiple shared memory regions. Also add logic to generate launch script with region ID configured as below: `add_virtual_device 8 ivshmem hv:/shm_region_0,256,1` Tracked-On: #8566 Signed-off-by: Kunhui-Li <kunhuix.li@intel.com> Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-03-28 14:34:38 +08:00
Jiaqing Zhao	f6bb15c85c	hv: mmu: intiialize ppt_page_pool.bitmap in allocate_ppt_pages() ppt_page_pool.bitmap should be zero-initialized. Also fixes the wrong indention in allocate_ppt_pages(). Tracked-On: #8559 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com>	2024-03-25 09:57:08 +08:00
Jiaqing Zhao	997bdc4843	hv: configure hv console default output in scenario file Add a new option CONSOLE_VM in scenario to set the default vm to be outputted in hv console, when it is not set, acrn console will be used (current behavior). This is intended for debugging vm boot issues. Tracked-On: #8518 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-03-25 09:52:30 +08:00
Wu Zhou	a052cda8e4	hv: console reads all input chars in one poll For uart console, some control keys are defined as byte sequences, such as: * up arrow - 0x1b/0x5b/0x41 * F8 - 0x1b/0x5b/0x31/0x39/0x7e Currently hv console only read one char per poll. When guest vuart console is active, those byte sequences may not be sent to guest vuart in good timing due to the poll interval. Thus control keys such as up/down can not be used in shell or vim. The solution is to read all input chars in one poll, so that control keys can be received by guest OS properly. Tracked-On: #8564 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-03-12 15:26:58 +08:00
Wu Zhou	925e3d95b4	hv: add max_len for sbuf_put param sbuf_put copies sbuf->ele_size of data, and puts into ring. Currently this function assumes that data size from caller is no less than sbuf->ele_size. But as sbuf->ele_size is usually setup by some sources outside of the HV (e.g., the service VM), it is not meant to be trusted. So caller should provide the max length of the data for safety reason. sbuf_put() will return UINT32_MAX if max_len of data is less than element size. Additionally, a helper function sbuf_put_many() is added for putting multiple entries. Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-02-20 11:52:02 +08:00
Wu Zhou	29b3d03ac7	hv: vm_event: send event on triple fault handler In the triple fault handler, post-launched VMs are instantly turned off. Now a vm event is generated simultaneously. So that developers can capture the event and decide what to do with it. (e.g., logging and populating diagnostics, or poweroff VM) Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-02-01 17:01:31 +08:00
Wu Zhou	ab63fe1a92	hv: vm_event: send RTC change event in hv vRTC This patch adds support for HV vrtc vm_event. RTC change event is sent upon each date/time reg write. Those events will be handled in DM. DM will try to emit an RTC change event(to Libvirt) based on its strategy. Only support post-launched VMs. The DM event handler has already implemented the rtc chanage event. Those events will be processed the same way as vrtc events from DM vrtc. Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-02-01 17:01:31 +08:00
Wu Zhou	262a48f346	dm: vm_event: add support for RTC change event When a guest OS performs an RTC change action, we wish this event be captured by developers, and then they can decide what to do with it. (e.g., whether to change physical RTC) There are some facts that makes RTC change event a bit complicated: - There are 7 RTC date/time regs (year, month…). They can only be updated one by one. - RTC time is not reliable before date/time update is finished. - Guests can update RTC date/time regs in any order. - Guests may update RTC date/time regs during either RTC halted or not halted. A single date/time update event is not reliable. We have to wait for the guest to finish the update process. So the DM's event handler sets up a timer, and wait for some time (1 second). If no more change happens befor the timer expires, we can conclude that the RTC change has been done. Then the rtc change event is emitted. This logic of event handler can be used to process HV vrtc time change event too. Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Jian Jun Chen <jian.jun.chen@intel.com>	2024-02-01 17:01:31 +08:00
Wu Zhou	d9ccf1ccb2	dm: vm_event: create vm_event thread This patch creates a thread for vm_event delivery. The thread uses epoll to poll event notifications, then read out the msg data queued in sbuf. An event handler is called upon success receiving. Both HV and DM event sources share the same process. Also vm_event tx API for DM event source is added in this patch. Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Jian Jun Chen <jian.jun.chen@intel.com>	2024-02-01 17:01:31 +08:00
Wu Zhou	581ec58fbb	hv: vm_event: create vm_event support This patch creates vm_event support in HV, including: 1. Create vm_event data type. 2. Add vm_event sbuf and its initializer. The sbuf will be allocated by DM in Service VM. Its page address will then be share to HV through hypercall. 3. Add an API to send the HV generated event. Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-02-01 17:01:31 +08:00
Muhammad Qasim Abdul Majeed	3be3b394ad	hypervisor: Fix spelling and grammar mistakes. Tracked-On: #8533 Signed-off-by: Muhammad Qasim Abdul Majeed <qasim.majeed20@gmail.com>	2023-10-24 11:10:47 +08:00
Muhammad Qasim Abdul Majeed	ce96ef6bae	hypervisor: Fix spelling and grammar mistakes. Tracked-On: #8533 Signed-off-by: Muhammad Qasim Abdul Majeed <qasim.majeed20@gmail.com>	2023-10-23 16:45:28 +08:00
Wu Zhou	bbe8e254cf	hv: support multi function ivshmem device Currently ivshmem device can only be configurated as single function device(bdf.f = 0) on bus 0. This greatly limits the number of ivshmem devices we can create. This patch is to enable multiple function bit in HEADER_TYPE config register, so that we can create many more ivshmem devices by using different function numbers on one bus:dev. The multi function device bit is to be set on ivshmem devices whose function number equls 0. PCI spec describe it as: ‘When Set, indicates that the Device may contain multiple Functions, but not necessarily.’, So if this dev is the only one on the bus:dev, it is still OK. Tracked-On: #8520 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-27 16:46:20 +08:00
Qiang Zhang	79b91b339b	hv: sched: add four parameters for BVT scheduler Per BVT (Borrowed Virtual Time) scheduler design, following per thread parameters are required to tune scheduling behaviour. - weight The time sharing of a thread on CPU. - warp Boost value of virtual time of a thread (time borrowed from future) to reduce Effective Virtual Time to prioritize the thread. - warp_limit Max warp time in one warp. - unwarp_period Min unwarp time after a warp. As of now, only weight is in use to tune virtual time ratio of VCPU threads from different VMs. Others parameters are for future extension. Tracked-On: #8500 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-18 16:26:05 +08:00
Qiang Zhang	04a4f31d28	config: add four per-vm bvt parameters Add four per-vm bvt parameters as the initial bvt parameter values for vCPU threads. - bvt_weight The time sharing of a thread on CPU. - bvt_warp_value Boost value of virtual time of a thread (time borrowed from future) to reduce Effective Virtual Time to prioritize the thread. - bvt_warp_limit Max warp time in one warp. - bvt_unwarp_period Min unwarp time after a warp. Tracked-On: #8500 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-18 16:26:05 +08:00
Qiang Zhang	6a1d91c740	hv: sched: Add sched_params struct for thread parameters Abstract out schedulers config data for vCPU threads and other hypervisor threads to sched_params structure. And it's used to initialize per thread scheduler private data. The sched_params for vCPU threads come from vm_config generated by config tools while other hypervisor threads need give them explicitly. Tracked-On: #8500 Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-18 16:26:05 +08:00
Qiang Zhang	c000a3f70b	hv: add clamp macro for convenience Add clamp macro to clamp a value within a range. Tracked-On: #8500 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-18 16:26:05 +08:00
Wu Zhou	9a6e940849	hv: signal_event after make_request make_request sets the request bit, and signal_event wakes the vcpu thread. If we signal_event comes first, the target vCPU has a chance to sleep again before processing the request bit. Tracked-On: #8507 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-15 11:52:40 +08:00
Wu Zhou	064be1e3e6	hv: support halt in hv idle When all vCPU threads on one pCPU are put to sleep (e.g., when all guests execute HLT), hv would schedule to idle thread. Currently the idle thread executes PAUSE which does not enter any c-state and consumes a lot of power. This patch is to support HLT in the idle thread. When we switch to HLT, we have to make sure events that would wake a vCPU must also be able to wake the pCPU. Those events are either generated by local interrupt or issued by other pCPUs followed by an ipi kick. Each of them have an interrupt involved, so they are also able to wake the halted pCPU. Except when the pCPU has just scheduled to idle thread but not yet halted, interrupts could be missed. sleep-------schedule to idle------IRQ ON---HLT--(kick missed) ^ wake---kick\| This areas should be protected. This is done by a safe halt mechanism leveraging STI instruction’s delay effect (same as Linux). vCPUs with lapic_pt or hv with CONFIG_KEEP_IRQ_DISABLED=y does not allow interrupts in root mode, so they could never wake from HLT (INIT kick does not wake HLT in root mode either). They should continue using PAUSE in idle. Tracked-On: #8507 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-15 11:52:40 +08:00
Wu Zhou	64d999e703	hv: switch to dynamic timer in bvt scheduler When bvt scheduler picks up a thread to run, it sets up a counter ‘run_countdown’ to determine how many ticks it should remain running. Then the timer will decrease run_countdown by 1 on every 1000Hz tick interrupt, until it reaches 0. The tick interrupt consumes a lot of power during idle (if we are using HLT in idle thread). This patch is to switch the 1000 HZ timer to a dynamic one, which only interrupt on run_countdown expires. Tracked-On: #8507 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-13 08:30:27 +08:00
Yonghua Huang	252ba0b047	hv: allocate ivshmem shared memory from E820 ACRN boot fails when size of ivshmem device is configured to 512MB, this patch allocates this memory from E820 table instead of reserving in hypervisor. Tracked-On: #8502 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-12 13:52:48 +08:00
Yonghua Huang	791019edc5	hv: define a MACRO to indicate maximum memory size ~0UL is widely used to specify the maximum memory size when calling e820_alloc_memory(), this patch to define a MACRO for it to avoid using this magic number. Tracked-On: #8502 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-12 13:52:48 +08:00
Qiang Zhang	a4a73b5aac	HV: emulate dummy multi-function dev in Service VM For a pdev which allocated to prelaunched VM or owned by HV, we need to check whether it is a multifuction dev at function 0. If yes we have to emulate a dummy function dev in Service VM, otherwise the sub-function devices will be lost in guest OS pci probe process. Tracked-On: #8492 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com> Signed-off-by: Victor Sun <victor.sun@intel.com>	2023-09-11 16:13:16 +08:00
Qiang Zhang	bf653d277b	HV: init one dev config with service vm config param When we do init_all_dev_config() in pci.c, the pdevs added to pci dev_config will be exposed to Service VM or passthru to prelauched VM. The original code would find service VM config in every pci pdev init loop, this is unnecessary and definitely impact performance. Here we generate Service VM config pointer with config tool so that init_one_dev_config() could refer service VM config directly. Tracked-On: #8491 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com> Signed-off-by: Victor Sun <victor.sun@intel.com>	2023-09-11 16:13:16 +08:00
Qiang Zhang	deccb22ea8	hv: rename is_allocated_to_prelaunched_vm to allocate_to_prelaunched_vm Rename is_allocated_to_prelaunched_vm to allocate_to_prelaunched_vm as it not only checks whether the PCI device is allocated to a Pre-launched VM but also associate it with Pre-launched VM's dev_config. Tracked-On: #8491 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-11 16:13:16 +08:00
Qiang Zhang	aebc16e9e5	hv: fix Service VM EPT mapping upper bound On some platforms, the last e820 entry may not be of type E820_TYPE_RAM, such as E820_TYPE_ACPI_NVS which may also be used by Service VM. So we need take all e820 entry types into account when finding the upper bound of Service VM EPT mapping. Tracked-On: #8495 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-05 11:09:46 +08:00
Muhammad Qasim Abdul Majeed	a457e65619	doc: Fix spelling and typo mistakes. Tracked-On: #8488 Signed-off-by: Muhammad Qasim Abdul Majeed <qasim.majeed20@gmail.com>	2023-09-05 09:34:21 +08:00
Zhang Chen	c6eda313f9	hypervisor/ivshmem: Add check to prevent malicious BAR0 opts The ivshmem spec define the BAR0 offset > 16 are reserved. So ACRN need ignore all operation when offset out of range. Tracked-On: #8487 Signed-off-by: Zhang Chen <chen.zhang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-08-16 10:12:03 +08:00
Zhang Chen	45382dca4b	hypervisor/ivshmem: Add ivshmem_dev_lock to protect deinit Add the ivshmem_dev_lock to protect races between release and allocations. Tracked-On: ##8486 Signed-off-by: Zhang Chen <chen.zhang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-08-16 10:12:03 +08:00
Zhang Chen	c9c0d2167b	hypervisor/ivshmem: Fix ivshmem ivposition loss issue ACRN sets the ivposition to the VM_ID in ivshmem_server_bind_peer(). This value should be saved in the ivshmem_device until unbind. It is wrong to clear ivs_dev->mmio in the ivshmem_vbar_map(), Instead, it should clear the ivshmem_device structure in the create_ivshmem_device to ensure the same initial states after VM reboot case. Tracked-On: #8485 Signed-off-by: Zhang Chen <chen.zhang@intel.com> Reviewed-by: Junjie Mao <Junjie.mao@intel.com>	2023-08-16 10:12:03 +08:00
Jiaqing Zhao	7bfbdf04b8	doc: remove '@return None' for void functions doxygen will warn that documented return type is found for functions that does not return anything in 1.9.4 or later versions. 'None' is not a special keyword in doxyge, it will recognize it as description to the return value that does not exist in void functions. Tracked-On: #8425 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-08-03 14:56:29 -07:00
Wu Zhou	8af2c263db	hv: disable HFI and ITD for guests The Hardware Feedback Interface (HFI) and Intel® Thread Director (ITD) features require OS to provide a physical page address to IA32_HW_FEEDBACK_PTR. Then the hardware will update the processor information to the page address. The issue is that guest VM will program its GPA to that MSR, causing great risk of tempering memory. So HFI and ITD should be made invisible to guests, until we provide proper virtulization of those features. Tracked-On: #8463 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-08-01 14:57:23 +08:00
Jiaqing Zhao	3cc1127ae9	hv: fix undefined reference to nested_vmexit_handler() in vmexit.c arch/x86/guest/nested.c, where nested_vmexit_handler() is defined, is only compiled when CONFIG_NVMX_ENABLED is enabled. Define a dummy function in include/arch/x86/asm/guest/nested.h to fix the undefined reference linker error. Tracked-On: #8465 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-08-01 14:22:14 +08:00
Wu Zhou	fa97e32917	hv: bugfix: skip invalid ffs64 return value ffs64() returns INVALID_BIT_INDEX (0xffffU) when it tries to deal with zero input value. This may happen In calculate_logical_dest_mask() when the guest tries to write some illegal destination IDs to MSI config registers of a pt-device. The ffs64() return value is used as per_cpu array index, and it would cause a page fault. This patch adds protection to the per_cpu array, making this function return zero on illegal value. As in logical destination's definition, a zero logical designation would point to no CPU. Fixes: `89d11d91e` Tracked-On: #8454 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-07-14 17:38:16 +08:00
Wu Zhou	89d11d91e2	hv: bugfix: fix the ptdev irq destination issue According to SDM Vol3 11.12.10, in x2APIC mode, Logical Destination has two parts: - Cluster ID (LDR[31:16]) - Logical ID (LDR[15:0]) Cluster ID is a numerical address, while Logical ID is a 16bit mask. We can only use Logical ID to address multi destinations within a Cluster. So we can't just 'or' all the Logical Destination in LDR registers to get one mask for all target pCPUs. This would get a wrong destination mask if the target Destinations are from different Clusters. For example in ADL/RPL x2APIC LDRs for core 2-5 are 0x10001 0x10100 0x20001 0x20100. If we 'or' them together, we would get a Logical Destination of 0x30101, which points to core 6 and another core. If core 6 is running a RTVM, then the irq is unable to get to core 2-5, causing the guest on core 2-5 driver fail. Guests working in xAPIC mode may use 'Flat Model' to select an arbitrary list of CPUs as its irq destination. HV may not be able to include them all when transfering to physical destinations, because the HW is working in x2APIC mode and can only use 'Cluster Model'. There would be no perfect fix for this issue. This patch is a simple fix, by just keep the first Cluster of all target Logical Destinations. Tracked-On: #8435 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-07-05 17:41:16 +08:00
Jiaqing Zhao	e5d46dcc7d	hv: vpci: ignore PCI I/O BAR with non-zero upper 16 bits On x86 platform, the upper 16 bit of I/O BAR should be initialized to zero by BIOS. Howerever, some buggy BIOS still programs the upper 16 bits to non-zero, which causes error in check_pt_dev_pio_bars(). Since I/O BAR reprogramming by VM is currently unsupported, this patch ignores such I/O BARs when creating vpci devices to make VM boot. Tracked-On: #8373 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-26 14:40:57 +08:00
Jiaqing Zhao	e92320cf56	hv: allow non-service vm to read MSR_PLATFORM_INFO (CEh) Guests bootloader may read MSR_PLATFORM_INFO to get TSC frequency for time measurement, so injecting #GP on read may crash the vm on boot. This patch emulates MSR_PLATFORM_INFO with 0, same behavior in kvm, to tell the guest it's a virtualized environment. Tracked-On: #8406 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com>	2023-06-19 17:45:11 +08:00
Wu Zhou	db83648a8d	hv: hide thermal interface from guests Thermal events are delivered through lapic thermal LVT. Currently ACRN does not support delivering those interrupts to guests by virtual lapic. They need to be virtualized to provide guests some thermal management abilities. Currently we just hide thermal lvt from guests, including: 1. Thermal LVT: There is no way to hide thermal LVT from guests. But we need do something to make sure no interrupt can be actually trigered: - skip thermal LVT in vlapic_trigger_lvt() - trap-and-emulate thermal LVT in lapic-pt mode 2. As We have plan to introduce virtualization of thermal monitor in the future, we use a vm flag GUEST_FLAG_VTM which is default 0 to control the access to it. So that it can help enabling VTM in the future. Tracked-On: #8414 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-15 20:36:44 +08:00
Wu Zhou	972cdeb318	hv: Write _CPC to guests' ACPI when VHWP is enabled When VHWP enabled, return 0 and px_cnt = 0 on ACRN_PMCMD_GET_PX_CNT, so that DM will write _CPC to guests' ACPI. Tracked-On: #8414 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-09 10:06:42 +08:00
Wu Zhou	c5d019b836	hv: emulate cpuids and MSRs for VHWP Changes made by this patch includes: 1. Emulate HWP and pstate MSRs/CPUIDs. Those are exposed to guest when the GUEST_FLAG_VHWP is set: - CPUID[6].EAX[7,9,10]: MSR_IA32_PM_ENABLE(enabled by hv, always read 1), MSR_IA32_HWP_CAPABILITIES, MSR_IA32_HWP_REQUEST, MSR_IA32_HWP_STATUS, - CPUID[6].ECX[0]: MSR_IA32_MPERF, MSR_IA32_APERF - MSR_IA32_PERF_STATUS(read as base frequency when not owning pCPU) - MSR_IA32_PERF_CTL(ignore writes) 2. Always hide HWP interrupt and package control MSRs/CPUIDs: - CPUID[6].EAX[8]: MSR_IA32_HWP_INTERRUPT(currently ACRN is not able to deliver thermal LVT virtual interrupt to guests) - CPUID[6].EAX[11,22]: MSR_IA32_HWP_REQUEST_PKG, MSR_IA32_HWP_CTL Tracked-On: #8414 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-09 10:06:42 +08:00
Wu Zhou	2edf141047	hv: add VHWP guest flag and its helper func Currently CPU frequency control is hidden to guests, and controlled by hypervisor. While it is sufficient in most cases, some guest OS may still need CPU performance info to make multi-core scheduling decisions. This is seen on Linux kernel, which uses HWP highest performance level as CPU core's priority in multi-core scheduling (CONFIG_SCHED_MC_PRIO). Enabling this kernel feature could improve performance as single thread workloads are scheduled on the highest performance cores. This is significantly useful for guests with hybrid cores. The concept is to expose performance interface to guest who exclusively owns pCPU assigned to it. So that Linux guest can load intel_pstate driver which will then provide the kernel with each core's schedule priority. Intel_pstate driver also relies on CONFIG_ACPI_CPPC_LIB to implement this mechanic, this means we also need to provide ACPI _CPC in DM. This patch sets up a guest flag GUEST_FLAG_VHWP to indicate whether the guest can have VHWP feature. Tracked-On: #8414 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-06-09 10:06:42 +08:00
Jiaqing Zhao	e4e25f076c	hv: sgx: refactor partition_epc() This patch refactors partition_epc() to make the code easier to understand, also fixes the maybe-uninitialized warning for gcc-13. Initializing 'vm_config' to get_vm_config(0) is okay here as scenario validator ensures CONFIG_MAX_VM_NUM to be always larger than 0. Tracked-On: #8413 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com>	2023-06-06 15:22:19 +08:00
Jiaqing Zhao	770cf8c434	hv: emulate MSR_PLATFORM_INFO (17h) This patch emulates the PLATFORM_INFO MSR in hypervisor to make it only visible to Service VM, and only processor ratios (bit 15:8, 47:40 and 55:48) and sample part bit (27) are exponsed. This is intended to prevent Service VM from changing processor parameters like turbo ratio. Tracked-On: #8406 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com>	2023-05-30 15:10:05 +08:00
Qiang Zhang	fcb8e9bb2d	ptirq: Fix INTx assignment for Post-launched VM When assigning a physical interrupt to a Post-launched VM, if it has been assigned to ServiceVM, we should remove that mapping first to reset ioapic pin state and rte, and build new mapping for the Post-launched VM. Tracked-On: #8370 Signed-off-by: Qiang Zhang <qiang4.zhang@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2023-04-13 12:24:57 +08:00
Qiang Zhang	bf9341844a	ptirq: Fix ptirq hash tables ptirq_remapping_info records which physical interrupt is mapped to the virtual interrupt in a VM. As we need to knonw whether a physical sid has been mapped to a VM and whether a virtual sid in a VM has been used, there should be two hash tables to link and iterate ptirq_remapping_info: - One is used to lookup from physical sid, linking phys_link. - The other is used to lookup from virtual sid in a VM, linking virt_link Without this patch, phys_link or virt_link from different ptirq_remapping_info was linked by one hash list head if they got the same hash value, as shown in following diagram. When looking for a ptirq_remapping_info from physical sid, the original code took all hash list node as phys_link and failed to get ptirq_remapping_info linked with virt_link and later references to its members are wrong. The same problem also occurred when looking for a ptirq_remapping_info from virtual sid and vm. ---------- <- hash table \|hlist_head\| --actual ptirq_remapping_info address ---------- --------- / --used as ptirq_remapping_info \| ... \| \|phys_link\| ___/ ---------- --------- --------- / --------- \|hlist_head\| -> \|phys_link\| <-> \|virt_link\| <-> \|phys_link\| ---------- --------- --------- --------- \| ... \| \|virt_link\| \| other \| \|virt_link\| ---------- --------- \| members \| --------- \| other \| --------- \| other \| \| members \| \| members \| --------- --------- Tracked-On: #8370 Signed-off-by: Qiang Zhang <qiang4.zhang@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com> Acked-by: Eddie Dong <eddie.dong@intel.com>	2023-04-13 12:24:57 +08:00
hangliu1	bb2118d98b	hv: support for pci uart with high mmio to enable early print output by pci uart, which has mmio address above 4G, add the early pagetable map for the MMIO range. we make an assumption that only map it with 2MB page, since platform with 39bit memory width have no 1G huge page feature and we cannot get the capacity at runtime, since it is a very early code stage. v2->v1: 1. add hva2hpa_early 2. add process when *pdpte is not present Signed-off-by: hangliu1 <hang1.liu@linux.intel.com> Reviewed-by: fei1.li <fei1.li@intel.com> Tracked-On: #6690	2022-11-21 16:50:05 +08:00

1 2 3 4 5 ...

3447 Commits