acrn-hypervisor

mirror of https://github.com/projectacrn/acrn-hypervisor.git synced 2025-11-14 10:31:59 +00:00

Author	SHA1	Message	Date
hangliu1	a72bd5e076	hv: multiarch: abstract pcpu related data from x86 Move phys_cpu_num and pcpu_active_bitmap to common, which could be only accessed by interfaces provided by smp.h. v2->v3: 1. move ALL_CPUS_MASK/AP_MASK to common cpu.h v1->v2: 1. preserve phys_cpu_num in x86 but implement arch_get_num_available_cpus() to provide interface for common code to access. 2. change function name test_xx to check_xx Tracked-On: #8801 Signed-off-by: hangliu1 <hang1.liu@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Reviewed-by: Liu, Yifan1 <yifan1.liu@intel.com> Acked-by: Wang, Yu1 <yu1.wang@intel.com>	2025-09-19 15:04:55 +08:00
hangliu1	41f04b4e7a	HV: abstract percpu structure from x86 arch Move x86 architecture dependent per cpu data into a seperate structure and embeded it in per_cpu_region. caller could access architecture dependent member by using prefix 'arch.'. v2->v3: move whose_iwkey, profiling_info and tsc_suspend to x86 v1->v2: rebased on latest repo Tracked-On: #8801 Signed-off-by: hangliu1 <hang1.liu@intel.com> Reviewed-by: Wang, Yu1 <yu1.wang@intel.com> Reviewed-by: Liu, Yifan1 <yifan1.liu@intel.com> Reviewed-by: Chen, Jian Jun<jian.jun.chen@intel.com> Acked-by: Wang, Yu1 <yu1.wang@intel.com>	2025-09-19 15:04:55 +08:00
Shiqing Gao	037c479221	hv: ipi: drop arch_ prefix from IPI functions Since there is no common IPI abstraction, the arch_ prefix is redundant. This patch renames the functions as follows: - arch_send_dest_ipi_mask -> send_dest_ipi_mask - arch_send_single_ipi -> send_single_ipi Tracked-On: #8799 Signed-off-by: Shiqing Gao <shiqing.gao@intel.com>	2025-09-19 13:56:21 +08:00
Yi Y Sun	aa3febbe26	hv: timer: rename functions to follow naming convention For arch specific codes, we use arch_xxx() to name the function. So, rename cpu_ticks/cpu_tickrate/set_hw_timeout/init_hw_timer to follow this convention. Then, use arch interface to set timeout value in update_physical_timer(). Furthermore, remove hw_timer.h and move its contents into common/ timer.h. Tracked-On: #8792 Signed-off-by: Yi Y Sun <yi.y.sun@intel.com> Acked-by: Wang, Yu1 <yu1.wang@intel.com>	2025-09-17 08:55:12 +08:00
Haicheng Li	7573e436a5	hv: multi-arch reconstruct memory library Split common and X86 arch specific codes. For arch without specific memory library,add common library to use. Tracked-On: #8794 Signed-off-by: Haicheng Li <haicheng.li@linux.intel.com> Co-developed-by: Haoyu Tang <haoyu.tang@intel.com> Signed-off-by: Haoyu Tang <haoyu.tang@intel.com> Signed-off-by: Wang, Yu1 <yu1.wang@intel.com> Acked-by: Wang, Yu1 <yu1.wang@intel.com>	2025-09-15 14:07:58 +08:00
Shiqing Gao	a5e46a70ff	hv: smpcall: abstract a common SMP call framework This patch: - abstracts the common logic from existing x86 implementation - moves x86-specific logic to arch/x86/notify.c A new common/notify.{c,h} is introduced to provide a common SMP call framework for multi-arch support in ACRN. arch-specific files such as arch/{x86,riscv}/notify.c is aim to provide the corresponding implementations respectively. The framework provides the following common APIs: - init_smp_call(): initialize the SMP call support during pCPU initialization - handle_smp_call(): execute the SMP call notification handler - smp_call_function(): trigger the SMP call request to target pCPUs Other SW modules should invoke these common APIs to perform arch-independent SMP operations. Two arch-specific hooks are abstracted: - arch_smp_call_kick_pcpu(): - On x86, special handling is required when LAPIC is passthrough. - On RISC-V, a plain IPI is sufficient to kick the target pCPU. - arch_init_smp_call(): - On x86, CPU initialization reserves dedicated vectors and registers callback handlers for purposes such as notifications or posted interrupts. - On RISC-V, no special handling is required at present; this can be extended in the future if needed. ---------- Changelog: * Merged the following two patches into one: [RFC PATCH v2 4/7] hv: introduce common/smp.{c,h} [RFC PATCH v2 5/7] hv: smpcall: x86: adapt to common SMP call Tracked-On: #8786 Signed-off-by: Shiqing Gao <shiqing.gao@intel.com> Acked-by: Wang, Yu1 <yu1.wang@intel.com>	2025-09-09 16:37:04 +08:00
Shiqing Gao	066d55c8c3	hv: ipi: x86: rename send_single_ipi() and send_dest_ipi_mask() Rename send_single_ipi() and send_dest_ipi_mask() to arch_send_single_ipi() and arch_send_dest_ipi_mask() in x86, to make the naming consistent with the RISC-V implementation and reflect that these functions are arch-specific. Tracked-On: #8786 Signed-off-by: Shiqing Gao <shiqing.gao@intel.com> Acked-by: Wang, Yu1 <yu1.wang@intel.com>	2025-09-09 16:37:04 +08:00
Shiqing Gao	89f0dc544c	hv: ipi: x86: update send_dest_ipi_mask() prototype Align the prototype of send_dest_ipi_mask() on x86 with the RISC-V definition. dest_mask is updated from uint32_t to uint64_t: From: void send_dest_ipi_mask(uint32_t dest_mask, uint32_t vector) To: void send_dest_ipi_mask(uint64_t dest_mask, uint32_t vector) On RISC-V, send_dest_ipi_mask() is implemented using SBI interfaces, where the dest_mask is defined as "unsigned long" in the SBI spec. Tracked-On: #8786 Signed-off-by: Shiqing Gao <shiqing.gao@intel.com> Acked-by: Wang, Yu1 <yu1.wang@intel.com>	2025-09-09 16:37:04 +08:00
Yifan Liu	7559698b76	hv: Makefile: Separate x86 and common makefile Currently ACRN supports only x86 architecture. And this patch is the first of a series of patches to enable ACRN on multiple architecture. This commit does the multi-arch of Makefile: put x86 specific content into arch/x86/Makefile. This includes: - Pre-launched VM ACPI binary generation - acrn.32.out generation (32 bit ELF was generated towards i386 architecture) - Customized modularization (*_MOD). Only one module is created in common section: COMMON_MOD - Architecture specific make targets and pre-build actions are moved to architecture specific makefile. Introduce the following variable to register arch targets and/or actions: - ARCH_PRE_BUILD_TARGETS - ARCH_ALL_TARGETS - ARCH_INSTALL_TARGETS Tracked-On: #8782 Signed-off-by: Yifan Liu <yifan1.liu@intel.com> Acked-by: Wang, Yu1 <yu1.wang@intel.com>	2025-09-03 14:41:33 +08:00
Jiaqing Zhao	f9725dd334	hv: reserve hypervisor region in e820 table Mark hypervisor memory region as unusable in its e820 table to avoid being overlapped by e820_alloc_memory(). As it is already filtered out in hypervisor e820 table, there is no longer need to filter it out in service VM e820. Tracked-On: #8738 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2025-08-19 07:49:23 +00:00
Jiaqing Zhao	1c7e1a192b	hv: refactor hypervisor image size helper function The hypervisor image size is determined at link time, but now it is calculated and stored in a global variable during mmu initialization, and the helper function reads from that variable. Change to calculate it inside helper function to avoid inconsistency. Tracked-On: #8738 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2025-08-19 07:49:23 +00:00
Jiaqing Zhao	3c6aa23ab2	hv: instr_emul: Correct handling of instruction length The VM-exit instruction length(VMX_EXIT_INSTR_LEN) in VMCS is undefined on EPT violation, except during delivery of a software interrupt, privileged software exception, or software exception[1]. Although CPU is likely to set the field, it can be incorrect in certain cases, such as cmp+jcc and test+jcc. Since hypervisor does not know exactly how much bytes needed, and GVA translation is costly, it first copies at most 15 (VIE_INST_SIZE) bytes within the page, then decodes the instruction. If more bytes are needed during decoding and copied length is less than 15, it copies remaining bytes. [1] 29.2.5, https://cdrdv2-public.intel.com/671200/325462-sdm-vol-1-2abcd-3abcd.pdf Tracked-On: #8756 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com>	2025-08-19 07:49:23 +00:00
Jiaqing Zhao	47027892a5	hv: pm: fix acpi register size calculation The Access Size field in ACPI GAS was not introduced before ACPI 2.0, Errata C. It is not guaranteed to be a non zero value, like QEMU programs it to 0. As it only indicates how many bytes it can be accessed at once, the register size should be determined by Bit Width and Bit Offset. In IO space, Bit Offset is always 0, the size is (Bit Width / 8). Tracked-On: #8771 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@intel.com> Reviewed-by: Li Fei <fei1.li@intel.com>	2025-08-19 07:49:23 +00:00
Yichong Tang	27aee66f88	hv: hyperv: Add hyperv page destory function In current code process, hyperv data in struct vm_arch is never cleared during VM shutdown and is retained to next VM launch. As the enabled bit of hypercall_page msr is not clear, hypercall page might cause fatal error such as Windows VM BSOD during VM restart and memory remapping. Hyperv page destory function can ensure hyperv page is destory during each VM shutdown so hyperv related config such as hypercall page is established correctly during each VM launch. Tracked-On: #8755 Signed-off-by: Yichong Tang <yichong.tang@intel.com>	2025-03-10 15:36:03 +08:00
Yuan Lu	dbc3ff39aa	hv: vm_reset: simulate RESET_CONTROL(0xCF9) register Add reset_control in acrn_vm. Use this reset_control to simulate RESET_CONTROL(0xCF9) register in hypervisor. Tracked-On: #8724 Signed-off-by: Yuan Lu <yuan.y.lu@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2024-09-12 14:09:17 +08:00
Jiaqing Zhao	eae668268e	hv: handle reboot from Service VM properly Service VM may write 0x6 to port 0xcf9 to trigger a warm reset, but current hypervisor always performs a cold reset by writing 0xE to CF9. Hypervisor should reboot the system in the same mode as Service VM specified. Specific OS features (like linux pstore) requires warm reset to keep data across reboot. The behavior of hv console's reboot command (cold reset) remains unchanged. Tracked-On: #8539 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-09-09 14:37:16 +08:00
Haiwei Li	17c4ce75a1	hv: cpuid: expose CPUID.EAX=07H subleaf to VMs Per SDM, VPDPBUSD/VPDPBUSDS/VPDPWSSD/VPDPWSSDS instructions depend on CPUID Feature Flag 'AVX-VNNI, AVX512_VNNI, AVX512VL'. 'AVX512_VNNI' and 'AVX512VL' are already exposed to any VM. 'AVX-VNNI' is in CPUID.(EAX=07H,ECX=1):EAX.AVX-VNNI[bit 4]. This patch is to expose all the CPUID.EAX=07H subleaf features to VMs. Mask corresponding bits if want to disable some features in the future. Tracked-On: #8710 Reviewed-by: Fei Li <fei1.li@intel.com> Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-09-09 14:03:51 +08:00
Haoyu Tang	fa1f2ba7df	local_gva2gpa_common: optimize code Remove unreachable code branch in line 163: if CR0 enabled WP, supervisor-mode writing a read-only page have been checked in line 109. Merge redundant checking: if smap is enabled, supervisor-mode can't access user-mode address when eflags.ac disabled. Tracked-On: #8708 Signed-off-by: Haoyu Tang <haoyu.tang@intel.com>	2024-08-30 15:19:51 +08:00
Yi Sun	e07a9618f9	hv: ENODEV should be able to be set into RAX as hypercall return value Some hypercalls return -ENODEV which should be set into RAX as return value, e.g. HC_ASSIGN_PCIDEV. So, remove the check in vmcall_vmexit_handler() and change return value to -EACCESS if the hypercall is not sent from Service VM or allowed VM. Tracked-On: #8598 Signed-off-by: Yi Sun <yi.y.sun@linux.intel.com>	2024-08-23 10:14:14 +08:00
Chen, Jinshi	48a102e6b0	hv: fix testability issues that impact module test This patch fixes the following testability issues identified by the dynamic module test. Global variables defined in function scope cannot be referenced outside the function, making it impossible to check the return value of these functions. Tracked-On: #861 Signed-off-by: Chen, Jinshi <jinshi.chen@intel.com>	2024-08-19 10:21:28 +08:00
Yonghua Huang	4e552b0785	hv: allow guest with the highest severity to read RESET_CONTROL Guest VM, such as Linux, may read RESET_CONTROL(0xCF9) register before writing to, in this case, ACRN should not always return dummy value. Tracked-On: #8688 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-08-12 10:06:15 +08:00
Haiwei Li	fa2b8fcfbe	doc: add module design for some defines in hwmgmt_page GAI Tooling Notice: These contents may have been developed with support from one or more generative artificial intelligence solutions. ACRN hypervisor is decomposed into a series of components and modules. The module design in hypervisor is to add inline doxygen style comments above functions, macros, structures, etc. This patch is to add comments for some elements in hwmgmt_page module. Tracked-On: #8665 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-08-01 14:50:27 +08:00
Jiaqing Zhao	2dc56a8f23	hv: add GUEST_FLAG_STATELESS flag GUEST_FLAG_STATELESS indicates guest is running a stateless operating system and need to be shutdown forcefully without data loss. This flag is only appalicable to pre-launched VM. For TEE_VM, this flag will be set implicitly. Tracked-On: #8671 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-07-30 09:26:50 +08:00
Haiwei Li	c4ea248bc9	hv: remove Service VM delayed loading Now multiboot modules memory is already reserved from e820 in function `alloc_mods_memory()` and Service VM will not corrupt pre-launched VM modules. So remove the code of Service VM delayed loading. Tracked-On: #8652 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-07-18 11:26:49 +08:00
Haiwei Li	529ade37a4	config_tools: support vUART Timer pCPU configuration This patch is to allow user to pin vUART timer to specific pCPU via ACRN config tool. User can configure by setting "vUART timer pCPU ID" under Hypervisor->Advanced Parameters. Tracked-On: #8648 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-07-10 10:26:21 +08:00
Gao, Shiqing	0bcf469758	hv: vtd: fix use of uninitialized variable in dmar_free_irte This patch fixes the following error: error: variable 'sid' is used uninitialized whenever 'if' condition is true [-Werror,-Wsometimes-uninitialized] Tracked-On: #861 Signed-off-by: Gao, Shiqing <shiqing.gao@intel.com>	2024-07-03 14:55:43 +08:00
YuanXin-Intel	e4b1584577	Change Service VM to supervisor role 1. Enable Service VM to power off or restart the whole platform even when RTVM is running. 2. Allow Service VM stop the RTVM using acrnctl tool with option "stop -f". 3. Add 'Service VM supervisor role enabled' option in ACRN configurator Tracked-On: #8618 Signed-off-by: YuanXin-Intel <xin.yuan@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com> Reviewed-by: Jian Jun Chen <jian.jun.chen@intel.com>	2024-06-28 13:35:07 +08:00
Haiwei Li	3d6ca845e2	hv: s3: add timer support When resume from s3, Service VM OS will hang because timer interrupt on BSP is not triggered. Hypervisor won't update physical timer because there are expired timers on pcpu timer list. Add suspend and resume ops for modules that use timers. This patch is just for Service VM OS. Support for User VM will be added in the future. Tracked-On: #8623 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-06-27 11:26:09 +08:00
Haiwei Li	81935737ff	hv: s3: reset vm after resume Now only BSP is reset. After Service VM OS resumes from s3, APs' apic_base_msr are incorrect with x2apic bit en. To avoid incorrect states, do `reset_vm` after resume. Tracked-On: #8623 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-06-27 11:26:09 +08:00
Haiwei Li	9c139681f2	hv: s3: hwp: enable hwp after resume from s3 After Service OS resume from s3, an error occurs: [3649827us][cpu=1][idle1][sev=2][seq=1749]:= Unhandled exception: 13 (General Protection) [3658622us][cpu=1][idle1][sev=2][seq=1750]: Host Registers: [3664881us][cpu=1][idle1][sev=2][seq=1751]:= Vector=0x000000000000000D RIP=0x000000000040F9F0 [3674213us][cpu=1][idle1][sev=2][seq=1752]:= RAX=0x0000000080003801 RBX=0x0000000001800800 RCX=0x0000000000000774 [3685787us][cpu=1][idle1][sev=2][seq=1753]:= RDX=0x0000000000000000 RDI=0x0000000000000080 RSI=0x0000000000000000 [3697371us][cpu=1][idle1][sev=2][seq=1754]:= RSP=0x0000000000616C18 RBP=0x0000000000616C38 RBX=0x0000000001800800 [3708947us][cpu=1][idle1][sev=2][seq=1755]:= R8=0x0000000000000038 R9=0x0000000000000001 R10=0x00000000000003F8 [3720539us][cpu=1][idle1][sev=2][seq=1756]:= R11=0x000000000000000D R12=0x0000000000458245 R13=0x0000000000000000 [3732114us][cpu=1][idle1][sev=2][seq=1757]:= RFLAGS=0x0000000000010202 R14=0x0000000000000000 R15=0x0000000000000000 [3743699us][cpu=1][idle1][sev=2][seq=1758]:= ERRCODE=0x0000000000000000 CS=0x0000000000000008 SS=0x0000000000000010 [3755305us][cpu=1][idle1][sev=2][seq=1759]:= CR2=0x0000000000000000 The error occurs in `msr_write(MSR_IA32_HWP_REQUEST, reg)`, when HWP is not available. This patch is to initialize HWP after resume. Tracked-On: #8623 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-06-27 11:26:09 +08:00
Haiwei Li	cdfd35ed3d	hv: s3: enable lapic earlier After Service VM OS resumes from s3, BSP starts APs asynchronously, followed by IPIs to APs to resume tsc. This process takes place in function `host_enter_s3`. While, APs' lapic are not ready to accept IPI interrupt, so BSP fails to resume tsc. So enable lapic earlier to make sure that APs are ready. Tracked-On: #8623 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-06-27 11:26:09 +08:00
Jiaqing Zhao	53825c5cac	e820: properly reserve memory for multiboot modules In current implementation, if there are multiple continous 4k-aligned modules, 0-sized e820 entries will be created between these regions. And for non-4k-aligned modules, when two of them are located in one page, the second memory range will not be reserved as it was not in one e820 entry after the first is reserved, making it vulnerable. This patch fixes it by marking the exact memory range of multiboot modules as unusable first, then shrinking the e820 entries to page boundary. If the module crosses multiple e820 entries, possibly due to a buggy bootloader, hypervisor will panic immediately to prevent modules getting corrupted. Tracked-On: #8617 Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-06-20 09:10:27 +08:00
Haiwei Li	b31fcd3519	hv: cpuid: fix hybrid related cpuid error Some cpuids will return invalid values on hybrid platform because of the error in the pointer arithmetic. Add `(void *)` before `cpu_cpuids.leaves`. Leaf 0x14 is used to report Intel Processor Trace Enumeration and varies between P-cores and E-cores on hybrid platform. So add it to `hybrid_leaves`. Tracked-On: #8608 Fixes: `59a8cc4c2` ("hv: cpuid: make leaf 0x4 per-cpu in hybrid architecture") Signed-off-by: Haiwei Li <haiwei.li@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-06-19 17:07:10 +08:00
andi6	46a860bf04	hv: fix using cpuid does not clear the upper 32-bit registers. In HV, cpuid uses the lower 32 bits of rax\rbx\rcx\rdx registers to pass parameters, But the software does not clear the upper 32-bit registers, if the guest uses 64-bit variables to pass parameters to cpuid，guest will use rax\rbx\rcx\rdx, not eax\ebx\ecx\edx, the previous value of the high 32 registers will affect the guest. Tracked-On: #8605 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: andi6 <andi6@xiaomi.com>	2024-06-19 15:35:26 +08:00
Haiwei Li	b885d02396	hv: cpuid: add several leaf to per-cpu list in hybrid architecture P-cores and E-cores accessing leaf 0x2U/0x14U/0x16U/0x18U/0x1A/0x1C/0x80000006U will have different information in hybrid architecture. So add them to per-cpu list in hybrid architecture and directly return the physical value. Note: 0x14U is hided and return 0. Tracked-On: #8608 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-05-28 11:02:56 +08:00
Haiwei Li	d6fe8b0892	hv: cpuid: make leaf 0x6 per-cpu in hybrid architecture Leaf 0x6 returns thermal and power management information. In hybrid architecture, P-cores and E-cores have different information. Add leaf 0x6 to per-cpu list in hybrid architecture and handle specific cpuid access. Tracked-On: #8608 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-05-28 11:02:56 +08:00
Haiwei Li	59a8cc4c28	hv: cpuid: make leaf 0x4 per-cpu in hybrid architecture Leaf 0x4 returns deterministic cache parameters for each level. In hybrid architecture, P-cores and E-cores have different cache information. Add leaf 0x4 to per-cpu list in hybrid architecture and handle specific cpuid access. Tracked-On: #8608 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-05-28 11:02:56 +08:00
Haiwei Li	f7506424e4	hv: cpuid: refactor per-cpu leaves definition CPUID returns processor identification and feature information. Different pcpus may return different infos. That is, the info is per-cpu. In hybrid architecture, per-cpu leaf is different from the previous. So introduce a struct percpu_cpuids to indicate the per-cpu leaf. struct percpu_cpuids will consist of two parts: generic percpu leaves and hybrid related percpu leaves. This patch is just to add generic percpu leaves. Tracked-On: #8608 Signed-off-by: Haiwei Li <haiwei.li@intel.com>	2024-05-28 11:02:56 +08:00
Xin Zhang	7edf800f16	Expose CPUID leaf 0x1f to guest with patched x2APIC ID CPUID leaf 1f is preferred superset of leaf 0b, currently ACRN exposes leaf 0b but leaf 1f is empty so the 2 leaves mismatch, and so application will follow the SDM to check 1f first. Tracked-On: #8608 Signed-off-by: Xin Zhang <xin.x.zhang@intel.com>	2024-05-28 11:02:56 +08:00
Zhangwei6	ddfcb8c3fc	hv: enable thermal lvt interrupt This patch can fetch the thermal lvt irq and propagate it to VM. At this stage we support the case that there is only one VM governing thermal. And we pass the hardware thermal irq to this VM. First, we register the handler for thermal lvt interrupt, its irq vector is THERMAL_VECTOR and the handler is thermal_irq_handler(). Then, when a thermal irq occurs, it flags the SOFTIRQ_THERMAL bit of softirq_pending, This bit triggers the thermal_softirq() function. And this function will inject the virtual thermal irq to VM. Tracked-On: #8595 Signed-off-by: Zhangwei6 <wei6.zhang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-05-16 09:40:32 +08:00
Zhangwei6	78243c3f49	hv: expose thermal MSRs to VM. In this phase, we only use one VM to control thermal. So we make thermal MSRs readable and writable by this VM. This VM is flagged with GUEST_FLAG_VTM, and can read/write thermal MSRs. For the VMs not flagged with GUEST_FLAG_VTM, can only read these thermal MSRs to get current status. Tracked-On: #8595 Signed-off-by: Zhangwei6 <wei6.zhang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-05-16 09:40:32 +08:00
Yonghua Huang	7bcd9d783e	hv: refine set_fs_base() function Leave canary of stack protector untouched on pCPU if it has been initialized, instead of generating a new one. Tracked-On: #8577 Signed-off-by: Yonghua Huang <yonghua.huang@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com> Reviewed-by: Fei Li <fei1.li@intel.com>	2024-04-23 11:00:43 +08:00
Jiaqing Zhao	f6bb15c85c	hv: mmu: intiialize ppt_page_pool.bitmap in allocate_ppt_pages() ppt_page_pool.bitmap should be zero-initialized. Also fixes the wrong indention in allocate_ppt_pages(). Tracked-On: #8559 Reviewed-by: Junjie Mao <junjie.mao@intel.com> Signed-off-by: Jiaqing Zhao <jiaqing.zhao@linux.intel.com>	2024-03-25 09:57:08 +08:00
Wu Zhou	29b3d03ac7	hv: vm_event: send event on triple fault handler In the triple fault handler, post-launched VMs are instantly turned off. Now a vm event is generated simultaneously. So that developers can capture the event and decide what to do with it. (e.g., logging and populating diagnostics, or poweroff VM) Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-02-01 17:01:31 +08:00
Wu Zhou	581ec58fbb	hv: vm_event: create vm_event support This patch creates vm_event support in HV, including: 1. Create vm_event data type. 2. Add vm_event sbuf and its initializer. The sbuf will be allocated by DM in Service VM. Its page address will then be share to HV through hypercall. 3. Add an API to send the HV generated event. Tracked-On: #8547 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2024-02-01 17:01:31 +08:00
Muhammad Qasim Abdul Majeed	3be3b394ad	hypervisor: Fix spelling and grammar mistakes. Tracked-On: #8533 Signed-off-by: Muhammad Qasim Abdul Majeed <qasim.majeed20@gmail.com>	2023-10-24 11:10:47 +08:00
Muhammad Qasim Abdul Majeed	ce96ef6bae	hypervisor: Fix spelling and grammar mistakes. Tracked-On: #8533 Signed-off-by: Muhammad Qasim Abdul Majeed <qasim.majeed20@gmail.com>	2023-10-23 16:45:28 +08:00
Qiang Zhang	6a1d91c740	hv: sched: Add sched_params struct for thread parameters Abstract out schedulers config data for vCPU threads and other hypervisor threads to sched_params structure. And it's used to initialize per thread scheduler private data. The sched_params for vCPU threads come from vm_config generated by config tools while other hypervisor threads need give them explicitly. Tracked-On: #8500 Signed-off-by: Qiang Zhang <qiang4.zhang@intel.com>	2023-09-18 16:26:05 +08:00
Wu Zhou	9a6e940849	hv: signal_event after make_request make_request sets the request bit, and signal_event wakes the vcpu thread. If we signal_event comes first, the target vCPU has a chance to sleep again before processing the request bit. Tracked-On: #8507 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-15 11:52:40 +08:00
Wu Zhou	064be1e3e6	hv: support halt in hv idle When all vCPU threads on one pCPU are put to sleep (e.g., when all guests execute HLT), hv would schedule to idle thread. Currently the idle thread executes PAUSE which does not enter any c-state and consumes a lot of power. This patch is to support HLT in the idle thread. When we switch to HLT, we have to make sure events that would wake a vCPU must also be able to wake the pCPU. Those events are either generated by local interrupt or issued by other pCPUs followed by an ipi kick. Each of them have an interrupt involved, so they are also able to wake the halted pCPU. Except when the pCPU has just scheduled to idle thread but not yet halted, interrupts could be missed. sleep-------schedule to idle------IRQ ON---HLT--(kick missed) ^ wake---kick\| This areas should be protected. This is done by a safe halt mechanism leveraging STI instruction’s delay effect (same as Linux). vCPUs with lapic_pt or hv with CONFIG_KEEP_IRQ_DISABLED=y does not allow interrupts in root mode, so they could never wake from HLT (INIT kick does not wake HLT in root mode either). They should continue using PAUSE in idle. Tracked-On: #8507 Signed-off-by: Wu Zhou <wu.zhou@intel.com> Reviewed-by: Junjie Mao <junjie.mao@intel.com>	2023-09-15 11:52:40 +08:00

1 2 3 4 5 ...

2363 Commits