doc: pick doc updates for v2.6 release

Tracked-On: #5692 Signed-off-by: David B. Kinder <david.b.kinder@intel.com>
2025-09-23 17:58:30 +00:00 · 2021-09-28 12:28:36 -07:00
parent eade39758a
commit e678dc8e7a
234 changed files with 2869 additions and 4457 deletions
--- a/doc/developer-guides/hld/hld-APL_GVT-g.rst
+++ b/doc/developer-guides/hld/hld-APL_GVT-g.rst
@@ -1,954 +0,0 @@
-.. _APL_GVT-g-hld:
-
-GVT-g High-Level Design
-#######################
-
-Introduction
-************
-
-Purpose of This Document
-========================
-
-This high-level design (HLD) document describes the usage requirements
-and high-level design for Intel |reg| Graphics Virtualization Technology for
-shared virtual :term:`GPU` technology (:term:`GVT-g`) on Apollo Lake-I
-SoCs.
-
-This document describes:
-
-  The different GPU virtualization techniques
-  GVT-g mediated passthrough
-  High-level design
-  Key components
-  GVT-g new architecture differentiation
-
-Audience
-========
-
-This document is for developers, validation teams, architects, and
-maintainers of Intel |reg| GVT-g for the Apollo Lake SoCs.
-
-The reader should have some familiarity with the basic concepts of
-system virtualization and Intel processor graphics.
-
-Reference Documents
-===================
-
-The following documents were used as references for this specification:
-
-  Paper in USENIX ATC '14 - *Full GPU Virtualization Solution with
-   Mediated Pass-Through* - https://www.usenix.org/node/183932
-
-  Hardware Specification - PRMs -
-   https://01.org/linuxgraphics/documentation/hardware-specification-prms
-
-Background
-**********
-
-Intel GVT-g is an enabling technology in emerging graphics
-virtualization scenarios. It adopts a full GPU virtualization approach
-based on mediated passthrough technology to achieve good performance,
-scalability, and secure isolation among Virtual Machines (VMs). A virtual
-GPU (vGPU), with full GPU features, is presented to each VM so that a
-native graphics driver can run directly inside a VM.
-
-Intel GVT-g technology for Apollo Lake (APL) has been implemented in
-open-source hypervisors or Virtual Machine Monitors (VMMs):
-
-  Intel GVT-g for ACRN, also known as, "AcrnGT"
-  Intel GVT-g for KVM, also known as, "KVMGT"
-  Intel GVT-g for Xen, also known as, "XenGT"
-
-The core vGPU device model is released under the BSD/MIT dual license, so it
-can be reused in other proprietary hypervisors.
-
-Intel has a portfolio of graphics virtualization technologies
-(:term:`GVT-g`, :term:`GVT-d`, and :term:`GVT-s`). GVT-d and GVT-s are
-outside the scope of this document.
-
-This HLD applies to the Apollo Lake platform only. Support of other
-hardware is outside the scope of this HLD.
-
-Targeted Usages
-===============
-
-The main targeted usage of GVT-g is in automotive applications, such as:
-
-  An Instrument cluster running in one domain
-  An In Vehicle Infotainment (IVI) solution running in another domain
-  Additional domains for specific purposes, such as Rear Seat
-   Entertainment or video camera capturing.
-
-.. figure:: images/APL_GVT-g-ive-use-case.png
-   :width: 900px
-   :align: center
-   :name: ive-use-case
-
-   IVE Use Case
-
-Existing Techniques
-===================
-
-A graphics device is no different from any other I/O device with
-respect to how the device I/O interface is virtualized. Therefore,
-existing I/O virtualization techniques can be applied to graphics
-virtualization. However, none of the existing techniques can meet the
-general requirements of performance, scalability, and secure isolation
-simultaneously. In this section, we review the pros and cons of each
-technique in detail, enabling the audience to understand the rationale
-behind the entire GVT-g effort.
-
-Emulation
---------
-
-A device can be emulated fully in software, including its I/O registers
-and internal functional blocks. Because there is no dependency on the
-underlying hardware capability, compatibility can be achieved
-across platforms. However, due to the CPU emulation cost, this technique
-is usually used only for legacy devices such as a keyboard, mouse, and VGA
-card.  Fully emulating a modern accelerator such as a GPU would involve great
-complexity and extremely low performance. It may be acceptable
-for use in a simulation environment, but it is definitely not suitable
-for production usage.
-
-API Forwarding
--------------
-
-API forwarding, or a split driver model, is another widely-used I/O
-virtualization technology. It has been used in commercial virtualization
-productions such as VMware*, PCoIP*, and Microsoft* RemoteFx*.
-It is a natural path when researchers study a new type of
-I/O virtualization usage—for example, when GPGPU computing in a VM was
-initially proposed. Intel GVT-s is based on this approach.
-
-The architecture of API forwarding is shown in :numref:`api-forwarding`:
-
-.. figure:: images/APL_GVT-g-api-forwarding.png
-   :width: 400px
-   :align: center
-   :name: api-forwarding
-
-   API Forwarding
-
-A frontend driver is employed to forward high-level API calls (OpenGL,
-DirectX, and so on) inside a VM to a backend driver in the Hypervisor
-for acceleration. The backend may be using a different graphics stack,
-so API translation between different graphics protocols may be required.
-The backend driver allocates a physical GPU resource for each VM,
-behaving like a normal graphics application in a Hypervisor.  Shared
-memory may be used to reduce memory copying between the host and guest
-graphic stacks.
-
-API forwarding can bring hardware acceleration capability into a VM,
-with other merits such as vendor independence and high density. However, it
-also suffers from the following intrinsic limitations:
-
-  Lagging features - Every new API version must be specifically
-   handled, which means slow time-to-market (TTM) to support new standards.
-   For example,
-   only DirectX9 is supported while DirectX11 is already in the market.
-   Also, there is a big gap in supporting media and compute usages.
-
-  Compatibility issues - A GPU is very complex, and consequently so are
-   high-level graphics APIs. Different protocols are not 100% compatible
-   on every subtly different API, so the customer can observe feature/quality
-   loss for specific applications.
-
-  Maintenance burden - Occurs when supported protocols and specific
-   versions are incremented.
-
-  Performance overhead - Different API forwarding implementations
-   exhibit quite different performance, which gives rise to a need for a
-   fine-grained graphics tuning effort.
-
-Direct Passthrough
-------------------
-
-"Direct passthrough" dedicates the GPU to a single VM, providing full
-features and good performance at the cost of device sharing
-capability among VMs. Only one VM at a time can use the hardware
-acceleration capability of the GPU, which is a major limitation of this
-technique.  However, it is still a good approach for enabling graphics
-virtualization usages on Intel server platforms, as an intermediate
-solution. Intel GVT-d uses this mechanism.
-
-.. figure:: images/APL_GVT-g-pass-through.png
-   :width: 400px
-   :align: center
-   :name: gvt-pass-through
-
-   Passthrough
-
-SR-IOV
------
-
-Single Root IO Virtualization (SR-IOV) implements I/O virtualization
-directly on a device. Multiple Virtual Functions (VFs) are implemented,
-with each VF directly assignable to a VM.
-
-.. _Graphic_mediation:
-
-Mediated Passthrough
-*********************
-
-Intel GVT-g achieves full GPU virtualization using a "mediated
-passthrough" technique.
-
-Concept
-=======
-
-Mediated passthrough enables a VM to access performance-critical I/O
-resources (usually partitioned) directly, without intervention from the
-hypervisor in most cases. Privileged operations from this VM are
-trapped-and-emulated to provide secure isolation among VMs.
-
-.. figure:: images/APL_GVT-g-mediated-pass-through.png
-   :width: 400px
-   :align: center
-   :name: mediated-pass-through
-
-   Mediated Passthrough
-
-The Hypervisor must ensure that no vulnerability is exposed when
-assigning performance-critical resource to each VM. When a
-performance-critical resource cannot be partitioned, a scheduler must be
-implemented (either in software or hardware) to enable time-based sharing
-among multiple VMs. In this case, the device must allow the hypervisor
-to save and restore the hardware state associated with the shared resource,
-either through direct I/O register reads and writes (when there is no software
-invisible state) or through a device-specific context save and restore
-mechanism (where there is a software invisible state).
-
-Examples of performance-critical I/O resources include the following:
-
-.. figure:: images/APL_GVT-g-perf-critical.png
-   :width: 800px
-   :align: center
-   :name: perf-critical
-
-   Performance-Critical I/O Resources
-
-
-The key to implementing mediated passthrough for a specific device is
-to define the right policy for various I/O resources.
-
-Virtualization Policies for GPU Resources
-=========================================
-
-:numref:`graphics-arch` shows how Intel Processor Graphics works at a high level.
-Software drivers write commands into a command buffer through the CPU.
-The Render Engine in the GPU fetches these commands and executes them.
-The Display Engine fetches pixel data from the Frame Buffer and sends
-them to the external monitors for display.
-
-.. figure:: images/APL_GVT-g-graphics-arch.png
-   :width: 400px
-   :align: center
-   :name: graphics-arch
-
-   Architecture of Intel Processor Graphics
-
-This architecture abstraction applies to most modern GPUs, but may
-differ in how graphics memory is implemented. Intel Processor Graphics
-uses system memory as graphics memory. System memory can be mapped into
-multiple virtual address spaces by GPU page tables. A 4 GB global
-virtual address space called "global graphics memory", accessible from
-both the GPU and CPU, is mapped through a global page table. Local
-graphics memory spaces are supported in the form of multiple 4 GB local
-virtual address spaces but are limited to access by the Render
-Engine through local page tables. Global graphics memory is mostly used
-for the Frame Buffer and also serves as the Command Buffer. Massive data
-accesses are made to local graphics memory when hardware acceleration is
-in progress. Other GPUs have similar page table mechanism accompanying
-the on-die memory.
-
-The CPU programs the GPU through GPU-specific commands, shown in
-:numref:`graphics-arch`, using a producer-consumer model. The graphics
-driver programs GPU commands into the Command Buffer, including primary
-buffer and batch buffer, according to the high-level programming APIs
-such as OpenGL* and DirectX*. Then, the GPU fetches and executes the
-commands. The primary buffer (called a ring buffer) may chain other
-batch buffers together. The primary buffer and ring buffer are used
-interchangeably thereafter. The batch buffer is used to convey the
-majority of the commands (up to ~98% of them) per programming model. A
-register tuple (head, tail) is used to control the ring buffer. The CPU
-submits the commands to the GPU by updating the tail, while the GPU
-fetches commands from the head and then notifies the CPU by updating
-the head after the commands have finished execution. Therefore, when
-the GPU has executed all commands from the ring buffer, the head and
-tail pointers are the same.
-
-Having introduced the GPU architecture abstraction, it is important for
-us to understand how real-world graphics applications use the GPU
-hardware so that we can virtualize it in VMs efficiently. To do so, we
-characterized the usages of the four critical interfaces for some
-representative GPU-intensive 3D workloads (the Phoronix Test Suite):
-
-1) the Frame Buffer,
-2) the Command Buffer,
-3) the GPU Page Table Entries (PTEs), which carry the GPU page tables, and
-4) the I/O registers, including Memory-Mapped I/O (MMIO) registers,
-   Port I/O (PIO) registers, and PCI configuration space registers
-   for internal state.
-
-:numref:`access-patterns` shows the average access frequency of running
-Phoronix 3D workloads on the four interfaces.
-
-The Frame Buffer and Command Buffer exhibit the most
-performance-critical resources, as shown in :numref:`access-patterns`.
-When the applications are being loaded, lots of source vertices and
-pixels are written by the CPU, so the Frame Buffer accesses occur in the
-range of hundreds of thousands per second. Then at run-time, the CPU
-programs the GPU through the commands to render the Frame Buffer, so
-the Command Buffer accesses become the largest group (also in the
-hundreds of thousands per second). PTE and I/O accesses are minor in both
-load and run-time phases ranging in tens of thousands per second.
-
-.. figure:: images/APL_GVT-g-access-patterns.png
-   :width: 400px
-   :align: center
-   :name: access-patterns
-
-   Access Patterns of Running 3D Workloads
-
-High-Level Architecture
-***********************
-
-:numref:`gvt-arch` shows the overall architecture of GVT-g, based on the
-ACRN hypervisor, with Service VM as the privileged VM, and multiple user
-guests. A GVT-g device model working with the ACRN hypervisor
-implements the policies of trap and passthrough. Each guest runs the
-native graphics driver and can directly access performance-critical
-resources: the Frame Buffer and Command Buffer, with resource
-partitioning (as presented later). To protect privileged resources—that
-is, the I/O registers and PTEs—corresponding accesses from the graphics
-driver in user VMs are trapped and forwarded to the GVT device model in the
-Service VM for emulation. The device model leverages i915 interfaces to access
-the physical GPU.
-
-In addition, the device model implements a GPU scheduler that runs
-concurrently with the CPU scheduler in ACRN to share the physical GPU
-timeslot among the VMs. GVT-g uses the physical GPU to directly execute
-all the commands submitted from a VM, so it avoids the complexity of
-emulating the Render Engine, which is the most complex part of the GPU.
-In the meantime, the resource passthrough of both the Frame Buffer and
-Command Buffer minimizes the hypervisor's intervention of CPU accesses,
-while the GPU scheduler guarantees every VM a quantum time-slice for
-direct GPU execution. With that, GVT-g can achieve near-native
-performance for a VM workload.
-
-In :numref:`gvt-arch`, the yellow GVT device model works as a client on
-top of an i915 driver in the Service VM. It has a generic Mediated Passthrough
-(MPT) interface, compatible with all types of hypervisors. For ACRN,
-some extra development work is needed for such MPT interfaces. For
-example, we need some changes in ACRN-DM to make ACRN compatible with
-the MPT framework. The vGPU lifecycle is the same as the lifecycle of
-the guest VM creation through ACRN-DM. They interact through sysfs,
-exposed by the GVT device model.
-
-.. figure:: images/APL_GVT-g-arch.png
-   :width: 600px
-   :align: center
-   :name: gvt-arch
-
-   AcrnGT High-level Architecture
-
-Key Techniques
-**************
-
-vGPU Device Model
-=================
-
-The vGPU Device model is the main component because it constructs the
-vGPU instance for each guest to satisfy every GPU request from the guest
-and gives the corresponding result back to the guest.
-
-The vGPU Device Model provides the basic framework to do
-trap-and-emulation, including MMIO virtualization, interrupt
-virtualization, and display virtualization. It also handles and
-processes all the requests internally (such as command scan and shadow),
-schedules them in the proper manner, and finally submits to
-the Service VM i915 driver.
-
-.. figure:: images/APL_GVT-g-DM.png
-   :width: 800px
-   :align: center
-   :name: GVT-DM
-
-   GVT-g Device Model
-
-MMIO Virtualization
-------------------
-
-Intel Processor Graphics implements two PCI MMIO BARs:
-
-  **GTTMMADR BAR**: Combines both :term:`GGTT` modification range and Memory
-   Mapped IO range. It is 16 MB on :term:`BDW`, with 2 MB used by MMIO, 6 MB
-   reserved, and 8 MB allocated to GGTT. GGTT starts from
-   :term:`GTTMMADR` + 8 MB. In this section, we focus on virtualization of
-   the MMIO range, leaving discussion of GGTT virtualization for later.
-
-  **GMADR BAR**: As the PCI aperture is used by the CPU to access tiled
-   graphics memory, GVT-g partitions this aperture range among VMs for
-   performance reasons.
-
-A 2 MB virtual MMIO structure is allocated per vGPU instance.
-
-All the virtual MMIO registers are emulated as simple in-memory
-read-write; that is, the guest driver will read back the same value that was
-programmed earlier. A common emulation handler (for example,
-intel_gvt_emulate_read/write) is enough to handle such general
-emulation requirements. However, some registers must be emulated with
-specific logic—for example, affected by change of other states or
-additional audit or translation when updating the virtual register.
-Therefore, a specific emulation handler must be installed for those
-special registers.
-
-The graphics driver may have assumptions about the initial device state,
-which stays with the point when the BIOS transitions to the OS. To meet
-the driver expectation, we need to provide an initial state of vGPU that
-a driver may observe on a pGPU. So the host graphics driver is expected
-to generate a snapshot of physical GPU state, which it does before the guest
-driver's initialization. This snapshot is used as the initial vGPU state
-by the device model.
-
-PCI Configuration Space Virtualization
--------------------------------------
-
-The PCI configuration space also must be virtualized in the device
-model. Different implementations may choose to implement the logic
-within the vGPU device model or in the default system device model (for
-example, ACRN-DM). GVT-g emulates the logic in the device model.
-
-Some information is vital for the vGPU device model, including
-Guest PCI BAR, Guest PCI MSI, and Base of ACPI OpRegion.
-
-Legacy VGA Port I/O Virtualization
----------------------------------
-
-Legacy VGA is not supported in the vGPU device model. We rely on the
-default device model (for example, :term:`QEMU`) to provide legacy VGA
-emulation, which means either ISA VGA emulation or
-PCI VGA emulation.
-
-Interrupt Virtualization
------------------------
-
-The GVT device model does not touch the hardware interrupt in the new
-architecture, since it is hard to combine the interrupt controlling
-logic between the virtual device model and the host driver. To prevent
-architectural changes in the host driver, the host GPU interrupt does
-not go to the virtual device model and the virtual device model has to
-handle the GPU interrupt virtualization by itself. Virtual GPU
-interrupts are categorized into three types:
-
-  Periodic GPU interrupts are emulated by timers. However, a notable
-   exception to this is the VBlank interrupt. Due to the demands of user space
-   compositors such as Wayland, which requires a flip done event to be
-   synchronized with a VBlank, this interrupt is forwarded from the Service VM
-   to the User VM when the Service VM receives it from the hardware.
-
-  Event-based GPU interrupts are emulated by the emulation logic (for
-   example, AUX Channel Interrupt).
-
-  GPU command interrupts are emulated by a command parser and workload
-   dispatcher. The command parser marks out which GPU command interrupts
-   are generated during the command execution, and the workload
-   dispatcher injects those interrupts into the VM after the workload is
-   finished.
-
-.. figure:: images/APL_GVT-g-interrupt-virt.png
-   :width: 400px
-   :align: center
-   :name: interrupt-virt
-
-   Interrupt Virtualization
-
-Workload Scheduler
------------------
-
-The scheduling policy and workload scheduler are decoupled for
-scalability reasons. For example, a future QoS enhancement will impact
-only the scheduling policy, and any i915 interface change or hardware submission
-interface change (from execlist to :term:`GuC`) will need only workload
-scheduler updates.
-
-The scheduling policy framework is the core of the vGPU workload
-scheduling system. It controls all of the scheduling actions and
-provides the developer with a generic framework for easy development of
-scheduling policies. The scheduling policy framework controls the work
-scheduling process without regard for how the workload is dispatched
-or completed. All the detailed workload dispatching is hidden in the
-workload scheduler, which is the actual executer of a vGPU workload.
-
-The workload scheduler handles everything about one vGPU workload. Each
-hardware ring is backed by one workload scheduler kernel thread. The
-workload scheduler picks the workload from current vGPU workload queue
-and communicates with the virtual hardware submission interface to emulate the
-"schedule-in" status for the vGPU. It performs context shadow, Command
-Buffer scan and shadow, and PPGTT page table pin/unpin/out-of-sync before
-submitting this workload to the host i915 driver. When the vGPU workload
-is completed, the workload scheduler asks the virtual hardware submission
-interface to emulate the "schedule-out" status for the vGPU. The VM
-graphics driver then knows that a GPU workload is finished.
-
-.. figure:: images/APL_GVT-g-scheduling.png
-   :width: 500px
-   :align: center
-   :name: scheduling
-
-   GVT-g Scheduling Framework
-
-Workload Submission Path
------------------------
-
-Software submits the workload using the legacy ring buffer mode on Intel
-Processor Graphics before Broadwell, which is no longer supported by the
-GVT-g virtual device model. A new hardware submission interface named
-"Execlist" is introduced since Broadwell. With the new hardware submission
-interface, software can achieve better programmability and easier
-context management. In Intel GVT-g, the vGPU submits the workload
-through the virtual hardware submission interface. Each workload in submission
-will be represented as an ``intel_vgpu_workload`` data structure, a vGPU
-workload, which will be put on a per-vGPU and per-engine workload queue
-later after performing a few basic checks and verifications.
-
-.. figure:: images/APL_GVT-g-workload.png
-   :width: 800px
-   :align: center
-   :name: workload
-
-   GVT-g Workload Submission
-
-
-Display Virtualization
----------------------
-
-GVT-g reuses the i915 graphics driver in the Service VM to initialize the Display
-Engine, and then manages the Display Engine to show different VM frame
-buffers. When two vGPUs have the same resolution, only the frame buffer
-locations are switched.
-
-.. figure:: images/APL_GVT-g-display-virt.png
-   :width: 800px
-   :align: center
-   :name: display-virt
-
-   Display Virtualization
-
-Direct Display Model
--------------------
-
-.. figure:: images/APL_GVT-g-direct-display.png
-   :width: 600px
-   :align: center
-   :name: direct-display
-
-   Direct Display Model
-
-In a typical automotive use case, there are two displays in the car
-and each one must show one domain's content, with the two domains
-being the Instrument cluster and the In Vehicle Infotainment (IVI). As
-shown in :numref:`direct-display`, this can be accomplished through the direct
-display model of GVT-g, where the Service VM and User VM are each assigned all hardware
-planes of two different pipes. GVT-g has a concept of display owner on a
-per hardware plane basis. If it determines that a particular domain is the
-owner of a hardware plane, then it allows the domain's MMIO register write to
-flip a frame buffer to that plane to go through to the hardware. Otherwise,
-such writes are blocked by the GVT-g.
-
-Indirect Display Model
----------------------
-
-.. figure:: images/APL_GVT-g-indirect-display.png
-   :width: 600px
-   :align: center
-   :name: indirect-display
-
-   Indirect Display Model
-
-For security or fastboot reasons, it may be determined that the User VM is
-either not allowed to display its content directly on the hardware or it may
-be too late before it boots up and displays its content. In such a
-scenario, the responsibility of displaying content on all displays lies
-with the Service VM. One of the use cases that can be realized is to display the
-entire frame buffer of the User VM on a secondary display. GVT-g allows for this
-model by first trapping all MMIO writes by the User VM to the hardware. A proxy
-application can then capture the address in GGTT where the User VM has written
-its frame buffer and using the help of the Hypervisor and the Service VM's i915
-driver, can convert the Guest Physical Addresses (GPAs) into Host
-Physical Addresses (HPAs) before making a texture source or EGL image
-out of the frame buffer and then either post processing it further or
-simply displaying it on a hardware plane of the secondary display.
-
-GGTT-Based Surface Sharing
--------------------------
-
-One of the major automotive use cases is called "surface sharing". This
-use case requires that the Service VM accesses an individual surface or a set of
-surfaces from the User VM without having to access the entire frame buffer of
-the User VM. Unlike the previous two models, where the User VM did not have to do
-anything to show its content and therefore a completely unmodified User VM
-could continue to run, this model requires changes to the User VM.
-
-This model can be considered an extension of the indirect display model.
-Under the indirect display model, the User VM's frame buffer was temporarily
-pinned by it in the video memory access through the Global graphics
-translation table. This GGTT-based surface sharing model takes this a
-step further by having a compositor of the User VM to temporarily pin all
-application buffers into GGTT. It then also requires the compositor to
-create a metadata table with relevant surface information such as width,
-height, and GGTT offset, and flip that in lieu of the frame buffer.
-In the Service VM, the proxy application knows that the GGTT offset has been
-flipped, maps it, and through it can access the GGTT offset of an
-application that it wants to access. It is worth mentioning that in this
-model, User VM applications did not require any changes, and only the
-compositor, Mesa, and i915 driver had to be modified.
-
-This model has a major benefit and a major limitation. The
-benefit is that since it builds on top of the indirect display model,
-there are no special drivers necessary for it on either Service VM or User VM.
-Therefore, any Real Time Operating System (RTOS) that uses
-this model can simply do so without having to implement a driver, the
-infrastructure for which may not be present in their operating system.
-The limitation of this model is that video memory dedicated for a User VM is
-generally limited to a couple of hundred MBs. This can easily be
-exhausted by a few application buffers so the number and size of buffers
-is limited. Since it is not a highly-scalable model in general, Intel
-recommends the Hyper DMA buffer sharing model, described next.
-
-Hyper DMA Buffer Sharing
------------------------
-
-.. figure:: images/APL_GVT-g-hyper-dma.png
-   :width: 800px
-   :align: center
-   :name: hyper-dma
-
-   Hyper DMA Buffer Design
-
-Another approach to surface sharing is Hyper DMA Buffer sharing. This
-model extends the Linux DMA buffer sharing mechanism in which one driver is
-able to share its pages with another driver within one domain.
-
-Applications buffers are backed by i915 Graphics Execution Manager
-Buffer Objects (GEM BOs).  As in GGTT surface
-sharing, this model also requires compositor changes. The compositor of the
-User VM requests i915 to export these application GEM BOs and then passes
-them on to a special driver called the Hyper DMA Buf exporter whose job
-is to create a scatter gather list of pages mapped by PDEs and PTEs and
-export a Hyper DMA Buf ID back to the compositor.
-
-The compositor then shares this Hyper DMA Buf ID with the Service VM's Hyper DMA
-Buf importer driver which then maps the memory represented by this ID in
-the Service VM. A proxy application in the Service VM can then provide the ID of this driver
-to the Service VM i915, which can create its own GEM BO. Finally, the application
-can use it as an EGL image and do any post-processing required before
-either providing it to the Service VM compositor or directly flipping it on a
-hardware plane in the compositor's absence.
-
-This model is highly scalable and can be used to share up to 4 GB worth
-of pages. It is also not limited to sharing graphics buffers. Other
-buffers for the IPU and others can also be shared with it. However, it
-does require that the Service VM port the Hyper DMA Buffer importer driver. Also,
-the Service VM must comprehend and implement the DMA buffer sharing model.
-
-For detailed information about this model, please refer to the `Linux
-HYPER_DMABUF Driver High Level Design
-<https://github.com/downor/linux_hyper_dmabuf/blob/hyper_dmabuf_integration_v4/Documentation/hyper-dmabuf-sharing.txt>`_.
-
-.. _plane_restriction:
-
-Plane-Based Domain Ownership
----------------------------
-
-.. figure:: images/APL_GVT-g-plane-based.png
-   :width: 600px
-   :align: center
-   :name: plane-based
-
-   Plane-Based Domain Ownership
-
-Yet another mechanism for showing content of both the Service VM and User VM on the
-same physical display is called plane-based domain ownership. Under this
-model, both the Service VM and User VM are provided a set of hardware planes that they can
-flip their contents onto. Since each domain provides its content, there
-is no need for any extra composition to be done through the Service VM. The display
-controller handles alpha blending contents of different domains on a
-single pipe. This saves on any complexity on either the Service VM or the User VM
-SW stack.
-
-It is important to provide only specific planes and have them statically
-assigned to different Domains. To achieve this, the i915 driver of both
-domains is provided a command-line parameter that specifies the exact
-planes that this domain has access to. The i915 driver then enumerates
-only those hardware planes and exposes them to its compositor. It is then left
-to the compositor configuration to use these planes appropriately and
-show the correct content on them. No other changes are necessary.
-
-While the biggest benefit of this model is that is extremely simple and
-quick to implement, it also has some drawbacks. First, since each domain
-is responsible for showing the content on the screen, there is no
-control of the User VM by the Service VM. If the User VM is untrusted, this could
-potentially cause some unwanted content to be displayed. Also, there is
-no post-processing capability, except that provided by the display
-controller (for example, scaling, rotation, and so on). So each domain
-must provide finished buffers with the expectation that alpha blending
-with another domain will not cause any corruption or unwanted artifacts.
-
-Graphics Memory Virtualization
-==============================
-
-To achieve near-to-native graphics performance, GVT-g passes through the
-performance-critical operations, such as Frame Buffer and Command Buffer
-from the VM. For the global graphics memory space, GVT-g uses graphics
-memory resource partitioning and an address space ballooning mechanism.
-For local graphics memory spaces, GVT-g implements per-VM local graphics
-memory through a render context switch because local graphics memory is
-accessible only by the GPU.
-
-Global Graphics Memory
----------------------
-
-Graphics Memory Resource Partitioning
-%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
-
-GVT-g partitions the global graphics memory among VMs. Splitting the
-CPU/GPU scheduling mechanism requires that the global graphics memory of
-different VMs can be accessed by the CPU and the GPU simultaneously.
-Consequently, GVT-g must, at any time, present each VM with its own
-resource, leading to the resource partitioning approach, for global
-graphics memory, as shown in :numref:`mem-part`.
-
-.. figure:: images/APL_GVT-g-mem-part.png
-   :width: 800px
-   :align: center
-   :name: mem-part
-
-   Memory Partition and Ballooning
-
-The performance impact of reduced global graphics memory resources
-due to memory partitioning is very limited according to various test
-results.
-
-Address Space Ballooning
-%%%%%%%%%%%%%%%%%%%%%%%%
-
-The address space ballooning technique is introduced to eliminate the
-address translation overhead, shown in :numref:`mem-part`. GVT-g exposes the
-partitioning information to the VM graphics driver through the PVINFO
-MMIO window. The graphics driver marks the other VMs' regions as
-'ballooned', and reserves them as not being used from its graphics
-memory allocator. Under this design, the guest view of global graphics
-memory space is exactly the same as the host view, and the driver
-programmed addresses, using guest physical address, can be directly used
-by the hardware. Address space ballooning is different from traditional
-memory ballooning techniques. Memory ballooning is for memory usage
-control concerning the number of ballooned pages, while address space
-ballooning is to balloon special memory address ranges.
-
-Another benefit of address space ballooning is that there is no address
-translation overhead as we use the guest Command Buffer for direct GPU
-execution.
-
-Per-VM Local Graphics Memory
----------------------------
-
-GVT-g allows each VM to use the full local graphics memory spaces of its
-own, similar to the virtual address spaces on the CPU. The local
-graphics memory spaces are visible only to the Render Engine in the GPU.
-Therefore, any valid local graphics memory address, programmed by a VM,
-can be used directly by the GPU. The GVT-g device model switches the
-local graphics memory spaces, between VMs, when switching render
-ownership.
-
-GPU Page Table Virtualization
-=============================
-
-Shared Shadow GGTT
------------------
-
-To achieve resource partitioning and address space ballooning, GVT-g
-implements a shared shadow global page table for all VMs. Each VM has
-its own guest global page table to translate the graphics memory page
-number to the Guest memory Page Number (GPN). The shadow global page
-table is then translated from the graphics memory page number to the
-Host memory Page Number (HPN).
-
-The shared shadow global page table maintains the translations for all
-VMs to support concurrent accesses from the CPU and GPU concurrently.
-Therefore, GVT-g implements a single, shared shadow global page table by
-trapping guest PTE updates, as shown in :numref:`shared-shadow`. The
-global page table, in MMIO space, has 1024K PTE entries, each pointing
-to a 4 KB system memory page, so the global page table overall creates a
-4 GB global graphics memory space. GVT-g audits the guest PTE values
-according to the address space ballooning information before updating
-the shadow PTE entries.
-
-.. figure:: images/APL_GVT-g-shared-shadow.png
-   :width: 600px
-   :align: center
-   :name: shared-shadow
-
-   Shared Shadow Global Page Table
-
-Per-VM Shadow PPGTT
-------------------
-
-To support local graphics memory access passthrough, GVT-g implements
-per-VM shadow local page tables. The local graphics memory is accessible
-only from the Render Engine. The local page tables have two-level
-paging structures, as shown in :numref:`per-vm-shadow`.
-
-The first level, Page Directory Entries (PDEs), located in the global
-page table, points to the second level, Page Table Entries (PTEs) in
-system memory, so guest accesses to the PDE are trapped and emulated
-through the implementation of shared shadow global page table.
-
-GVT-g also write-protects a list of guest PTE pages for each VM. The
-GVT-g device model synchronizes the shadow page with the guest page, at
-the time of write-protection page fault, and switches the shadow local
-page tables at render context switches.
-
-.. figure:: images/APL_GVT-g-per-vm-shadow.png
-   :width: 800px
-   :align: center
-   :name: per-vm-shadow
-
-   Per-VM Shadow PPGTT
-
-.. _GVT-g-prioritized-rendering:
-
-Prioritized Rendering and Preemption
-====================================
-
-Different Schedulers and Their Roles
------------------------------------
-
-.. figure:: images/APL_GVT-g-scheduling-policy.png
-   :width: 800px
-   :align: center
-   :name: scheduling-policy
-
-   Scheduling Policy
-
-In the system, there are three different schedulers for the GPU:
-
-  i915 User VM scheduler
-  Mediator GVT scheduler
-  i915 Service VM scheduler
-
-Because the User VM always uses the host-based command submission (ELSP) model
-and it never accesses the GPU or the Graphic Micro Controller (:term:`GuC`)
-directly, its scheduler cannot do any preemption by itself.
-The i915 scheduler does ensure that batch buffers are
-submitted in dependency order—that is, if a compositor has to wait for
-an application buffer to finish before its workload can be submitted to
-the GPU, then the i915 scheduler of the User VM ensures that this happens.
-
-The User VM assumes that by submitting its batch buffers to the Execlist
-Submission Port (ELSP), the GPU will start working on them. However,
-the MMIO write to the ELSP is captured by the Hypervisor, which forwards
-these requests to the GVT module. GVT then creates a shadow context
-based on this batch buffer and submits the shadow context to the Service VM
-i915 driver.
-
-However, it is dependent on a second scheduler called the GVT
-scheduler. This scheduler is time based and uses a round robin algorithm
-to provide a specific time for each User VM to submit its workload when it
-is considered as a "render owner". The workload of the User VMs that are not
-render owners during a specific time period end up waiting in the
-virtual GPU context until the GVT scheduler makes them render owners.
-The GVT shadow context submits only one workload at
-a time, and once the workload is finished by the GPU, it copies any
-context state back to DomU and sends the appropriate interrupts before
-picking up any other workloads from either this User VM or another one. This
-also implies that this scheduler does not do any preemption of
-workloads.
-
-Finally, there is the i915 scheduler in the Service VM. This scheduler uses the
-:term:`GuC` or ELSP to do command submission of Service VM local content as well as any
-content that GVT is submitting to it on behalf of the User VMs. This
-scheduler uses :term:`GuC` or ELSP to preempt workloads. :term:`GuC` has four different
-priority queues, but the Service VM i915 driver uses only two of them. One of
-them is considered high priority and the other is normal priority with a
-:term:`GuC` rule being that any command submitted on the high priority queue
-would immediately try to preempt any workload submitted on the normal
-priority queue. For ELSP submission, the i915 will submit a preempt
-context to preempt the current running context and then wait for the GPU
-engine to be idle.
-
-While the identification of workloads to be preempted is decided by
-customizable scheduling policies, the i915 scheduler simply submits a
-preemption request to the :term:`GuC` high-priority queue once a candidate for
-preemption is identified. Based on the hardware's ability to preempt (on an
-Apollo Lake SoC, 3D workload is preemptible on a 3D primitive level with
-some exceptions), the currently executing workload is saved and
-preempted. The :term:`GuC` informs the driver using an interrupt of a preemption
-event occurring. After handling the interrupt, the driver submits the
-high-priority workload through the normal priority :term:`GuC` queue. As such,
-the normal priority :term:`GuC` queue is used for actual execbuf submission most
-of the time with the high-priority :term:`GuC` queue being used only for the
-preemption of lower-priority workload.
-
-Scheduling policies are customizable and left to customers to change if
-they are not satisfied with the built-in i915 driver policy, where all
-workloads of the Service VM are considered higher priority than those of the
-User VM. This policy can be enforced through a Service VM i915 kernel command-line
-parameter and can replace the default in-order command submission (no
-preemption) policy.
-
-AcrnGT
-*******
-
-ACRN is a flexible, lightweight reference hypervisor, built with
-real-time and safety-criticality in mind, optimized to streamline
-embedded development through an open-source platform.
-
-AcrnGT is the GVT-g implementation on the ACRN hypervisor. It adapts
-the MPT interface of GVT-g onto ACRN by using the kernel APIs provided
-by ACRN.
-
-:numref:`full-pic` shows the full architecture of AcrnGT with a Linux Guest
-OS and an Android Guest OS.
-
-.. figure:: images/APL_GVT-g-full-pic.png
-   :width: 800px
-   :align: center
-   :name: full-pic
-
-   Full picture of the AcrnGT
-
-AcrnGT in Kernel
-=================
-
-The AcrnGT module in the Service VM kernel acts as an adaption layer to connect
-between GVT-g in the i915, the VHM module, and the ACRN-DM user space
-application:
-
-  AcrnGT module implements the MPT interface of GVT-g to provide
-   services to it, including set and unset trap areas, set and unset
-   write-protection pages, etc.
-
-  It calls the VHM APIs provided by the ACRN VHM module in the Service VM
-   kernel, to eventually call into the routines provided by ACRN
-   hypervisor through hyper-calls.
-
-  It provides user space interfaces through ``sysfs`` to the user space
-   ACRN-DM so that DM can manage the lifecycle of the virtual GPUs.
-
-AcrnGT in DM
-=============
-
-To emulate a PCI device to a Guest, we need an AcrnGT sub-module in the
-ACRN-DM.  This sub-module is responsible for:
-
-  registering the virtual GPU device to the PCI device tree presented to
-   guest;
-
-  registerng the MMIO resources to ACRN-DM so that it can reserve
-   resources in ACPI table;
-
-  managing the lifecycle of the virtual GPU device, such as creation,
-   destruction, and resetting according to the state of the virtual
-   machine.
--- a/doc/developer-guides/hld/hld-devicemodel.rst
+++ b/doc/developer-guides/hld/hld-devicemodel.rst
@@ -216,14 +216,14 @@ DM Initialization
       *   map[0]:0~ctx->lowmem_limit & map[2]:4G~ctx->highmem for RAM
       *   ctx->highmem = request_memory_size - ctx->lowmem_limit
       *
-       *             Begin      End         Type         Length
-       * 0:             0 -     0xef000     RAM          0xEF000
-       * 1        0xef000 -     0x100000    (reserved)   0x11000
-       * 2       0x100000 -     lowmem      RAM          lowmem - 0x100000
-       * 3:        lowmem -     bff_fffff   (reserved)   0xc00_00000-lowmem
-       * 4:   0xc00_00000 -     dff_fffff   PCI hole     512MB
-       * 5:   0xe00_00000 -     fff_fffff   (reserved)   512MB
-       * 6:   1_000_00000 -     highmem     RAM          highmem-4G
+       *            Begin     Limit           Type            Length
+       * 0:             0  -  0xA0000         RAM             0xA0000
+       * 1       0x100000  -  lowmem part1    RAM             0x0
+       * 2:   SW SRAM_bot  -  SW SRAM_top     (reserved)      SOFTWARE_SRAM_MAX_SIZE
+       * 3:   gpu_rsvd_bot -  gpu_rsvd_top    (reserved)      0x4004000
+       * 4:   lowmem part2 -  0x80000000      (reserved)      0x0
+       * 5:     0xE0000000 -  0x100000000     MCFG, MMIO      512MB
+       * 6:  HIGHRAM_START_ADDR -  mmio64 start  RAM          ctx->highmem
       */

 -  **VM Loop Thread**: DM kicks this VM loop thread to create I/O
--- a/doc/developer-guides/hld/hld-emulated-devices.rst
+++ b/doc/developer-guides/hld/hld-emulated-devices.rst
@@ -15,7 +15,6 @@ documented in this section.
   UART virtualization <uart-virt-hld>
   Watchdog virtualization <watchdog-hld>
   AHCI virtualization <ahci-hld>
-   GVT-g GPU Virtualization <hld-APL_GVT-g>
   System timer virtualization <system-timer-hld>
   UART emulation in hypervisor <vuart-virt-hld>
   RTC emulation in hypervisor <rtc-virt-hld>
--- a/doc/developer-guides/hld/hld-overview.rst
+++ b/doc/developer-guides/hld/hld-overview.rst
@@ -175,6 +175,9 @@ ACRN adopts various approaches for emulating devices for the User VM:
   resources (mostly data-plane related) are passed-through to the User VMs and
   others (mostly control-plane related) are emulated.

+
+.. _ACRN-io-mediator:
+
 I/O Emulation
 -------------

@@ -193,6 +196,7 @@ I/O read from the User VM.
   I/O (PIO/MMIO) Emulation Path

 :numref:`overview-io-emu-path` shows an example I/O emulation flow path.
+
 When a guest executes an I/O instruction (port I/O or MMIO), a VM exit
 happens. The HV takes control and executes the request based on the VM exit
 reason ``VMX_EXIT_REASON_IO_INSTRUCTION`` for port I/O access, for
@@ -224,8 +228,9 @@ HSM/hypercall. The HV then stores the result to the guest register
 context, advances the guest IP to indicate the completion of instruction
 execution, and resumes the guest.

-MMIO access path is similar except for a VM exit reason of *EPT
-violation*.
+MMIO access path is similar except for a VM exit reason of *EPT violation*.
+MMIO access is usually trapped through a ``VMX_EXIT_REASON_EPT_VIOLATION`` in
+the hypervisor.

 DMA Emulation
 -------------
@@ -328,7 +333,7 @@ power operations.
 VM Manager creates the User VM based on DM application, and does User VM state
 management by interacting with lifecycle service in ACRN service.

-Please refer to VM management chapter for more details.
+Refer to VM management chapter for more details.

 ACRN Service
 ============
--- a/doc/developer-guides/hld/hld-security.rst
+++ b/doc/developer-guides/hld/hld-security.rst
@@ -1034,7 +1034,7 @@ Note that there are some security considerations in this design:
   other User VM.

 Keeping the Service VM system as secure as possible is a very important goal in
-the system security design, please follow the recommendations in
+the system security design. Follow the recommendations in
 :ref:`sos_hardening`.

 SEED Derivation
@@ -1058,7 +1058,7 @@ the non-secure OS issues this power event) is about to enter S3. While
 the restore state hypercall is called only by vBIOS when User VM is ready to
 resume from suspend state.

-For security design consideration of handling secure world S3, please
+For security design consideration of handling secure world S3,
 read the previous section: :ref:`uos_suspend_resume`.

 Platform Security Feature Virtualization and Enablement
--- a/doc/developer-guides/hld/hld-vsbl.rst
+++ b/doc/developer-guides/hld/hld-vsbl.rst
@@ -1,4 +0,0 @@
-.. _hld-vsbl:
-
-Virtual Slim-Bootloader High-Level Design
-#########################################
--- a/doc/developer-guides/hld/hv-cpu-virt.rst
+++ b/doc/developer-guides/hld/hv-cpu-virt.rst
@@ -116,7 +116,7 @@ any pCPU that is not included in it.
 CPU Assignment Management in HV
 ===============================

-The physical CPU assignment is pre-defined by ``cpu_affinity`` in
+The physical CPU assignment is predefined by ``cpu_affinity`` in
 ``vm config``, while post-launched VMs could be launched on pCPUs that are
 a subset of it.

@@ -1084,7 +1084,7 @@ ACRN always enables I/O bitmap in *VMX_PROC_VM_EXEC_CONTROLS* and EPT
 in *VMX_PROC_VM_EXEC_CONTROLS2*. Based on them,
 *pio_instr_vmexit_handler* and *ept_violation_vmexit_handler* are
 used for IO/MMIO emulation for a emulated device. The emulated device
-could locate in hypervisor or DM in the Service VM. Please refer to the "I/O
+could locate in hypervisor or DM in the Service VM. Refer to the "I/O
 Emulation" section for more details.

 For an emulated device done in the hypervisor, ACRN provide some basic
--- a/doc/developer-guides/hld/hv-dev-passthrough.rst
+++ b/doc/developer-guides/hld/hv-dev-passthrough.rst
@@ -83,7 +83,7 @@ one the following 4 cases:
  debug purpose, so the UART device is owned by hypervisor and is not visible
  to any VM. For now, UART is the only pci device could be owned by hypervisor.
 - **Pre-launched VM**: The passthrough devices will be used in a pre-launched VM is
-  pre-defined in VM configuration. These passthrough devices are owned by the
+  predefined in VM configuration. These passthrough devices are owned by the
  pre-launched VM after the VM is created. These devices will not be removed
  from the pre-launched VM. There could be pre-launched VM(s) in logical partition
  mode and hybrid mode.
@@ -381,7 +381,7 @@ GSI Sharing Violation Check
 All the PCI devices that are sharing the same GSI should be assigned to
 the same VM to avoid physical GSI sharing between multiple VMs.
 In logical partition mode or hybrid mode, the PCI devices assigned to
-pre-launched VM is statically pre-defined. Developers should take care not to
+pre-launched VM is statically predefined. Developers should take care not to
 violate the rule.
 For post-launched VM, devices that don't support MSI, ACRN DM puts the devices
 sharing the same GSI pin to a GSI
@@ -404,7 +404,7 @@ multiple PCI components with independent local time clocks within the same
 system.  Intel supports PTM on several of its systems and devices, such as PTM
 root capabilities support on Whiskey Lake and Tiger Lake PCIe root ports, and
 PTM device support on an Intel I225-V/I225-LM family Ethernet controller.  For
-further details on PTM, please refer to the `PCIe specification
+further details on PTM, refer to the `PCIe specification
 <https://pcisig.com/specifications>`_.

 ACRN adds PCIe root port emulation in the hypervisor to support the PTM feature
@@ -473,7 +473,7 @@ hypervisor startup. The Device Model (DM) then checks whether the pass-through d
 supports PTM requestor capabilities and whether the corresponding root port
 supports PTM root capabilities, as well as some other sanity checks.  If an
 error is detected during these checks, the error will be reported and ACRN will
-not enable PTM in the Guest VM. This doesn’t prevent the user from launching the Guest
+not enable PTM in the Guest VM. This doesn't prevent the user from launching the Guest
 VM and passing through the device to the Guest VM.  If no error is detected,
 the device model will use ``add_vdev`` hypercall to add a virtual root port (VRP),
 acting as the PTM root, to the Guest VM before passing through the device to the Guest VM.
--- a/doc/developer-guides/hld/hv-interrupt.rst
+++ b/doc/developer-guides/hld/hv-interrupt.rst
@@ -28,7 +28,7 @@ In the software modules view shown in :numref:`interrupt-sw-modules`,
 the ACRN hypervisor sets up the physical interrupt in its basic
 interrupt modules (e.g., IOAPIC/LAPIC/IDT). It dispatches the interrupt
 in the hypervisor interrupt flow control layer to the corresponding
-handlers; this could be pre-defined IPI notification, timer, or runtime
+handlers; this could be predefined IPI notification, timer, or runtime
 registered passthrough devices. The ACRN hypervisor then uses its VM
 interfaces based on vPIC, vIOAPIC, and vMSI modules, to inject the
 necessary virtual interrupt into the specific VM, or directly deliver
@@ -246,9 +246,6 @@ ACRN hypervisor maintains a global IRQ Descriptor Table shared among the
 physical CPUs, so the same vector will link to the same IRQ number for
 all CPUs.

-.. note:: need to reference API doc for irq_desc
-
-
 The *irq_desc[]* array's index represents IRQ number. A *handle_irq*
 will be called from *interrupt_dispatch* to commonly handle edge/level
 triggered IRQ and call the registered *action_fn*.
--- a/doc/developer-guides/hld/hv-ioc-virt.rst
+++ b/doc/developer-guides/hld/hv-ioc-virt.rst
@@ -613,7 +613,4 @@ for TTY line discipline in User VM::
   -l com2,/run/acrn/ioc_$vm_name


-Porting and Adaptation to Different Platforms
-*********************************************

-TBD
--- a/doc/developer-guides/hld/hv-rdt.rst
+++ b/doc/developer-guides/hld/hv-rdt.rst
@@ -46,19 +46,19 @@ to enforce the settings.
   .. code-block:: none
      :emphasize-lines: 2,4

-      <RDT desc="Intel RDT (Resource Director Technology).">
-            <RDT_ENABLED desc="Enable RDT">y</RDT_ENABLED>
-            <CDP_ENABLED desc="CDP (Code and Data Prioritization). CDP is an extension of CAT.">n</CDP_ENABLED>
-            <CLOS_MASK desc="Cache Capacity Bitmask">0xF</CLOS_MASK>
+      <RDT>
+            <RDT_ENABLED>y</RDT_ENABLED>
+            <CDP_ENABLED</CDP_ENABLED>
+            <CLOS_MASK>0xF</CLOS_MASK>

 Once the cache mask is set of each individual CPU, the respective CLOS ID
 needs to be set in the scenario XML file under ``VM`` section. If user desires
-to use CDP feature, CDP_ENABLED should be set to ``y``.
+to use CDP feature, ``CDP_ENABLED`` should be set to ``y``.

   .. code-block:: none
      :emphasize-lines: 2

-      <clos desc="Class of Service for Cache Allocation Technology. Please refer SDM 17.19.2 for details and use with caution.">
+      <clos>
            <vcpu_clos>0</vcpu_clos>

 .. note::
@@ -113,11 +113,11 @@ for non-root and root modes to enforce the settings.
   .. code-block:: none
      :emphasize-lines: 2,5

-      <RDT desc="Intel RDT (Resource Director Technology).">
-            <RDT_ENABLED desc="Enable RDT">y</RDT_ENABLED>
-            <CDP_ENABLED desc="CDP (Code and Data Prioritization). CDP is an extension of CAT.">n</CDP_ENABLED>
-            <CLOS_MASK desc="Cache Capacity Bitmask"></CLOS_MASK>
-            <MBA_DELAY desc="Memory Bandwidth Allocation delay value">0</MBA_DELAY>
+      <RDT>
+            <RDT_ENABLED>y</RDT_ENABLED>
+            <CDP_ENABLED>n</CDP_ENABLED>
+            <CLOS_MASK></CLOS_MASK>
+            <MBA_DELAY>0</MBA_DELAY>

 Once the cache mask is set of each individual CPU, the respective CLOS ID
 needs to be set in the scenario XML file under ``VM`` section.
@@ -125,7 +125,7 @@ needs to be set in the scenario XML file under ``VM`` section.
   .. code-block:: none
      :emphasize-lines: 2

-      <clos desc="Class of Service for Cache Allocation Technology. Please refer SDM 17.19.2 for details and use with caution.">
+      <clos>
            <vcpu_clos>0</vcpu_clos>

 .. note::
--- a/doc/developer-guides/hld/hv-startup.rst
+++ b/doc/developer-guides/hld/hv-startup.rst
@@ -113,8 +113,8 @@ initial states, including IDT and physical PICs.

 After the BSP detects that all APs are up, it will continue to enter guest mode; similar, after one AP
 complete its initialization, it will start entering guest mode as well.
-When BSP & APs enter guest mode, they will try to launch pre-defined VMs whose vBSP associated with
-this physical core; these pre-defined VMs are static configured in ``vm config`` and they could be
+When BSP & APs enter guest mode, they will try to launch predefined VMs whose vBSP associated with
+this physical core; these predefined VMs are static configured in ``vm config`` and they could be
 pre-launched Safety VM or Service VM; the VM startup will be explained in next section.

 .. _vm-startup:
--- a/doc/developer-guides/hld/hv-vm-management.rst
+++ b/doc/developer-guides/hld/hv-vm-management.rst
@@ -32,8 +32,8 @@ VM powers off, the VM returns to a 'powered off' state again.
 A VM can be paused to wait for some operation when it is running, so there is
 also a 'paused' state.

-:numref:`hvvm-state` illustrates the state-machine of a VM state transition,
-please refer to :ref:`hv-cpu-virt` for related VCPU state.
+:numref:`hvvm-state` illustrates the state-machine of a VM state transition.
+Refer to :ref:`hv-cpu-virt` for related vCPU state.

 .. figure:: images/hld-image108.png
   :align: center
@@ -49,7 +49,7 @@ Pre-Launched and Service VM

 The hypervisor is the owner to control pre-launched and Service VM's state
 by calling VM APIs directly, following the design of system power
-management. Please refer to ACRN power management design for more details.
+management. Refer to ACRN power management design for more details.


 Post-Launched User VMs
@@ -59,5 +59,5 @@ DM takes control of post-launched User VMs' state transition after the Service V
 boots, by calling VM APIs through hypercalls.

 Service VM user level service such as Life-Cycle-Service and tools such
-as Acrnd may work together with DM to launch or stop a User VM. Please
-refer to ACRN tool introduction for more details.
+as ``acrnd`` may work together with DM to launch or stop a User VM.
+Refer to :ref:`acrnctl` documentation for more details.
--- a/doc/developer-guides/hld/hv-vt-d.rst
+++ b/doc/developer-guides/hld/hv-vt-d.rst
@@ -49,16 +49,6 @@ Pre-Parsed DMAR Information
 For specific platforms, the ACRN hypervisor uses pre-parsed DMA remapping
 reporting information directly to save hypervisor bootup time.

-DMA Remapping Unit for Integrated Graphics Device
-=================================================
-
-Generally, there is a dedicated remapping hardware unit for the Intel
-integrated graphics device. ACRN implements GVT-g for graphics, but
-GVT-g is not compatible with VT-d. The remapping hardware unit for the
-graphics device is disabled on ACRN if GVT-g is enabled. If the graphics
-device needs to passthrough to a VM, then the remapping hardware unit
-must be enabled.
-
 DMA Remapping
 *************

--- a/doc/developer-guides/hld/images/APL_GVT-g-DM.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-DM.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-access-patterns.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-access-patterns.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-api-forwarding.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-api-forwarding.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-arch.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-arch.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-direct-display.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-direct-display.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-display-virt.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-display-virt.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-full-pic.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-full-pic.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-graphics-arch.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-graphics-arch.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-hyper-dma.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-hyper-dma.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-indirect-display.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-indirect-display.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-interrupt-virt.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-interrupt-virt.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-ive-use-case.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-ive-use-case.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-mediated-pass-through.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-mediated-pass-through.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-mem-part.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-mem-part.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-pass-through.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-pass-through.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-per-vm-shadow.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-per-vm-shadow.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-perf-critical.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-perf-critical.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-plane-based.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-plane-based.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-scheduling-policy.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-scheduling-policy.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-scheduling.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-scheduling.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-shared-shadow.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-shared-shadow.png
--- a/doc/developer-guides/hld/images/APL_GVT-g-workload.png
+++ b/doc/developer-guides/hld/images/APL_GVT-g-workload.png
--- a/doc/developer-guides/hld/images/interrupt-image39.png
+++ b/doc/developer-guides/hld/images/interrupt-image39.png
--- a/doc/developer-guides/hld/images/ioc-image49.png
+++ b/doc/developer-guides/hld/images/ioc-image49.png
--- a/doc/developer-guides/hld/images/ioc-image65.png
+++ b/doc/developer-guides/hld/images/ioc-image65.png
--- a/doc/developer-guides/hld/images/mem-image2.png
+++ b/doc/developer-guides/hld/images/mem-image2.png
--- a/doc/developer-guides/hld/images/mem-image5.png
+++ b/doc/developer-guides/hld/images/mem-image5.png
--- a/doc/developer-guides/hld/images/mem-image6.png
+++ b/doc/developer-guides/hld/images/mem-image6.png
--- a/doc/developer-guides/hld/images/mem-image7.png
+++ b/doc/developer-guides/hld/images/mem-image7.png
--- a/doc/developer-guides/hld/images/virtio-hld-image6.png
+++ b/doc/developer-guides/hld/images/virtio-hld-image6.png
--- a/doc/developer-guides/hld/images/virtio-hld-image7.png
+++ b/doc/developer-guides/hld/images/virtio-hld-image7.png
--- a/doc/developer-guides/hld/index.rst
+++ b/doc/developer-guides/hld/index.rst
@@ -25,5 +25,4 @@ system.
   Virtio Devices <hld-virtio-devices>
   Power Management <hld-power-management>
   Tracing and Logging <hld-trace-log>
-   Virtual Bootloader <hld-vsbl>
   Security <hld-security>
--- a/doc/developer-guides/hld/virtio-console.rst
+++ b/doc/developer-guides/hld/virtio-console.rst
@@ -82,12 +82,12 @@ The device model configuration command syntax for virtio-console is::
 -  The ``stdio/tty/pty`` is TTY capable, which means :kbd:`TAB` and
   :kbd:`BACKSPACE` are supported, as on a regular terminal

-  When TTY is used, please make sure the redirected TTY is sleeping,
+-  When TTY is used, make sure the redirected TTY is sleeping,
   (e.g., by ``sleep 2d`` command), and will not read input from stdin before it
   is used by virtio-console to redirect guest output.

-  When virtio-console socket_type is appointed to client, please make sure
-   server VM(socket_type is appointed to server) has started.
+-  When virtio-console socket_type is appointed to client, make sure
+   server VM (socket_type is appointed to server) has started.

 -  Claiming multiple virtio-serial ports as consoles is supported,
   however the guest Linux OS will only use one of them, through the
@@ -222,7 +222,7 @@ SOCKET
 The virtio-console socket-type can be set as socket server or client. Device model will
 create a Unix domain socket if appointed the socket_type as server, then server VM or
 another user VM can bind and listen for communication requirement. If appointed to
-client, please make sure the socket server is ready prior to launch device model.
+client, make sure the socket server is ready prior to launch device model.

 1. Add a PCI slot to the device model (``acrn-dm``) command line, adjusting
   the ``</path/to/file.sock>`` to your use case in the VM1 configuration::
--- a/doc/developer-guides/hld/virtio-net.rst
+++ b/doc/developer-guides/hld/virtio-net.rst
@@ -193,7 +193,7 @@ example, showing the flow through each layer:

 .. code-block:: c

-   vhm_intr_handler -->                          // HSM interrupt handler
+   hsm_intr_handler -->                          // HSM interrupt handler
       tasklet_schedule -->
           io_req_tasklet -->
               acrn_ioreq_distribute_request --> // ioreq can't be processed in HSM, forward it to device DM
@@ -348,7 +348,7 @@ cases.)

 .. code-block:: c

-   vhm_dev_ioctl -->                // process the IOCTL and call hypercall to inject interrupt
+   hsm_dev_ioctl -->                // process the IOCTL and call hypercall to inject interrupt
       hcall_inject_msi -->

 **ACRN Hypervisor**
@@ -426,11 +426,10 @@ our case, we use systemd to automatically create the network by default.
 You can check the files with prefix 50- in the Service VM
 ``/usr/lib/systemd/network/``:

- :acrn_raw:`50-acrn.netdev <misc/acrnbridge/acrn.netdev>`
- :acrn_raw:`50-acrn.netdev <misc/acrnbridge/acrn.netdev>`
- :acrn_raw:`50-acrn.network <misc/acrnbridge/acrn.network>`
- :acrn_raw:`50-tap0.netdev <misc/acrnbridge/tap0.netdev>`
- :acrn_raw:`50-eth.network <misc/acrnbridge/eth.network>`
+- :acrn_raw:`50-acrn.netdev <misc/services/acrn_bridge/acrn.netdev>`
+- :acrn_raw:`50-acrn.network <misc/services/acrn_bridge/acrn.network>`
+- :acrn_raw:`50-tap0.netdev <misc/services/acrn_bridge/tap0.netdev>`
+- :acrn_raw:`50-eth.network <misc/services/acrn_bridge/eth.network>`

 When the Service VM is started, run ``ifconfig`` to show the devices created by
 this systemd configuration: