US9058183B2 - Hypervisor isolation of processor cores to enable computing accelerator cores - Google Patents

Hypervisor isolation of processor cores to enable computing accelerator cores Download PDF

Info

Publication number
US9058183B2
US9058183B2 US12/648,592 US64859209A US9058183B2 US 9058183 B2 US9058183 B2 US 9058183B2 US 64859209 A US64859209 A US 64859209A US 9058183 B2 US9058183 B2 US 9058183B2
Authority
US
United States
Prior art keywords
cores
operating system
subset
application
work
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US12/648,592
Other versions
US20110161955A1 (en
Inventor
Thomas R. Woller
Patryk Kaminski
Erich Boleyn
Keith A. Lowery
Benjamin C. Serebrin
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced Micro Devices Inc
Original Assignee
Advanced Micro Devices Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Advanced Micro Devices Inc filed Critical Advanced Micro Devices Inc
Priority to US12/648,592 priority Critical patent/US9058183B2/en
Assigned to ADVANCED MICRO DEVICES, INC. reassignment ADVANCED MICRO DEVICES, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BOLEYN, ERICH, LOWERY, KEITH A., SEREBRIN, BENJAMIN C., KAMINSKI, PATRYK, WOLLER, THOMAS R.
Priority to EP10796238.3A priority patent/EP2519877B1/en
Priority to JP2012547104A priority patent/JP2013516021A/en
Priority to CN201080059820.8A priority patent/CN102713847B/en
Priority to KR1020127019346A priority patent/KR101668399B1/en
Priority to PCT/US2010/060193 priority patent/WO2011090596A2/en
Publication of US20110161955A1 publication Critical patent/US20110161955A1/en
Publication of US9058183B2 publication Critical patent/US9058183B2/en
Application granted granted Critical
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/4401Bootstrapping
    • G06F9/4406Loading of operating system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45545Guest-host, i.e. hypervisor is an application program itself, e.g. VirtualBox
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45579I/O management, e.g. providing access to device drivers or storage
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/509Offload

Definitions

  • the invention is related to computer systems and more particularly to multi-core computer systems.
  • an exemplary computing system 100 includes multiple processors 102 , each of which includes one or more processor cores (e.g., processor cores 104 ).
  • processors 102 are coupled to other processors 102 , memory 106 , devices 108 , and storage 110 by one or more hub integrated circuits (e.g., memory controller hub and I/O controller hub), bus (e.g., PCI bus, ISA bus, and SMBus), other suitable communication interfaces, or combinations thereof.
  • hub integrated circuits e.g., memory controller hub and I/O controller hub
  • bus e.g., PCI bus, ISA bus, and SMBus
  • An operating system e.g., Microsoft Windows, Linux, and UNIX
  • An operating system provides an interface between the hardware and a user (i.e., computing applications, e.g., applications 114 ).
  • Execution of operating system 112 may be distributed across a plurality of cores 104 .
  • a typical computing system may not be able to utilize all processor cores or utilize all processor cores efficiently.
  • an operating system may be able to access and control only a limited number of CPU cores, leaving idle other cores in the computing system.
  • a method includes executing an operating system on a first subset of cores including one or more cores of a plurality of cores of a computer system.
  • the operating system executes as a guest under control of a virtual machine monitor.
  • the method includes executing work for an application on a second subset of cores including one or more cores of the plurality of cores.
  • the first and second subsets of cores are mutually exclusive and the second subset of cores is not visible to the operating system.
  • the method includes sequestering the second subset of cores from the operating system.
  • an apparatus in at least one embodiment of the invention, includes a plurality of cores and an operating system software encoded in one or more media accessible to the plurality of cores.
  • the apparatus includes hypervisor software encoded in one or more media accessible to the plurality of cores and executable on one or more of the plurality of cores.
  • the hypervisor software is executable to control execution of the operating system software as a guest on a first set of cores including one or more cores of the plurality of cores and to execute at least some work of an application on a second set of cores including one or more cores of the plurality of cores.
  • the second set of cores is not visible to the operating system.
  • a computer program product includes one or more functional sequences executable as, or in conjunction with, a virtual machine monitor and configured to execute an operating system sequence as a guest under control of the virtual machine monitor on a first set of cores including one or more cores of a plurality of cores.
  • the computer program product includes one or more functional sequences to execute at least some work of an application on a second set of cores including one or more cores of the plurality of cores. The second set of cores is not visible to the operating system.
  • FIG. 1 illustrates a functional block diagram of an exemplary multi-core computing system.
  • FIG. 2 illustrates a functional block diagram of an exemplary virtualization system.
  • FIG. 3 illustrates a functional block diagram of an exemplary virtualization system consistent with at least one embodiment of the invention.
  • FIG. 4 illustrates a functional block diagram of an exemplary virtual machine monitor executing on the virtualization system of FIG. 3 with sequestered processor cores configured as de facto accelerators consistent with at least one embodiment of the invention.
  • FIG. 5 illustrates exemplary information and control flows in the virtualization system of FIG. 3 with sequestered processor cores configured as de facto accelerators consistent with at least one embodiment of the invention.
  • FIG. 6 illustrates exemplary information and control flows for a work unit process flow in the virtualization system of FIG. 3 with sequestered processor cores configured as de facto accelerators consistent with at least one embodiment of the invention.
  • FIG. 7 illustrates exemplary information and control flows for work unit page fault processing in the virtualization system of FIG. 3 with sequestered processor cores configured as de facto accelerators consistent with at least one embodiment of the invention.
  • FIG. 8 illustrates exemplary information and control flows for work unit command completion in the virtualization system of FIG. 3 with sequestered processor cores configured as de facto accelerators consistent with at least one embodiment of the invention.
  • FIG. 9 illustrates information and control flows in the virtualization system of FIG. 3 configured for instant-on application usage consistent with at least one embodiment of the invention.
  • virtualization of a computing system is used to hide physical characteristics of the computing system from a user (i.e., software executing on the computing system) and instead, presents an abstract emulated computing system (i.e., a virtual machine (VM)) to the user.
  • VM virtual machine
  • Physical hardware resources of computing system 100 are exposed to one or more guests (e.g., guests 206 ) as one or more corresponding isolated, apparently independent, virtual machines (e.g., VM 204 ).
  • a virtual machine may include one or more virtual resources (e.g., VCPU, VMEMORY, and VDEVICES) that are implemented by physical resources of computing system 100 that a virtual machine monitor (VMM) (i.e., hypervisor, e.g., VMM 202 ) allocates to the virtual machine.
  • VMM virtual machine monitor
  • a “virtual machine monitor” (VMM) or “hypervisor” is software that provides the virtualization capability.
  • the VMM provides an interface between the guest software and the physical resources.
  • the VMM provides each guest the appearance of full control over a complete computer system (i.e., memory, central processing unit (CPU) and all peripheral devices).
  • a Type 1 (i.e., native) VMM is a standalone software program that executes on physical resources and provides the virtualization for one or more guests.
  • a guest operating system executes on a level above the VMM.
  • a Type 2 (i.e., hosted) VMM is integrated into or executes on an operating system, the operating system components execute directly on physical resources and are not virtualized by the VMM.
  • the VMM is considered a distinct software layer and a guest operating system may execute on a third software level above the hardware.
  • VMM 202 retains control over the physical resources.
  • a guest system e.g., an instance of an operating system (e.g., Windows, Linux, and UNIX) executes on a corresponding virtual machine and shares physical resources with other guest systems executing on other virtual machines.
  • an operating system e.g., Windows, Linux, and UNIX
  • multiple operating systems e.g., multiple instances of the same operating system or instances of different operating systems
  • VMM 202 is executed by some or all processor cores in the physical resources. An individual guest is executed by a set of processor cores included in the physical resources. The processors switch between execution of VMM 202 and execution of one or more guests 206 .
  • a “world switch” is a switch between execution of a guest and execution of a VMM.
  • a world switch may be initiated by a VMMCALL instruction or by other suitable techniques, e.g., interrupt mechanisms or predetermined instructions defined by a control block, described below. Although a particular world switch may be described herein as being initiated using a particular technique, other suitable techniques may be used.
  • a current processor core environment e.g., guest or VMM
  • VMM executes its state information and restores state information for a target core environment (e.g., VMM or guest) to which the processor core execution is switched.
  • a VMM executes a world switch when the VMM executes a guest that was scheduled for execution.
  • a world switch from executing a guest to executing a VMM is made when the VMM exercises control over physical resources, e.g., when the guest attempts to access a peripheral device, when a new page of memory is to be allocated to the guest, or when it is time for the VMM to schedule another guest, etc.
  • Virtualization techniques may be implemented using only software (which includes firmware) or by a combination of software and hardware.
  • some processors include virtualization hardware, which allows simplification of VMM code and improves system performance for full virtualization (e.g., hardware extensions for virtualization provided by AMD-V and Intel VT-x).
  • Software, as described herein, may be encoded in at least one computer readable medium selected from the set of a disk, tape, or other magnetic, optical, or electronic storage medium.
  • Virtualization techniques may be used to isolate or sequester one or more processor cores of a computing system from an operating system executing as a guest on one or more other processing cores of the computer system under control of a VMM.
  • sequestered cores may be configured as de facto accelerators. That is, sequestered cores are used by the VMM to complete work initiated from within the operating system environment. Although the host cores and the sequestered cores reside within a shared memory environment, the sequestered cores are not managed by the operating system directly.
  • the VMM is configured as a vehicle for communicating between the sequestered cores and the host cores.
  • An exemplary VMM implements a memory-based solution for propagating work requests, page faults, and completion information using a queue-based architecture implemented within a shared memory space. Computational work may be initiated within the confines of the guest operating system. A VMM then coordinates work between the operating system and the sequestered cores. Accordingly, a VMM may be used to implement general computational acceleration. A VMM and sequestered cores may be used to implement instant-on application usage. In addition, a VMM may be used to configure sequestered cores as network device accelerators.
  • the number of cores used by a guest operating system may be selectable.
  • the number of host cores may be the maximum number of cores that a particular guest operating system is able to utilize.
  • the number of cores used by the guest operating system is not limited thereto, and a system may be configured with a predetermined number of cores for an operating system that is less than a maximum number of cores.
  • exemplary computing system 400 includes VMM 402 .
  • VMM 402 emulates a decoupled architecture, i.e., VMM 402 sequesters cores to execute applications or application tasks.
  • VMM 402 sequesters cores 406 from cores 404 .
  • VMM 402 assigns host cores 404 and sequestered cores 406 separate virtual memory spaces.
  • VMM 402 assigns host cores 404 and sequestered cores 406 a shared virtual memory space. Techniques for implementing a shared virtual memory space are described in U.S.
  • VMM 402 maintains a set of control blocks, which include state and control information for execution of a guest on host cores 404 and a set of state and control information for execution of a work unit on sequestered cores 406 .
  • these control blocks are known as virtual machine control blocks (VMCBs).
  • VMCBs virtual machine control blocks
  • Each guest and de facto accelerator may be associated with a corresponding control block.
  • Exemplary control blocks may be stored in memory and/or in storage of the host hardware and include state and control information for a corresponding guest or de facto accelerator and/or state and control information for the VMM.
  • a control block includes state information corresponding to core state at a point at which a guest last exited.
  • Exemplary control blocks may be accessed by particular instructions and information may be stored in particular fields of predetermined data structures.
  • VMM 402 is configured to isolate at least one core (e.g., sequestered cores 406 ) for use as a de facto accelerator.
  • Operating system 408 e.g., Microsoft Windows
  • host cores 404 e.g., x86 cores
  • application 414 executes on operating system 408 .
  • Kernel mode driver 410 which executes on operating system 408 , exchanges information with VMM 402 to provide user application 414 indirect access to the de facto accelerators.
  • the guest operating system may utilize sequestered cores 406 using kernel mode driver 410 , e.g., using a call.
  • Communications between VMM 402 and guest operating system 408 and between VMM 402 and de facto accelerators are accomplished using queues in shared virtual memory (e.g., work queue 424 , command queue 418 , fault queue 422 , and response queue 420 ).
  • shared virtual memory e.g., work queue 424 , command queue 418 , fault queue 422 , and response queue 420 .
  • Scheduler 416 includes a thread pool across which work items are distributed to available segregated cores 406 .
  • the work units are assigned to available segregated cores using round-robin scheduling; however, other suitable scheduling algorithms (e.g., dynamic priority scheduling, etc.) may be used in other embodiments of scheduler 416 .
  • scheduler 416 is a user-mode scheduler, which allows scheduling to be performed separate from the operating system.
  • scheduler 416 is a kernel-mode scheduler, which requires modification of kernel-level portions of the operating system.
  • At least some of the functionality of scheduler 416 is performed by VMM 402 and/or at least some of the functionality of scheduler 416 is performed by kernel mode driver 410 .
  • VMM 402 maintains relevant topology and architecture information in an information or control structure that is visible to kernel mode driver 410 .
  • VMM 402 provides at least information about available de facto accelerators to kernel mode driver 410 .
  • a fault queue 422 , command queue 418 , response queue 420 , and work queue 424 are implemented in shared virtual memory space. All of those queues require operating system access (e.g., kernel mode access). In at least one embodiment of computing system 400 , the queues must be accessible from outside of the process context of a creating application. Thus, operating system 408 must provide memory translation. Only the work queue requires user-mode access. In at least one embodiment, queues, 418 , 420 , 422 , and 424 use non-locking implementations and are configured for a single reader and a single writer. Virtual machine monitor 402 enqueues to fault queue 422 and response queue 420 .
  • Kernel mode driver 410 dequeues from fault queue 422 and response queue 420 . Kernel mode driver 410 enqueues to command queue 418 and VMM 402 dequeues from command queue 418 . Application 414 enqueues to work queue 424 . Scheduler 416 , which may be implemented using VMM 402 and/or kernel mode driver 410 , dequeues from work queue 424 .
  • application 414 calls queueing application programming interface (API) 412 to initialize the queueing interfaces.
  • Queueing API 412 instantiates kernel mode driver 410 and makes documented input/output control (ioctl) calls to allocate the queues.
  • Kernel mode driver 410 receives the ioctl command and allocates queues that may be read or written by appropriate entities (e.g., VMM 402 and kernel mode driver 410 ), consistent with the description above.
  • Kernel mode driver 410 creates an internal work table that associates work queue 424 with an address space. Kernel mode driver 410 also creates a page table and allocates stacks for the de facto accelerators. Kernel mode driver 410 creates a kernel mode thread and also returns a pointer to work queue 424 for use by application 414 .
  • polling techniques are used to process the queues.
  • communications between VMM 402 and guest operating system 408 and between VMM 402 and sequestered cores 406 , configured as de facto accelerators are achieved using doorbell techniques.
  • any writer e.g., kernel mode driver 410 , queuing API 412 , or VMM 402
  • VMM 402 supports a VMM call that serves as a doorbell for a specific queue.
  • VMM 402 rings the doorbell of kernel mode driver 410 by issuing a software interrupt. Different software interrupts may be used to distinguish between different doorbell recipients.
  • application 414 may push an entry into work queue 424 via queueing API 412 and kernel mode driver 410 rings a doorbell for VMM 402 , e.g., by executing a VMMCALL, to indicate that the work queue has a new entry.
  • the VMMCALL instruction transfers control from guest operating system 408 to VMM 402 .
  • kernel mode driver 410 pushes a command into command queue 418
  • kernel mode driver 410 rings a doorbell (e.g., by executing a VMMCALL) for VMM 402 to indicate that the command queue has a new entry.
  • VMM 402 may push an entry into fault queue 422 and send a fault queue interrupt via a local Advanced Programmable Interrupt Controller (APIC) to a host core 404 .
  • APIC Advanced Programmable Interrupt Controller
  • VMM 402 can ring the doorbell of kernel mode driver 410 using software interrupts. The particular interrupt number used is stored in a field in a configuration block and maintained by kernel mode driver 410 .
  • Application 414 creates work queue 424 and registers with kernel mode driver 410 for an entry point in the work queue table.
  • Application 414 uses queuing API 412 to add work items to work queue 424 .
  • Queuing API 412 rings the doorbell of scheduler 416 .
  • kernel mode driver 410 will read work queue 424 .
  • calls to VMM 402 will explicitly include an indicator of which core should be targeted by VMM 402 .
  • scheduler 416 determines whether a de facto accelerator is available. If no de facto accelerator is available, scheduler 416 updates a status to indicate that work queue 424 is not empty. If a de facto accelerator is available, scheduler 416 reads work queue 424 .
  • Scheduler 416 selects an available de facto accelerator and makes a scheduling call to VMM 402 .
  • scheduler 416 when scheduler 416 is distinct from VMM 402 , scheduler 416 may write a command to command queue 418 and ring the doorbell of VMM 402 . Then VMM 402 sets up execution context and initializes a target sequestered core 406 configured as a de facto accelerator. VMM 402 writes to response queue 420 and scheduler 416 processes response queue 420 to maintain visibility into status (e.g., availability) of sequestered cores 406 . When scheduler 416 dequeues a work item from work queue 424 , scheduler 416 consults a list of available de facto accelerators of sequestered core 406 configured as de facto accelerators and selects a target sequestered core 406 .
  • Scheduler 416 then creates and enqueues a command queue entry that indicates the work item and the target sequestered core 406 . Then scheduler 416 rings the doorbell of VMM 402 . In order for scheduler 416 to maintain an accurate view of resource availability, scheduler 416 should be notified of work item completion.
  • a system stack is manipulated so that a return from a work item makes a VMM call to notify VMM 402 of work item completion.
  • VMM 402 boots on the cores of system 400 (e.g., host cores 404 and sequestered cores 406 ) ( 502 ).
  • VMM 402 is booted from memory (e.g., on a hard drive), separately from the Basic Input Output System.
  • Virtual machine monitor 402 then boots operating system 408 as a guest on operating system cores 404 and sequesters cores 406 from cores 402 ( 504 ). For example, when booting operating system 408 , VMM 402 informs operating system 408 of a number of cores on which to execute. Then operating system 408 will not attempt to access sequestered cores 406 .
  • Other techniques for sequestering cores 406 from operating system cores 404 include modifying the BIOS tables so that operating system 408 is aware of only a particular number of cores less than a total number of cores, with virtual machine monitor 402 controlling the environments on both sets of cores. Those BIOS tables may either be loaded automatically from read-only memory or patched in by VMM 402 .
  • VMM 402 intercepts operating system commands to configure a number of operating system cores.
  • operating system 408 loads an accelerated computing kernel mode device driver 410 ( 508 ).
  • Application 414 runs on operating system 408 ( 510 ).
  • Application 414 generates work units, which are then scheduled to execute on sequestered cores 406 ( 512 ).
  • VMM 402 Upon completion, VMM 402 notifies operating system 408 of completed work ( 514 ).
  • kernel mode driver 410 creates an internal work table, which may be used for adding work queue table entries ( 602 ).
  • Application 414 creates a work queue and registers with kernel mode driver 410 for an entry in the work queue table ( 604 ). While executing, application 414 pushes a work queue entry onto work queue 424 ( 606 ).
  • Kernel mode driver 410 notifies VMM 402 that work queue 424 has a new entry ( 608 ) using a doorbell (e.g., VMMCALL), as described above, or other suitable notification technique.
  • a doorbell e.g., VMMCALL
  • Virtual memory monitor 402 processes the doorbell on host cores 404 and sends an INIT inter-processor interrupt (IPI) to a particular sequestered core 406 .
  • Virtual machine monitor 402 processes an exit to VMM 402 on the particular sequestered core 406 ( 610 ). If the particular sequestered core 406 is idle (i.e., is not already processing a work unit), VMM 402 pulls a next work unit entry from work queue 424 ( 612 ), modifies a VMCB, and begins execution of code for processing the work unit ( 614 ). Otherwise, the particular sequestered core continues executing a previously launched work unit. In at least one embodiment of computing system 400 , if a particular sequestered core 406 is already executing a work unit, VMM 402 will not interrupt that particular sequestered core 406 with an exit to VMM 402 .
  • IPI INIT inter-processor interrupt
  • a sequestered core 406 configured as a de facto accelerator may experience a page fault (i.e., sequestered core 406 accesses a page that is mapped in address space but is not loaded into physical memory).
  • a page fault i.e., sequestered core 406 accesses a page that is mapped in address space but is not loaded into physical memory.
  • those page faults experienced by sequestered core 406 are recognized by VMM 402 and a world switch occurs to VMM 402 ( 702 ).
  • Virtual machine monitor 402 obtains page fault information from the sequestered core and creates a kernel-level page fault entry, which VMM 402 pushes onto user fault queue 422 ( 704 ).
  • Virtual machine monitor 402 issues a fault queue interrupt via a local APIC to one of host cores 404 ( 706 ).
  • Kernel mode driver 410 interrupt handler processes the interrupt and executes a fault queue deferred procedure call and reads the fault off of system fault queue 428 .
  • Kernel mode driver 410 updates the page tables associated with the user process ( 710 ) and generates a command (e.g., CMD_RESUME including a field for a target core) for resuming execution by the sequestered core 406 configured as a de facto accelerator ( 712 ).
  • a command e.g., CMD_RESUME including a field for a target core
  • Kernel mode driver 410 pushes that command into command queue 418 ( 712 ) and rings a doorbell of VMM 402 (e.g., VMMCALL) that indicates that command queue 418 has a new entry ( 714 ).
  • Virtual machine monitor 402 processes the VMMCALL on host core 404 and issues an inter-processor interrupt (i.e., INIT IPI) to a sequestered core 406 that includes queue handler 412 (i.e., de facto accelerator core 0 ), which processes command queue 418 .
  • INIT IPI inter-processor interrupt
  • de facto accelerator core 0 In response to the inter-processor interrupt, de facto accelerator core 0 reads command queue 418 and processes the command (e.g., CMD_RESUME) ( 716 ), e.g., by sending an inter-processor interrupt to an appropriate sequestered core 406 to resume processing the work unit ( 718 ).
  • Virtual machine monitor 402 then processes a VMEXIT (e.g., performs a world switch) and the sequestered core 406 resumes processing the work unit ( 720 ).
  • VMEXIT e.g., performs a world switch
  • the sequestered core 406 executes a routine that includes one or more instructions that indicate the work unit has completed execution (e.g., VMMCALL) ( 802 ). Accordingly, sequestered core 406 returns to execution of VMM 402 , and VMM 402 processes the indicator of work unit completion ( 804 ). In at least one embodiment of computing system 400 , VMM 402 determines whether it is configured to issue a notification of work unit completion ( 808 ).
  • VMM 402 will proceed to process a next work unit ( 810 ). Alternatively, VMM will issue a completion directive. In at least one embodiment, VMM 402 pushes a work unit completion entry into system fault queue 428 and VMM 402 sends a fault queue interrupt (e.g., via local APIC) to an operating system core 404 ( 812 ).
  • a fault queue interrupt e.g., via local APIC
  • Kernel mode driver 410 processes the fault queue interrupt and reads an entry from system fault queue. Kernel mode driver 410 locates the user process context associated with the fault entry and pushes the fault entry into a particular user fault queue 422 for the process context ( 814 ). A user work thread handler in kernel mode driver 410 pulls a fault entry from user fault queue 422 and completes the work unit ( 818 ).
  • sequestered cores 406 are configured for instant-on application usage, rather than as de facto accelerators.
  • VMM 402 boots on the cores of system 400 (e.g., host cores 404 and sequestered cores 406 ) ( 902 ).
  • VMM 402 may reside in the BIOS and automatically sequesters cores 406 from cores 402 ( 904 ).
  • Virtual machine monitor 402 is configured to have access to the file system and runs a user application on one or more of sequestered cores 406 ( 906 ).
  • VMM 402 boots operating system 408 as a guest on host cores 404 ( 906 ).
  • Virtual machine monitor 402 includes one or more drivers or basic input output system (i.e., BIOS interface) functions to access media containing an application that will initially run on sequestered cores 406 .
  • BIOS interface basic input output system
  • VMM 402 is described as a virtual machine monitor in general, in at least one embodiment, VMM 402 is a minimalistic implementation of a virtual machine monitor that is configured to provide the functionality described herein, and few other virtualization functions. In another embodiment, the functionality of VMM 402 described herein is incorporated into a general virtual machine monitor that provides other typical virtual machine functions.
  • virtual machine monitors may be nested, e.g., operating system 408 is a VMM machine monitor that is controlled by VMM 402 consistent with the functionality described herein.
  • use of virtualization techniques to sequester cores requires no modification to the operating system.
  • VMM 402 may coordinate with a network router device to accelerate packet inspection functions using sequestered cores 406 .

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Security & Cryptography (AREA)
  • Hardware Redundancy (AREA)
  • Debugging And Monitoring (AREA)

Abstract

Techniques for utilizing processor cores include sequestering processor cores for use independently from an operating system. In at least one embodiment of the invention, a method includes executing an operating system on a first subset of cores including one or more cores of a plurality of cores of a computer system. The operating system executes as a guest under control of a virtual machine monitor. The method includes executing work for an application on a second subset of cores including one or more cores of the plurality of cores. The first and second subsets of cores are mutually exclusive and the second subset of cores is not visible to the operating system. In at least one embodiment, the method includes sequestering the second subset of cores from the operating system.

Description

BACKGROUND
1. Field of the Invention
The invention is related to computer systems and more particularly to multi-core computer systems.
2. Description of the Related Art
In general, the number of central processing unit (CPU) cores (i.e., processor cores) and/or processors included within a computing system is increasing rapidly. Referring to FIG. 1, an exemplary computing system 100 includes multiple processors 102, each of which includes one or more processor cores (e.g., processor cores 104). Processors 102 are coupled to other processors 102, memory 106, devices 108, and storage 110 by one or more hub integrated circuits (e.g., memory controller hub and I/O controller hub), bus (e.g., PCI bus, ISA bus, and SMBus), other suitable communication interfaces, or combinations thereof. An operating system (e.g., Microsoft Windows, Linux, and UNIX) provides an interface between the hardware and a user (i.e., computing applications, e.g., applications 114). Execution of operating system 112 may be distributed across a plurality of cores 104.
Although a computing system includes multiple processor cores, a typical computing system may not be able to utilize all processor cores or utilize all processor cores efficiently. For example, an operating system may be able to access and control only a limited number of CPU cores, leaving idle other cores in the computing system.
SUMMARY OF EMBODIMENTS OF THE INVENTION
Accordingly, techniques for utilizing processor cores include sequestering processor cores for use independently from an operating system. In at least one embodiment of the invention, a method includes executing an operating system on a first subset of cores including one or more cores of a plurality of cores of a computer system. The operating system executes as a guest under control of a virtual machine monitor. The method includes executing work for an application on a second subset of cores including one or more cores of the plurality of cores. The first and second subsets of cores are mutually exclusive and the second subset of cores is not visible to the operating system. In at least one embodiment, the method includes sequestering the second subset of cores from the operating system.
In at least one embodiment of the invention, an apparatus includes a plurality of cores and an operating system software encoded in one or more media accessible to the plurality of cores. The apparatus includes hypervisor software encoded in one or more media accessible to the plurality of cores and executable on one or more of the plurality of cores. The hypervisor software is executable to control execution of the operating system software as a guest on a first set of cores including one or more cores of the plurality of cores and to execute at least some work of an application on a second set of cores including one or more cores of the plurality of cores. The second set of cores is not visible to the operating system.
In at least one embodiment of the invention, a computer program product includes one or more functional sequences executable as, or in conjunction with, a virtual machine monitor and configured to execute an operating system sequence as a guest under control of the virtual machine monitor on a first set of cores including one or more cores of a plurality of cores. The computer program product includes one or more functional sequences to execute at least some work of an application on a second set of cores including one or more cores of the plurality of cores. The second set of cores is not visible to the operating system.
BRIEF DESCRIPTION OF THE DRAWINGS
The present invention may be better understood, and its numerous objects, features, and advantages made apparent to those skilled in the art by referencing the accompanying drawings.
FIG. 1 illustrates a functional block diagram of an exemplary multi-core computing system.
FIG. 2 illustrates a functional block diagram of an exemplary virtualization system.
FIG. 3 illustrates a functional block diagram of an exemplary virtualization system consistent with at least one embodiment of the invention.
FIG. 4 illustrates a functional block diagram of an exemplary virtual machine monitor executing on the virtualization system of FIG. 3 with sequestered processor cores configured as de facto accelerators consistent with at least one embodiment of the invention.
FIG. 5 illustrates exemplary information and control flows in the virtualization system of FIG. 3 with sequestered processor cores configured as de facto accelerators consistent with at least one embodiment of the invention.
FIG. 6 illustrates exemplary information and control flows for a work unit process flow in the virtualization system of FIG. 3 with sequestered processor cores configured as de facto accelerators consistent with at least one embodiment of the invention.
FIG. 7 illustrates exemplary information and control flows for work unit page fault processing in the virtualization system of FIG. 3 with sequestered processor cores configured as de facto accelerators consistent with at least one embodiment of the invention.
FIG. 8 illustrates exemplary information and control flows for work unit command completion in the virtualization system of FIG. 3 with sequestered processor cores configured as de facto accelerators consistent with at least one embodiment of the invention.
FIG. 9 illustrates information and control flows in the virtualization system of FIG. 3 configured for instant-on application usage consistent with at least one embodiment of the invention.
The use of the same reference symbols in different drawings indicates similar or identical items.
DETAILED DESCRIPTION
Referring to FIG. 2, virtualization of a computing system is used to hide physical characteristics of the computing system from a user (i.e., software executing on the computing system) and instead, presents an abstract emulated computing system (i.e., a virtual machine (VM)) to the user. Physical hardware resources of computing system 100 are exposed to one or more guests (e.g., guests 206) as one or more corresponding isolated, apparently independent, virtual machines (e.g., VM 204). For example, a virtual machine may include one or more virtual resources (e.g., VCPU, VMEMORY, and VDEVICES) that are implemented by physical resources of computing system 100 that a virtual machine monitor (VMM) (i.e., hypervisor, e.g., VMM 202) allocates to the virtual machine.
As referred to herein, a “virtual machine monitor” (VMM) or “hypervisor” is software that provides the virtualization capability. The VMM provides an interface between the guest software and the physical resources. Typically, the VMM provides each guest the appearance of full control over a complete computer system (i.e., memory, central processing unit (CPU) and all peripheral devices). A Type 1 (i.e., native) VMM is a standalone software program that executes on physical resources and provides the virtualization for one or more guests. A guest operating system executes on a level above the VMM. A Type 2 (i.e., hosted) VMM is integrated into or executes on an operating system, the operating system components execute directly on physical resources and are not virtualized by the VMM. The VMM is considered a distinct software layer and a guest operating system may execute on a third software level above the hardware. Although the description that follows refers to an exemplary Type 1 VMM, techniques described herein may be implemented in a Type 2 VMM.
Referring back to FIG. 2, while VM 204 has full control over the virtual resources of virtual machine 204, VMM 202 retains control over the physical resources. A guest system, e.g., an instance of an operating system (e.g., Windows, Linux, and UNIX) executes on a corresponding virtual machine and shares physical resources with other guest systems executing on other virtual machines. Thus, multiple operating systems (e.g., multiple instances of the same operating system or instances of different operating systems) can co-exist on the same computing system, but in isolation from each other.
VMM 202 is executed by some or all processor cores in the physical resources. An individual guest is executed by a set of processor cores included in the physical resources. The processors switch between execution of VMM 202 and execution of one or more guests 206. As referred to herein, a “world switch” is a switch between execution of a guest and execution of a VMM. In general, a world switch may be initiated by a VMMCALL instruction or by other suitable techniques, e.g., interrupt mechanisms or predetermined instructions defined by a control block, described below. Although a particular world switch may be described herein as being initiated using a particular technique, other suitable techniques may be used. During a world switch, a current processor core environment (e.g., guest or VMM) saves its state information and restores state information for a target core environment (e.g., VMM or guest) to which the processor core execution is switched. For example, a VMM executes a world switch when the VMM executes a guest that was scheduled for execution. Similarly, a world switch from executing a guest to executing a VMM is made when the VMM exercises control over physical resources, e.g., when the guest attempts to access a peripheral device, when a new page of memory is to be allocated to the guest, or when it is time for the VMM to schedule another guest, etc.
Virtualization techniques may be implemented using only software (which includes firmware) or by a combination of software and hardware. For example, some processors include virtualization hardware, which allows simplification of VMM code and improves system performance for full virtualization (e.g., hardware extensions for virtualization provided by AMD-V and Intel VT-x). Software, as described herein, may be encoded in at least one computer readable medium selected from the set of a disk, tape, or other magnetic, optical, or electronic storage medium.
Virtualization techniques may be used to isolate or sequester one or more processor cores of a computing system from an operating system executing as a guest on one or more other processing cores of the computer system under control of a VMM. In at least one embodiment of a virtualization system, sequestered cores may be configured as de facto accelerators. That is, sequestered cores are used by the VMM to complete work initiated from within the operating system environment. Although the host cores and the sequestered cores reside within a shared memory environment, the sequestered cores are not managed by the operating system directly. The VMM is configured as a vehicle for communicating between the sequestered cores and the host cores. An exemplary VMM implements a memory-based solution for propagating work requests, page faults, and completion information using a queue-based architecture implemented within a shared memory space. Computational work may be initiated within the confines of the guest operating system. A VMM then coordinates work between the operating system and the sequestered cores. Accordingly, a VMM may be used to implement general computational acceleration. A VMM and sequestered cores may be used to implement instant-on application usage. In addition, a VMM may be used to configure sequestered cores as network device accelerators.
The number of cores used by a guest operating system (i.e., host cores) may be selectable. For example, the number of host cores may be the maximum number of cores that a particular guest operating system is able to utilize. However, in at least one embodiment of a virtualization system, the number of cores used by the guest operating system is not limited thereto, and a system may be configured with a predetermined number of cores for an operating system that is less than a maximum number of cores.
Referring to FIG. 3, exemplary computing system 400 includes VMM 402. VMM 402 emulates a decoupled architecture, i.e., VMM 402 sequesters cores to execute applications or application tasks. In at least one embodiment, VMM 402 sequesters cores 406 from cores 404. In at least one embodiment, VMM 402 assigns host cores 404 and sequestered cores 406 separate virtual memory spaces. In at least one embodiment, VMM 402 assigns host cores 404 and sequestered cores 406 a shared virtual memory space. Techniques for implementing a shared virtual memory space are described in U.S. patent application Ser. No. 12/648,550, entitled “SYSTEMS AND METHODS IMPLEMENTING NON-SHARED PAGE TABLES FOR SHARING MEMORY RESOURCES MANAGED BY A MAIN OPERATING SYSTEM WITH ACCELERATOR DEVICES,” naming Patryk Kaminski, Thomas Woller, Keith Lowery, and Erich Boleyn, as inventors, now U.S. Pat. No. 8,719,543, issued May 6, 2014, and U.S. patent application Ser. No. 12/648,556, entitled “SYSTEMS AND METHODS IMPLEMENTING SHARED PAGE TABLES FOR SHARING MEMORY RESOURCES MANAGED BY A MAIN OPERATING SYSTEM WITH ACCELERATOR DEVICES,” naming Patryk Kaminski, Thomas Woller, Keith Lowery, and Erich Boleyn, as inventors, both filed on or about the filing date of the instant application, which applications are hereby incorporated by reference herein.
In at least one embodiment, VMM 402 maintains a set of control blocks, which include state and control information for execution of a guest on host cores 404 and a set of state and control information for execution of a work unit on sequestered cores 406. In at least one embodiment, these control blocks are known as virtual machine control blocks (VMCBs). Each guest and de facto accelerator may be associated with a corresponding control block. Exemplary control blocks may be stored in memory and/or in storage of the host hardware and include state and control information for a corresponding guest or de facto accelerator and/or state and control information for the VMM. For example, a control block includes state information corresponding to core state at a point at which a guest last exited. Exemplary control blocks may be accessed by particular instructions and information may be stored in particular fields of predetermined data structures.
In at least one embodiment of computing system 400, VMM 402 is configured to isolate at least one core (e.g., sequestered cores 406) for use as a de facto accelerator. Operating system 408 (e.g., Microsoft Windows) executes as a guest on host cores 404 (e.g., x86 cores) and application 414 executes on operating system 408. Kernel mode driver 410, which executes on operating system 408, exchanges information with VMM 402 to provide user application 414 indirect access to the de facto accelerators. The guest operating system may utilize sequestered cores 406 using kernel mode driver 410, e.g., using a call. Communications between VMM 402 and guest operating system 408 and between VMM 402 and de facto accelerators are accomplished using queues in shared virtual memory (e.g., work queue 424, command queue 418, fault queue 422, and response queue 420).
Scheduler 416 includes a thread pool across which work items are distributed to available segregated cores 406. In at least one embodiment of scheduler 416, the work units are assigned to available segregated cores using round-robin scheduling; however, other suitable scheduling algorithms (e.g., dynamic priority scheduling, etc.) may be used in other embodiments of scheduler 416. In at least one embodiment of computing system 400, scheduler 416 is a user-mode scheduler, which allows scheduling to be performed separate from the operating system. However, in at least one embodiment of computing system 400, scheduler 416 is a kernel-mode scheduler, which requires modification of kernel-level portions of the operating system. In at least one embodiment of computing system 400, at least some of the functionality of scheduler 416 is performed by VMM 402 and/or at least some of the functionality of scheduler 416 is performed by kernel mode driver 410. VMM 402 maintains relevant topology and architecture information in an information or control structure that is visible to kernel mode driver 410. VMM 402 provides at least information about available de facto accelerators to kernel mode driver 410.
In at least one embodiment of computing system 400, a fault queue 422, command queue 418, response queue 420, and work queue 424 are implemented in shared virtual memory space. All of those queues require operating system access (e.g., kernel mode access). In at least one embodiment of computing system 400, the queues must be accessible from outside of the process context of a creating application. Thus, operating system 408 must provide memory translation. Only the work queue requires user-mode access. In at least one embodiment, queues, 418, 420, 422, and 424 use non-locking implementations and are configured for a single reader and a single writer. Virtual machine monitor 402 enqueues to fault queue 422 and response queue 420. Kernel mode driver 410 dequeues from fault queue 422 and response queue 420. Kernel mode driver 410 enqueues to command queue 418 and VMM 402 dequeues from command queue 418. Application 414 enqueues to work queue 424. Scheduler 416, which may be implemented using VMM 402 and/or kernel mode driver 410, dequeues from work queue 424.
In at least one embodiment of computing system 400, application 414 calls queueing application programming interface (API) 412 to initialize the queueing interfaces. Queueing API 412 instantiates kernel mode driver 410 and makes documented input/output control (ioctl) calls to allocate the queues. Kernel mode driver 410 receives the ioctl command and allocates queues that may be read or written by appropriate entities (e.g., VMM 402 and kernel mode driver 410), consistent with the description above. Kernel mode driver 410 creates an internal work table that associates work queue 424 with an address space. Kernel mode driver 410 also creates a page table and allocates stacks for the de facto accelerators. Kernel mode driver 410 creates a kernel mode thread and also returns a pointer to work queue 424 for use by application 414.
In at least one embodiment of computing system 400, polling techniques are used to process the queues. In at least one embodiment of computing system 400, rather than using polling techniques, communications between VMM 402 and guest operating system 408 and between VMM 402 and sequestered cores 406, configured as de facto accelerators, are achieved using doorbell techniques. In general, any writer (e.g., kernel mode driver 410, queuing API 412, or VMM 402) to a queue will ring a doorbell to notify a recipient (e.g., kernel mode driver 410 or VMM 402) of available queue items. In at least one embodiment of the computing system, VMM 402 supports a VMM call that serves as a doorbell for a specific queue. Information that indicates which queue contains a new entry, and/or other suitable information, is included in the parameters of the VMM call. In addition, VMM 402 rings the doorbell of kernel mode driver 410 by issuing a software interrupt. Different software interrupts may be used to distinguish between different doorbell recipients.
For example, application 414 may push an entry into work queue 424 via queueing API 412 and kernel mode driver 410 rings a doorbell for VMM 402, e.g., by executing a VMMCALL, to indicate that the work queue has a new entry. The VMMCALL instruction transfers control from guest operating system 408 to VMM 402. Similarly, when kernel mode driver 410 pushes a command into command queue 418, kernel mode driver 410 rings a doorbell (e.g., by executing a VMMCALL) for VMM 402 to indicate that the command queue has a new entry. In yet another example, when a work unit has completed on a sequestered core 406 configured as a de facto accelerator, VMM 402 may push an entry into fault queue 422 and send a fault queue interrupt via a local Advanced Programmable Interrupt Controller (APIC) to a host core 404. VMM 402 can ring the doorbell of kernel mode driver 410 using software interrupts. The particular interrupt number used is stored in a field in a configuration block and maintained by kernel mode driver 410.
Application 414 creates work queue 424 and registers with kernel mode driver 410 for an entry point in the work queue table. Application 414 uses queuing API 412 to add work items to work queue 424. Queuing API 412 rings the doorbell of scheduler 416. In embodiments where scheduling logic resides in kernel mode driver 410, kernel mode driver 410 will read work queue 424. Accordingly, calls to VMM 402 will explicitly include an indicator of which core should be targeted by VMM 402. In response to the doorbell, scheduler 416 determines whether a de facto accelerator is available. If no de facto accelerator is available, scheduler 416 updates a status to indicate that work queue 424 is not empty. If a de facto accelerator is available, scheduler 416 reads work queue 424. Scheduler 416 selects an available de facto accelerator and makes a scheduling call to VMM 402.
In at least one embodiment of computing system 400, when scheduler 416 is distinct from VMM 402, scheduler 416 may write a command to command queue 418 and ring the doorbell of VMM 402. Then VMM 402 sets up execution context and initializes a target sequestered core 406 configured as a de facto accelerator. VMM 402 writes to response queue 420 and scheduler 416 processes response queue 420 to maintain visibility into status (e.g., availability) of sequestered cores 406. When scheduler 416 dequeues a work item from work queue 424, scheduler 416 consults a list of available de facto accelerators of sequestered core 406 configured as de facto accelerators and selects a target sequestered core 406. Scheduler 416 then creates and enqueues a command queue entry that indicates the work item and the target sequestered core 406. Then scheduler 416 rings the doorbell of VMM 402. In order for scheduler 416 to maintain an accurate view of resource availability, scheduler 416 should be notified of work item completion. In at least one embodiment of computing system 400, a system stack is manipulated so that a return from a work item makes a VMM call to notify VMM 402 of work item completion.
Referring to FIGS. 3, 4, and 5, upon a system reset, VMM 402 boots on the cores of system 400 (e.g., host cores 404 and sequestered cores 406) (502). In at least one embodiment, VMM 402 is booted from memory (e.g., on a hard drive), separately from the Basic Input Output System. Virtual machine monitor 402 then boots operating system 408 as a guest on operating system cores 404 and sequesters cores 406 from cores 402 (504). For example, when booting operating system 408, VMM 402 informs operating system 408 of a number of cores on which to execute. Then operating system 408 will not attempt to access sequestered cores 406. Other techniques for sequestering cores 406 from operating system cores 404 include modifying the BIOS tables so that operating system 408 is aware of only a particular number of cores less than a total number of cores, with virtual machine monitor 402 controlling the environments on both sets of cores. Those BIOS tables may either be loaded automatically from read-only memory or patched in by VMM 402. In another technique for sequestering cores from the operating system, VMM 402 intercepts operating system commands to configure a number of operating system cores.
After the cores are sequestered and the operating system has booted, operating system 408 loads an accelerated computing kernel mode device driver 410 (508). Application 414 runs on operating system 408 (510). Application 414 generates work units, which are then scheduled to execute on sequestered cores 406 (512). Upon completion, VMM 402 notifies operating system 408 of completed work (514).
Referring to FIGS. 3, 4, and 6, a work unit initiation process is described in additional detail. In at least one embodiment of computing system 400, kernel mode driver 410 creates an internal work table, which may be used for adding work queue table entries (602). Application 414 creates a work queue and registers with kernel mode driver 410 for an entry in the work queue table (604). While executing, application 414 pushes a work queue entry onto work queue 424 (606). Kernel mode driver 410 notifies VMM 402 that work queue 424 has a new entry (608) using a doorbell (e.g., VMMCALL), as described above, or other suitable notification technique. Virtual memory monitor 402 processes the doorbell on host cores 404 and sends an INIT inter-processor interrupt (IPI) to a particular sequestered core 406. Virtual machine monitor 402 processes an exit to VMM 402 on the particular sequestered core 406 (610). If the particular sequestered core 406 is idle (i.e., is not already processing a work unit), VMM 402 pulls a next work unit entry from work queue 424 (612), modifies a VMCB, and begins execution of code for processing the work unit (614). Otherwise, the particular sequestered core continues executing a previously launched work unit. In at least one embodiment of computing system 400, if a particular sequestered core 406 is already executing a work unit, VMM 402 will not interrupt that particular sequestered core 406 with an exit to VMM 402.
While processing a work unit, a sequestered core 406 configured as a de facto accelerator may experience a page fault (i.e., sequestered core 406 accesses a page that is mapped in address space but is not loaded into physical memory). Referring to FIGS. 3, 4, and 7, in at least one embodiment of computing system 400, those page faults experienced by sequestered core 406 are recognized by VMM 402 and a world switch occurs to VMM 402 (702). Virtual machine monitor 402 obtains page fault information from the sequestered core and creates a kernel-level page fault entry, which VMM 402 pushes onto user fault queue 422 (704). Virtual machine monitor 402 issues a fault queue interrupt via a local APIC to one of host cores 404 (706). Kernel mode driver 410 interrupt handler processes the interrupt and executes a fault queue deferred procedure call and reads the fault off of system fault queue 428. Kernel mode driver 410 updates the page tables associated with the user process (710) and generates a command (e.g., CMD_RESUME including a field for a target core) for resuming execution by the sequestered core 406 configured as a de facto accelerator (712). Kernel mode driver 410 pushes that command into command queue 418 (712) and rings a doorbell of VMM 402 (e.g., VMMCALL) that indicates that command queue 418 has a new entry (714). Virtual machine monitor 402 processes the VMMCALL on host core 404 and issues an inter-processor interrupt (i.e., INIT IPI) to a sequestered core 406 that includes queue handler 412 (i.e., de facto accelerator core 0), which processes command queue 418. In response to the inter-processor interrupt, de facto accelerator core 0 reads command queue 418 and processes the command (e.g., CMD_RESUME) (716), e.g., by sending an inter-processor interrupt to an appropriate sequestered core 406 to resume processing the work unit (718). Virtual machine monitor 402 then processes a VMEXIT (e.g., performs a world switch) and the sequestered core 406 resumes processing the work unit (720).
Referring to FIGS. 3, 4, and 8, in at least one embodiment of computing system 400, once a work unit has been processed and the sequestered core 406 executes a last instruction for the work unit, the sequestered core 406 executes a routine that includes one or more instructions that indicate the work unit has completed execution (e.g., VMMCALL) (802). Accordingly, sequestered core 406 returns to execution of VMM 402, and VMM 402 processes the indicator of work unit completion (804). In at least one embodiment of computing system 400, VMM 402 determines whether it is configured to issue a notification of work unit completion (808). If VMM is not configured to issue a notification, VMM 402 will proceed to process a next work unit (810). Alternatively, VMM will issue a completion directive. In at least one embodiment, VMM 402 pushes a work unit completion entry into system fault queue 428 and VMM 402 sends a fault queue interrupt (e.g., via local APIC) to an operating system core 404 (812).
Kernel mode driver 410 processes the fault queue interrupt and reads an entry from system fault queue. Kernel mode driver 410 locates the user process context associated with the fault entry and pushes the fault entry into a particular user fault queue 422 for the process context (814). A user work thread handler in kernel mode driver 410 pulls a fault entry from user fault queue 422 and completes the work unit (818).
Referring to FIG. 9, in at least one embodiment of computing system 400, sequestered cores 406 are configured for instant-on application usage, rather than as de facto accelerators. Upon a system reset, VMM 402 boots on the cores of system 400 (e.g., host cores 404 and sequestered cores 406) (902). For example, VMM 402 may reside in the BIOS and automatically sequesters cores 406 from cores 402 (904). Virtual machine monitor 402 is configured to have access to the file system and runs a user application on one or more of sequestered cores 406 (906). Meanwhile, VMM 402 boots operating system 408 as a guest on host cores 404 (906). Virtual machine monitor 402 includes one or more drivers or basic input output system (i.e., BIOS interface) functions to access media containing an application that will initially run on sequestered cores 406.
Although VMM 402 is described as a virtual machine monitor in general, in at least one embodiment, VMM 402 is a minimalistic implementation of a virtual machine monitor that is configured to provide the functionality described herein, and few other virtualization functions. In another embodiment, the functionality of VMM 402 described herein is incorporated into a general virtual machine monitor that provides other typical virtual machine functions. In at least one embodiment of computing system 400, virtual machine monitors may be nested, e.g., operating system 408 is a VMM machine monitor that is controlled by VMM 402 consistent with the functionality described herein. In at least one embodiment of computing system 400, use of virtualization techniques to sequester cores requires no modification to the operating system.
The description of the invention set forth herein is illustrative, and is not intended to limit the scope of the invention as set forth in the following claims. For example, while the invention has been described in an embodiment in which sequestered cores are configured as de facto accelerators for an application execution on a guest operating system under control of a VMM, one of skill in the art will appreciate that the teachings herein can be utilized for instant-on applications, network device acceleration, and general computational acceleration. For example, VMM 402 may coordinate with a network router device to accelerate packet inspection functions using sequestered cores 406. In addition, although the invention has been described in a computing system in general, embodiments of the teachings described herein may be included in servers, desktop systems (e.g., personal computers), embedded applications (e.g., mobile communications devices) and other suitable applications. Variations and modifications of the embodiments disclosed herein may be made based on the description set forth herein, without departing from the scope and spirit of the invention as set forth in the following claims.

Claims (21)

What is claimed is:
1. A method comprising:
executing a virtual machine monitor on a computer system including a plurality of cores;
selecting one or more cores of the plurality of cores as a first subset of cores;
executing an operating system on the first subset of cores, wherein the operating system executes as a guest under control of the virtual machine monitor and an application executes using the operating system;
sequestering from the operating system, as one or more computing accelerators, a second subset of cores of the plurality of cores, the first and second subsets of cores being mutually exclusive, wherein the virtual machine monitor provides an interface to the first subset of cores and the second subset of cores;
scheduling work for the application to be completed by the second subset of cores;
causing a core of the second subset of cores to exit to the virtual machine monitor in response to an inter-processor interrupt caused by a core of the first subset of cores; and
executing the work for the application on the second subset of cores, the second subset of cores not being visible to the operating system, and the application indirectly accessing the second subset of cores using the virtual machine monitor.
2. The method of claim 1, wherein the sequestering includes dedicating a portion of shared system memory to the second subset of cores.
3. The method of claim 1, further comprising:
providing information regarding available computing accelerators to a driver executing on the operating system.
4. The method of claim 1, further comprising:
scheduling work to be completed by a target accelerator of the one or more computing accelerators.
5. The method of claim 1, wherein the executing comprises:
requesting work to be completed for the benefit of the application by the computing accelerators;
indicating to the operating system completed work; and
handling page faults generated by a work item.
6. The method of claim 1, further comprising:
accessing, by the second subset of cores, an application program interface provided by the operating system.
7. The method of claim 6, wherein the application program interface is one of an operating system memory allocation routine and an exception handling routine.
8. The method of claim 1, wherein the virtual machine monitor executes on the first subset of cores and the second subset of cores.
9. The method of claim 1, wherein the first subset includes fewer than a maximum number of cores the operating system is able to utilize.
10. The method of claim 1, wherein the sequestering comprises modifying BIOS tables to sequester the second set of cores.
11. The method of claim 1, wherein the selecting comprises intercepting an operating system command to configure a number of cores in the first subset of cores.
12. An apparatus comprising:
a plurality of cores;
operating system software encoded in one or more media accessible to the plurality of cores;
application software encoded in one or more media accessible to the plurality of cores;
hypervisor software encoded in one or more media accessible to the plurality of cores and executable on one or more of the plurality of cores, wherein the hypervisor software is executable to select a first set of cores and a second set of cores from the plurality of cores and executable to control execution of the operating system software as a guest on the first set of cores, the operating system software being configured to execute the application software, and wherein at least some work of the application software is executed on the second set of cores, wherein the second set of cores is not visible to the operating system and the first and second sets of cores are mutually exclusive, wherein the hypervisor software provides an interface to the first set of cores and the second set of cores, and is executable to cause a core of the second set of cores to exit to the hypervisor in response to an inter-processor interrupt caused by a core of the first set of cores; and
a computing driver encoded in one or more media accessible to the first set of cores, wherein the computing driver executes on the operating system and interacts with the hypervisor to schedule work of the application to the second set of cores.
13. The apparatus of claim 12, wherein the hypervisor software includes code executable on the plurality of cores to isolate the second set of cores from the operating system.
14. The apparatus of claim 12, further comprising: a shared system memory shared by the plurality of cores.
15. The apparatus of claim 12,
wherein the application software and the computing driver are configured to generate at least one queue for communicating between the application and the second set of cores.
16. The apparatus of claim 15, wherein the at least one queue includes:
a command queue configured to communicate from a computing driver to the hypervisor;
a response queue configured to communicate from the hypervisor to the computing driver;
a fault queue configured to communicate from the hypervisor to the computing driver; and
a work queue configured to communicate from a computing application program interface to the hypervisor.
17. The apparatus of claim 12, further comprising:
a computing application program interface encoded in one or more media accessible to the first set of cores.
18. The apparatus of claim 12, further comprising:
application software encoded in one or more media accessible to the plurality of cores executable on the operating system, wherein the hypervisor software includes code executable to configure the second set of cores as computing accelerators and code to execute work for the application software on the second set of cores.
19. A computer program product comprising:
a non-transitory computer-storage medium configured to store one or more functional sequences executable as, or in conjunction with, a virtual machine monitor and configured to select one or more cores of a plurality of cores of a computer system as a first set of cores, configured to execute an operating system sequence as a guest under control of the virtual machine monitor on the first set of cores, configured to sequester as one or more computing accelerators, a second set of cores of the plurality of cores, the first and second sets of cores being mutually exclusive, configured to execute an application sequence on the operating system, configured to schedule at least some work of the application sequence to the second set of cores, and configured to execute the at least some work of the application sequence on the second set of cores, wherein the second set of cores is not visible to the operating system, and configured to cause a core of the second set of cores to exit to the virtual machine monitor in response to an inter-processor interrupt caused by a core of the first set of cores, wherein the virtual machine monitor is configured to provide an interface to the first set of cores and the second set of cores.
20. The computer program product of claim 19, wherein the one or more functional sequences configure the second set of cores as computing accelerators for executing application code work on the second set of cores.
21. The computer program product of claim 19, wherein the computer program product is encoded in at least one computer readable medium selected from the set of a disk, tape, or other non-transitory storage medium.
US12/648,592 2009-12-29 2009-12-29 Hypervisor isolation of processor cores to enable computing accelerator cores Active 2032-11-10 US9058183B2 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
US12/648,592 US9058183B2 (en) 2009-12-29 2009-12-29 Hypervisor isolation of processor cores to enable computing accelerator cores
KR1020127019346A KR101668399B1 (en) 2009-12-29 2010-12-14 Hypervisor isolation of processor cores
JP2012547104A JP2013516021A (en) 2009-12-29 2010-12-14 Hypervisor separation of processor core
CN201080059820.8A CN102713847B (en) 2009-12-29 2010-12-14 The supervisory process isolation of processor cores
EP10796238.3A EP2519877B1 (en) 2009-12-29 2010-12-14 Hypervisor-based isolation of processor cores
PCT/US2010/060193 WO2011090596A2 (en) 2009-12-29 2010-12-14 Hypervisor isolation of processor cores

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US12/648,592 US9058183B2 (en) 2009-12-29 2009-12-29 Hypervisor isolation of processor cores to enable computing accelerator cores

Publications (2)

Publication Number Publication Date
US20110161955A1 US20110161955A1 (en) 2011-06-30
US9058183B2 true US9058183B2 (en) 2015-06-16

Family

ID=44189079

Family Applications (1)

Application Number Title Priority Date Filing Date
US12/648,592 Active 2032-11-10 US9058183B2 (en) 2009-12-29 2009-12-29 Hypervisor isolation of processor cores to enable computing accelerator cores

Country Status (6)

Country Link
US (1) US9058183B2 (en)
EP (1) EP2519877B1 (en)
JP (1) JP2013516021A (en)
KR (1) KR101668399B1 (en)
CN (1) CN102713847B (en)
WO (1) WO2011090596A2 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9298504B1 (en) * 2012-06-11 2016-03-29 Amazon Technologies, Inc. Systems, devices, and techniques for preempting and reassigning tasks within a multiprocessor system
US9996357B2 (en) 2015-10-30 2018-06-12 International Business Machines Corporation Resolving page faults out of context for shared contexts
US10416897B2 (en) 2017-03-27 2019-09-17 SK Hynix Inc. Memory system with latency distribution optimization and an operating method thereof
US10534653B2 (en) * 2017-04-18 2020-01-14 Electronics And Telecommunications Research Institute Hypervisor-based virtual machine isolation apparatus and method
US11429419B2 (en) * 2018-08-03 2022-08-30 Nvidia Corporation Secure access of virtual machine memory suitable for AI assisted automotive applications
US11500668B2 (en) 2020-10-15 2022-11-15 Red Hat, Inc. Page fault support for virtual machine network accelerators
US12141597B2 (en) 2020-11-30 2024-11-12 Red Hat, Inc. Efficient out of order request completion

Families Citing this family (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8484653B2 (en) 2010-07-28 2013-07-09 Red Hat Israel, Ltd. Mechanism for delayed hardware upgrades in virtualization systems
US8418177B2 (en) 2010-10-01 2013-04-09 Microsoft Corporation Virtual machine and/or multi-level scheduling support on systems with asymmetric processor cores
US9529615B2 (en) * 2010-11-24 2016-12-27 International Business Machines Corporation Virtual device emulation via hypervisor shared memory
US9189283B2 (en) * 2011-03-03 2015-11-17 Hewlett-Packard Development Company, L.P. Task launching on hardware resource for client
US9645823B2 (en) 2011-03-03 2017-05-09 Hewlett-Packard Development Company, L.P. Hardware controller to choose selected hardware entity and to execute instructions in relation to selected hardware entity
US8966625B1 (en) 2011-05-24 2015-02-24 Palo Alto Networks, Inc. Identification of malware sites using unknown URL sites and newly registered DNS addresses
US8555388B1 (en) 2011-05-24 2013-10-08 Palo Alto Networks, Inc. Heuristic botnet detection
US9898316B1 (en) * 2011-09-30 2018-02-20 EMC IP Holding Company LLC Extended fractional symmetric multi-processing capabilities to guest operating systems
EP2798473A4 (en) * 2011-12-28 2015-08-05 Intel Corp Systems, methods and computer program products for bootstrapping a type 1 virtual machine monitor after operating system launch
US8789047B2 (en) 2012-01-26 2014-07-22 Empire Technology Development Llc Allowing world switches between virtual machines via hypervisor world switch security setting
CN104272296A (en) * 2012-04-30 2015-01-07 惠普发展公司,有限责任合伙企业 Processor providing multiple system images
US9104870B1 (en) 2012-09-28 2015-08-11 Palo Alto Networks, Inc. Detecting malware
US9215239B1 (en) 2012-09-28 2015-12-15 Palo Alto Networks, Inc. Malware detection based on traffic analysis
US9448829B2 (en) 2012-12-28 2016-09-20 Intel Corporation Hetergeneous processor apparatus and method
US9361416B2 (en) * 2013-01-30 2016-06-07 Empire Technology Development Llc Dynamic reconfiguration of programmable hardware
US9390462B2 (en) * 2013-03-15 2016-07-12 Intel Corporation Memory mapping for a graphics processing unit
JP6040101B2 (en) * 2013-05-31 2016-12-07 株式会社日立製作所 Storage device control method, storage device, and information processing device
WO2014209286A1 (en) * 2013-06-25 2014-12-31 Empire Technology Development, Llc Reconfiguration with virtual machine switching
US10019575B1 (en) 2013-07-30 2018-07-10 Palo Alto Networks, Inc. Evaluating malware in a virtual machine using copy-on-write
US9613210B1 (en) 2013-07-30 2017-04-04 Palo Alto Networks, Inc. Evaluating malware in a virtual machine using dynamic patching
US9811665B1 (en) 2013-07-30 2017-11-07 Palo Alto Networks, Inc. Static and dynamic security analysis of apps for mobile devices
US9852000B2 (en) * 2013-08-27 2017-12-26 Empire Technology Development Llc Consolidating operations associated with a plurality of host devices
CN104714843B (en) * 2013-12-17 2018-06-15 华为技术有限公司 More kernel operating system instances support the method and device of multiprocessor
US10514942B2 (en) 2014-02-24 2019-12-24 Red Hat Israel, Ltd. Using linker scripts for loading system configuration tables
US9766916B2 (en) * 2014-05-05 2017-09-19 International Business Machines Corporation Implementing coherent accelerator function isolation for virtualization
KR101595064B1 (en) * 2014-06-20 2016-02-18 고려대학교 산학협력단 System and method of sharing device on trustzone virtual environment
US9489516B1 (en) 2014-07-14 2016-11-08 Palo Alto Networks, Inc. Detection of malware using an instrumented virtual machine environment
US20160019555A1 (en) * 2014-07-15 2016-01-21 Boles Thomas Automated system for rating employee screening practices and corporate management
US9542554B1 (en) 2014-12-18 2017-01-10 Palo Alto Networks, Inc. Deduplicating malware
US9805193B1 (en) * 2014-12-18 2017-10-31 Palo Alto Networks, Inc. Collecting algorithmically generated domains
US9639395B2 (en) * 2015-04-16 2017-05-02 Google Inc. Byte application migration
KR102309798B1 (en) * 2015-04-16 2021-10-06 삼성전자주식회사 SR-IOV based non volatile memory controller and method for dynamically allocating resources to queues by the non volatile memory controller
US9747122B2 (en) 2015-04-16 2017-08-29 Google Inc. Virtual machine systems
US10846117B1 (en) * 2015-12-10 2020-11-24 Fireeye, Inc. Technique for establishing secure communication between host and guest processes of a virtualization architecture
US10108446B1 (en) 2015-12-11 2018-10-23 Fireeye, Inc. Late load technique for deploying a virtualization layer underneath a running operating system
US10437623B2 (en) * 2015-12-24 2019-10-08 Intel IP Corporation Fast switching between virtual machines without interrupt virtualization for high-performance, secure trusted-execution environment
CN105700826A (en) * 2015-12-31 2016-06-22 华为技术有限公司 Virtualization method and device
GB2549773B (en) * 2016-04-28 2018-05-16 Metaswitch Networks Ltd Configuring host devices
CN108228333A (en) * 2016-12-14 2018-06-29 中国航空工业集团公司西安航空计算技术研究所 A kind of internuclear resource isolation method of multiple nucleus system
US10437616B2 (en) * 2016-12-31 2019-10-08 Intel Corporation Method, apparatus, system for optimized work submission to an accelerator work queue
CN110291502B (en) 2017-11-15 2022-01-11 华为技术有限公司 Method, device and acceleration system for scheduling acceleration resources
GB2571922B (en) * 2018-03-05 2020-03-25 Advanced Risc Mach Ltd External exception handling
AU2019252434B2 (en) * 2018-04-11 2024-03-28 Cornell University Method and system for improving software container performance and isolation
CN108664273A (en) * 2018-05-09 2018-10-16 歌尔股份有限公司 A kind of method and apparatus solving multiple softward interview hardware resource conflicts
US10956573B2 (en) 2018-06-29 2021-03-23 Palo Alto Networks, Inc. Dynamic analysis techniques for applications
US11010474B2 (en) 2018-06-29 2021-05-18 Palo Alto Networks, Inc. Dynamic analysis techniques for applications
WO2020155005A1 (en) * 2019-01-31 2020-08-06 Intel Corporation Shared memory mechanism to support fast transport of sq/cq pair communication between ssd device driver in virtualization environment and physical ssd
US11196765B2 (en) 2019-09-13 2021-12-07 Palo Alto Networks, Inc. Simulating user interactions for malware analysis
US11755785B2 (en) 2020-08-03 2023-09-12 Nxp Usa, Inc. System and method of limiting access of processors to hardware resources
CN112181626A (en) * 2020-10-16 2021-01-05 华东计算技术研究所(中国电子科技集团公司第三十二研究所) System, method and medium for scheduling CPU (Central processing Unit) without Android operating system
US20230033583A1 (en) * 2021-07-30 2023-02-02 Advanced Micro Devices, Inc. Primary input-output queue serving host and guest operating systems concurrently

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030229794A1 (en) 2002-06-07 2003-12-11 Sutton James A. System and method for protection against untrusted system management code by redirecting a system management interrupt and creating a virtual machine container
US20060150183A1 (en) 2004-12-30 2006-07-06 Chinya Gautham N Mechanism to emulate user-level multithreading on an OS-sequestered sequencer
US20080271014A1 (en) 2007-04-26 2008-10-30 Serebrin Benjamin C Lightweight World Switch
US20090007104A1 (en) 2007-06-29 2009-01-01 Zimmer Vincent J Partitioned scheme for trusted platform module support
US20090037936A1 (en) 2007-07-31 2009-02-05 Serebrin Benjamin C Placing Virtual Machine Monitor (VMM) Code in Guest Context to Speed Memory Mapped Input/Output Virtualization
US20090055693A1 (en) 2007-08-08 2009-02-26 Dmitriy Budko Monitoring Execution of Guest Code in a Virtual Machine
US20090187697A1 (en) 2008-01-22 2009-07-23 Serebrin Benjamin C Execute-Only Memory and Mechanism Enabling Execution From Execute-Only Memory for Minivisor
US7809895B2 (en) * 2007-03-09 2010-10-05 Oracle America, Inc. Low overhead access to shared on-chip hardware accelerator with memory-based interfaces
US20100325644A1 (en) * 2009-06-18 2010-12-23 Van Der Linden Robertus Johannes Methods and systems for importing a device driver into a guest computing environment
US20110010721A1 (en) * 2009-07-13 2011-01-13 Vishakha Gupta Managing Virtualized Accelerators Using Admission Control, Load Balancing and Scheduling
US8510859B2 (en) 2006-09-26 2013-08-13 Intel Corporation Methods and arrangements to launch trusted, co-existing environments
US8713574B2 (en) * 2006-06-05 2014-04-29 International Business Machines Corporation Soft co-processors to provide a software service function off-load architecture in a multi-core processing environment

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7484091B2 (en) * 2004-04-29 2009-01-27 International Business Machines Corporation Method and system for providing a trusted platform module in a hypervisor environment
US8914618B2 (en) * 2005-12-29 2014-12-16 Intel Corporation Instruction set architecture-based inter-sequencer communications with a heterogeneous resource
JP4775744B2 (en) * 2007-10-19 2011-09-21 インテル・コーポレーション Method and program for launching a reliable coexistence environment

Patent Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030229794A1 (en) 2002-06-07 2003-12-11 Sutton James A. System and method for protection against untrusted system management code by redirecting a system management interrupt and creating a virtual machine container
US20060150183A1 (en) 2004-12-30 2006-07-06 Chinya Gautham N Mechanism to emulate user-level multithreading on an OS-sequestered sequencer
US8713574B2 (en) * 2006-06-05 2014-04-29 International Business Machines Corporation Soft co-processors to provide a software service function off-load architecture in a multi-core processing environment
US8510859B2 (en) 2006-09-26 2013-08-13 Intel Corporation Methods and arrangements to launch trusted, co-existing environments
US7809895B2 (en) * 2007-03-09 2010-10-05 Oracle America, Inc. Low overhead access to shared on-chip hardware accelerator with memory-based interfaces
US20080271014A1 (en) 2007-04-26 2008-10-30 Serebrin Benjamin C Lightweight World Switch
US20090007104A1 (en) 2007-06-29 2009-01-01 Zimmer Vincent J Partitioned scheme for trusted platform module support
US20090037936A1 (en) 2007-07-31 2009-02-05 Serebrin Benjamin C Placing Virtual Machine Monitor (VMM) Code in Guest Context to Speed Memory Mapped Input/Output Virtualization
US20090055693A1 (en) 2007-08-08 2009-02-26 Dmitriy Budko Monitoring Execution of Guest Code in a Virtual Machine
US20090187902A1 (en) 2008-01-22 2009-07-23 Serebrin Benjamin C Caching Binary Translations for Virtual Machine Guest
US20090187698A1 (en) 2008-01-22 2009-07-23 Serebrin Benjamin C Minivisor Entry Point in Virtual Machine Monitor Address Space
US20090187904A1 (en) 2008-01-22 2009-07-23 Serebrin Benjamin C Redirection Table for Virtual Machine Guest
US20090187729A1 (en) 2008-01-22 2009-07-23 Serebrin Benjamin C Separate Page Table Base Address for Minivisor
US20090187726A1 (en) 2008-01-22 2009-07-23 Serebrin Benjamin C Alternate Address Space to Permit Virtual Machine Monitor Access to Guest Virtual Address Space
US20090187697A1 (en) 2008-01-22 2009-07-23 Serebrin Benjamin C Execute-Only Memory and Mechanism Enabling Execution From Execute-Only Memory for Minivisor
US20100325644A1 (en) * 2009-06-18 2010-12-23 Van Der Linden Robertus Johannes Methods and systems for importing a device driver into a guest computing environment
US20110010721A1 (en) * 2009-07-13 2011-01-13 Vishakha Gupta Managing Virtualized Accelerators Using Admission Control, Load Balancing and Scheduling

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
AMD, "AMD Virtualization (AMD-V(TM)) Technology," data sheet, downloaded Aug. 26, 2009, URL: <https://www.amd.com/us/products/technologies/virtualization/Pages/amd-v.aspx, 2 pages.
AMD, "AMD Virtualization (AMD-V™) Technology," data sheet, downloaded Aug. 26, 2009, URL: <https://www.amd.com/us/products/technologies/virtualization/Pages/amd-v.aspx, 2 pages.
AMD, "AMD-V(TM) Nested Paging" White Paper, Revision 1.0, Jul. 2008, 19 pages.
AMD, "AMD-V™ Nested Paging" White Paper, Revision 1.0, Jul. 2008, 19 pages.
AMD, "The Future is Fusion: The Industry-Changing Impact of Accelerated Computing," White Paper, 2008, 10 pages.
Govil, Kinshuk, et al., "Cellular Disco: Resource Management Using Virtual Clusters on Shared-Memory Multiprocessors," 17th ACM Symposium on Operating Systems Principles (SOSP'99), Operating Systems Review 33(5):154-169, Dec. 1999.
International Search Report and Written Opinion mailed Aug. 22, 2011 in PCT App. No. PCT/US2010/060193, 11 pages.
Jeffery, Casey M., and Figueiredo, Renato J. O., "Towards Byzantine Fault Tolerance in Many-core Computing Platforms," 13th IEEE International Symposium on Pacific Rim Dependable Computing, PRDC 2007, Dec. 17-19, 2007, pp. 256-259.

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9298504B1 (en) * 2012-06-11 2016-03-29 Amazon Technologies, Inc. Systems, devices, and techniques for preempting and reassigning tasks within a multiprocessor system
US9996357B2 (en) 2015-10-30 2018-06-12 International Business Machines Corporation Resolving page faults out of context for shared contexts
US10416897B2 (en) 2017-03-27 2019-09-17 SK Hynix Inc. Memory system with latency distribution optimization and an operating method thereof
US10534653B2 (en) * 2017-04-18 2020-01-14 Electronics And Telecommunications Research Institute Hypervisor-based virtual machine isolation apparatus and method
US11429419B2 (en) * 2018-08-03 2022-08-30 Nvidia Corporation Secure access of virtual machine memory suitable for AI assisted automotive applications
US11500668B2 (en) 2020-10-15 2022-11-15 Red Hat, Inc. Page fault support for virtual machine network accelerators
US12141597B2 (en) 2020-11-30 2024-11-12 Red Hat, Inc. Efficient out of order request completion

Also Published As

Publication number Publication date
CN102713847A (en) 2012-10-03
CN102713847B (en) 2016-03-16
EP2519877B1 (en) 2020-04-08
KR101668399B1 (en) 2016-10-21
US20110161955A1 (en) 2011-06-30
JP2013516021A (en) 2013-05-09
KR20120111734A (en) 2012-10-10
WO2011090596A2 (en) 2011-07-28
WO2011090596A3 (en) 2011-10-20
EP2519877A2 (en) 2012-11-07

Similar Documents

Publication Publication Date Title
US9058183B2 (en) Hypervisor isolation of processor cores to enable computing accelerator cores
JP6646114B2 (en) Dynamic virtual machine sizing
US10691363B2 (en) Virtual machine trigger
Suzuki et al. {GPUvm}: Why Not Virtualizing {GPUs} at the Hypervisor?
TWI722071B (en) Interrupts between virtual machines
JP5042848B2 (en) System and method for depriving components of virtual machine monitor
AU2008302393B2 (en) Reducing the latency of virtual interrupt delivery in virtual machines
EP3039540B1 (en) Virtual machine monitor configured to support latency sensitive virtual machines
US8539499B1 (en) Symmetric multiprocessing with virtual CPU and VSMP technology
US10983847B2 (en) Dynamically loadable unikernel binaries
US10445126B2 (en) Preloading enhanced application startup
US10620963B2 (en) Providing fallback drivers for IO devices in a computing system
CN106339257B (en) Method and system for making client operating system light weight and virtualization operating system
US11169837B2 (en) Fast thread execution transition
US10956193B2 (en) Hypervisor virtual processor execution with extra-hypervisor scheduling
US20150186180A1 (en) Systems and methods for affinity dispatching based on network input/output requests
US20210124601A1 (en) Implementing high-performance virtual machines for bare metal simulation
US10678909B2 (en) Securely supporting a global view of system memory in a multi-processor system

Legal Events

Date Code Title Description
AS Assignment

Owner name: ADVANCED MICRO DEVICES, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WOLLER, THOMAS R.;KAMINSKI, PATRYK;BOLEYN, ERICH;AND OTHERS;SIGNING DATES FROM 20091216 TO 20091228;REEL/FRAME:023852/0477

STCF Information on status: patent grant

Free format text: PATENTED CASE

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 8