US20050240931A1 - System and method for dynamic cooperative distributed execution of computer tasks without a centralized controller - Google Patents

System and method for dynamic cooperative distributed execution of computer tasks without a centralized controller Download PDF

Info

Publication number
US20050240931A1
US20050240931A1 US11/150,951 US15095105A US2005240931A1 US 20050240931 A1 US20050240931 A1 US 20050240931A1 US 15095105 A US15095105 A US 15095105A US 2005240931 A1 US2005240931 A1 US 2005240931A1
Authority
US
United States
Prior art keywords
execution
tasks
computer
peer
computers
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/150,951
Inventor
Sivaprasad Padisetty
Shankar Manian
Hari Narayan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Priority to US11/150,951 priority Critical patent/US20050240931A1/en
Publication of US20050240931A1 publication Critical patent/US20050240931A1/en
Assigned to MICROSOFT TECHNOLOGY LICENSING, LLC reassignment MICROSOFT TECHNOLOGY LICENSING, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MICROSOFT CORPORATION
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/5038Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the execution order of a plurality of tasks, e.g. taking priority or time dependency constraints into consideration
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network

Definitions

  • This invention relates generally to the controlled execution of tasks over a computer network, and more particularly to a computer framework for cooperative automated execution of distributed tasks by a set of computers without a centralized controller.
  • Test computers are chosen to run a test case that includes a sequence of tasks to be performed by different ones of the test computers in an interactive fashion.
  • execution of the task sequence of the test run is controlled by a centralized test controller.
  • the test control goes through the sequence of tasks one by one, instructs a corresponding test computer to carry out one task, receives the result of the task, and instructs another test computer to carry out the next task in the sequence based on the outcome of the previous task.
  • each test computer is required to form a communication link with the centralized test controller to receive instructions and report task execution status and results. This requirement excludes many computers that do not have the capability of forming a direct network connection with the controller from being used for distributed computer testing.
  • the present invention provides a framework for automated dynamic execution of distributed tasks by a set of computers without the need to use a centralized controller to coordinate the execution of the tasks.
  • this framework is especially advantageous for the application of testing interactive computer operations, it can also be used to carry out other types of distributed tasks.
  • the execution of a sequence of tasks is coordinated through the cooperation of the peer computers selected to perform the tasks.
  • Each computer has an execution agent for handling the cooperative execution of the tasks.
  • Task execution instructions that identify the sequence of tasks to be executed and the corresponding computers that are to perform the individual tasks are given to a first computer in the set of selected computers.
  • the first computer passes the task execution information on to the other peer computers involved in the distributed execution so that each computer knows which tasks it is to perform in relation to the tasks of the other computers.
  • the execution agents of the computers then communicate with each other for status update and synchronize to execute the sequence of tasks.
  • FIG. 1 is a block diagram generally illustrating an exemplary computer system on which an embodiment of an execution agent for cooperative execution of distributed tasks in accordance with the invention may be implemented;
  • FIG. 2 is a schematic diagram showing an embodiment of a system that includes a set of computers for cooperative distributed execution of tasks of a test run;
  • FIG. 3 is a flow diagram summarizing the operation of execution agents on selected computers in an embodiment of the invention for automated execution of distributed tasks without a centralized controller;
  • FIG. 4 is a schematic diagram showing the structure of execution agents on the computers in the system of FIG. 2 for carrying out the coordinated execution of distributed tasks by the computers.
  • program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types.
  • program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types.
  • program modules may be practiced with other computer system configurations, including hand-held devices, multi-processor systems, microprocessor-based or programmable consumer electronics, network PCs, minicomputers, mainframe computers, and the like.
  • the invention may be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network.
  • program modules may be located in both local and remote memory storage devices.
  • FIG. 1 a general purpose computing device is shown in the form of a conventional personal computer 20 , including a processing unit 21 , a system memory 22 , and a system bus 23 that couples various system components including the system memory to the processing unit 21 .
  • the system bus 23 may be any of several types of bus structures including a memory bus or memory controller, a peripheral bus, and a local bus using any of a variety of bus architectures.
  • the system memory includes read only memory (ROM) 24 and random access memory (RAM) 25 .
  • ROM 24 read only memory
  • RAM random access memory
  • a basic input/output system (BIOS) 26 containing the basic routines that help to transfer information between elements within the personal computer 20 , such as during start-up, is stored in ROM 24 .
  • the personal computer 20 further includes a hard disk drive 27 for reading from and writing to a hard disk 60 , a magnetic disk drive 28 for reading from or writing to a removable magnetic disk 29 , and an optical disk drive 30 for reading from or writing to a removable optical disk 31 such as a CD ROM or other optical media.
  • the hard disk drive 27 , magnetic disk drive 28 , and optical disk drive 30 are connected to the system bus 23 by a hard disk drive interface 32 , a magnetic disk drive interface 33 , and an optical disk drive interface 34 , respectively.
  • the drives and their associated computer-readable media provide nonvolatile storage of computer readable instructions, data structures, program modules and other data for the personal computer 20 .
  • exemplary environment described herein employs a hard disk 60 , a removable magnetic disk 29 , and a removable optical disk 31 , it will be appreciated by those skilled in the art that other types of computer readable media which can store data that is accessible by a computer, such as magnetic cassettes, flash memory cards, digital video disks, Bernoulli cartridges, random access memories, read only memories, storage area networks, and the like may also be used in the exemplary operating environment.
  • a number of program modules may be stored on the hard disk 60 , magnetic disk 29 , optical disk 31 , ROM 24 or RAM 25 , including an operating system 35 , one or more applications programs 36 , other program modules 37 , and program data 38 .
  • a user may enter commands and information into the personal computer 20 through input devices such as a keyboard 40 and a pointing device 42 .
  • Other input devices may include a microphone, joystick, game pad, satellite dish, scanner, or the like.
  • These and other input devices are often connected to the processing unit 21 through a serial port interface 46 that is coupled to the system bus, but may be connected by other interfaces, such as a parallel port, game port or a universal serial bus (USB) or a network interface card.
  • a monitor 47 or other type of display device is also connected to the system bus 23 via an interface, such as a video adapter 48 .
  • personal computers typically include other peripheral output devices, not shown, such as speakers and printers.
  • the personal computer 20 may operate in a networked environment using logical connections to one or more remote computers, such as a remote computer 49 .
  • the remote computer 49 may be another personal computer, a server, a router, a network PC, a peer device or other common network node, and typically includes many or all of the elements described above relative to the personal computer 20 , although only a memory storage device 50 has been illustrated in FIG. 1 .
  • the logical connections depicted in FIG. 1 include a local area network (LAN) 51 and a wide area network (WAN) 52 .
  • LAN local area network
  • WAN wide area network
  • the personal computer 20 When used in a LAN networking environment, the personal computer 20 is connected to the local network 51 through a network interface or adapter 53 . When used in a WAN networking environment, the personal computer 20 typically includes a modem 54 or other means for establishing communications over the WAN 52 .
  • the modem 54 which may be internal or external, is connected to the system bus 23 via the serial port interface 46 .
  • program modules depicted relative to the personal computer 20 may be stored in the remote memory storage device. It will be appreciated that the network connections shown are exemplary and other means of establishing a communications link between the computers may be used.
  • the present invention is directed to a new framework 70 for automated execution of distributed tasks by a selected set of computers.
  • the sequence of distributed tasks is executed by the corresponding computers in an orderly fashion through the cooperative interactions among the computers.
  • FIG. 2 shows only three computers 80 , 82 and 86 in the machine set 88 for executing a given sequence of distributed tasks. It will be appreciated, however, that the number of computers required for carrying out a sequence of tasks depends on the particular sequence.
  • the distributed task execution framework of the invention is used for the application of automated computer testing, and the following description will refer to the context of executing tasks involved in a computer test run. It will be appreciated, however, that the cooperative execution of distributed tasks of the invention is not limited to testing applications, and can be used equally well for executing other types of distributed tasks.
  • each of the computers 80 , 82 , 86 participating in the task execution is provided with an execution agent (EA) 90 , 92 or 96 .
  • the execution agent 90 is responsible for launching the assigned tasks on the test machine 80 on which it resides, and for communicating with the execution agents 92 , 96 on the other test machines 82 , 86 to coordinate the execution of the distributed tasks of a test run.
  • the execution agent is implemented as part of a dynamic link library (.dll) for computer testing. The functionality of the execution agents will be described in greater detail below.
  • a “task” is the smallest unit of execution by a machine.
  • a “job” is a list of tasks, and a “run” is a list of jobs.
  • a set of machines is reserved for a run. All machines participate in the execution of each job in the run even if they do not have any tasks of the job or even if that job's logical-to-physical machine mapping does not include that particular physical machine.
  • An “Exe” task is a command line (anything that can be created as a process) to be executed.
  • a “Copyfile” task copies a list of files from a source to a destination.
  • a “RunJob” task has a pointer to another job. In this context, the job being pointed to is called a “subjob,” which will be explained in greater detail below.
  • the execution of the RunJob task is basically the execution of the subjob. When the execution agent encounters this task, it finds the associated subjob and executes it, and the task's pass/fail status is determined by the success/failure or the subjob.
  • the common properties for all types of tasks include the user name under which the task is run, the password for the user name, the domain for the user credentials, and a FailureAction property that specifies the action to be taken by the execution agent if the task fails.
  • Each task also has a TaskCategory property, which tells the execution agent what kind of task it is. This property in combination with the FailureAction property determines the course of action to be taken upon the failure of the task.
  • one of the common properties of the task is the “Logical Machine” to which the task is assigned.
  • the MachineName which is the name of the physical machine that is to actually execute the task, is “found out” by using the LogicalMachine and the LogicalMachine To Physical Machine Mapping.
  • the SetupJob tasks are precursors to the tasks that actually run the tests and may include any task that is needed to set up the machine, which can vary from copying source binaries to deploying an operating system.
  • the Regular tasks are the tasks that actually run the tests.
  • the Cleanup tasks are involved in any cleanup operation that needs to be done regardless of the success/failure of the Setup and Regular tasks.
  • the CleanupPass tasks are involved in any cleanup operation that needs to be done upon the non-failure of the Setup and Regular tasks.
  • the CleanupFailure tasks are those that are involved in any cleanup operation to be done upon the failure of the job.
  • the execution instructions may identify the assignment of those tasks with respect to a set of logical machines, and provide a mapping between the logic machine set to the physical machines selected to carry out the tasks. For instance, the instructions may state that task number 5 is to be performed by logical machine A, which is mapped to physical machines X and Y. In that case, further splitting of the tasks with respect to the physical machines may be required. In one embodiment, if the logical machine a task is supposed to run is mapped into two or more physical machines, the task is split into as many tasks to have one task per physical machine. The splitting is achieved by making a copy of the task and changing the MachineName property of the task to the name of the physical machine that will run the task.
  • the task is a RunJob task that refers to a subjob with mapping
  • the task is not split, and the MachineName of the task is changes to the list of machine names specified by the subjob's mapping.
  • the RunJob task does not specify the logical-to-physical machine mapping for the subjob, then the subjob is also split with one subjob per task. The task dependencies are adjusted accordingly after the splitting of the tasks
  • the execution instructions may be loaded onto the computer 80 using a transportable medium, such as a CD-ROM.
  • the execution instructions identify the tasks and the order in which they are to be executed, and which test computer is to perform which task. In other words, the execution instructions provide information for each test computer involved in the test run to know what its tasks are in relation to the tasks of the other computers.
  • the execution instructions are provided in the form of an XML file or document.
  • the XML file allows easy viewing and editing and provides a standard format for the execution instructions to be coded.
  • the Push Daemon 108 of the test management server 100 sends a Run XML file 120 that contains the execution instructions for a test run to the execution agent 90 on the computer 80 in the set of computers selected for executing the test run (step 122 ).
  • the execution agent 90 of the first computer 80 receives the Run XML document, it first checks to see if it is already running any other test run (step 126 ). If so, it rejects the message and exits the run process. If it is not running any other test, it calls a EaCreateRunTimeXML API function to transform the Run XML (step 128 ).
  • a test machine may be taken off the network.
  • the execution agent is programmed to support such a scenario and will resume communications with the other machines when its machine comes back online. Jobs can also span over reboots.
  • the execution agent stores its state information on the file system of the local machine and will recover its state when the machine reboots and resume execution.
  • the execution agent also supports the installation of an operating system on the local machine as part of the test job. If the machine has multiple partitions, it also supports fresh installations of the operating system with formatting and deploying an image in the machine. In that case, the execution agent will continue the execution of the test run tasks after the new operating system boots up on the machine.
  • the execution agent is preferably built such that any kind of failure is either recovered or reported for follow-up. If one of the test machines for a job is hung or out of reach for some reason, the execution agents on the other machines detect it and cancel the current execution and report a failure. It also takes care of hooking into the debugger for both user mode and kernel mode breaks on the machines.
  • the execution agent 90 has several major components.
  • the Run object handles a test run. If the run is a multi-machine operation, and the machine on which the execution agent resides is the first machine to be called with the Run XML, the Run object 150 processes the Run XML to split it and produce a series of smaller XML files to work with, and sends the concatenation of all the processed XML files to the other machines involved in the test run. On the other hand, if the local machine is not the first machine as in the case of the computer 82 and is instead receiving the concatenated XML, the Run object 152 splits the concatenated XML and stores the files.
  • the Job object 154 instantiates Task objects 160 for each task of the run that is to be performed by the local machine and initializes them.
  • the Task objects 160 are grouped according to the execution phase they belong to.
  • the Job object 154 manages the dependency between the tasks, which involves executing only those tasks that do not have any dependency (i.e., pending the completion of another task) at that given time, and at the completion of a task removing the dependencies from other tasks that have dependency on this task.
  • a job is a multi-machined entity, and there is on Job object in each test machine for each job. All the Job objects on the test machines talk to each other to coordinate the task execution and dependencies.
  • Each Job object sends updates to the peer machines and the database at the beginning and completion of the associated job.
  • the EaCreateRunTimeXML object 158 is called by the Run object 150 to process the input Run XML file and splits the file into various smaller XML files as described above. It also handles the mapping from logical machines to physical machines for the tasks and subjobs.

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

A system and method is provided for cooperative execution of distributed tasks by networked computers without the use of a centralized controller to coordinate the task execution. Each computer has an execution agent that cooperates with the execution agents of the other computers to carry out the execution of a given sequence of tasks. The execution instructions for the task sequence are provided to a first computer in the selected set of computers for task execution. The first computer processes the instructions and forwards them to its peer computers so that each of them knows the tasks it is to perform in connection with the tasks of the other computers. The computers then executes the tasks assigned to them and use peer-to-peer communications to provide status update to their peer computers to synchronize and coordinate the task execution.

Description

    TECHNICAL FIELD
  • This invention relates generally to the controlled execution of tasks over a computer network, and more particularly to a computer framework for cooperative automated execution of distributed tasks by a set of computers without a centralized controller.
  • BACKGROUND OF THE INVENTION
  • Computer software and hardware operations, especially those that involve complex interactions between different computers such as those supporting various networking protocols and distributed transactions, often have to be extensively tested to ensure that they function properly. Typically, to test a given type of computer interactive operation, a set of test computers are chosen to run a test case that includes a sequence of tasks to be performed by different ones of the test computers in an interactive fashion. In one conventional test framework, the execution of the task sequence of the test run is controlled by a centralized test controller. The test control goes through the sequence of tasks one by one, instructs a corresponding test computer to carry out one task, receives the result of the task, and instructs another test computer to carry out the next task in the sequence based on the outcome of the previous task.
  • The use of a centralized controller to control the automated execution of distributed computer tasks, however, places a significant constraint on the availability of test computers. To enable the centralized control, each test computer is required to form a communication link with the centralized test controller to receive instructions and report task execution status and results. This requirement excludes many computers that do not have the capability of forming a direct network connection with the controller from being used for distributed computer testing.
  • SUMMARY OF THE INVENTION
  • In view of the foregoing, the present invention provides a framework for automated dynamic execution of distributed tasks by a set of computers without the need to use a centralized controller to coordinate the execution of the tasks. Although this framework is especially advantageous for the application of testing interactive computer operations, it can also be used to carry out other types of distributed tasks. In accordance with the invention, the execution of a sequence of tasks is coordinated through the cooperation of the peer computers selected to perform the tasks. Each computer has an execution agent for handling the cooperative execution of the tasks. Task execution instructions that identify the sequence of tasks to be executed and the corresponding computers that are to perform the individual tasks are given to a first computer in the set of selected computers. The first computer passes the task execution information on to the other peer computers involved in the distributed execution so that each computer knows which tasks it is to perform in relation to the tasks of the other computers. The execution agents of the computers then communicate with each other for status update and synchronize to execute the sequence of tasks.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a block diagram generally illustrating an exemplary computer system on which an embodiment of an execution agent for cooperative execution of distributed tasks in accordance with the invention may be implemented;
  • FIG. 2 is a schematic diagram showing an embodiment of a system that includes a set of computers for cooperative distributed execution of tasks of a test run;
  • FIG. 3 is a flow diagram summarizing the operation of execution agents on selected computers in an embodiment of the invention for automated execution of distributed tasks without a centralized controller; and
  • FIG. 4 is a schematic diagram showing the structure of execution agents on the computers in the system of FIG. 2 for carrying out the coordinated execution of distributed tasks by the computers.
  • DETAILED DESCRIPTION OF THE INVENTION
  • Turning to the drawings, wherein like reference numerals refer to like elements, the invention is illustrated as being implemented in a suitable computing environment. Although not required, the invention will be described in the general context of computer-executable instructions, such as program modules, being executed by a personal computer. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Moreover, those skilled in the art will appreciate that the invention may be practiced with other computer system configurations, including hand-held devices, multi-processor systems, microprocessor-based or programmable consumer electronics, network PCs, minicomputers, mainframe computers, and the like. The invention may be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices.
  • The following description begins with a description of a general-purpose computing device that may implement components of a framework of the invention for coordinated execution of distributed tasks by a selected set of computing devices. The framework of the invention for cooperative distributed task execution without using a centralized controller will be described in greater detail with reference to FIGS. 2-4. Turning now to FIG. 1, a general purpose computing device is shown in the form of a conventional personal computer 20, including a processing unit 21, a system memory 22, and a system bus 23 that couples various system components including the system memory to the processing unit 21. The system bus 23 may be any of several types of bus structures including a memory bus or memory controller, a peripheral bus, and a local bus using any of a variety of bus architectures. The system memory includes read only memory (ROM) 24 and random access memory (RAM) 25. A basic input/output system (BIOS) 26, containing the basic routines that help to transfer information between elements within the personal computer 20, such as during start-up, is stored in ROM 24. The personal computer 20 further includes a hard disk drive 27 for reading from and writing to a hard disk 60, a magnetic disk drive 28 for reading from or writing to a removable magnetic disk 29, and an optical disk drive 30 for reading from or writing to a removable optical disk 31 such as a CD ROM or other optical media.
  • The hard disk drive 27, magnetic disk drive 28, and optical disk drive 30 are connected to the system bus 23 by a hard disk drive interface 32, a magnetic disk drive interface 33, and an optical disk drive interface 34, respectively. The drives and their associated computer-readable media provide nonvolatile storage of computer readable instructions, data structures, program modules and other data for the personal computer 20. Although the exemplary environment described herein employs a hard disk 60, a removable magnetic disk 29, and a removable optical disk 31, it will be appreciated by those skilled in the art that other types of computer readable media which can store data that is accessible by a computer, such as magnetic cassettes, flash memory cards, digital video disks, Bernoulli cartridges, random access memories, read only memories, storage area networks, and the like may also be used in the exemplary operating environment.
  • A number of program modules may be stored on the hard disk 60, magnetic disk 29, optical disk 31, ROM 24 or RAM 25, including an operating system 35, one or more applications programs 36, other program modules 37, and program data 38. A user may enter commands and information into the personal computer 20 through input devices such as a keyboard 40 and a pointing device 42. Other input devices (not shown) may include a microphone, joystick, game pad, satellite dish, scanner, or the like. These and other input devices are often connected to the processing unit 21 through a serial port interface 46 that is coupled to the system bus, but may be connected by other interfaces, such as a parallel port, game port or a universal serial bus (USB) or a network interface card. A monitor 47 or other type of display device is also connected to the system bus 23 via an interface, such as a video adapter 48. In addition to the monitor, personal computers typically include other peripheral output devices, not shown, such as speakers and printers.
  • The personal computer 20 may operate in a networked environment using logical connections to one or more remote computers, such as a remote computer 49. The remote computer 49 may be another personal computer, a server, a router, a network PC, a peer device or other common network node, and typically includes many or all of the elements described above relative to the personal computer 20, although only a memory storage device 50 has been illustrated in FIG. 1. The logical connections depicted in FIG. 1 include a local area network (LAN) 51 and a wide area network (WAN) 52. Such networking environments are commonplace in offices, enterprise-wide computer networks, intranets and the Internet.
  • When used in a LAN networking environment, the personal computer 20 is connected to the local network 51 through a network interface or adapter 53. When used in a WAN networking environment, the personal computer 20 typically includes a modem 54 or other means for establishing communications over the WAN 52. The modem 54, which may be internal or external, is connected to the system bus 23 via the serial port interface 46. In a networked environment, program modules depicted relative to the personal computer 20, or portions thereof, may be stored in the remote memory storage device. It will be appreciated that the network connections shown are exemplary and other means of establishing a communications link between the computers may be used.
  • In the description that follows, the invention will be described with reference to acts and symbolic representations of operations that are performed by one or more computers, unless indicated otherwise. As such, it will be understood that such acts and operations, which are at times referred to as being computer-executed, include the manipulation by the processing unit of the computer of electrical signals representing data in a structured form. This manipulation transforms the data or maintains it at locations in the memory system of the computer, which reconfigures or otherwise alters the operation of the computer in a manner well understood by those skilled in the art. The data structures where data is maintained are physical locations of the memory that have particular properties defined by the format of the data. However, while the invention is being described in the foregoing context, it is not meant to be limiting as those of skill in the art will appreciate that various ones of the acts and operations described hereinafter may also be implemented in hardware.
  • Referring now to FIG. 2, the present invention is directed to a new framework 70 for automated execution of distributed tasks by a selected set of computers. In accordance with the invention, the sequence of distributed tasks is executed by the corresponding computers in an orderly fashion through the cooperative interactions among the computers. As a result, there is no need to use a centralized controller to coordinate and control the progress of the execution of the tasks. For simplicity of illustration, FIG. 2 shows only three computers 80, 82 and 86 in the machine set 88 for executing a given sequence of distributed tasks. It will be appreciated, however, that the number of computers required for carrying out a sequence of tasks depends on the particular sequence.
  • In a preferred embodiment as described below with reference to FIG. 2, the distributed task execution framework of the invention is used for the application of automated computer testing, and the following description will refer to the context of executing tasks involved in a computer test run. It will be appreciated, however, that the cooperative execution of distributed tasks of the invention is not limited to testing applications, and can be used equally well for executing other types of distributed tasks.
  • To carry out the distributed tasks without the help of a centralized controller, each of the computers 80, 82, 86 participating in the task execution is provided with an execution agent (EA) 90, 92 or 96. The execution agent 90 is responsible for launching the assigned tasks on the test machine 80 on which it resides, and for communicating with the execution agents 92, 96 on the other test machines 82, 86 to coordinate the execution of the distributed tasks of a test run. In one embodiment, the execution agent is implemented as part of a dynamic link library (.dll) for computer testing. The functionality of the execution agents will be described in greater detail below.
  • To facilitate the description of a preferred embodiment for computer testing operations, some basic concepts regarding automated execution by selected machines as used in the embodiment are first defined. Generally, a “task” is the smallest unit of execution by a machine. A “job” is a list of tasks, and a “run” is a list of jobs. A set of machines is reserved for a run. All machines participate in the execution of each job in the run even if they do not have any tasks of the job or even if that job's logical-to-physical machine mapping does not include that particular physical machine.
  • As implemented in the embodiment, there are three different types of tasks. An “Exe” task is a command line (anything that can be created as a process) to be executed. A “Copyfile” task copies a list of files from a source to a destination. A “RunJob” task has a pointer to another job. In this context, the job being pointed to is called a “subjob,” which will be explained in greater detail below. The execution of the RunJob task is basically the execution of the subjob. When the execution agent encounters this task, it finds the associated subjob and executes it, and the task's pass/fail status is determined by the success/failure or the subjob. The common properties for all types of tasks include the user name under which the task is run, the password for the user name, the domain for the user credentials, and a FailureAction property that specifies the action to be taken by the execution agent if the task fails. Each task also has a TaskCategory property, which tells the execution agent what kind of task it is. This property in combination with the FailureAction property determines the course of action to be taken upon the failure of the task. Also, one of the common properties of the task is the “Logical Machine” to which the task is assigned. The MachineName, which is the name of the physical machine that is to actually execute the task, is “found out” by using the LogicalMachine and the LogicalMachine To Physical Machine Mapping.
  • In one implementation, there are five different categories of tasks: SetupJob, Regular, Cleanup, CleanupPass, and CleanupFail. The SetupJob tasks are precursors to the tasks that actually run the tests and may include any task that is needed to set up the machine, which can vary from copying source binaries to deploying an operating system. The Regular tasks are the tasks that actually run the tests. The Cleanup tasks are involved in any cleanup operation that needs to be done regardless of the success/failure of the Setup and Regular tasks. The CleanupPass tasks are involved in any cleanup operation that needs to be done upon the non-failure of the Setup and Regular tasks. In contrast, the CleanupFailure tasks are those that are involved in any cleanup operation to be done upon the failure of the job. Each task in any of the five categories can be of any type (Exe, Copy, or RunJob). As mentioned above, a “job” is a list of tasks. The tasks in a job may belong to different categories. Within the job the tasks are executed in the following order: Setup, Regular, Cleanup, and then CleanupPass or CleanupFail, depending on the status of the job at the end of the regular tasks.
  • In accordance with a feature of a preferred embodiment, a task may be the execution of a “subjob.” A subjob is basically the same as a job except for a few differences. A subjob has a “parent” task (a RunJob task) that refers to it, and the subjob is executed as part of running the parent task within some other job, which is referred to as the ParentJob in this context. When the list of tasks of a given test run is given to the test machines, a subjob may be referred to without specifying on which physical machines the subjob is to run. In that case, the subjob will be executed on the machine on which the parent task is supposed to run. On the other hand, if a mapping to a physical machine is specified for a subjob, the subjob will be executed on that machine.
  • Generally, when a sequence of tasks is given to a set of test computers, the execution instructions may identify the assignment of those tasks with respect to a set of logical machines, and provide a mapping between the logic machine set to the physical machines selected to carry out the tasks. For instance, the instructions may state that task number 5 is to be performed by logical machine A, which is mapped to physical machines X and Y. In that case, further splitting of the tasks with respect to the physical machines may be required. In one embodiment, if the logical machine a task is supposed to run is mapped into two or more physical machines, the task is split into as many tasks to have one task per physical machine. The splitting is achieved by making a copy of the task and changing the MachineName property of the task to the name of the physical machine that will run the task. If the task is a RunJob task that refers to a subjob with mapping, the task is not split, and the MachineName of the task is changes to the list of machine names specified by the subjob's mapping. If the RunJob task does not specify the logical-to-physical machine mapping for the subjob, then the subjob is also split with one subjob per task. The task dependencies are adjusted accordingly after the splitting of the tasks
  • To use the set 88 of test machines to execute the sequence of tasks for a test run, task execution instructions are provided to one of the test machines. In the illustration of FIG. 2, the computer 80 is used for that purpose. The task execution instructions are generated by a test management server 100, which includes a test case database 102 and scheduler 106. The scheduler 106 takes a test run and finds a set 88 of available test machines for running the test case. The test management server 100 further includes a push daemon 108 for delivering the task execution instructions to the group of test computers. If a network connection 118 is formed between the push daemon 108 and the computer 80, the push daemon may send the execution instructions to the computer 80 via the network connection. Alternatively if there is no network connection available between the test management server 100 and any of the test computers 80, 82 and 86, the execution instructions may be loaded onto the computer 80 using a transportable medium, such as a CD-ROM. The execution instructions identify the tasks and the order in which they are to be executed, and which test computer is to perform which task. In other words, the execution instructions provide information for each test computer involved in the test run to know what its tasks are in relation to the tasks of the other computers. In accordance with a feature of a preferred embodiment, the execution instructions are provided in the form of an XML file or document. The XML file allows easy viewing and editing and provides a standard format for the execution instructions to be coded.
  • Referring now to FIGS. 2 and 3, to initiate a test run, the Push Daemon 108 of the test management server 100 sends a Run XML file 120 that contains the execution instructions for a test run to the execution agent 90 on the computer 80 in the set of computers selected for executing the test run (step 122). When the execution agent 90 of the first computer 80 receives the Run XML document, it first checks to see if it is already running any other test run (step 126). If so, it rejects the message and exits the run process. If it is not running any other test, it calls a EaCreateRunTimeXML API function to transform the Run XML (step 128). This transformation involves splitting the tasks specified in the Run XML document, adjusting the dependencies of the tasks, and adjusting the subjobs, if any. As part of this reformatting operation, the execution agent splits the Run XML into several smaller XML files, including one XML file for each job of the test run and a global JobList XML file that contains a list of all jogs of the test run. It also forms a concatenated version of all of these XML files that is called a “PeerRun XML.” The execution agent on the first machine then identify the other machines involved in the run by reading the list of the test machines from the Global JobList XML, and sends the PeerRun XML file to each of the other machines (step 130). When the execution agent on a peer machine 82 receives the PeerRun XML 84, it recognizes the document to be from the execution agent of the first machine. If the peer machine is already involved in another test run (step 132), its execution agent 92 returns an error message to the first machine 80 (step 136). Otherwise the execution agent of the peer machine splits the concatenated XML files and store them in the memory (step 138). Once every machine involved in the test run receives the execution instructions in the XML files, each test machine starts running the test jobs one by one sequentially according to the test XML files (step 140) and sends the results of the test jobs to the test result database 72 In executing the tasks of the test run, the execution agent of each test machine communicate with all the execution agents of the other test machines by sending messages 104 that provide status updates to everyone (step 144). They also synchronize with each other at the beginning of every run or job to make sure that all the machines start at the same time. If the machines do not have direct network connections to the test result database 72 of the test manager server, the test results may be sent to a local database 74. Once all the jobs of the test run are completed, the test results collected by the local database 74 may be sent to the database 72 of the test management server 100 for diagnoses and reporting.
  • In a preferred embodiment, the execution agent has several important high-level features. First, the execution agent can execute a job involving multiple peer test machines. It accomplishes this by using peer-to-peer messaging, whereby the execution agents running all the test machines synchronize with each other and receive the up-to-date status of what is happening on all the other test machines.
  • During the course of executing a job, a test machine may be taken off the network. The execution agent is programmed to support such a scenario and will resume communications with the other machines when its machine comes back online. Jobs can also span over reboots. To handle that scenario, the execution agent stores its state information on the file system of the local machine and will recover its state when the machine reboots and resume execution. The execution agent also supports the installation of an operating system on the local machine as part of the test job. If the machine has multiple partitions, it also supports fresh installations of the operating system with formatting and deploying an image in the machine. In that case, the execution agent will continue the execution of the test run tasks after the new operating system boots up on the machine.
  • To ensure the reliability of the test results, the execution agent is preferably built such that any kind of failure is either recovered or reported for follow-up. If one of the test machines for a job is hung or out of reach for some reason, the execution agents on the other machines detect it and cancel the current execution and report a failure. It also takes care of hooking into the debugger for both user mode and kernel mode breaks on the machines.
  • The execution agent also provides functionality to collect test and system information after the execution of the job. Different information gathering commands can be executed based on the outcome of the job. The job can request for minimal logs to be gathered if the job passes and for detailed logs to be captured if the job fails. Failure actions are supported by the execution agent. If any task in the job fails, the job can request for many different kinds of actions to be taken. The job can be canceled, and a set of failure commands can be executed to collect information about the system and the job for later diagnosis. The job can also be defined to recover the machine back to a good state if the failure leaves the test machine in a bad state. In this case, the machines are recovered and returned to the available test machine pool so that new test jobs can be assigned to this machine.
  • Tests can be defined in great detail. Apart from giving the actual command line to be executed for an executable task, the user can also specify the credentials of which user account to run the task under. The user can also specify which computer is to be used as a monitor for showing the process status or to create a new monitor session for the task. The execution agent also supports the ability for any execution to be canceled at any point of time. The cancel request is processed and passed on to all the peer machines, and the cleanup processes are executed. The machines are then returned back to the free machine pool for further job assignments. The execution agent is incorporated with error logging to the database for any kind of failures so that all information is available from the user interface for the user to diagnose the issues.
  • Turning now to FIG. 4, in one implementation, the execution agent 90 has several major components. The Run object handles a test run. If the run is a multi-machine operation, and the machine on which the execution agent resides is the first machine to be called with the Run XML, the Run object 150 processes the Run XML to split it and produce a series of smaller XML files to work with, and sends the concatenation of all the processed XML files to the other machines involved in the test run. On the other hand, if the local machine is not the first machine as in the case of the computer 82 and is instead receiving the concatenated XML, the Run object 152 splits the concatenated XML and stores the files. The Run object 150 or 152 then instantiates Job objects 154, one for each job present in the test run, and initialize them. The Job objects 154 are grouped according to the execution phase they belong to (e.g., Setup, Regular, and Cleanup). The Run object of each computer initializes all common data by providing accessors for Job and Task objects to access them, and executes the jobs in a sequential order according to the execution instructions in the XML files. During the test run, the Run objects synchronizes the execution of each job in all the machines such that at any point of time all the machines involved run the same job. The Run object of each computer also initializes and starts an EA HB object 154 for maintaining a “heartbeat” with all the peer machines. The run is canceled upon failure of heartbeat (i.e., the peer machine being unresponsive) for three consecutive times. The Run object is also responsible for sending updates to the peer machines and the result database at the beginning and completion of the run.
  • The Job object 154 instantiates Task objects 160 for each task of the run that is to be performed by the local machine and initializes them. The Task objects 160 are grouped according to the execution phase they belong to. The Job object 154 manages the dependency between the tasks, which involves executing only those tasks that do not have any dependency (i.e., pending the completion of another task) at that given time, and at the completion of a task removing the dependencies from other tasks that have dependency on this task. Generally, a job is a multi-machined entity, and there is on Job object in each test machine for each job. All the Job objects on the test machines talk to each other to coordinate the task execution and dependencies. Each Job object sends updates to the peer machines and the database at the beginning and completion of the associated job.
  • The Task object 160 is responsible for executing an associated task in a specified user context and terminal session. It has the ability to cancel or kill the task, if needed. The Task object 158 sends updates to the peer machines and the database at the beginning and completion of the task.
  • The MessageDelivery object 162 is responsible for delivery of messages to the peer machines. It gets the routing information from the Run XML and constructs the headers to the peer machines and the database. To that end, it has to exposed functions: SendMessageToPeers and SendMessageToSource. It also maintains a list of the peer machines for reference.
  • The EaCreateRunTimeXML object 158 is called by the Run object 150 to process the input Run XML file and splits the file into various smaller XML files as described above. It also handles the mapping from logical machines to physical machines for the tasks and subjobs.
  • The EA SDK objects are a group of APIs that are used to interface with the execution agent. The EaCreate function creates an instance of the execution agent. It takes the Run XML as the input and returns a handle for the execution agent. It is signaled on completion of the run. The EaWaitForMultipleObjects is used to wait on the execution agent handles. The EaSignal function is used to signal the execution agent and to cancel the execution agent. The EaQueryStatus function, when invoked, returns the status of the run at that point of time in an XML format. The EaClose function closes the handle for the execution agent.
  • In view of the many possible embodiments to which the principles of this invention may be applied, it should be recognized that the embodiments described herein with respect to the drawing figures are meant to be illustrative only and should not be taken as limiting the scope of the invention. Therefore, the invention as described herein contemplates all such embodiments as may come within the scope of the following claims and equivalents thereof.

Claims (18)

1-21. (canceled)
22. A computer-readable medium having computer-executable instructions for coordinated execution of distributed tasks comprising:
receiving by a peer computer in a group of at least one peer computers a set of execution instructions for the peer computer, the set of execution instructions identifying a sequence of tasks to be performed by the peer computer;
executing by the peer computer the tasks identified by the set of execution instructions in connection with execution of any tasks assigned to other peer computers in the group of at least one peer computers;
transmitting by the peer computer to any other peer computers in the group of at least one peer computer communication messages containing task execution status to synchronize and coordinate the execution of the tasks.
23. The computer-readable medium of claim 22 wherein the sequence of tasks to be performed constitutes a test run of interactive computer operations.
24. The computer-readable medium of claim 22 wherein the instructions include a job for executing a predefined set of tasks.
25. The computer-readable medium of claim 22 further comprising execution instructions provided in an input XML document.
26. The computer-readable medium of claim 22 having further computer-executable instructions for the peer computer to report results of execution of the tasks to a database.
27. A method of performing coordinated execution of distributed tasks by a group of peer computers, comprising:
receiving by a peer computer in a group of at least one peer computer, a set of execution instructions for the peer computer, the set of execution instructions identifying a sequence of tasks to be performed by the peer computer;
executing by the peer computer the tasks identified by the set of execution instructions in connection with execution of any tasks assigned to other peer computers in the group of at least one peer computers;
transmitting by the peer computer to any other peer computers in the group of at least one peer computer communication messages containing task execution status to synchronize and coordinate the execution of the tasks.
28. The method of claim 27 wherein the sequence of tasks to be performed constitutes a test run of interactive computer operations.
29. The method of claim 27 wherein the execution instructions include a job that executes a predefined set of tasks.
30. The method of claim 27 wherein the execution instructions are provided to the peer computer in an input XML document.
31. The method of claim 27 further comprising reporting results of execution of the tasks by the group of at least one peer computer to a database.
32. A computing device for performing automated execution of distributed tasks, the computing device comprising,
a peer computer, the peer computer connected by a communication link to at least one other peer computer;
at least one execution agent residing on the peer computer, the execution agent being programmed for receiving a set of execution instructions for the peer computer, the set of execution instructions identifying a sequence of tasks to be performed;
the execution agent capable of executing tasks assigned to the peer computer and the peer computer transmitting to the other peer computers communication messages containing task execution status to synchronize and coordinate the execution of the sequence of tasks.
33. The computing device of claim 32 wherein the sequence of tasks to be performed constitutes a test run of interactive computer operations.
34. The computing device of claim 32 wherein the execution instructions include a job for executing a predefined set of tasks.
35. The computing device of claim 32 wherein the execution instructions are in an input XML document.
36. The computing device of claim 35 wherein the execution agent of the peer computer is further programmed to process the input XML document to derive the execution instructions for sending to other peer computers.
37. The computing device of claim 36 wherein the execution agent of the peer computer is further programmed to format the execution instructions as a second XML document.
38. The computing device of claim 32 further including a test result database, and wherein the execution agent of the peer computer is further programmed for reporting results of the execution of tasks to the test result database.
US11/150,951 2003-11-24 2005-06-12 System and method for dynamic cooperative distributed execution of computer tasks without a centralized controller Abandoned US20050240931A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US11/150,951 US20050240931A1 (en) 2003-11-24 2005-06-12 System and method for dynamic cooperative distributed execution of computer tasks without a centralized controller

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/720,893 US8104043B2 (en) 2003-11-24 2003-11-24 System and method for dynamic cooperative distributed execution of computer tasks without a centralized controller
US11/150,951 US20050240931A1 (en) 2003-11-24 2005-06-12 System and method for dynamic cooperative distributed execution of computer tasks without a centralized controller

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US10/720,893 Continuation US8104043B2 (en) 2003-11-24 2003-11-24 System and method for dynamic cooperative distributed execution of computer tasks without a centralized controller

Publications (1)

Publication Number Publication Date
US20050240931A1 true US20050240931A1 (en) 2005-10-27

Family

ID=34591672

Family Applications (2)

Application Number Title Priority Date Filing Date
US10/720,893 Expired - Fee Related US8104043B2 (en) 2003-11-24 2003-11-24 System and method for dynamic cooperative distributed execution of computer tasks without a centralized controller
US11/150,951 Abandoned US20050240931A1 (en) 2003-11-24 2005-06-12 System and method for dynamic cooperative distributed execution of computer tasks without a centralized controller

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US10/720,893 Expired - Fee Related US8104043B2 (en) 2003-11-24 2003-11-24 System and method for dynamic cooperative distributed execution of computer tasks without a centralized controller

Country Status (1)

Country Link
US (2) US8104043B2 (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070294512A1 (en) * 2006-06-20 2007-12-20 Crutchfield William Y Systems and methods for dynamically choosing a processing element for a compute kernel
US20070294681A1 (en) * 2006-06-20 2007-12-20 Tuck Nathan D Systems and methods for profiling an application running on a parallel-processing computer system
US20070294680A1 (en) * 2006-06-20 2007-12-20 Papakipos Matthew N Systems and methods for compiling an application for a parallel-processing computer system
US20070294671A1 (en) * 2006-06-20 2007-12-20 Demetriou Christopher G Systems and methods for debugging an application running on a parallel-processing computer system
US20070294665A1 (en) * 2006-06-20 2007-12-20 Papakipos Matthew N Runtime system for executing an application in a parallel-processing computer system
US20070294666A1 (en) * 2006-06-20 2007-12-20 Papakipos Matthew N Systems and methods for determining compute kernels for an application in a parallel-processing computer system
US20070294696A1 (en) * 2006-06-20 2007-12-20 Papakipos Matthew N Multi-thread runtime system
US20070294682A1 (en) * 2006-06-20 2007-12-20 Demetriou Christopher G Systems and methods for caching compute kernels for an application running on a parallel-processing computer system
US20070294663A1 (en) * 2006-06-20 2007-12-20 Mcguire Morgan S Application program interface of a parallel-processing computer system that supports multiple programming languages
US20080005547A1 (en) * 2006-06-20 2008-01-03 Papakipos Matthew N Systems and methods for generating reference results using a parallel-processing computer system
US20080244589A1 (en) * 2007-03-29 2008-10-02 Microsoft Corporation Task manager
US20090055686A1 (en) * 2007-08-23 2009-02-26 International Business Machines Corporation Server side logic unit testing
US20110092350A1 (en) * 2009-10-21 2011-04-21 Wilkinson Richard W Systems and methods for folding
US8650280B2 (en) * 2011-12-28 2014-02-11 Sybase, Inc. Monitoring distributed task execution using a chained text messaging system
US9081747B1 (en) 2012-03-06 2015-07-14 Big Bang Llc Computer program deployment to one or more target devices
US11444464B1 (en) * 2016-03-25 2022-09-13 Goal Zero Llc Portable hybrid generator

Families Citing this family (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20030019596A (en) * 2001-05-23 2003-03-06 소니 가부시끼 가이샤 Broadcast program display method, broadcast program display apparatus, and broadcast receiver
TWI221240B (en) * 2003-11-14 2004-09-21 Via Tech Inc Workflow defining system and workflow managing system
DE102004011201B4 (en) * 2004-03-04 2006-10-12 Siemens Ag Method for managing and monitoring the operation of multiple distributed in at least one communication network integrated hardware and / or software systems and system for performing the method
US8250230B2 (en) * 2004-09-30 2012-08-21 Microsoft Corporation Optimizing communication using scalable peer groups
US20070133520A1 (en) * 2005-12-12 2007-06-14 Microsoft Corporation Dynamically adapting peer groups
US7613703B2 (en) * 2004-09-30 2009-11-03 Microsoft Corporation Organizing resources into collections to facilitate more efficient and reliable resource access
US8549180B2 (en) * 2004-10-22 2013-10-01 Microsoft Corporation Optimizing access to federation infrastructure-based resources
DE502004007302D1 (en) 2004-11-26 2008-07-10 Bae Ro Gmbh & Co Kg sterilizing lamp
US7328199B2 (en) * 2005-10-07 2008-02-05 Microsoft Corporation Componentized slot-filling architecture
US20070101328A1 (en) * 2005-10-31 2007-05-03 Microsoft Corporation Sequencing a single task sequence across multiple operating environments
US20070101342A1 (en) * 2005-10-31 2007-05-03 Microsoft Corporation Automated device driver management
US7822699B2 (en) * 2005-11-30 2010-10-26 Microsoft Corporation Adaptive semantic reasoning engine
US7606700B2 (en) * 2005-11-09 2009-10-20 Microsoft Corporation Adaptive task framework
US20070106496A1 (en) * 2005-11-09 2007-05-10 Microsoft Corporation Adaptive task framework
US7831585B2 (en) * 2005-12-05 2010-11-09 Microsoft Corporation Employment of task framework for advertising
US7933914B2 (en) * 2005-12-05 2011-04-26 Microsoft Corporation Automatic task creation and execution using browser helper objects
US20070130134A1 (en) * 2005-12-05 2007-06-07 Microsoft Corporation Natural-language enabling arbitrary web forms
US20070203869A1 (en) * 2006-02-28 2007-08-30 Microsoft Corporation Adaptive semantic platform architecture
US7996783B2 (en) 2006-03-02 2011-08-09 Microsoft Corporation Widget searching utilizing task framework
US20070257354A1 (en) * 2006-03-31 2007-11-08 Searete Llc, A Limited Liability Corporation Of The State Of Delaware Code installation decisions for improving aggregate functionality
US20110246642A1 (en) * 2006-03-31 2011-10-06 Cohen Alexander J Aggregating network activity using software provenance data
US20080080393A1 (en) * 2006-09-29 2008-04-03 Microsoft Corporation Multiple peer groups for efficient scalable computing
US7881316B2 (en) * 2006-09-29 2011-02-01 Microsoft Corporation Multiple peer groups for efficient scalable computing
US20080080530A1 (en) * 2006-09-29 2008-04-03 Microsoft Corporation Multiple peer groups for efficient scalable computing
US20080080529A1 (en) * 2006-09-29 2008-04-03 Microsoft Corporation Multiple peer groups for efficient scalable computing
US20090070121A1 (en) * 2007-09-11 2009-03-12 Jean-Baptiste Leonelli System, Method And Graphical User Interface For Workflow Generation, Deployment And/Or Execution
US10997531B2 (en) * 2007-09-11 2021-05-04 Ciambella Ltd. System, method and graphical user interface for workflow generation, deployment and/or execution
US9395771B1 (en) * 2007-10-26 2016-07-19 Pce, Inc. Plenum pressure control system
US8656392B2 (en) * 2009-06-10 2014-02-18 The Boeing Company Consensus based distributed task execution
US9075617B2 (en) * 2009-06-19 2015-07-07 Lindsay Ian Smith External agent interface
JP2011053995A (en) * 2009-09-03 2011-03-17 Hitachi Ltd Data processing control method and computer system
US20120174068A1 (en) * 2010-12-30 2012-07-05 Sap Ag Testing Software Code
US8909696B1 (en) * 2011-11-02 2014-12-09 Google Inc. Redundant data requests with redundant response cancellation
GB2511347A (en) 2013-02-28 2014-09-03 Ibm Resource management with conditioned policies
US9075856B2 (en) * 2013-03-14 2015-07-07 Symantec Corporation Systems and methods for distributing replication tasks within computing clusters
JP6494608B2 (en) 2013-06-18 2019-04-03 チャンベッラ・リミテッド Method and apparatus for code virtualization and remote process call generation
EP3019956A4 (en) 2013-07-12 2017-03-15 Ciambella Ltd. Method and apparatus for firmware virtualization
US9465658B1 (en) * 2014-03-27 2016-10-11 Amazon Technologies, Inc. Task distribution over a heterogeneous environment through task and consumer categories
CN112910857B (en) * 2014-09-15 2023-07-11 佩里梅特雷克斯公司 Method for verifying security
US10067490B2 (en) 2015-05-08 2018-09-04 Ciambella Ltd. Method and apparatus for modifying behavior of code for a controller-based device
SG11201708743UA (en) 2015-05-08 2017-11-29 Ciambella Ltd Method and apparatus for automatic software development for a group of controller-based devices
US9729633B2 (en) 2015-05-29 2017-08-08 Microsoft Technology Licensing, Llc Distributed data processing system
EP3394743B1 (en) 2015-12-21 2023-07-12 Ciambella Ltd. Method and apparatus for creating and managing controller based remote solutions
US9961012B2 (en) * 2015-12-21 2018-05-01 Microsoft Technology Licensing, Llc Per-stage assignment of pipelines agents
US11087249B2 (en) 2016-05-24 2021-08-10 Ciambella Ltd. Method and apparatus for triggering execution of a workflow over a network
US10798780B2 (en) 2016-08-22 2020-10-06 Ciambella Ltd. Method and apparatus for creating and managing controller based remote solutions
CN110419024A (en) 2017-03-14 2019-11-05 西安姆贝拉有限公司 Method and apparatus for automatically generating and merging code in a development environment
US10848388B1 (en) * 2019-07-12 2020-11-24 Deloitte Development Llc Distributed computing framework
US11392414B2 (en) 2019-10-04 2022-07-19 Target Brands, Inc. Cooperation-based node management protocol

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4969092A (en) * 1988-09-30 1990-11-06 Ibm Corp. Method for scheduling execution of distributed application programs at preset times in an SNA LU 6.2 network environment
US5634011A (en) * 1992-06-18 1997-05-27 International Business Machines Corporation Distributed management communications network
US6157960A (en) * 1997-05-07 2000-12-05 International Business Machines Corporation Technique for programmatically creating distributed object programs
US6167431A (en) * 1997-07-26 2000-12-26 International Business Machines Corp. Server probe method and apparatus for a distributed data processing system
US6549932B1 (en) * 1998-06-03 2003-04-15 International Business Machines Corporation System, method and computer program product for discovery in a distributed computing environment
US6581104B1 (en) * 1996-10-01 2003-06-17 International Business Machines Corporation Load balancing in a distributed computer enterprise environment
US6934755B1 (en) * 2000-06-02 2005-08-23 Sun Microsystems, Inc. System and method for migrating processes on a network
US7159217B2 (en) * 2001-12-20 2007-01-02 Cadence Design Systems, Inc. Mechanism for managing parallel execution of processes in a distributed computing environment

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2772304B2 (en) * 1992-04-10 1998-07-02 富士通株式会社 Load balancing method for parallel processing
JP3696901B2 (en) * 1994-07-19 2005-09-21 キヤノン株式会社 Load balancing method

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4969092A (en) * 1988-09-30 1990-11-06 Ibm Corp. Method for scheduling execution of distributed application programs at preset times in an SNA LU 6.2 network environment
US5634011A (en) * 1992-06-18 1997-05-27 International Business Machines Corporation Distributed management communications network
US6581104B1 (en) * 1996-10-01 2003-06-17 International Business Machines Corporation Load balancing in a distributed computer enterprise environment
US6157960A (en) * 1997-05-07 2000-12-05 International Business Machines Corporation Technique for programmatically creating distributed object programs
US6167431A (en) * 1997-07-26 2000-12-26 International Business Machines Corp. Server probe method and apparatus for a distributed data processing system
US6549932B1 (en) * 1998-06-03 2003-04-15 International Business Machines Corporation System, method and computer program product for discovery in a distributed computing environment
US6934755B1 (en) * 2000-06-02 2005-08-23 Sun Microsystems, Inc. System and method for migrating processes on a network
US7159217B2 (en) * 2001-12-20 2007-01-02 Cadence Design Systems, Inc. Mechanism for managing parallel execution of processes in a distributed computing environment

Cited By (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8108844B2 (en) * 2006-06-20 2012-01-31 Google Inc. Systems and methods for dynamically choosing a processing element for a compute kernel
US20070294671A1 (en) * 2006-06-20 2007-12-20 Demetriou Christopher G Systems and methods for debugging an application running on a parallel-processing computer system
US20070294680A1 (en) * 2006-06-20 2007-12-20 Papakipos Matthew N Systems and methods for compiling an application for a parallel-processing computer system
US8136102B2 (en) * 2006-06-20 2012-03-13 Google Inc. Systems and methods for compiling an application for a parallel-processing computer system
US20070294665A1 (en) * 2006-06-20 2007-12-20 Papakipos Matthew N Runtime system for executing an application in a parallel-processing computer system
US8136104B2 (en) * 2006-06-20 2012-03-13 Google Inc. Systems and methods for determining compute kernels for an application in a parallel-processing computer system
US20070294696A1 (en) * 2006-06-20 2007-12-20 Papakipos Matthew N Multi-thread runtime system
US20070294682A1 (en) * 2006-06-20 2007-12-20 Demetriou Christopher G Systems and methods for caching compute kernels for an application running on a parallel-processing computer system
US20070294663A1 (en) * 2006-06-20 2007-12-20 Mcguire Morgan S Application program interface of a parallel-processing computer system that supports multiple programming languages
US20080005547A1 (en) * 2006-06-20 2008-01-03 Papakipos Matthew N Systems and methods for generating reference results using a parallel-processing computer system
US20070294512A1 (en) * 2006-06-20 2007-12-20 Crutchfield William Y Systems and methods for dynamically choosing a processing element for a compute kernel
US8972943B2 (en) 2006-06-20 2015-03-03 Google Inc. Systems and methods for generating reference results using parallel-processing computer system
US8745603B2 (en) 2006-06-20 2014-06-03 Google Inc. Application program interface of a parallel-processing computer system that supports multiple programming languages
US20110010715A1 (en) * 2006-06-20 2011-01-13 Papakipos Matthew N Multi-Thread Runtime System
US8584106B2 (en) 2006-06-20 2013-11-12 Google Inc. Systems and methods for compiling an application for a parallel-processing computer system
US8024708B2 (en) * 2006-06-20 2011-09-20 Google Inc. Systems and methods for debugging an application running on a parallel-processing computer system
US8458680B2 (en) 2006-06-20 2013-06-04 Google Inc. Systems and methods for dynamically choosing a processing element for a compute kernel
US20070294681A1 (en) * 2006-06-20 2007-12-20 Tuck Nathan D Systems and methods for profiling an application running on a parallel-processing computer system
US20070294666A1 (en) * 2006-06-20 2007-12-20 Papakipos Matthew N Systems and methods for determining compute kernels for an application in a parallel-processing computer system
US8146066B2 (en) * 2006-06-20 2012-03-27 Google Inc. Systems and methods for caching compute kernels for an application running on a parallel-processing computer system
US8261270B2 (en) * 2006-06-20 2012-09-04 Google Inc. Systems and methods for generating reference results using a parallel-processing computer system
US8375368B2 (en) 2006-06-20 2013-02-12 Google Inc. Systems and methods for profiling an application running on a parallel-processing computer system
US8381202B2 (en) 2006-06-20 2013-02-19 Google Inc. Runtime system for executing an application in a parallel-processing computer system
US8418179B2 (en) 2006-06-20 2013-04-09 Google Inc. Multi-thread runtime system
US8429617B2 (en) 2006-06-20 2013-04-23 Google Inc. Systems and methods for debugging an application running on a parallel-processing computer system
US8443349B2 (en) 2006-06-20 2013-05-14 Google Inc. Systems and methods for determining compute kernels for an application in a parallel-processing computer system
US8443348B2 (en) 2006-06-20 2013-05-14 Google Inc. Application program interface of a parallel-processing computer system that supports multiple programming languages
US8448156B2 (en) 2006-06-20 2013-05-21 Googe Inc. Systems and methods for caching compute kernels for an application running on a parallel-processing computer system
US20080244589A1 (en) * 2007-03-29 2008-10-02 Microsoft Corporation Task manager
US7865779B2 (en) * 2007-08-23 2011-01-04 International Business Machines Corporation Server side logic unit testing
US20090055686A1 (en) * 2007-08-23 2009-02-26 International Business Machines Corporation Server side logic unit testing
US20110092350A1 (en) * 2009-10-21 2011-04-21 Wilkinson Richard W Systems and methods for folding
US8650280B2 (en) * 2011-12-28 2014-02-11 Sybase, Inc. Monitoring distributed task execution using a chained text messaging system
US9081747B1 (en) 2012-03-06 2015-07-14 Big Bang Llc Computer program deployment to one or more target devices
US11444464B1 (en) * 2016-03-25 2022-09-13 Goal Zero Llc Portable hybrid generator

Also Published As

Publication number Publication date
US8104043B2 (en) 2012-01-24
US20050114854A1 (en) 2005-05-26

Similar Documents

Publication Publication Date Title
US8104043B2 (en) System and method for dynamic cooperative distributed execution of computer tasks without a centralized controller
US6983400B2 (en) Distributed test harness model
JP5258019B2 (en) A predictive method for managing, logging, or replaying non-deterministic operations within the scope of application process execution
CN100435101C (en) Apparatus and method for maintaining resource integrity in a software environment
US20210311858A1 (en) System and method for providing a test manager for use with a mainframe rehosting platform
TW412707B (en) System, method and computer program product for discovery in a distributed computing environment
US7165189B1 (en) Distributed test framework for clustered systems
US6708324B1 (en) Extensible automated testing software
US7937455B2 (en) Methods and systems for modifying nodes in a cluster environment
US7089561B2 (en) Methods and systems for creating and communicating with computer processes
US7207034B2 (en) Undo infrastructure
US7130881B2 (en) Remote execution model for distributed application launch and control
US20040205179A1 (en) Integrating design, deployment, and management phases for systems
US20030051186A1 (en) Methods to restore tests execution after unexpected crashes for use in a distributed test framework
US8086900B2 (en) System, method and computer program product for testing a boot image
US10698767B1 (en) Decentralized management of multi-service workflows
US20030120776A1 (en) System controller for use in a distributed processing framework system and methods for implementing the same
CN113312153B (en) Cluster deployment method and device, electronic equipment and storage medium
US11151020B1 (en) Method and system for managing deployment of software application components in a continuous development pipeline
CN112035062B (en) Migration method of local storage of cloud computing, computer equipment and storage medium
US7313786B2 (en) Grid-enabled ANT compatible with both stand-alone and grid-based computing systems
US7376676B2 (en) Method, system, and program for autonomic copy services solutions
US11720348B2 (en) Computing node allocation based on build process specifications in continuous integration environments
US20080115109A1 (en) Enhanced Hover Help For Software Debuggers
JP3090641B2 (en) Parallel data processing system and control method thereof

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MICROSOFT CORPORATION;REEL/FRAME:034766/0001

Effective date: 20141014