Batch processing in workflow schedules activity instances in multiple workflow cases of the same workflow type to run as a group. It can optimize business processes execution dynamically. To achieve this goal, it is necessary to define a dataflow operation language to group and ungroup the data in multiple cases of a workflow. Though our previous work has preliminarily investigated the model and its implementation, there is still lack of a formally defined model. In this paper, we first propose a method that is based on a partial vector space to model the dataflow in multiple workflow cases. Based on this model, the data operation primitives for batch processing are specified and defined formally. Since most WfMSs (Workflow Management Systems) use RDBMS (relational database management system) to store their data currently, an SQL (Structured Query Language)-like implementation language, namely DBOL (Data Batch Operation Language), is proposed. Evaluation experiments have also been done to show its performance.
Concurrency and Computation: Practice and Experience: selected papers from the Fourth International Workshop on Workﬂow Management (ICWM2009), Geneva, Switzerland, 04 May 2009, Vol. 23, no. 16 (Nov 2011), pp. 1936-1950
Copyright © 2011 John Wiley & Sons, Ltd.