Enhancing throughput for streaming applications running on cluster systems
Data de publicació2013
MetadadesMostra el registre d'unitat complet
The exploitation of throughput in a parallel application that processes an input data stream is a difficult challenge. For typical coarse-grain applications, where the computation time of tasks is greater than their communication time, the maximum achievable throughput is determined by the maximum task computation time. Thus, the improvement in throughput above this maximum would eventually require the modification of the source code of the tasks. In this work, we address the improvement of throughput by proposing two task replication methodologies that have the target throughput to be achieved as an input parameter. They proceed by generating a new task graph structure that permits the target throughput to be achieved. The first replication mechanism, named DPRM (Data Parallel Replication Mechanism), exploits the inner task data parallelism. The second mechanism, named TCRM (Task Copy Replication Mechanism), creates new execution paths inside the application task graph structure that allows more than one instance of data to be processed concurrently. We evaluate the effectiveness of these mechanisms with three real applications executed in a cluster system: the MPEG2 video compressor, the IVUS (Intra-Vascular Ultra-Sound) medical image application and the BASIZ (Bright and SAtured Images Zone) video processing application. In all these cases, the obtained throughput was greater after applying the proposed replication mechanism than what the application could provide with the original implementation.
És part deJournal of Parallel and Distributed Computing, 2013, vol. 73, núm. 8, p. 1092-1105
Projectes de recerca europeus
Mostrant elements relacionats per títol, autor i matèria.
Guirado Fernández, Fernando; Ripoll, A.; Roig Mateu, Concepció; Hernàndez, A.; Luque, Emilio (Springer Verlag, 2006)There is a large range of image processing applications that act on an input sequence of image frames that are continuously received. Throughput is a key performance measure to be optimized when executing them. In this ...
Yuan, X.; Roig Mateu, Concepció; Ripoll, A.; Senar, M.A.; Guirado Fernández, Fernando; Luque, Emilio (Springer Verlag, 2002)The mapping of parallel applications constitutes a difficult problem for which very few practical tools are available. AMEEDA has been developed in order to overcome the lack of a general-purpose mapping tool. The ...
Guirado Fernández, Fernando; Ripoll, A.; Roig Mateu, Concepció; Luque, Emilio (Springer Verlag, 2004)Pipeline applications simultaneously execute different instances from an input data set. Performance parameters for such applications are latency (the time taken to process an individual data set) and throughput (the ...