Projecten per jaar
Samenvatting
This paper discusses an OpenCL version of a volumetric JPEG 2000 codec that runs on GPUs, multi-core processors or a combination of both. Since the performance critical part consists of a fine-grained (discrete wavelet transform) and coarse-grained algorithm (Tier-1), the best performance is obtained with a hybrid execution in which the discrete wavelet transform is executed on a GPU and Tier-1 on a multi-core. Using an Intel i7 multi-core in combination with a modest NVIDIA Quadro K620 GPU yields speedups greater than 10 compared with the original sequential code. The performance bottlenecks that arise on GPUs when parallelizing algorithms that are coarse-grained by nature are discussed and also the optimizations that are possible. A performance analysis reveals the inefficiencies and explains the deviations from the GPU peak performance.
Originele taal-2 | English |
---|---|
Pagina's (van-tot) | 229-245 |
Aantal pagina's | 17 |
Tijdschrift | International Journal of High Performance Computing Applications |
Volume | 31 |
Nummer van het tijdschrift | 3 |
Vroegere onlinedatum | 10 mei 2016 |
DOI's | |
Status | Published - 1 mei 2017 |
Vingerafdruk
Duik in de onderzoeksthema's van 'Heterogeneous acceleration of volumetric JPEG 2000 using OpenCL'. Samen vormen ze een unieke vingerafdruk.Projecten
- 2 Afgelopen
-
EU465: Ijle signaalcodering voor interferentie-gebaseerde beeldvormingsmodaliteiten (INTERFERE)
1/06/14 → 31/05/19
Project: Fundamenteel
-
SRP11: SRP (Zwaartepunt): Verwerking van grootschalige multi-dimensionale, multi-spectrale, multi-sensoriële en gedistribueerde gegevens (M³D²)
Schelkens, P., Deligiannis, N., Jansen, B., Kuijk, M., Munteanu, A., Sahli, H., Steenhaut, K., Stiens, J., Schelkens, P., Cornelis, J. P., Kuijk, M., Munteanu, A., Sahli, H., Stiens, J. & Vounckx, R.
1/11/12 → 31/12/23
Project: Fundamenteel