Selected papers



  • Configurable XOR hash functions for banked scratchpad memories in GPUs.
    Gert-Jan van den Braak, Juan Gómez-Luna, José María González-Linares, Henk Corporaal, and Nicolás Guil,
    IEEE Transactions on Computers. Volume 65, Issue 7, July 2016, 2045-2058
    DOI: 10.1109/TC.2015.2479595

  • In-place matrix transposition on GPUs.
    Juan Gómez-Luna, I-Jui Sung, Li-Wen Chang, José María González-Linares, Nicolás Guil, and Wen-Mei W. Hwu,
    IEEE Transactions on Parallel and Distributed Systems. Volume 27, Issue 3, March 2016, 776-788
    DOI: 10.1109/TPDS.2015.2412549

  • In-place data sliding algorithms for many-core architectures.
    Juan Gómez-Luna, Li-Wen Chang, I-Jui Sung, Nicolás Guil, and Wen-Mei W. Hwu,
    44th International Conference on Parallel Processing (ICPP), 210-219 (2015)
    DOI: 10.1109/ICPP.2015.30

  • In-place transposition of rectangular matrices on accelerators.
    I-Jui Sung, Juan Gómez-Luna, José María González-Linares, Nicolás Guil, and Wen-Mei W. Hwu,
    19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 207-218 (2014)
    DOI: 10.1145/2555243.2555266

  • Simulation and architecture improvements of atomic operations on GPU scratchpad memory.
    Gert-Jan van den Braak, Juan Gómez-Luna, Henk Corporaal, José María González-Linares, and Nicolás Guil,
    IEEE 31st International Conference on Computer Design (ICCD), 357-362 (2013)
    DOI: 10.1109/ICCD.2013.6657065

  • Performance modeling of atomic additions on GPU scratchpad memory.
    Juan Gómez-Luna, José María González-Linares, José Ignacio Benavides, and Nicolás Guil,
    IEEE Transactions on Parallel and Distributed Systems. Volume 24, Issue 11, November 2013, 2273-2282
    DOI: 10.1109/TPDS.2012.319

  • An optimized approach to histogram computation on GPU.
    Juan Gómez-Luna, José María González-Linares, José Ignacio Benavides, and Nicolás Guil,
    Machine Vision and Applications. Volume 24, Issue 5, July 2013, 899-908
    DOI: 10.1007/s00138-012-0443-3

  • Performance models for asynchronous data transfers on consumer Graphics Processing Units.
    Juan Gómez-Luna, José María González-Linares, José Ignacio Benavides, and Nicolás Guil,
    Journal of Parallel and Distributed Computing. Volume 72, Issue 9, September 2012, 1117-1126
    DOI: 10.1016/j.jpdc.2011.07.011

  • Egomotion compensation and moving objects detection algorithm on GPU.
    Juan Gómez-Luna, Holger Endt, Walter Stechele, José María González-Linares, José Ignacio Benavides, and Nicolás Guil,
    ParCo 2011, Advances in Parallel Computing 22, 183-190 (2012)
    DOI: 10.3233/978-1-61499-041-3-183

  • Load balancing versus occupancy maximization: the generalized Hough transform as a case study.
    Juan Gómez-Luna, José María González-Linares, José Ignacio Benavides, Emilio L. Zapata, and Nicolás Guil,
    International Journal of High Performance Computing Applications. May 2011; 25 (2), 205-222
    DOI: 10.1177/1094342010383998

  • Parallelization of a video segmentation algorithm on CUDA-enabled Graphics Processing Units.
    Juan Gómez-Luna, José María González-Linares, José Ignacio Benavides, and Nicolás Guil,
    Euro-Par 2009, LNCS 5704, 924-935 (2009)
    DOI: 10.1007/978-3-642-03869-3_85



  • Other documents



  • Programming Graphics Processing Units for video processing.
    Juan Gómez-Luna,
    Seminar. Munich, July 2012
    Download PDF

  • Programming issues for video analysis on Graphics Processing Units.
    Juan Gómez-Luna,
    PhD thesis. February 2012
    Download PDF