Publications of year 2016
Thesis
- Marc Sergent. Scalability of a task-based runtime system for dense linear algebra applications. PhD thesis, Université de Bordeaux, December 2016.
Articles in journal, book chapters
- Terry Cojean, Abdou Guermouche, Andra Hugo, Raymond Namyst, and Pierre-André Wacrenier. Resource aggregation for task-based Cholesky Factorization on top of modern architectures. Parallel Computing, 83:73-92, November 2016. Note: This paper is submitted for review to the Parallel Computing special issue for HCW and HeteroPar 16 workshops. [doi:10.1016/j.parco.2018.10.007]
Conference articles
-
Emmanuel Agullo, Olivier Beaumont, Lionel Eyraud-Dubois, and Suraj Kumar. Are Static Schedules so Bad ? A Case Study on Cholesky Factorization. In Proceedings of the 30th IEEE International Parallel & Distributed Processing Symposium, IPDPS’16, Chicago, IL, USA, May 2016. IEEE. [doi:10.1109/IPDPS.2016.90]
-
Nolwenn Balin, Guillaume Sylvand, and Jérôme Robert. Fast methods applied to BEM solvers for acoustic propagation problems. In 22nd AIAA/CEAS Aeroacoustics Conference, pages 2712, 2016. [doi:10.2514/6.2016-2712]
-
Olivier Beaumont, Terry Cojean, Lionel Eyraud-Dubois, Abdou Guermouche, and Suraj Kumar. Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources. In International Conference on High Performance Computing, Data, and Analytics (HiPC), Hyderabad, India, December 2016. [doi:10.1109/HiPC.2016.045]
-
Terry Cojean. Exploiting Two-Level Parallelism by Aggregating Computing Resources in Task-Based Applications Over Accelerator-Based Machines. In SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP 2016), Paris, France, April 2016.
-
Terry Cojean. The StarPU Runtime System at Exascale ?. In RESPA workshop at SC16, Salt Lake City, Utah, United States, November 2016.
-
Terry Cojean, Abdou Guermouche, Andra Hugo, Raymond Namyst, and Pierre-André Wacrenier. Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines. In HeteroPar’2016 workshop of Euro-Par, Grenoble, France, August 2016.
-
Terry Cojean, Abdou Guermouche, Andra-Ecaterina Hugo, Raymond Namyst, and Pierre-André Wacrenier. Resource aggregation in task-based applications over accelerator-based multicore machines. In HeteroPar’2016 worshop of Euro-Par, Grenoble, France, August 2016.
-
Vinicius Garcia Pinto, Luka Stanisic, Arnaud Legrand, Lucas Mello Schnorr, Samuel Thibault, and Vincent Danjean. Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach. In VPA - 3rd Workshop on Visual Performance Analysis, Salt Lake City, USA, November 2016. Note: Held in conjunction with SC16. [doi:10.1109/VPA.2016.008]
-
Johan Janzén, David Black-Schaffer, and Andra Hugo. Partitioning GPUs for Improved Scalability. In IEEE 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), October 2016. [doi:10.1109/SBAC-PAD.2016.14]
-
Marc Sergent, David Goudin, Samuel Thibault, and Olivier Aumage. Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System. In HIPS - 21st International Workshop on High-Level Parallel Programming Models and Supportive Environments, Chicago, USA, May 2016. [doi:10.1109/IPDPSW.2016.105]
Internal reports
-
Emmanuel Agullo, Olivier Aumage, Berenger Bramas, Olivier Coulaud, and Samuel Pitoiset. Bridging the gap between OpenMP 4.0 and native runtime systems for the fast multipole method. Research Report RR-8953, Inria, March 2016.
-
Emmanuel Agullo, Bérenger Bramas, Olivier Coulaud, Martin Khannouz, and Luka Stanisic. Task-based fast multipole method for clusters of multicore processors. Research Report RR-8970, Inria Bordeaux Sud-Ouest, October 2016.
-
E Agullo, L Giraud, A Guermouche, S Nakov, and Jean Roman. Task-based Conjugate Gradient: from multi-GPU towards heterogeneous architectures. Research Report 8912, Inria Bordeaux Sud-Ouest, May 2016.