Publications of year 2016

Table of Contents

Thesis

  1. Marc Sergent
    Scalability of a task-based runtime system for dense linear algebra applications
    PhD thesis, Université de Bordeaux, December 2016
    [WWW] [PDF] Keyword(s): On MPI Support, High performance computing, Run-time systems, Distributed computing, Task-based programming, Parallel programming models, Calcul haute performance, Supports d'exécution, Calcul distribué, Programmation par tâches, Modèles de programmation parallèle

    @phdthesis{sergent:tel-01483666,
    TITLE = {{Scalability of a task-based runtime system for dense linear algebra applications}},
    AUTHOR = {Sergent, Marc},
    URL = {https://tel.archives-ouvertes.fr/tel-01483666},
    NUMBER = {2016BORD0372},
    SCHOOL = {{Universit{\'e} de Bordeaux}},
    YEAR = {2016},
    MONTH = Dec,
    KEYWORDS = {On MPI Support ; High performance computing ; Run-time systems ; Distributed computing ; Task-based programming ; Parallel programming models ; Calcul haute performance ; Supports d'ex{\'e}cution ; Calcul distribu{\'e} ; Programmation par t{\^a}ches ; Mod{\`e}les de programmation parall{\`e}le},
    PDF = {https://tel.archives-ouvertes.fr/tel-01483666/file/SERGENT_MARC_2016.pdf},
    HAL_ID = {tel-01483666},
    HAL_VERSION = {v1},
    
    }
    

Conference articles

  1. Emmanuel Agullo, Olivier Beaumont, Lionel Eyraud-Dubois, and Suraj Kumar
    Are Static Schedules so Bad ? A Case Study on Cholesky Factorization
    In Proceedings of the 30th IEEE International Parallel & Distributed Processing Symposium, IPDPS'16, Chicago, IL, USA, May 2016
    IEEE
    [WWW] [PDF] Keyword(s): On Scheduling, Cholesky Factorization, Accelerators, Heterogeneous Systems, Runtime Systems, Scheduling, Unrelated Machines

    @inproceedings{agullo:hal-01223573,
    TITLE = {{Are Static Schedules so Bad ? A Case Study on Cholesky Factorization}},
    AUTHOR = {Agullo, Emmanuel and Beaumont, Olivier and Eyraud-Dubois, Lionel and Kumar, Suraj},
    URL = {https://hal.inria.fr/hal-01223573},
    ADDRESS = {Chicago, IL, USA},
    PUBLISHER = {{IEEE}},
    BOOKTITLE = {Proceedings of the 30th IEEE International Parallel \& Distributed Processing Symposium, IPDPS'16},
    YEAR = {2016},
    MONTH = May,
    keywords = {On Scheduling; Cholesky Factorization ; Accelerators ; Heterogeneous Systems ; Runtime Systems; Scheduling ; Unrelated Machines},
    PDF = {https://hal.inria.fr/hal-01223573/file/heteroprioCameraReady-ieeeCompatiable.pdf},
    HAL_ID = {hal-01223573},
    HAL_VERSION = {v2},
    
    }
    
  2. Olivier Beaumont, Terry Cojean, Lionel Eyraud-Dubois, Abdou Guermouche, and Suraj Kumar
    Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources
    In International Conference on High Performance Computing, Data, and Analytics (HiPC), Hyderabad, India, December 2016
    [WWW] [PDF] Keyword(s): On Parallel Tasks, STARPU, Scheduling, Linear Algebra, Heterogeneous Platforms, Task-based Scheduling, Cholesky Factorization, Simulation, Resource Aggregation

    @inproceedings{beaumont:hal-01361992,
    TITLE = {{Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources}},
    AUTHOR = {Beaumont, Olivier and Cojean, Terry and Eyraud-Dubois, Lionel and Guermouche, Abdou and Kumar, Suraj},
    URL = {https://hal.inria.fr/hal-01361992},
    BOOKTITLE = {{International Conference on High Performance Computing, Data, and Analytics (HiPC)}},
    ADDRESS = {Hyderabad, India},
    YEAR = {2016},
    MONTH = Dec,
    KEYWORDS = {On Parallel Tasks; STARPU ; Scheduling ; Linear Algebra ; Heterogeneous Platforms ; Task-based Scheduling ; Cholesky Factorization ; Simulation ; Resource Aggregation},
    PDF = {https://hal.inria.fr/hal-01361992v2/document},
    HAL_ID = {hal-01361992},
    HAL_VERSION = {v1},
    
    }
    
  3. Terry Cojean, Abdou Guermouche, Andra Hugo, Raymond Namyst, and Pierre-André Wacrenier
    Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines
    In HeteroPar'2016 workshop of Euro-Par, Grenoble, France, August 2016
    [WWW] [PDF] Keyword(s): On Parallel Tasks, dense linear algebra, Cholesky, Multicore, accelerator, GPU, heterogeneous computing, task DAG, runtime system

    @inproceedings{cojean:hal-01181135,
    TITLE = {{Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines}},
    AUTHOR = {Cojean, Terry and Guermouche, Abdou and Hugo, Andra and Namyst, Raymond and Wacrenier, Pierre-Andr{\'e}},
    URL = {https://hal.inria.fr/hal-01181135},
    BOOKTITLE = {{HeteroPar'2016 workshop of Euro-Par}},
    ADDRESS = {Grenoble, France},
    YEAR = {2016},
    MONTH = Aug,
    KEYWORDS = {On Parallel Tasks; dense linear algebra ; Cholesky ; Multicore ; accelerator ; GPU ; heterogeneous computing ; task DAG ; runtime system},
    PDF = {https://hal.inria.fr/hal-01181135/file/papier%20%281%29.pdf},
    HAL_ID = {hal-01181135},
    HAL_VERSION = {v3},
    
    }
    
  4. Vinicius Garcia Pinto, Luka Stanisic, Arnaud Legrand, Lucas Mello Schnorr, Samuel Thibault, and Vincent Danjean
    Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach
    In VPA - 3rd Workshop on Visual Performance Analysis, Salt Lake City, USA, November 2016
    Note: Held in conjunction with SC16
    [WWW] [PDF] [doi:10.1109/VPA.2016.008] Keyword(s): On Scheduling, STARPU

    @inproceedings{garciapinto:hal-01353962,
    TITLE = {{Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach}},
    AUTHOR = {Garcia Pinto, Vinicius and Stanisic, Luka and Legrand, Arnaud and Mello Schnorr, Lucas and Thibault, Samuel and Danjean, Vincent},
    URL = {https://hal.inria.fr/hal-01353962},
    NOTE = {Held in conjunction with SC16},
    BOOKTITLE = {{VPA - 3rd Workshop on Visual Performance Analysis}},
    ADDRESS = {Salt Lake City, USA},
    YEAR = {2016},
    MONTH = Nov,
    KEYWORDS = {On Scheduling; STARPU},
    PDF = {https://hal.inria.fr/hal-01353962v2/document},
    HAL_ID = {hal-01353962},
    HAL_VERSION = {v1},
    doi = {10.1109/VPA.2016.008},
    
    }
    
  5. Johan Janzén, David Black-Schaffer, and Andra Hugo
    Partitioning GPUs for Improved Scalability
    In IEEE 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), October 2016
    [WWW] [doi:10.1109/SBAC-PAD.2016.14] Keyword(s): On Scheduling

    @InProceedings{JaBlHU2016a,
    author = {Johan Janz{\'e}n and David Black-Schaffer and Andra Hugo},
    title = {{Partitioning GPUs for Improved Scalability}},
    booktitle = {IEEE 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)},
    year = 2016,
    KEYWORDS = {On Scheduling},
    DOI = {10.1109/SBAC-PAD.2016.14},
    URL = {http://ieeexplore.ieee.org/abstract/document/7789322/},
    month = Oct
    }
    
  6. Marc Sergent, David Goudin, Samuel Thibault, and Olivier Aumage
    Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System
    In HIPS - 21st International Workshop on High-Level Parallel Programming Models and Supportive Environments, Chicago, USA, May 2016
    [WWW] [PDF] [doi:10.1109/IPDPSW.2016.105] Keyword(s): On Memory Control, memory control, task-based run-time systems, compressed linear algebra, distributed computing

    @inproceedings{sergent:hal-01284004,
    TITLE = {{Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System}},
    AUTHOR = {Sergent, Marc and Goudin, David and Thibault, Samuel and Aumage, Olivier},
    URL = {https://hal.inria.fr/hal-01284004},
    BOOKTITLE = {{HIPS - 21st International Workshop on High-Level Parallel Programming Models and Supportive Environments}},
    ADDRESS = {Chicago, USA},
    YEAR = {2016},
    MONTH = May,
    keywords = {On Memory Control; memory control ; task-based run-time systems ; compressed linear algebra ; distributed computing},
    PDF = {https://hal.inria.fr/hal-01284004/file/PID4127657.pdf},
    HAL_ID = {hal-01284004},
    HAL_VERSION = {v1},
    doi = {10.1109/IPDPSW.2016.105},
    
    }
    

Internal reports

  1. Emmanuel Agullo, Olivier Aumage, Berenger Bramas, Olivier Coulaud, and Samuel Pitoiset
    Bridging the gap between OpenMP 4.0 and native runtime systems for the fast multipole method
    Research Report RR-8953, Inria, March 2016
    [WWW] [PDF] Keyword(s): On OpenMP Support on top of StarPU, STARPU, runtime system, parallel programming model, compiler, priority, commutativity, multicore architecture, moteur d'exécution, modèle de programmation parallèle, compilateur, OpenMP 4.0, OpenMP 4.X, priorité, commutativité, architecture multicore

    @techreport{agullo:hal-01372022,
    TITLE = {{Bridging the gap between OpenMP 4.0 and native runtime systems for the fast multipole method}},
    AUTHOR = {Agullo, Emmanuel and Aumage, Olivier and Bramas, Berenger and Coulaud, Olivier and Pitoiset, Samuel},
    URL = {https://hal.inria.fr/hal-01372022},
    TYPE = {Research Report},
    NUMBER = {RR-8953},
    PAGES = {49},
    INSTITUTION = {{Inria}},
    YEAR = {2016},
    MONTH = Mar,
    KEYWORDS = {On OpenMP Support on top of StarPU; STARPU ; runtime system ; parallel programming model ; compiler ; priority ; commutativity ; multicore architecture ; moteur d'ex{\'e}cution ; mod{\`e}le de programmation parall{\`e}le ; compilateur ; OpenMP 4.0 ; OpenMP 4.X ; priorit{\'e} ; commutativit{\'e} ; architecture multicore},
    PDF = {https://hal.inria.fr/hal-01372022/file/RR-8953.pdf},
    HAL_ID = {hal-01372022},
    HAL_VERSION = {v1},
    
    }
    
  2. Emmanuel Agullo, Bérenger Bramas, Olivier Coulaud, Martin Khannouz, and Luka Stanisic
    Task-based fast multipole method for clusters of multicore processors
    Research Report RR-8970, Inria Bordeaux Sud-Ouest, October 2016
    [WWW] [PDF] Keyword(s): On Applications, STARPU, multicore processor, runtime system, FMM, cluster, high performance computing (HPC), fast multipole method, hybrid parallelization, task-based programming, MPI, OpenMP

    @techreport{agullo:hal-01387482,
    TITLE = {{Task-based fast multipole method for clusters of multicore processors}},
    AUTHOR = {Agullo, Emmanuel and Bramas, B{\'e}renger and Coulaud, Olivier and Khannouz, Martin and Stanisic, Luka},
    URL = {https://hal.inria.fr/hal-01387482},
    TYPE = {Research Report},
    NUMBER = {RR-8970},
    PAGES = {15 },
    INSTITUTION = {{Inria Bordeaux Sud-Ouest}},
    YEAR = {2016},
    MONTH = Oct,
    KEYWORDS = {On Applications; STARPU ; multicore processor ; runtime system ; FMM ; cluster ; high performance computing (HPC) ; fast multipole method ; hybrid parallelization ; task-based programming ; MPI ; OpenMP},
    PDF = {https://hal.inria.fr/hal-01387482/file/report-8970.pdf},
    HAL_ID = {hal-01387482},
    HAL_VERSION = {v1},
    
    }
    
  3. E Agullo, L Giraud, A Guermouche, S Nakov, and Jean Roman
    Task-based Conjugate Gradient: from multi-GPU towards heterogeneous architectures
    Research Report 8912, Inria Bordeaux Sud-Ouest, May 2016
    [WWW] [PDF] Keyword(s): High Performance Computing (HPC), multi-GPUs, heterogeneous architectures, task-based model, runtime system, sparse linear systems, Conjugate Gradient., On Applications, StarPU, scheduling

    @techreport{agullo:hal-01316982,
    TITLE = {{Task-based Conjugate Gradient: from multi-GPU towards heterogeneous architectures}},
    AUTHOR = {Agullo, E and Giraud, L and Guermouche, A and Nakov, S and Roman, Jean},
    URL = {https://hal.inria.fr/hal-01316982},
    TYPE = {Research Report},
    NUMBER = {8912},
    INSTITUTION = {{Inria Bordeaux Sud-Ouest}},
    YEAR = {2016},
    MONTH = May,
    KEYWORDS = {High Performance Computing (HPC) ; multi-GPUs ; heterogeneous architectures ; task-based model ; runtime system ; sparse linear systems ; Conjugate Gradient.},
    PDF = {https://hal.inria.fr/hal-01316982/file/RR-8912.pdf},
    HAL_ID = {hal-01316982},
    HAL_VERSION = {v1},
    KEYWORDS = {On Applications; StarPU, scheduling} 
    }
    

Miscellaneous

  1. Terry Cojean, Abdou Guermouche, Andra Hugo, Raymond Namyst, and Pierre-André Wacrenier
    Resource aggregation for task-based Cholesky Factorization on top of modern architectures
    Note: This paper is submitted for review to the Parallel Computing special issue for HCW and HeteroPar 16 workshops, November 2016
    [WWW] [PDF] Keyword(s): On Parallel Tasks, Intel Xeon-Phi KNL, heterogeneous computing, GPU, accelerator, Multicore, dense linear algebra, task DAG, Cholesky factorization, runtime system

    @unpublished{cojean:hal-01409965,
    TITLE = {{Resource aggregation for task-based Cholesky Factorization on top of modern architectures}},
    AUTHOR = {Cojean, Terry and Guermouche, Abdou and Hugo, Andra and Namyst, Raymond and Wacrenier, Pierre-Andr{\'e}},
    URL = {https://hal.inria.fr/hal-01409965},
    NOTE = {This paper is submitted for review to the Parallel Computing special issue for HCW and HeteroPar 16 workshops},
    YEAR = {2016},
    MONTH = Nov,
    KEYWORDS = {On Parallel Tasks; Intel Xeon-Phi KNL ; heterogeneous computing ; GPU ; accelerator ; Multicore ; dense linear algebra ; task DAG ; Cholesky factorization ; runtime system},
    PDF = {https://hal.inria.fr/hal-01409965/file/submission.pdf},
    HAL_ID = {hal-01409965},
    HAL_VERSION = {v1},
    
    }
    

Author: root

Created: 2022-05-18 Wed 08:25

Validate