On MPI Support

Table of Contents

Year 2021

  1. Emmanuel Agullo, Mirco Altenbernd, Hartwig Anzt, Leonardo Bautista-Gomez, Tommaso Benacchio, Luca Bonaventura, Hans-Joachim Bungartz, Sanjay Chatterjee, Florina M Ciorba, Nathan Debardeleben, Daniel Drzisga, Sebastian Eibl, Christian Engelmann, Wilfried N Gansterer, Luc Giraud, Dominik Göddeke, Marco Heisig, Fabienne Jézéquel, Nils Kohl, Sherry Xiaoye, Romain Lion, Miriam Mehl, Paul Mycek, Michael Obersteiner, Enrique S Quintana-Ortì, Francesco Rizzi, Ulrich Rüde, Martin Schulz, Fred Fung, Robert Speck, Linda Stals, Keita Teranishi, Samuel Thibault, Dominik Thönnes, Andreas Wagner, and Barbara Wohlmuth
    Resiliency in numerical algorithm design for extreme scale simulations
    International Journal of High Performance Computing Applications, September 2021
    [WWW] [PDF] Keyword(s): On MPI Support, Fault tolerance, Task-based programming, Checkpoint-restart, Buddy in-memory

    @article{agullo:hal-03348787,
    TITLE = {{Resiliency in numerical algorithm design for extreme scale simulations}},
    AUTHOR = {Agullo, Emmanuel and Altenbernd, Mirco and Anzt, Hartwig and Bautista-Gomez, Leonardo and Benacchio, Tommaso and Bonaventura, Luca and Bungartz, Hans-Joachim and Chatterjee, Sanjay and Ciorba, Florina M and Debardeleben, Nathan and Drzisga, Daniel and Eibl, Sebastian and Engelmann, Christian and Gansterer, Wilfried N and Giraud, Luc and G{\"o}ddeke, Dominik and Heisig, Marco and J{\'e}z{\'e}quel, Fabienne and Kohl, Nils and Xiaoye, Sherry and Lion, Romain and Mehl, Miriam and Mycek, Paul and Obersteiner, Michael and Quintana-Ort{\'i}, Enrique S and Rizzi, Francesco and R{\"u}de, Ulrich and Schulz, Martin and Fung, Fred and Speck, Robert and Stals, Linda and Teranishi, Keita and Thibault, Samuel and Th{\"o}nnes, Dominik and Wagner, Andreas and Wohlmuth, Barbara},
    URL = {https://hal.inria.fr/hal-03348787},
    JOURNAL = {{International Journal of High Performance Computing Applications}},
    PUBLISHER = {{SAGE Publications}},
    YEAR = {2021},
    MONTH = Sep,
    PDF = {https://hal.inria.fr/hal-03348787/file/2010.13342.pdf},
    KEYWORDS = {On MPI Support ; Fault tolerance ; Task-based programming ; Checkpoint-restart ; Buddy in-memory},
    HAL_ID = {hal-03348787},
    HAL_VERSION = {v1},
    
    }
    

Year 2020

  1. Alexandre Denis, Emmanuel Jeannot, Philippe Swartvagher, and Samuel Thibault
    Using Dynamic Broadcasts to improve Task-Based Runtime Performances
    In Euro-Par - 26th International European Conference on Parallel and Distributed Computing, Warsaw, Poland, August 2020
    Rzadca and Malawski, Springer
    [WWW] [PDF] [doi:10.1007/978-3-030-57675-2_28] Keyword(s): On MPI Support, Task-based runtime system, communications, collective, broadcast

    @inproceedings{denis:hal-02872765,
    TITLE = {{Using Dynamic Broadcasts to improve Task-Based Runtime Performances}},
    AUTHOR = {Denis, Alexandre and Jeannot, Emmanuel and Swartvagher, Philippe and Thibault, Samuel},
    URL = {https://hal.inria.fr/hal-02872765},
    BOOKTITLE = {{Euro-Par - 26th International European Conference on Parallel and Distributed Computing}},
    ADDRESS = {Warsaw, Poland},
    ORGANIZATION = {{Rzadca and Malawski}},
    PUBLISHER = {{Springer}},
    YEAR = {2020},
    MONTH = Aug,
    DOI = {10.1007/978-3-030-57675-2_28},
    KEYWORDS = {On MPI Support ; Task-based runtime system ; communications ; collective ; broadcast},
    PDF = {https://hal.inria.fr/hal-02872765/file/dynamic_broadcasts.pdf},
    HAL_ID = {hal-02872765},
    HAL_VERSION = {v1},
    
    }
    
  2. Romain Lion and Samuel Thibault
    From tasks graphs to asynchronous distributed checkpointing with local restart
    In 2020 IEEE/ACM 10th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS), Atlanta, USA, November 2020
    [WWW] [PDF] [doi:10.1109/FTXS51974.2020.00009] Keyword(s): On MPI Support, Fault tolerance, Task-based programming, Checkpoint-restart, Buddy in-memory

    @inproceedings{lion:hal-02970529,
    TITLE = {{From tasks graphs to asynchronous distributed checkpointing with local restart}},
    AUTHOR = {Lion, Romain and Thibault, Samuel},
    URL = {https://hal.archives-ouvertes.fr/hal-02970529},
    BOOKTITLE = {{2020 IEEE/ACM 10th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS)}},
    ADDRESS = {Atlanta, USA},
    YEAR = {2020},
    MONTH = Nov,
    DOI = {10.1109/FTXS51974.2020.00009},
    KEYWORDS = {On MPI Support ; Fault tolerance ; Task-based programming ; Checkpoint-restart ; Buddy in-memory},
    PDF = {https://hal.archives-ouvertes.fr/hal-02970529/file/2020001221.pdf},
    HAL_ID = {hal-02970529},
    HAL_VERSION = {v1},
    
    }
    

Year 2019

  1. Romain Lion
    Tolérance aux pannes dans l'exécution distribuée de graphes de tâches
    In Conférence d'informatique en Parallélisme, Architecture et Système, Anglet, France, June 2019
    [WWW] [PDF] Keyword(s): On MPI Support, Task-based, Starpu, HPC, Data locality

    @inproceedings{lion:hal-02296118,
    TITLE = {{Tol{\'e}rance aux pannes dans l'ex{\'e}cution distribu{\'e}e de graphes de t{\^a}ches}},
    AUTHOR = {Lion, Romain},
    URL = {https://hal.inria.fr/hal-02296118},
    BOOKTITLE = {{Conf{\'e}rence d'informatique en Parall{\'e}lisme, Architecture et Syst{\`e}me}},
    ADDRESS = {Anglet, France},
    YEAR = {2019},
    MONTH = Jun,
    KEYWORDS = {On MPI Support ; Task-based ; Starpu ; HPC ; Data locality},
    PDF = {https://hal.inria.fr/hal-02296118/file/Compas_Romain_LION_submitted_final.pdf},
    HAL_ID = {hal-02296118},
    HAL_VERSION = {v1},
    
    }
    

Year 2017

  1. Emmanuel Agullo, Olivier Aumage, Mathieu Faverge, Nathalie Furmento, Florent Pruvost, Marc Sergent, and Samuel Thibault
    Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model
    TPDS - IEEE Transactions on Parallel and Distributed Systems, December 2017
    [WWW] [PDF] [doi:10.1109/TPDS.2017.2766064] Keyword(s): On MPI Support, runtime system, sequential task flow, task-based programming, heterogeneous computing, distributed computing, multicore, GPU, Cholesky factorization

    @article{agullo:hal-01618526,
    TITLE = {{Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model}},
    AUTHOR = {Agullo, Emmanuel and Aumage, Olivier and Faverge, Mathieu and Furmento, Nathalie and Pruvost, Florent and Sergent, Marc and Thibault, Samuel},
    URL = {https://hal.inria.fr/hal-01618526},
    JOURNAL = {{TPDS - IEEE Transactions on Parallel and Distributed Systems}},
    PUBLISHER = {{Institute of Electrical and Electronics Engineers}},
    MONTH = DEC,
    YEAR = {2017},
    KEYWORDS = {On MPI Support ; runtime system ; sequential task flow ; task-based programming ; heterogeneous computing ; distributed computing ; multicore ; GPU ; Cholesky factorization},
    PDF = {https://hal.inria.fr/hal-01618526/file/tpds14.pdf},
    HAL_ID = {hal-01618526},
    HAL_VERSION = {v1},
    doi = {10.1109/TPDS.2017.2766064},
    
    }
    

Year 2016

  1. Marc Sergent
    Scalability of a task-based runtime system for dense linear algebra applications
    PhD thesis, Université de Bordeaux, December 2016
    [WWW] [PDF] Keyword(s): On MPI Support, High performance computing, Run-time systems, Distributed computing, Task-based programming, Parallel programming models, Calcul haute performance, Supports d'exécution, Calcul distribué, Programmation par tâches, Modèles de programmation parallèle

    @phdthesis{sergent:tel-01483666,
    TITLE = {{Scalability of a task-based runtime system for dense linear algebra applications}},
    AUTHOR = {Sergent, Marc},
    URL = {https://tel.archives-ouvertes.fr/tel-01483666},
    NUMBER = {2016BORD0372},
    SCHOOL = {{Universit{\'e} de Bordeaux}},
    YEAR = {2016},
    MONTH = Dec,
    KEYWORDS = {On MPI Support ; High performance computing ; Run-time systems ; Distributed computing ; Task-based programming ; Parallel programming models ; Calcul haute performance ; Supports d'ex{\'e}cution ; Calcul distribu{\'e} ; Programmation par t{\^a}ches ; Mod{\`e}les de programmation parall{\`e}le},
    PDF = {https://tel.archives-ouvertes.fr/tel-01483666/file/SERGENT_MARC_2016.pdf},
    HAL_ID = {tel-01483666},
    HAL_VERSION = {v1},
    
    }
    

Year 2014

  1. Emmanuel Agullo, Olivier Aumage, Mathieu Faverge, Nathalie Furmento, Florent Pruvost, Marc Sergent, and Samuel Thibault
    Harnessing clusters of hybrid nodes with a sequential task-based programming model
    In 8th International Workshop on Parallel Matrix Algorithms and Applications, July 2014
    [WWW] [PDF] Keyword(s): On MPI Support

    @inproceedings{agullo:hal-01283949,
    TITLE = {{Harnessing clusters of hybrid nodes with a sequential task-based programming model}},
    AUTHOR = {Agullo, Emmanuel and Aumage, Olivier and Faverge, Mathieu and Furmento, Nathalie and Pruvost, Florent and Sergent, Marc and Thibault, Samuel},
    URL = {https://hal.inria.fr/hal-01283949},
    BOOKTITLE = {{8th International Workshop on Parallel Matrix Algorithms and Applications}},
    YEAR = {2014},
    MONTH = Jul,
    PDF = {https://hal.inria.fr/hal-01283949/file/pmaa14.pdf},
    HAL_ID = {hal-01283949},
    HAL_VERSION = {v1},
    keywords = {On MPI Support} 
    }
    
  2. Cédric Augonnet, Olivier Aumage, Nathalie Furmento, Samuel Thibault, and Raymond Namyst
    StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators
    Research Report RR-8538, INRIA, May 2014
    [WWW] [PDF] Keyword(s): On MPI Support, StarPU

    @techreport{augonnet:hal-00992208,
    hal_id = {hal-00992208},
    url = {http://hal.inria.fr/hal-00992208},
    title = {{StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators}},
    author = {Augonnet, C{\'e}dric and Aumage, Olivier and Furmento, Nathalie and Thibault, Samuel and Namyst, Raymond},
    language = {Anglais},
    affiliation = {RUNTIME - INRIA Bordeaux - Sud-Ouest , Laboratoire Bordelais de Recherche en Informatique - LaBRI},
    type = {Research Report},
    institution = {INRIA},
    number = {RR-8538},
    year = {2014},
    month = May,
    pdf = {http://hal.inria.fr/hal-00992208/PDF/RR-8538.pdf},
    KEYWORDS = {On MPI Support;StarPU} 
    }
    

Year 2012

  1. Cédric Augonnet, Olivier Aumage, Nathalie Furmento, Raymond Namyst, and Samuel Thibault
    StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators
    In Siegfried Benkner Jesper Larsson Träff and Jack Dongarra, editors, EuroMPI 2012, volume 7490 of LNCS, September 2012
    Springer
    Note: Poster Session
    [WWW] [PDF] Keyword(s): On MPI Support, StarPU

    @InProceedings{AugAumFurNamThi2012EuroMPI,
    author = {C\'edric Augonnet and Olivier Aumage and Nathalie Furmento and Raymond Namyst and Samuel Thibault},
    title = {{StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators}},
    booktitle = {EuroMPI 2012},
    year = 2012,
    editor = {Jesper Larsson Tr{\"a}ff, Siegfried Benkner and Jack Dongarra},
    volume = {7490},
    series = {LNCS},
    month = SEP,
    note = {Poster Session},
    publisher = {Springer},
    url = {http://hal.inria.fr/hal-00725477},
    pdf = {http://hal.inria.fr/hal-00725477/document},
    KEYWORDS = {On MPI Support;StarPU} 
    }
    

Author: root

Created: 2022-05-18 Wed 08:25

Validate