国产bbaaaaa片,成年美女黄网站色视频免费,成年黄大片,а天堂中文最新一区二区三区,成人精品视频一区二区三区尤物

首頁(yè)> 外文期刊>Parallel Computing >Leveraging task-parallelism in message-passing dense matrix factorizations using SMPSs
【24h】

Leveraging task-parallelism in message-passing dense matrix factorizations using SMPSs

機(jī)譯:在使用SMPS的消息傳遞密集矩陣分解中利用任務(wù)并行性

獲取原文
獲取原文并翻譯 | 示例

摘要

In this paper, we investigate how to exploit task-parallelism during the execution of the Cholesky factorization on clusters of multicore processors with the SMPSs programming model. Our analysis reveals that the major difficulties in adapting the code for this operation in ScaLAPACK to SMPSs lie in algorithmic restrictions and the semantics of the SMPSs programming model, but also that they both can be overcome with a limited programming effort. The experimental results report considerable gains in performance and scalability of the routine parallelized with SMPSs when compared with conventional approaches to execute the original ScaLAPACK implementation in parallel as well as two recent message-passing routines for this operation.In summary, our study opens the door to the possibility of reusing message-passing legacy codes/libraries for linear algebra, by introducing up-to-date techniques like dynamic out-of-order scheduling that significantly upgrade their performance, while avoiding a costly rewrite/reimplementation.
機(jī)譯:在本文中,我們研究了使用SMPSs編程模型在多核處理器集群上執(zhí)行Cholesky因式分解期間如何利用任務(wù)并行性。我們的分析表明,使ScaLAPACK中的此操作的代碼適應(yīng)SMPS的主要困難在于算法限制和SMPS編程模型的語(yǔ)義,但它們都可以通過有限的編程工作來克服。實(shí)驗(yàn)結(jié)果表明,與傳統(tǒng)方法并行執(zhí)行原始ScaLAPACK實(shí)施的常規(guī)方法以及此操作的兩個(gè)最新消息傳遞例程相比,與SMPS并行化的例程在性能和可伸縮性方面均獲得了可觀的收益。通過引入最新技術(shù)(例如動(dòng)態(tài)無序調(diào)度)來顯著提高其性能,同時(shí)又避免了昂貴的重寫/重新實(shí)現(xiàn),從而為線性代數(shù)重用了消息傳遞舊代碼/庫(kù)以用于線性代數(shù)的可能性。

著錄項(xiàng)

  • 來源
    《Parallel Computing》 |2014年第6期|113-128|共16頁(yè)
  • 作者單位

    Centre Internacional de Metodes Numerics en Enginyeria (CIMNE), Parc Mediterrani de la Tecnologia, Esteve Terradas 5, 08860 Castelldefels, Spain,Universitat Politecnica de Catalunya, Jordi Girona 1-3, Edifici C1, 08034 Barcelona, Spain;

    Edinburgh Parallel Computing Centre, University of Edinburgh, UK;

    Barcelona Supercomputing Center (BSC-CNS), 08034 Barcelona, Spain,Artificial Intelligence Research Institute (IIAA), Spanish National Research Council (CSIC), Spain;

    Depto. de Ingenieria y Ciencia de Computadores, Universidad Jaume Ⅰ (UJI), 12.071 Castellon, Spain;

  • 收錄信息 美國(guó)《科學(xué)引文索引》(SCI);美國(guó)《工程索引》(EI);
  • 原文格式 PDF
  • 正文語(yǔ)種 eng
  • 中圖分類
  • 關(guān)鍵詞

    Task parallelism; Message-passing numerical libraries; Linear algebra; Clusters of multi-core processors;

    機(jī)譯:任務(wù)并行性;消息傳遞數(shù)字庫(kù);線性代數(shù)多核處理器集群;

相似文獻(xiàn)

  • 外文文獻(xiàn)
  • 中文文獻(xiàn)
  • 專利
獲取原文

客服郵箱:kefu@zhangqiaokeyan.com

京公網(wǎng)安備:11010802029741號(hào) ICP備案號(hào):京ICP備15016152號(hào)-6 六維聯(lián)合信息科技 (北京) 有限公司?版權(quán)所有
  • 客服微信

  • 服務(wù)號(hào)