{"id":893,"date":"2023-08-15T11:16:38","date_gmt":"2023-08-15T18:16:38","guid":{"rendered":"https:\/\/labs.wsu.edu\/scads\/?page_id=893"},"modified":"2025-08-20T16:54:24","modified_gmt":"2025-08-20T23:54:24","slug":"parallel-computing","status":"publish","type":"page","link":"https:\/\/labs.wsu.edu\/scads\/research\/parallel-computing\/","title":{"rendered":"Parallel Computing"},"content":{"rendered":"\n<h1 class=\"wp-block-heading\">Parallel Computing<\/h1>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity\" \/>\n\n\n\n<p>Graph (network) computations are critical kernels in many algorithms in data mining, data analysis, scientific computing, computational science and engineering, etc. In large-scale applications, the graph computations need to be performed in parallel. Parallelizing graph algorithms effectively \u2014 with emphasis on scalability and performance \u2014 is particularly challenging for a variety of reasons: In many graph algorithms runtime is dominated by memory latency rather than processor speed, there exist little computation to hide memory access costs, data locality is poor, and available concurrency is low.<\/p>\n\n\n\n<p>Listed below in reverse chronological order are papers we have written together with a number of different collaborators introducing a range of techniques for dealing with these challenges in the context of a variety graph problems. Much of Assefaw&#8217;s earlier work in this direction focused on distributed-memory platforms. His more recent effort targets the emerging and rapidly growing multicore platforms as well as massively multithreaded platforms. The list also includes his recent works on other combinatorial problems than graph problems and on problems around matrix computations. At SCADS, we are in general interested in exploring the interplay between algorithms, architectures, and applications in developing scalable systems.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity\" \/>\n\n\n\n<h2 class=\"wp-block-heading\">Software<\/h2>\n\n\n\n<p>MPI-based implementations of some of the distributed memory parallel algorithms for the distance-1 and distance-2 coloring problems we developed together with colleagues are publicly available via the&nbsp;<a href=\"http:\/\/www.cs.sandia.gov\/Zoltan\/\">Zoltan<\/a> software toolkit of Sandia National Laboratories.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity\" \/>\n\n\n\n<h2 class=\"wp-block-heading\">Papers<\/h2>\n\n\n<div class=\"wsu-row wsu-row--thirds\" >\r\n    \n<div class=\"wsu-column\"  style=\"\">\r\n\t\n<div class=\"wsu-cta  wsu-text-align--center\" >\n\t<a \t\t\t\t\thref=\"#distributed_mem_architecture\"\t\n\t\tclass=\"wsu-button  wsu-button--style-round\">\n\t\t\t\n\t\tDistributed memory architectures\t\t\t<\/a>\n<\/div>\n\n<\/div>\r\n\n\n<div class=\"wsu-column\"  style=\"\">\r\n\t\n<div class=\"wsu-cta  wsu-text-align--center\" >\n\t<a \t\t\t\t\thref=\"#multi_architecture\"\t\n\t\tclass=\"wsu-button  wsu-button--style-round\">\n\t\t\t\n\t\tMulti-core and multi-threaded architectures\t\t\t<\/a>\n<\/div>\n\n<\/div>\r\n\n\n<div class=\"wsu-column\"  style=\"\">\r\n\t\n<div class=\"wsu-cta  wsu-text-align--center\" >\n\t<a \t\t\t\t\thref=\"#shared_mem_architecture\"\t\n\t\tclass=\"wsu-button  wsu-button--style-round\">\n\t\t\t\n\t\tShared memory architectures\t\t\t<\/a>\n<\/div>\n\n<\/div>\r\n\n<\/div>\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" id=\"distributed_mem_architecture\" \/>\n\n\n\n<h3 class=\"wp-block-heading\">Distributed-memory architectures<strong>&nbsp;<\/strong><\/h3>\n\n\n\n<ul>\n<li>S. Ghosh, M. Halappanavar, A. Kalyanaraman and A.H. Gebremedhin.&nbsp;<strong>Exploring MPI Communication Models for Graph Applications Using Graph Matching as a Case Study,<\/strong>&nbsp;IEEE International Parallel and Distributed Processing Symposium (IPDPS 2019).<br><a href=\"#mpi_graph_matching_2019\">Abstract<\/a><\/li>\n\n\n\n<li>S. Ghosh, M. Halappanavar, A. Tumeo, A. Kalyanaraman and A.H. Gebremedhin.&nbsp;<strong>miniVite: A graph Analytics Benchmarking Tool for Massively Parallel Systems<\/strong>, IEEE International Workshop on Performance Modeling, Benchmarking and Simulation (PMBS&#8217;18), held in conjunction with ACM\/IEEE Supercomputing (SC&#8217;18).<br><a href=\"#minivite_benchmarking_2018\">Abstract<\/a><\/li>\n\n\n\n<li>Sayan Ghosh, Mahantesh Halappanavar, Antonino Tumeo, Ananth Kalyanaraman and Assefaw H. Gebremedhin.&nbsp;<strong><a href=\"https:\/\/s3.wp.wsu.edu\/uploads\/sites\/3315\/2021\/02\/PID5561577.pdf\">Scalable Distributed Memory Community Detection using Vite<\/a><\/strong>. 2018 IEEE High Performance Extreme Computing Conference (HPEC&#8217;18).<br><a href=\"#scalable_vite_2018\">Abstract<\/a><\/li>\n\n\n\n<li>S. Ghosh, M. Halappanavar, A. Tumeo, A. Kalyanaraman, H. Lu, D. Chavarria-Miranda, A. Khan and A. H. Gebremedhin,&nbsp;<strong>Distributed Louvain Algorithm for Graph Community Detection,<\/strong>&nbsp;IEEE International Parallel and Distributed Processing Symposium (IPDPS 2018). Abstract&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Paper in PDF<\/li>\n\n\n\n<li>S. Ghosh, J.R. Hammond, A.J. Pena, P. Balaji, A.H. Gebremedhin and B. Chapman,&nbsp;<strong>One-sided Interface for Matrix Operations using MPI-3 RMA: A Case Study with Elemental&nbsp;<\/strong>,&nbsp;<em>Proceedings of International Conference on Parallel Processing (ICPP 2016), to appear.&nbsp;<\/em><br><a href=\"#icpp_elemental_2016\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/icpp2016-elemental.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>U. Catalyurek, F. Dobrian, A.H. Gebremedhin, M. Halappanavar and A. Pothen,&nbsp;<strong>Distributed-memory Parallel Algorithms for Matching and Coloring<\/strong>, <em>Proceedings of IEEE International Parallel and Distributed Processing Symposium, Workshops and PhD Forums (IPDPSW), Workshop on Parallel Computing and Optimization (PCO&#8217;11), pages 1966\u20131975, 2011&nbsp;<\/em>.<br><a href=\"#ipdps_2011\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"https:\/\/eecs.wsu.edu\/~assefaw\/publications\/ipdps11.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>D. Bozdag, U. Catalyurek, A.H. Gebremedhin, F. Manne, E. Boman and F. Ozgunner,&nbsp;<strong>Distributed-memory Parallel Algorithms for Distance-2 Coloring and Related Problems in Derivative Computation<\/strong>,&nbsp;<em>SIAM Journal on Scientific Computing Vol 32, Issue 4, pp 2418\u20132446, 2010.<\/em><br><a href=\"#sisc_d3_2010\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"https:\/\/eecs.wsu.edu\/~assefaw\/publications\/ParD2Color-SISC.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>D. Bozdag, A.H. Gebremedhin, F. Manne, E. Boman and U. Catalyurek,&nbsp;<strong>A framework for Scalable Greedy Coloring on Distributed Memory Parallel Computers<\/strong>,&nbsp;<em>Journal of Parallel and Distributed Computing Vol 68, No 4, pp 515-535, 2008<\/em>.<br><a href=\"#jpdc_2008\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/jpdc.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>D. Bozdag, U. Catalyurek, A.H. Gebremedhin, F. Manne, E. G. Boman and F. Ozguner,&nbsp;<strong>A Parallel Distance-2 Graph Coloring Algorithm for Distributed Memory Computers<\/strong>,&nbsp;<em>Lecture Notes in Computer Science, vol 3726, 2005, pages 796 \u2013 806,Springer. Proc. of HPCC 2005, Sept 21 \u2013 25, 2005, Sorrento, Italy.<\/em><br><a href=\"#hpcc_2005\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/hpcc.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>E.G. Boman, D. Bozdag, U. Catalyurek, A.H. Gebremedhin and F. Manne,&nbsp;<strong>A Scalable Parallel Graph Coloring Algorithm for Distributed Memory Computers<\/strong>,&nbsp;<em>Lecture Notes in Computer Science, vol 3648 , 2005, pages 241 \u2013 251,Springer. Proc. of EuroPar 2005, 30 Aug \u2013 2 Sept, 2005, Lisboa, Portugal.<\/em><br><a href=\"#europar_2005\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/EuroPar05_coloring.pdf\">Paper in PDF<\/a><\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" id=\"multi_architecture\" \/>\n\n\n\n<h3 class=\"wp-block-heading\">Multi-core and multi-threaded architectures<\/h3>\n\n\n\n<ul>\n<li>H.Lu, M. Halappanavar, D. Chavarri-a-Miranda, A.H. Gebremedhin, A. Panyala and A. Kalyanaraman,&nbsp;<strong>Algorithms for Balanced Graph Colorings with Applications in Parallel Computing<\/strong>, IEEE Transactions on Parallel and Distributed Systems 28(5), 1240\u20131256, 2017.<br><a href=\"#balanced_tpds_2017\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/BalancedColoring-TPDS17.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>S. Ghosh and A.H. Gebremedhin&nbsp;<strong>Parallelization of Bin Packing on Multicore Systems<\/strong>, 2016 IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC&#8217;16).<br><a href=\"#binpacking_hipc_2016\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/HiPC16.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>R.A. Rossi, D.F. Gleich, A.H. Gebremedhin,&nbsp;<strong>Parallel Maximum Clique Algorithms with Applications to Network Analysis&nbsp;<\/strong>, SIAM Journal on Scientific Computing, Vol 37, Issue 5, pages C589-C618, 2015.<br><a href=\"#pmc_sisc_2015\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/sisc2015-cliques.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>H.Lu, M. Halappanavar, D. Chavarri-a-Miranda, A.H. Gebremedhin and A. Kalyanaraman,&nbsp;<strong>Balanced Coloring for Parallel Computing Applications&nbsp;<\/strong>, Proceedings of IPDPS 2015.<br><a href=\"#balanced_ipdps_2015\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/balancedColoring.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>R.A. Rossi, D.F. Gleich, A.H. Gebremedhin and M.M.A Patwary,&nbsp;<strong>Fast Maximum Clique Algorithms for Large Graphs<\/strong>, Proceedings of WWW2014.<br><a href=\"#pmc_www_2014\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/cliques-WWW2014.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>U. Catalyurek, J. Feo, A.H. Gebremedhin, M. Halappanavar and A. Pothen,&nbsp;<strong>Graph Coloring Algorithms for Multi-core and Massively Multi-threaded Architectures<\/strong>, Parallel Computing 38 (2012), 576-594.<br><a href=\"#parco_2012\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/ParCo-D1Coloring.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>M.M.A. Patwary, A.H. Gebremedhin and A. Pothen,&nbsp;<strong>New Multithreaded Ordering and Coloring Algorithms for Multicore Architectures<\/strong>, In E. Jeannot, R. Namyst and J. Roman, editors, Euro-Par 2011, volume 6853 of Lecture Notes in Computer Science, pages 250\u2013262. Springer, 2011.<br><a href=\"#europar_2011\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/www.eecs.wsu.edu\/~assefaw\/publications\/EuroPar2011.pdf\">Paper in PDF<\/a><\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" id=\"shared_mem_architecture\" \/>\n\n\n\n<h3 class=\"wp-block-heading\">Shared-memory and coarse-grained multiprocessors&nbsp;<\/h3>\n\n\n\n<ul>\n<li>A.H. Gebremedhin, F.Manne and T. Woods,&nbsp;<strong>Speeding up Parallel Graph Coloring,<\/strong>&nbsp;<em>Lecture Notes in Computer Science, vol 3732, pp 1079-1088, 2005, Springer. Proc. of Para 2004, June 20 \u2013 23, 2004, Lyngby, Denmark.<\/em><br><a href=\"#para_2004\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/para04.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>A. Gebremedhin, I.Guerrin-Lassous, J. Gustedt and J.A. Telle,&nbsp;<strong>Graph Coloring on Coarse Grained Multicomputers,&nbsp;<\/strong><em>Discrete Applied Mathematics, Vol 131, No 1, pp 179\u2013198, 2003.<\/em><br><a href=\"#dam_2003\">Abstract<\/a>&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/coloring-cgm-dam.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>A.H. Gebremedhin, F. Manne and A. Pothen,&nbsp;<strong>Parallel Distance-k Coloring Algorithms for Numerical Optimization,<\/strong>&nbsp;<em>In B. Monien and R. Feldmann (Eds.): EuroPar 2002, Lecture Notes in Computer Science 2400, pp. 912-921, Springer-Verlag 2002.<\/em><br><a href=\"#europar_2002\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/europar.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>A. Gebremedhin and F. Manne,&nbsp;<strong>Scalable Parallel Graph Coloring Algorithms,&nbsp;<\/strong><em>Concurrency: Practice and Expereince, Vol 12, pp1131\u20131146, 2000<\/em>.<br><a href=\"#cpe_2000\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/cpe-color.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>A.H. Gebremedhin, I.G. Lassous, J. Gustedt and J.A. Telle,&nbsp;<strong>Graph Coloring on a Coarse Grained Multiprocessor,<\/strong><em> In Brandes, Ulrik, Wagner and Dorothea (Eds.): WG 2000, Lecture Notes in Computer Science 1928, pp. 184-195, 2000, Springer-Verlag.<\/em><br><a href=\"#wg_2000\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/coloring-cgm-wg00.pdf\">Paper in PDF<\/a><\/li>\n\n\n\n<li>A.H. Gebremedhin and F. Manne,&nbsp;<strong>Parallel Graph Coloring Algorithms using OpenMP,<\/strong>&nbsp;<em>In Proc. of EWOMP&#8217;99, First European Workshop on OpenMP, Sept.30 \u2013 Oct. 1, 1999, Lund, Sweden.<\/em><br><a href=\"#ewomp_1999\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/eecs.wsu.edu\/~assefaw\/publications\/ewomp99.pdf\">Paper in PDF<\/a><\/li>\n<\/ul>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Parallel Computing Graph (network) computations are critical kernels in many algorithms in data mining, data analysis, scientific computing, computational science and engineering, etc. In large-scale applications, the graph computations need to be performed in parallel. Parallelizing graph algorithms effectively \u2014 with emphasis on scalability and performance \u2014 is particularly challenging for a variety of reasons: [&hellip;]<\/p>\n","protected":false},"author":5234,"featured_media":0,"parent":861,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"categories":[],"tags":[],"university_category":[],"location":[],"organization":[],"_links":{"self":[{"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/pages\/893"}],"collection":[{"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/users\/5234"}],"replies":[{"embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/comments?post=893"}],"version-history":[{"count":16,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/pages\/893\/revisions"}],"predecessor-version":[{"id":1863,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/pages\/893\/revisions\/1863"}],"up":[{"embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/pages\/861"}],"wp:attachment":[{"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/media?parent=893"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/categories?post=893"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/tags?post=893"},{"taxonomy":"wsuwp_university_category","embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/university_category?post=893"},{"taxonomy":"wsuwp_university_location","embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/location?post=893"},{"taxonomy":"wsuwp_university_org","embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/organization?post=893"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}