{"id":399,"date":"2019-04-15T08:08:39","date_gmt":"2019-04-15T06:08:39","guid":{"rendered":"https:\/\/www.kth.se\/blogs\/pdc\/?p=399"},"modified":"2019-04-17T13:55:38","modified_gmt":"2019-04-17T11:55:38","slug":"top500-list-a-brief-introduction","status":"publish","type":"post","link":"https:\/\/www.kth.se\/blogs\/pdc\/2019\/04\/top500-list-a-brief-introduction\/","title":{"rendered":"Top500 list: a brief introduction"},"content":{"rendered":"<div class=\"post-content-wrapper\"><p>You may have heard of the <a href=\"https:\/\/www.top500.org\/\">Top500 list<\/a>. It ranks the world\u2019s 500 most powerful supercomputers based on their performance as measured by the <a href=\"https:\/\/www.top500.org\/project\/linpack\/\">Linpack benchmark<\/a>. Published twice per year (in June and November) since 1993, the Top500 list records the development of supercomputers over the past two to three decades. In addition to performance, the Top500 list also summarises the main characteristics of the supercomputers in the list. Therefore, it contains much richer information than a mere ranking of the supercomputers by performance. In this post, we\u2019ll have a closer look at the Top500 list and relevant topics, including supercomputers, performance, and statistics on processors and co-processors.<\/p>\n<h2>Supercomputer<\/h2>\n<p>The Top500 list is all about supercomputers. It therefore makes sense to have a brief overview of supercomputers before going into the details of the Top500 list. As shown in the image below, a supercomputer usually consists of many cabinets (also called racks), which are each about the size of a fridge. Each cabinet contains a stack of blades (with each blade being about the size of a PC turned sideways). Each blade has several compute nodes mounted in it, with each compute node having one or more multicore processors. For example, PDC\u2019s <a href=\"https:\/\/www.pdc.kth.se\/hpc-services\/computing-systems\/beskow-1.737436\">Beskow<\/a> system has 11 cabinets, 515 blades, 2,060 compute nodes, and a total of 67,456 cores . Because of their extraordinary computational capability, supercomputers have been used in many fields including molecular modelling, quantum mechanics, physical simulations, and climate research. This is vividly reflected in the <a href=\"https:\/\/en.wikipedia.org\/wiki\/Titan_(supercomputer)#\/media\/File:Titan_supercomputer_at_the_Oak_Ridge_National_Laboratory.jpg\">design on the Titan supercomputer<\/a>, which is still among the world\u2019s top 10 most powerful supercomputers (as of the November 2018 Top500 list).<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-medium wp-image-401 aligncenter\" src=\"https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/anatomy-625x192.png\" alt=\"\" width=\"625\" height=\"192\" srcset=\"https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/anatomy-625x192.png 625w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/anatomy-768x236.png 768w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/anatomy-900x276.png 900w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/anatomy.png 1200w\" sizes=\"auto, (max-width: 625px) 100vw, 625px\" \/><!--more--><\/p>\n<h2>FLOPS<\/h2>\n<p>The ranking of the supercomputers in the Top500 list is determined by their performance as measured in specific tests. <a href=\"https:\/\/en.wikipedia.org\/wiki\/FLOPS\">FLOPS<\/a>, or number of floating point operations per second, is a common measure of computer performance. However, the speed of a computer in FLOPS can vary depending on the number of <a href=\"https:\/\/en.wikipedia.org\/wiki\/Bit\">bits<\/a> used in the representation of the <a href=\"https:\/\/en.wikipedia.org\/wiki\/Floating-point_arithmetic\">floating point<\/a> numbers in the tests. The tests used to determine the Top500 list measure the computers&#8217; performance when working with 64-bit floating point arithmetic. This is also commonly known as \u201cdouble precision\u201d on many computer systems. Over the years, the performance of the world\u2019s fastest supercomputers have reached teraFLOPS (10<sup>12<\/sup> FLOPS) in 1997 and petaFLOPS (10<sup>15<\/sup> FLOPS) in 2008, and the next milestone is exaFLOPS (10<sup>18<\/sup> FLOPS). A supercomputer that is capable of 1 exaFLOPS or higher would be called an exascale supercomputer, and, when they become available, they are expected to contribute significantly to the advancement of science and technology.<\/p>\n<h2>Performance development<\/h2>\n<p>Some particularly useful information that is available from the Top500 list website highlights the <a href=\"https:\/\/www.top500.org\/statistics\/perfdevel\/\">performance development<\/a> of supercomputer systems. As shown in the figure below, the performance development roughly follows a straight line, indicating that there has been an exponential increase in the performance of supercomputers (since the vertical axis has a logarithmic scale). Note that the triangles correspond to the very top system each year, the squares correspond to the 500<sup>th<\/sup> supercomputer, and the circles denote the performance sum of all the systems in the list. Based on such exponential growth in supercomputing power, one can hope that the world\u2019s fastest supercomputer will achieve exascale performance around 2021.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-422 size-medium\" src=\"https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/perfdevel-e1553879027759-625x539.png\" alt=\"\" width=\"625\" height=\"539\" srcset=\"https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/perfdevel-e1553879027759-625x539.png 625w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/perfdevel-e1553879027759-768x662.png 768w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/perfdevel-e1553879027759-900x776.png 900w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/perfdevel-e1553879027759.png 1176w\" sizes=\"auto, (max-width: 625px) 100vw, 625px\" \/><\/p>\n<p style=\"text-align: center\">Exponential growth of supercomputing power<br \/>\nSource: <a href=\"https:\/\/www.top500.org\/statistics\/perfdevel\/\">https:\/\/www.top500.org\/statistics\/perfdevel\/<\/a><\/p>\n<p>An interesting observation is the somewhat \u201cstepwise\u201d development in the performance of the top machine (indicated by the triangles in the figure). This means that a top machine usually stays at the top for a certain amount of time before it is surpassed by a newer and more powerful one. For instance, the <a href=\"https:\/\/www.top500.org\/system\/177999\">Tianhe-2A<\/a> machine stayed at the top for a total of three years (from 2013 to 2015). It remains the 4<sup>th<\/sup> fastest machine as of November 2018.<\/p>\n<p>The 500<sup>th<\/sup> machine in the list follows a smoother development of performance (as shown by the squares in the figure), and is roughly 100 times slower compared to the top machine. You might have noticed that there is a turning point in 2008 where the increase in performance of the 500<sup>th<\/sup> machine slowed down slightly. In the paper <a href=\"https:\/\/ieeexplore.ieee.org\/abstract\/document\/7328648\">\u201cThe TOP500 List and Progress in High-Performance Computing\u201d<\/a>, the authors suggested that the faster development in the performance of supercomputers prior to 2008 can be attributed to the increase in both the size of supercomputers (in terms of the number of processors) and in the performance of processor cores. After 2008, the increase in supercomputer size slowed down while the performance development was maintained thanks to the increasing number of cores in the multicore processors, according to Figures 2 and 3 in <a href=\"https:\/\/ieeexplore.ieee.org\/abstract\/document\/7328648\">the paper<\/a>.<\/p>\n<h2>The Top500 list (Nov 2018)<\/h2>\n<p>At the time of writing, the latest Top500 list is from November 2018. As shown in the figure below, the performances in the Top500 list are listed as R<sub>max<\/sub>\u00a0and R<sub>peak<\/sub>. Note that R<sub>max<\/sub> is the performance that was actually measured and it is the figure that is used as the ranking criterion in the Top500 list. R<sub>peak<\/sub>, on the other hand, is the theoretical peak performance of a particular system which is calculated based on the number and type of processors. On modern supercomputers, R<sub>max<\/sub> is typically around 70% of R<sub>peak<\/sub>. Taking <a href=\"https:\/\/www.olcf.ornl.gov\/olcf-resources\/compute-systems\/summit\/\">Summit<\/a>\u00a0for example, its R<sub>max<\/sub> is 0.14 exaFLOPS while its R<sub>peak<\/sub> reaches 0.2 exaFLOPS.<\/p>\n<p><img decoding=\"async\" class=\" wp-image-423 aligncenter\" src=\"https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/top5-625x380.png\" alt=\"\" width=\"666\" srcset=\"https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/top5-625x380.png 625w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/top5-768x467.png 768w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/top5-900x547.png 900w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/top5.png 1200w\" sizes=\"(max-width: 625px) 100vw, 625px\" \/><\/p>\n<p style=\"text-align: center\"><img decoding=\"async\" class=\" wp-image-425 aligncenter\" src=\"https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/top10-625x335.png\" alt=\"\" width=\"665\" srcset=\"https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/top10-625x335.png 625w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/top10-768x412.png 768w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/top10-900x483.png 900w, https:\/\/www.kth.se\/blogs\/pdc\/files\/2019\/03\/top10.png 1200w\" sizes=\"(max-width: 625px) 100vw, 625px\" \/><span style=\"font-size: 1.125rem\">Top 10 supercomputers as of November 2018<\/span><br \/>\nSource: <a href=\"https:\/\/www.top500.org\/lists\/2018\/11\/\">https:\/\/www.top500.org\/lists\/2018\/11\/<\/a><\/p>\n<p>Among the current top 10 machines, five are in the United States, two in China, and the remaining three are in Switzerland, Japan and Germany respectively. Five of the top 10 machines use NVIDIA GPUs for acceleration. In particular, the top machine, <a href=\"https:\/\/www.olcf.ornl.gov\/olcf-resources\/compute-systems\/summit\/\">Summit<\/a>, is equipped with 27,648 NVIDIA Volta V100 cards. It is worth noting that Summit is also the world\u2019s 3<sup>rd<\/sup> most energy-efficient machine, according to the <a href=\"https:\/\/www.top500.org\/green500\/lists\/2018\/11\/\">Green500 list<\/a>. Two other machines, <a href=\"https:\/\/hpc.llnl.gov\/hardware\/platforms\/sierra\">Sierra<\/a> and <a href=\"https:\/\/abci.ai\/en\/about_abci\/computing_resource.html\">ABCI<\/a>, are also among the top 10 in both the Top500 list and the Green500 list.<\/p>\n<p class=\"p1\">The <a href=\"https:\/\/www.top500.org\/green500\/lists\/2018\/11\/\">Green500 list<\/a> ranks the most energy-efficient supercomputers. It is significant because reducing energy consumption is one of the major challenges being faced in working towards exascale computing. The power constraint for a potential exascale supercomputer (that is, the maximum amount of power that such a system could use) is usually 20-30 megawatts, according to a <a href=\"https:\/\/www.hipeac.net\/news\/6895\/powerstack-a-global-response-to-the-power-management-problem-for-exascale\/\">recent article<\/a> from the HiPEAC (High Performance and Embedded Architecture and Compilation) project. This means that the power efficiency of an exascale supercomputer has to reach 33 gigaFLOPS per watt, which is around twice as efficient as the <a href=\"https:\/\/www.top500.org\/system\/179165\">Shoubu system B<\/a>, which is currently the most energy-efficient system in the Green500 list (November 2018).<\/p>\n<h2>Statistics for (co-)processors<\/h2>\n<p>The Top500 list website also provides <a href=\"https:\/\/www.top500.org\/statistics\/list\/\">list statistics<\/a> for different categories like processor generations, geographical regions, operating systems, and so on. Based on this information, we summarised the processor and accelerator\/co-processor families in terms of the number of supercomputers and the sum of performances, as shown in Tables 1 and 2. It can be seen from the two tables that Intel Xeon processors (including Skylake, Broadwell, Haswell, and IvyBridge) and NVIDIA GPUs (including Volta, Pascal, Kepler, and Fermi) are dominant in the Top500 supercomputers as of November 2018. Such dominance is in both the number of supercomputers and the sum of performances. However, if we look at the most powerful machines (that is, the top 10),\u00a0 the processors and accelerators\/co-processors have much more diverse origins. This leads to some uncertainty in what the world\u2019s first exascale supercomputer will look like.<\/p>\n<p style=\"text-align: center\"><strong>Table 1:<\/strong>\u00a0Processor statistics as of November 2018.<br \/>\nCalculated from\u00a0<a href=\"https:\/\/www.top500.org\/statistics\/list\/\">Top500 list statistics<\/a><\/p>\n<table style=\"width: 100%\">\n<tbody>\n<tr>\n<th style=\"text-align: center\">Processor family<\/th>\n<th style=\"text-align: center\">System count<\/th>\n<th style=\"text-align: center\">Performance sum<br \/>\n(petaFLOPS)<\/th>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">Intel Xeon<\/td>\n<td style=\"text-align: center\">476<\/td>\n<td style=\"text-align: center\">984.41<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">IBM Power<\/td>\n<td style=\"text-align: center\">13<\/td>\n<td style=\"text-align: center\">291.99<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">Sunway\/Shenwei<\/td>\n<td style=\"text-align: center\">1<\/td>\n<td style=\"text-align: center\">93.02<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">Fujitsu SPARC64<\/td>\n<td style=\"text-align: center\">6<\/td>\n<td style=\"text-align: center\">20.93<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">AMD Opteron<\/td>\n<td style=\"text-align: center\">2<\/td>\n<td style=\"text-align: center\">18.76<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">Hygon Dhyana<\/td>\n<td style=\"text-align: center\">1<\/td>\n<td style=\"text-align: center\">4.33<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">Cavium ThunderX2<\/td>\n<td style=\"text-align: center\">1<\/td>\n<td style=\"text-align: center\">1.53<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p style=\"text-align: center\"><strong>Table 2:<\/strong>\u00a0Accelerator\/Co-processor\u00a0statistics as of November 2018.<br \/>\nCalculated from <a href=\"https:\/\/www.top500.org\/statistics\/list\/\">Top500 list statistics<\/a><\/p>\n<table style=\"width: 100%\">\n<tbody>\n<tr>\n<th style=\"text-align: center\">Accelerator\/Co-processor<br \/>\nfamily<\/th>\n<th style=\"text-align: center\">System count<\/th>\n<th style=\"text-align: center\">Performance sum<br \/>\n(petaFLOPS)<\/th>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">NVIDIA GPU<\/td>\n<td style=\"text-align: center\">128<\/td>\n<td style=\"text-align: center\">507.86<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">Matrix-2000<\/td>\n<td style=\"text-align: center\">1<\/td>\n<td style=\"text-align: center\">61.44<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">Intel Xeon Phi<\/td>\n<td style=\"text-align: center\">8<\/td>\n<td style=\"text-align: center\">13.76<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">Deep Computing Processor<\/td>\n<td style=\"text-align: center\">1<\/td>\n<td style=\"text-align: center\">4.33<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center\">PEZY-SC series<\/td>\n<td style=\"text-align: center\">2<\/td>\n<td style=\"text-align: center\">2.06<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2>Summary<\/h2>\n<p>We have briefly introduced the <a href=\"https:\/\/www.top500.org\/\">Top500 list<\/a> and discussed relevant topics. Some key points are summarised below.<\/p>\n<ul>\n<li>The Top500 list ranks supercomputers based on their performance, and is published twice a year (in June and November).<\/li>\n<li>The performance, R<sub>max<\/sub>, is measured by the Linpack benchmark with 64-bit floating point arithmetic.<\/li>\n<li>The next milestone in supercomputer performance is 1 exaFLOPS, or 10<sup>18<\/sup> floating point operations per second.<\/li>\n<li>The development of\u00a0performance roughly follows an exponential growth.<\/li>\n<li>As of November 2018, the top 10 supercomputers are in the United States, China, Switzerland, Japan and Germany.<\/li>\n<li>Processors and accelerators\/co-processors in the most powerful supercomputers have diverse origins.<\/li>\n<\/ul>\n<\/div>","protected":false},"excerpt":{"rendered":"<p>You may have heard of the Top500 list. It ranks the world\u2019s 500 most powerful supercomputers based on their performance as measured by the Linpack benchmark. Published twice per year (in June and November) since 1993, the Top500 list records the development of supercomputers over the past two to three decades. In addition to performance, [&hellip;]<\/p>\n","protected":false},"author":1140,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"footnotes":"","jetpack_post_was_ever_published":false},"categories":[4,5],"tags":[22,23],"class_list":["post-399","post","type-post","status-publish","format-standard","hentry","category-benchmark","category-performance","tag-exascale","tag-supercomputer"],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p9W9Im-6r","_links":{"self":[{"href":"https:\/\/www.kth.se\/blogs\/pdc\/wp-json\/wp\/v2\/posts\/399","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.kth.se\/blogs\/pdc\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.kth.se\/blogs\/pdc\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.kth.se\/blogs\/pdc\/wp-json\/wp\/v2\/users\/1140"}],"replies":[{"embeddable":true,"href":"https:\/\/www.kth.se\/blogs\/pdc\/wp-json\/wp\/v2\/comments?post=399"}],"version-history":[{"count":44,"href":"https:\/\/www.kth.se\/blogs\/pdc\/wp-json\/wp\/v2\/posts\/399\/revisions"}],"predecessor-version":[{"id":466,"href":"https:\/\/www.kth.se\/blogs\/pdc\/wp-json\/wp\/v2\/posts\/399\/revisions\/466"}],"wp:attachment":[{"href":"https:\/\/www.kth.se\/blogs\/pdc\/wp-json\/wp\/v2\/media?parent=399"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.kth.se\/blogs\/pdc\/wp-json\/wp\/v2\/categories?post=399"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.kth.se\/blogs\/pdc\/wp-json\/wp\/v2\/tags?post=399"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}