Stories
Slash Boxes
Comments

SoylentNews is people

posted by martyb on Tuesday June 20 2017, @03:34PM   Printer-friendly
from the Is-that-a-Cray-in-your-pocket? dept.

A new list was published on top500.org. It might be noteworthy that the NSA, Google, Amazon, Microsoft etc. are not submitting information to this list. Currently, the top two places are occupied by China, with a comfortable 400% head-start in peak-performance and 370% Rmax performance to the 3rd place (Switzerland). US appears on rank 4, Japan on rank 7, and Germany is not in the top ten at all.

All operating systems in the top-10 are Linux and derivates. It seems obvious that, since it is highly optimized hardware, only operating systems are viable which can be fine-tune (so, either open source or with vendor-support for such customizations). Still I would have thought that, since a lot of effort needs to be invested anyway, maybe other systems (BSD?) could be equally suited to the task.

RankSiteSystemCoresRmax (TFlop/s)Rpeak (TFlop/s)Power (kW)
1China: National Supercomputing Center in WuxiSunway TaihuLight - Sunway MPP, Sunway SW26010 260C 1.45GHz, Sunway - NRCPC10,649,60093,014.6125,435.915,371
2China: National Super Computer Center in GuangzhouTianhe-2 (MilkyWay-2) - TH-IVB-FEP Cluster, Intel Xeon E5-2692 12C 2.200GHz, TH Express-2, Intel Xeon Phi 31S1P - NUDT3,120,00033,862.754,902.417,808
3Switzerland: Swiss National Supercomputing Centre (CSCS)Piz Daint - Cray XC50, Xeon E5-2690v3 12C 2.6GHz, Aries interconnect , NVIDIA Tesla P100 - Cray Inc.361,76019,590.025,326.32,272
4U.S.: DOE/SC/Oak Ridge National LaboratoryTitan - Cray XK7, Opteron 6274 16C 2.200GHz, Cray Gemini interconnect, NVIDIA K20x - Cray Inc.560,64017,590.027,112.58,209
5U.S.: DOE/NNSA/LLNLSequoia - BlueGene/Q, Power BQC 16C 1.60 GHz, Custom - IBM1,572,86417,173.220,132.77,890
6U.S.: DOE/SC/LBNL/NERSCCori - Cray XC40, Intel Xeon Phi 7250 68C 1.4GHz, Aries interconnect - Cray Inc.622,33614,014.727,880.73,939
7Japan: Joint Center for Advanced High Performance ComputingOakforest-PACS - PRIMERGY CX1640 M1, Intel Xeon Phi 7250 68C 1.4GHz, Intel Omni-Path - Fujitsu556,10413,554.624,913.52,719
8Japan: RIKEN Advanced Institute for Computational Science (AICS)K computer, SPARC64 VIIIfx 2.0GHz, Tofu interconnect - Fujitsu705,02410,510.011,280.412,660
9U.S.: DOE/SC/Argonne National LaboratoryMira - BlueGene/Q, Power BQC 16C 1.60GHz, Custom - IBM786,4328,586.610,066.33,945
10U.S.: DOE/NNSA/LANL/SNLTrinity - Cray XC40, Xeon E5-2698v3 16C 2.3GHz, Aries interconnect - Cray Inc.301,0568,100.911,078.94,233

takyon: TSUBAME3.0 leads the Green500 list with 14.110 gigaflops per Watt. Piz Daint is #3 on the TOP500 and #6 on the Green500 list, at 10.398 gigaflops per Watt.

According to TOP500, this is only the second time in the history of the list that the U.S. has not secured one of the top 3 positions.

The #100 and #500 positions on June 2017's list have an Rmax of 1.193 petaflops and 432.2 teraflops respectively. Compare to 1.0733 petaflops and 349.3 teraflops for the November 2016 list.

[Update: Historical lists can be found on https://www.top500.org/lists/. There was a time when you only needed 0.4 gigaflops to make the original Top500 list — how do today's mobile phones compare? --martyb]


Original Submission

 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 1, Informative) by Anonymous Coward on Wednesday June 21 2017, @12:27AM

    by Anonymous Coward on Wednesday June 21 2017, @12:27AM (#528807)

    What's your take on the new storage and memory technologies?
    Examples:
    High Bandwidth Memory
    Hybrid Memory Cube
    GDDR6/GDDR5X/DDR5
    3D QLC NAND
    NAND with 64-96 layers
    Intel/Micron 3D XPoint (the only significant post-NAND technology to make it to market)

    and last but probably least,

    helium-filled shingled magnetic recording hard drives (because HAMR is nowhere to be found)

    Memory with a high bandwidth (and a low latency) is becoming more and more critical in HPC. CPU core performance has, for some time now, been increasing faster than memory performance. That is, it's getting harder and harder to keep the cores fed with data. This is often more true for HPC (high-performance computing) workloads than for workloads in many other spaces. Now, there are different technologies which attempt to deliver this bandwidth (don't forget latency) as you point out. I wish I could say more here, but I can't due to NDA concerns. However, perhaps I could paint a very rough picture from publicly available information. HBM ("High Bandwidth Memory") is good stuff compared to DDR, and HBM2+ is better. There are some concerns here, as the HMB stacks use a very wide, parallel bus, and need to be placed very close to the CPU/SoC die. I think we're talking about distances on the order of 1mm or so. There's only so much room close to a CPU/SoC die to place HBM stacks, given this distance requirement. Don't forget that the HBM stacks and the CPU/SoC die probably have to share a (likely silicon) interposer. The nice thing about HMC ("Hybrid Memory Cube"), is that it uses a serial, rather than a parallel interface like HBM. Thus, HMC can be placed further away from the CPU/SoC die(s), and this can increase total memory capacity. The issue here is, you then pay for this extra capacity in terms of latency, as you have to introduce a SerDes step, etc. Also, one needs to think about power consumption: consider the amount of power, on average, that it takes to move one bit of data to/from memory with HBM vs. HMC; think pico Joules per bit. -- I can't speak much to the GDDRX or NAND stuff myself. XPoint sure sounds interesting, but there has been a lot of hype there. I wonder how this will turn out in the near to mid future. Again, keep an eye on power consumption there, this will limit the solution space XPoint can compete in. I don't know much about helium-filled drives either, sorry.

    Starting Score:    0  points
    Moderation   +1  
       Informative=1, Total=1
    Extra 'Informative' Modifier   0  

    Total Score:   1