Wu Feng, co-founder of the Green500, talks about the challenges to reach exascale through energy efficient super computing, with massive parallel processing at the Denver Supercomputing 2017 conference. Wu Feng is a Professor and Turner Fellow of Computer Science with additional appointments in Electrical & Computer Engineering, Health Sciences, and Biomedical Engineering and Mechanics at Virginia Tech (VT). At VT, he directs the Synergy Laboratory, which conducts research at the synergistic intersection of systems software, middleware, and application software; of particular note is his high-performance computing (HPC) research in the areas of green supercomputing, accelerator-based parallel computing, and bioinformatics. Prior to joining VT, he spent seven years at Los Alamos National Laboratory, where he began his journey in green supercomputing in 2001 with Green Destiny, a 240-node supercomputer in 5 square feet and consuming only 3.2 kW of power when booted diskless. This work ultimately created the impetus for the Green500.
Filmed in 4K60 at Supercomputing 2017 in Denver using Panasonic GH5 ($1999 at Amazon.com) on firmware 2.1 (aperture priority, AF continuous tracking) with Leica 12mm f1.4 ($1297 at Amazon.com) with Sennheiser MKE440 stereo shotgun microphone ($325 at Amazon.com), get $25 off renting cameras and lenses with my referral link at https://share.lensrentals.com/x/wWbHqV
Around the year 2000, the convergence on Linux and commodity x86_64 processors provided a homogeneous scientific computing platform which enabled the construction of the Worldwide LHC Computing Grid (WLCG) for LHC data processing. This allowed the High Energy Physics (HEP) community to use a homogeneous software model utilizing the x86_64 architecture. LHC experiments at CERN, in particular ATLAS and CMS, started investigating ARMv8 64-bit (AArch64) architecture for HEP needs. A journey which started in 2013. The LHC community faces a great challenge regarding computing needs in 10 years and has started exploring public clouds, volunteer computing (e.g., LHC@home) and HPC facilities to increase peak computation capacity. This talk will contain information about future (a timeline of 10 years) computation needs for LHC experiments and the more recent progress done by ATLAS, CernVM and CMS teams on using ARMv8 64-bit/AArch64.
You can watch the keynote by David Abdurachmanov and Jakob Blomer here:
David Rusling, Linaro CTO talks Trebble, Servers, HPC, Tiny Linux IoTL, Automotive, Machine Learning
David Rusling says this has been the best Linaro Connect for him thus far in the 7 years since Linaro was started. He talks about how Google recognizes the part Linaro can play to help with Project Trebble, to help keep longer term support for each LTS kernel release also as part of the Linaro Mobile Group. The Linaro Enterprise Day showed how far Linaro has gotten to with all the work coming together towards ARM Servers taking market share in the server market. Kanta Vekaria works towards Linaro's involvment with High Performance Computing (HPC) as she talked about in her keynote Nicolas Pitre is working on making the Internet of Tiny Linux (IoTL) to make Linux suitable for IoT you can see his talk here persuading the kernel developers that making changes that benefit the embedded market. Linaro is very active with Zephyr which is kind of the Linux Kernel of the embedded world, working on it in in the Linaro IoT & Embedded Group (LITE). Talking about the establishment of the Open Source Foundries spin-off of Linaro where they can pursue business opportunities to work more closely together with customers who need help implementing open source on ARM solutions such as the IoT solutions shown in this video also introducing the Associate Membership Level for smaller members such as small to medium companies and Universities to be able to join Linaro in the coming months trying to involve everyone in the open source ecosystem. Linaro also is looking into getting involved with open source for the Automotive market possibly related to the software needed for self-driving cars and more. Linaro getting involved with open source for artificial intelligence, machine learning. You can see my previous videos with David Rusling over the past 5 years here.
Jon Masters says Moores Law may have come to an end and that single threaded performance is not defining the industry anymore because it's not increasing at the same rate that it used to. What is defining the future of the industry is machine learning, accelerators, lots of additional workload optimization that is happening outside of the core. Thus he believes ARM has an opportunity to get into the mainstream server space in the next 12-18 months with the newest powerful ARM Server solutions such as the Cavium ThunderX2 and the Qualcomm Centriq 2400. You can see some of my previous Jon Masters interviews over the past 5 years here.
The MIT Supply Response Supercomputing Lab has been investigating opportunities to get cycles when they are cheapest, either through an innovative sensor system that utilizes a hyperlocal weather monitoring application that watches clouds, or a clever scraping of PUC utility websites to ramp compute resources up when electricity is inexpensive. They are currently testing a number of projects that are based around ARM and utilizes every bit of the energy-aware programmability of big.LITTLE and Slurm Workload Manager.
#DIV/0! Is their Solar-Powered Supercomputing cluster. It is named for the error they got in Excel when they tried to calculate their performance per dollar.
They maintain the Debian ports of every HPC code they can get their hands on (please send some along if you have additions).
IoTNet is the network in Boston and Cambridge which only handles IoT comms. It is low bandwidth, high latency and lossy which they are hoping will keep humans, with their real-time protocols, off. Machines and CPS like it because it is asynchronous, asymmetric and low power. If you have a key dongle for your car you are probably already using the TTN in your city.
Interested parties can contact them at MITARM@mit.edu
This is the Qualcomm 48-core (custom Qualcomm Falkor cores) Centriq 2400 ARM Server reference evaluation board. Featuring 48-cores with 12 DIMM slots of DDR4 RAM memory, dual PCI riser boards for fully customizable setup. At Linaro Connect SFO17, the ARM team is demonstrating an ELK big data demo with ElasticSEarch, Logstash and Kibana for graphic visualization running on Qualcomm Centriq 2400 processor. The Qualcomm Centriq 2400 processor is designed for cloud computing running these microservices in containers seamlessly. There is a 2nd demo of Linuxkit with NGINX running in a container. Both demos use Docker containers running on Ubuntu 16.04.
Daniel Thompson of Linaro talks about the upcoming new ARM Developerbox that was announced at Linaro Connect SFO17. The box is a 24-core ARM Cortex-A53 low-power workstation allowing software development for ARM to be done on ARM. The board is a collaboration between Socionext, Gigabyte and Linaro and is the first 96Boards Enterprise Edition platform to exploit microATX form-factor and feels like a normal PC motherboard, right up to the row of three PCIe slots that are available.
Following his keynote to introduce Socionext and the recently announced multi-core desktop Arm developer platform Socionext Chairman and CEO Yasuo Nishiguchi PhD talks about servers, SoCs and the company's new CPU-to-CPU communication technology DDT (Direct Data Transaction), and looks forward to the launch of the new desktop developer platform in the December-January time frame.
Socionext presents SynQuacer SC2A11 cloud ARM server based on 24-core ARM Cortex-A53 highly integrated low-power server system suitable for edge computing which processes data at the edge of the cloud in the IoT era. Socionext develop high power efficiency processor chip, and it is 60% power reduction on same performance compare to conventional chip. Also it is very good performance in parallel processing, like hadoop. This high power efficiency processor chip will be suitable for various application in the IoT era.
Gigabyte shows a prototype of their upcoming 2U dual socket Cavium ThunderX2 ARM Powered Server optimized form factor with customization options like using AMD or Nvidia GPGPU Cards support, very high IO bandwidth expansions, storage, a lot of RAM, among many other optimizations and features, discusses all the considerations that they are having with their customers for the design, how Gigabyte can be creative about enabling higher value to their customers who many are now looking forward to launch ARM Servers as the ThunderX2 might have a fast enough performance, and they are aiming to provide a value proposition to be the system of choice for large cloud companies and for small companies also. Also working with Redhat, Suse and Linaro, bringing up the ecosystem to enable the cloud server and high performance computing market.