SEATTLE and BARCELONA, Spain , Dec. 07, 2016 -- At the 2016 Neural Information Processing Systems (NIPS) Conference in Barcelona, Spain, global supercomputer leader Cray Inc. (Nasdaq:CRAY) today announced the results of a deep learning collaboration between Cray, Microsoft, and the Swiss National Supercomputing Centre (CSCS) that expands the horizons of running deep learning algorithms at scale using the power of Cray supercomputers.
Running larger deep learning models is a path to new scientific possibilities, but conventional systems and architectures limit the problems that can be addressed, as models take too long to train. Cray worked with Microsoft and CSCS, a world-class scientific computing center, to leverage their decades of high performance computing expertise to profoundly scale the Microsoft Cognitive Toolkit (formerly CNTK) on a Cray® XC50™ supercomputer at CSCS nicknamed “Piz Daint”.
By accelerating the training process, instead of waiting weeks or months for results, data scientists can obtain results within hours or even minutes. With the introduction of supercomputing architectures and technologies to deep learning frameworks, customers now have the ability to solve a whole new class of problems, such as moving from image recognition to video recognition, and from simple speech recognition to natural language processing with context.
Deep learning problems share algorithmic similarities with applications traditionally run on a massively parallel supercomputer. By optimizing inter-node communication using the Cray® XC™ Aries network and a high performance MPI library, each training job can leverage significantly more compute resources – reducing the time required to train an individual model.
“Cray’s proficiency in performance analysis and profiling, combined with the unique architecture of the XC systems, allowed us to bring deep learning problems to our Piz Daint system and scale them in a way that nobody else has,” said Prof. Dr. Thomas C. Schulthess, director of the Swiss National Supercomputing Centre (CSCS). “What is most exciting is that our researchers and scientists will now be able to use our existing Cray XC supercomputer to take on a new class of deep learning problems that were previously infeasible.”
“Applying a supercomputing approach to optimize deep learning workloads represents a powerful breakthrough for training and evaluating deep learning algorithms at scale,” said Dr. Xuedong Huang, distinguished engineer, Microsoft AI and Research. “Our collaboration with Cray and CSCS has demonstrated how the Microsoft Cognitive Toolkit can be used to push the boundaries of deep learning.”
A team of experts from Cray, Microsoft, and CSCS have scaled the Microsoft Cognitive Toolkit to more than 1,000 NVIDIA® Tesla® P100 GPU accelerators on the Cray XC50 supercomputer at CSCS. The result of this deep learning collaboration opens the door for researchers to run larger, more complex, and multi-layered deep learning workloads at scale, harnessing the performance of a Cray supercomputer.
To simplify the building and deploying of deep learning environments in supercomputing, Cray is supporting its Cray XC customers with deep learning toolkits, such as the Microsoft Cognitive Toolkit, that allow customers to run deep learning applications at their fullest potential – at scale on a Cray supercomputer. Fusing high performance computing capability with deep learning is another step forward in Cray’s vision of the convergence of supercomputing and big data.
“Only Cray can bring the combination of supercomputing technologies, supercomputing best practices, and expertise in performance optimization to scale deep learning problems,” said Dr. Mark S. Staveley, Cray’s director of deep learning and machine learning. “We are working to unlock possibilities around new approaches and model sizes, turning the dreams and theories of scientists into something real that they can explore. Our collaboration with Microsoft and CSCS is a game changer for what can be accomplished using deep learning.”
For more information on Cray’s machine learning and deep learning solutions and the Cray XC series of supercomputers, and please visit the Cray website at www.cray.com.
About Cray Inc.
Global supercomputing leader Cray Inc. (Nasdaq:CRAY) provides innovative systems and solutions enabling scientists and engineers in industry, academia and government to meet existing and future simulation and analytics challenges. Leveraging more than 40 years of experience in developing and servicing the world’s most advanced supercomputers, Cray offers a comprehensive portfolio of supercomputers and big data storage and analytics solutions delivering unrivaled performance, efficiency and scalability. Cray’s Adaptive Supercomputing vision is focused on delivering innovative next-generation products that integrate diverse processing technologies into a unified architecture, allowing customers to meet the market’s continued demand for realized performance. Go to www.cray.com for more information.
Cray, and the stylized CRAY mark are registered trademarks of Cray Inc. in the United States and other countries, and XC50 and XC are trademarks of Cray Inc. Other product and service names mentioned herein are the trademarks of their respective owners.
Cray Media:
Nick Davis
206/701-2123
[email protected]
Cray Investors:
Paul Hiemstra
206/701-2044
[email protected]


Apple Turns 50: From Garage Startup to AI Crossroads
Britain Courts Anthropic Amid US Defense Department Dispute
OpenAI Executive Shake-Up Ahead of Anticipated 2026 IPO
Samsung Electronics Eyes Record Q1 Profit Amid AI-Driven Chip Boom
Norma Group Posts Revenue Decline in 2025, Eyes Modest Recovery in 2026
Annie Altman Amends Sexual Abuse Lawsuit Against OpenAI CEO Sam Altman
First Western Ship Transits Strait of Hormuz Since Iran War Began
CTOC Adds 3,000 Doctors, 500 Hospitals Ahead of Liquidity Push
MATCH Act Targets ASML and Chinese Chipmakers in New U.S. Export Crackdown
TSMC Japan's Second Fab to Produce 3nm Chips by 2028
UAE's Largest Natural Gas Facility Suspended After Attack-Triggered Fire
Jefferies Upgrades Sodexo to Buy With €55 Target After Historic CEO Appointment
Nike Beats Q3 Estimates but China Weakness and Margin Pressure Weigh on Outlook
UPS and Teamsters Reach Agreement to Limit Driver Severance Program
SoftwareONE Posts 22.5% Revenue Surge in 2025 on Crayon Acquisition
Microsoft's $10 Billion Japan Investment: AI Infrastructure and Data Sovereignty Push
Microsoft Eyes $7B Texas Energy Deal to Power AI Data Centers 



