Sep
12
2018
--

Nvidia launches the Tesla T4, its fastest data center inferencing platform yet

Nvidia today announced its new GPU for machine learning and inferencing in the data center. The new Tesla T4 GPUs (where the ‘T’ stands for Nvidia’s new Turing architecture) are the successors to the current batch of P4 GPUs that virtually every major cloud computing provider now offers. Google, Nvidia said, will be among the first to bring the new T4 GPUs to its Cloud Platform.

Nvidia argues that the T4s are significantly faster than the P4s. For language inferencing, for example, the T4 is 34 times faster than using a CPU and more than 3.5 times faster than the P4. Peak performance for the P4 is 260 TOPS for 4-bit integer operations and 65 TOPS for floating point operations. The T4 sits on a standard low-profile 75 watt PCI-e card.

What’s most important, though, is that Nvidia designed these chips specifically for AI inferencing. “What makes Tesla T4 such an efficient GPU for inferencing is the new Turing tensor core,” said Ian Buck, Nvidia’s VP and GM of its Tesla data center business. “[Nvidia CEO] Jensen [Huang] already talked about the Tensor core and what it can do for gaming and rendering and for AI, but for inferencing — that’s what it’s designed for.” In total, the chip features 320 Turing Tensor cores and 2,560 CUDA cores.

In addition to the new chip, Nvidia is also launching a refresh of its TensorRT software for optimizing deep learning models. This new version also includes the TensorRT inference server, a fully containerized microservice for data center inferencing that plugs seamlessly into an existing Kubernetes infrastructure.

 

 

Apr
07
2016
--

OpenStack’s Mitaka release focuses on manageability and user experience

DSC09941 (1) The OpenStack Foundation today launched Mitaka, the thirteenth release of its open source enterprise cloud platform. In many ways, this new release shows the growing maturity of the project, which was originally incubated in 2010 by Rackspace and NASA. Instead of lots of major feature additions (though there are still plenty of those), the focus for this release was on making the platform… Read More

Feb
24
2015
--

Mirantis Partners With Google To Bring Kubernetes To OpenStack

gorch_fock_wheel Mirantis, a major player in the OpenStack ecosystem, today announced that it has partnered with Google to bring support for Kubernetes, Google’s open-source project for managing containerized applications, to the OpenStack project. This new project uses OpenStack’s Murano application catalog to make it easier to deploy and configure these new Kubernetes-based clusters and their… Read More

Powered by WordPress | Theme: Aeros 2.0 by TheBuckmaker.com