Categories
Misc

NVIDIA Speech AI Models Deliver Industry-Leading Accuracy and Performance

NVIDIA is driving state-of-the-art performance, efficiency, and accessibility in both speech AI and language models, setting the stage for innovations that are…

NVIDIA is driving state-of-the-art performance, efficiency, and accessibility in both speech AI and language models, setting the stage for innovations that are redefining what’s possible in automatic speech recognition (ASR). NVIDIA Parakeet TDT 0.6B v2 is a 600-million-parameter automatic speech recognition (ASR) model designed for high-quality English transcription. It is currently ranked #

Source

Categories
Misc

NVIDIA Blackwell Delivers up to 2.6x Higher Performance in MLPerf Training v5.0

The journey to create a state-of-the-art large language model (LLM) begins with a process called pretraining. Pretraining a state-of-the-art model is…

The journey to create a state-of-the-art large language model (LLM) begins with a process called pretraining. Pretraining a state-of-the-art model is computationally demanding, with popular open-weights models featuring tens to hundreds of billions parameters and trained using trillions of tokens. As model intelligence grows with increasing model parameter count and training dataset size…

Source

Categories
Misc

Reproducing NVIDIA MLPerf v5.0 Training Scores for LLM Benchmarks

The previous post, NVIDIA Blackwell Delivers up to 2.6x Higher Performance in MLPerf Training v5.0, explains how the NVIDIA platform delivered the fastest time…

The previous post, NVIDIA Blackwell Delivers up to 2.6x Higher Performance in MLPerf Training v5.0, explains how the NVIDIA platform delivered the fastest time to train across all seven benchmarks in this latest MLPerf round. This post provides a guide to reproduce the performance of NVIDIA MLPerf v5.0 submissions of Llama 2 70B LoRA fine-tuning and Llama 405B pretraining.

Source

Categories
Misc

How 1X Technologies’ Robots Are Learning to Lend a Helping Hand

Humans learn the norms, values and behaviors of society from each other — and Bernt Børnich, founder and CEO of 1X Technologies, thinks robots should learn like this, too. “For robots to be truly intelligent and show nuances like being careful around your pet, holding the door open for an elderly person and generally behaving
Read Article

Categories
Misc

Maximizing OpenMM Molecular Dynamics Throughput with NVIDIA Multi-Process Service

Molecular dynamics (MD) simulations model atomic interactions over time and require significant computational power. However, many simulations have small…

Source

Categories
Misc

Streamline Trade Capture and Evaluation with Self-Correcting AI Workflows

An illustration of a female sitting at a computer looking at trade trends.The success of LLMs in chat and digital assistant applications is sparking high expectations for their potential in business process automation. While achieving…An illustration of a female sitting at a computer looking at trade trends.

The success of LLMs in chat and digital assistant applications is sparking high expectations for their potential in business process automation. While achieving human-level reliability in such workflows has been challenging, it has highlighted key areas for improvement and fueled ongoing innovation. Despite reliability challenges, there’s tremendous business potential in automating workflows…

Source

Categories
Misc

Floating-Point 8: An Introduction to Efficient, Lower-Precision AI Training

A decorative image.With the growth of large language models (LLMs), deep learning is advancing both model architecture design and computational efficiency. Mixed precision…A decorative image.

With the growth of large language models (LLMs), deep learning is advancing both model architecture design and computational efficiency. Mixed precision training, which strategically employs lower precision formats like brain floating point 16 (BF16) for computationally intensive operations while retaining the stability of 32-bit floating-point (FP32) where needed, has been a key strategy for…

Source

Categories
Misc

NVIDIA Blackwell Delivers Breakthrough Performance in Latest MLPerf Training Results

NVIDIA is working with companies worldwide to build out AI factories — speeding the training and deployment of next-generation AI applications that use the latest advancements in training and inference. The NVIDIA Blackwell architecture is built to meet the heightened performance requirements of these new applications. In the latest round of MLPerf Training — the
Read Article

Categories
Misc

NVIDIA RTX Blackwell GPUs Accelerate Professional-Grade Video Editing

4:2:2 cameras — capable of capturing double the color information compared with most standard cameras — are becoming widely available for consumers. At the same time, generative AI video models are rapidly increasing in functionality and quality, making new tools and workflows possible. NVIDIA RTX GPUs based on the NVIDIA Blackwell architecture include dedicated hardware
Read Article

Categories
Misc

CodeAgents + Structure: A Better Way to Execute Actions