5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse

In our previous blog post, we demonstrated how reusing the key-value (KV) cache by offloading it to CPU memory can accelerate time to first token (TTFT) by up to 14x on x86-based NVIDIA H100 Tensor Core GPUs and 28x on the NVIDIA GH200 Superchip. In this post, we shed light on KV cache reuse techniques and best practices that can drive even further TTFT speedups. Introduction to KV cache  LLM models are rapidly being adopted for many tasks, including question-answering, and code generation. To generate a response, these models begin by converting the user’s prompt into tokens, which are then … Read more

Tags:

Tribute to the Renowned Artist Behind Star Wars, Lord of the Rings, and Magic: The Gathering

Greg Hildebrandt and his brother achieved great success in the fantasy and sci-fi art genres, and now we’ve lost them both.   In 1976, the Hildebrandt duo hit the ground running with the J.R.R. Tolkien calendar, which became a bestseller at the time, and their vibrant colors and vision of Middle-earth have inspired many subsequent Tolkien calendars and books, and their style has inspired many other artists. They gave the Hobbits big feet, although this was not mentioned in the books, and yes, they were the reason the Hobbits in Peter Jackson’s films had bigger feet. The brothers were then … Read more

Tags:

Asus ‘Turbo Game Mode’ arrives on its AM5 motherboards — second CCD and SMT toggles arrive for up to a 35% performance boost on X3D chips

Asus has released a new BIOS update for select AMD motherboards, introducing the “Turbo Game Mode” designed to optimize gaming performance on a range of Ryzen processors. This mode is aimed at users seeking the best possible experience from AMD’s high-core CPUs by adjusting core usage. Specifically, it disables one of the CPU’s two chiplets (CCD) and turns off Simultaneous Multi-Threading (SMT), optimizing single-threaded performance for games that do not benefit from high core counts. As confirmed by Computer Base, the update is said to be available for Asus’s ROG X870E Crosshair Hero motherboard. However, other models like the X670, … Read more

Tags:

Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan

Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve surgical workflows. One such challenge is efficiently combining multi-modal imaging data, such as preoperative 3D patient images with intra-operative video. This is key to providing surgeons with real-time, accurate guidance during minimally invasive or robotic-assisted procedures.  In this post, we walk you through the use of state-of-the-art AI and imaging techniques, with a highlight of  ImFusion’s integration of NVIDIA Holoscan for real-time sensor processing, AI, and I/O. We explore how NVIDIA Holoscan enabled us to double the pipeline … Read more

Tags:

Roccat Vulcan TKL Pro RGB Gaming Keyboard Review

PerformancePSU Verdict: 4.5/5 View on Amazon From the Reviewer The Roccat Vulcan TKL Pro is a well-constructed keyboard that would complement almost any PC gaming setup. Its highly responsive Titan Optical Switches are amazing to use and perform great in all genres of video games. With that being said, we didn’t feel like the typing experience or build quality was nice enough to warrant the price tag when coupled with the fact it doesn’t have wireless connectivity like other keyboards in the price bracket however, this keyboard is aimed at gamers and with that in mind is does gaming exceptionally … Read more

Tags:

Qualcomm says its Snapdragon Elite benchmarks show Intel didn’t tell the whole story in its Lunar Lake marketing

At its Snapdragon Summit in Maui, Qualcomm is taking Intel to task. The company, which released its Snapdragon X processors over the summer, is using its home turf to claim that its chips are faster than Intel’s newly released Intel Core Ultra Series 2 “Lunar Lake” processors, and suggested that Intel didn’t tell the whole story in its marketing. Qualcomm presented a series of benchmarks; Some of them were repeats of what we’ve seen before, now with Lunar Lake (and AMD’s Strix Point) added, while others were designed specifically to disprove Intel’s claims that Lunar Lake offers “the fastest CPU … Read more

Tags:

Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries

Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft’s TuringMM visual embedding model that maps images and text into a shared high-dimensional space. Operating on billions of images across the web, performance is critical.  This post details efforts to optimize the TuringMM pipeline using NVIDIA TensorRT and NVIDIA acceleration libraries like CV-CUDA and nvImageCodec. These efforts resulted in a 5.13x speedup and significant TCO reduction. We share how we worked with the Microsoft Bing team to tackle optimization of their core embeddings pipelines that power internet-scale … Read more

Tags:

Prebuilt Gaming PC Beginners Buyers Guide

So, you are thinking of buying a new prebuilt gaming pc for yourself or a loved one, however, there is one small problem; You have no idea where to start – don’t worry friend, we’ve got you. A Full RGB Gaming PC So what will you learn about in our guide? It can be overwhelming knowing where to start when you’re in the market for a new gaming PC especially if you have never brought one before but, never fear, by the time you have finished reading this short prebuilt gaming PC buyers guide, you will be armed with the … Read more

Tags:

SLI and Crossfire: What Is It, Do I Need It, Is It Dead?

When Do You Need SLI and Crossfire? When shopping for a video card, you should always get the single best card you can afford, unless you are planning on running at resolution that requires SLI to play games at the desired quality (which is generally 4K at high/max quality). I started with a single GTX 760 2GB, added another GTX 760 2GB later on for SLI. I then traded the GTX 760s for a single GTX 970 4GB. Later on, I added another GTX 970 4GB for SLI GTX 970 4GB. This is what you see in the picture. What … Read more

Tags:

What are Motherboard VRMs?

Motherboard VRMs are the often overlooked spec that has direct and often drastic impacts on system stability, performance, and longevity. What are they exactly though, why are they important, and how do you pick a motherboard with suitable VRMs for your build? Read on to learn more. Motherboard VRMs – A Quick Explainer VRM stands for Voltage Regulator Module. It consists of a set of motherboard embedded components that form a circuit which takes the 12-volt output generated by the power supply and converts and regulates it to meet the specific voltage requirements of the CPU, GPU, and RAM. Steady … Read more

Tags: