FERRAMENTAS LINUX: Resultados da pesquisa ROCm
Mostrando postagens classificadas por data para a consulta ROCm. Ordenar por relevância Mostrar todas as postagens
Mostrando postagens classificadas por data para a consulta ROCm. Ordenar por relevância Mostrar todas as postagens

sábado, 14 de março de 2026

AMD’s Critical Linux 7.0 Patch Fixes RDNA4 Idle Power Drain After AI Workloads

 

AMD

AMD is rolling out a critical Linux kernel patch and new MES firmware to resolve severe idle power consumption issues on next-gen RDNA4 (Radeon RX 9000 series) GPUs. Following workloads like AI inference with Llama.cpp, these graphics cards were stuck at 100% utilization, leading to abnormal energy use. Here is the technical breakdown of the fix heading to Linux 7.0.

quinta-feira, 12 de março de 2026

The Paradigm Shift: Running LLMs on AMD Ryzen AI NPUs with Linux

 

AMD

Unlock the full potential of AMD Ryzen AI NPUs on Linux. Our in-depth guide covers the revolutionary Lemonade 10.0 and FastFlowLM integration, enabling efficient LLM inference. Learn about kernel requirements, supported Ryzen AI 300/400 hardware, and how this shifts the paradigm for open-source AI development on edge devices.

quarta-feira, 4 de março de 2026

AMD Quietly Unlocks GPU Performance Analysis: ROCprof Trace Decoder Goes Open-Source

 


AMD has quietly open-sourced the ROCprof Trace Decoder, a critical component for GPU performance analysis. This MIT-licensed tool unlocks hardware-level thread tracing on Instinct and Radeon GPUs, providing kernel developers with unprecedented visibility into wave execution. 

terça-feira, 24 de fevereiro de 2026

The Linux Kernel's Colossus: How the AMDGPU Driver Surpassed Six Million Lines of Code

 


Explore the exponential growth of the AMDGPU driver in Linux Kernel 7.0, now exceeding six million lines of code. This deep-dive analyzes the architectural reasons behind this expansion, the role of auto-generated headers, and what this dominance means for the open-source ecosystem and enterprise computing.

segunda-feira, 9 de fevereiro de 2026

Master AMD's Peak Tops Limiter (PTL) for Superior AI/ML Power & Thermal Management

 

AMD


Discover how AMD's new Peak Tops Limiter (PTL) in the AMDGPU/AMDKFD Linux drivers enables granular control over Instinct accelerator computational throughput. This in-depth guide covers sysfs controls, ROCm APIs, and kernel parameters for optimizing power efficiency and thermal budgets in high-performance computing and AI workloads. Learn implementation strategies for data centers and research labs.

terça-feira, 27 de janeiro de 2026

Decoding AMD’s LLVM GFX13 Commit: The First Official Glimpse of RDNA5 Architecture

 

AMD

The LLVM 23 Git codebase now includes the initial AMDGPU GFX13 target, signaling development for AMD's next-gen RDNA5 graphics architecture. This article provides a deep technical analysis of GFX13's implications, its evolution from GFX12/RDNA4, and what it means for future GPU programming, high-performance computing, and competitive benchmarking against NVIDIA. Read our expert breakdown.

segunda-feira, 26 de janeiro de 2026

Master Your Linux GPU: LACT 0.8.4 Unleashes Advanced Control for AMD & Intel Graphics


Hardware


LACT 0.8.4 is here, offering the premier GUI control panel for AMD Radeon & Intel Arc Linux overclocking and GPU management. Dive into enhanced UI, Docker support, and advanced sensor telemetry. Discover why this open-source tool is essential for Linux gaming and compute optimization. Download now on GitHub.

sexta-feira, 23 de janeiro de 2026

AMD Radeon RDNA4 GPU Performance Surges with New Mesa 26.1 OpenGL Driver Optimizations

 

Radeon

AMD Radeon RDNA4 GPUs receive substantial OpenGL performance optimizations in Mesa 26.1 with enhanced buffer operations, image clears, and MSAA resolves. Expert Gallium3D improvements by AMD's Marek Olšák target GFX12 architecture for gaming, professional visualization, and compute workloads on Linux systems.

quinta-feira, 22 de janeiro de 2026

AMD ROCm 7.2.0 Launch: Official RDNA4 & RDNA3 GPU Support, New Tools, and Performance Optimizations

 


AMD's ROCm 7.2.0 is now officially available, expanding open-source GPU compute support to new RDNA4 graphics cards like the Radeon AI PRO R9600D and RX 9060 XT LP, while finally adding RDNA3 support. Discover the performance enhancements, new HIP APIs, and the beta launch of the ROCm Optiq visualization platform in this comprehensive release analysis.

PyTorch 2.10 Release: A Comprehensive Guide to GPU Acceleration, Performance Optimizations, and Deep Learning Enhancements

 

AI


PyTorch 2.10 introduces major upgrades for Intel, AMD, and NVIDIA GPU acceleration, Python 3.14 compatibility, and advanced kernel optimizations. Explore performance benchmarks, key features, and enterprise AI implications in this detailed technical analysis. 

domingo, 18 de janeiro de 2026

AMDGPU & AMDKFD Linux Driver Patches: A Deep Dive into RDNA 4 Prep & 340MHz HDMI Boost for Linux 6.20~7.0

 

Radeon


Explore the latest AMDGPU & AMDKFD Linux kernel patches for Linux 6.20~7.0. Dive deep into RDNA 3.5/4.0 (GFX12.1) support, SMU 15 updates, 340MHz HDMI clock fixes for 4K/8K displays, and ARM64 server optimizations. A must-read for Linux sysadmins, PC builders, and HPC developers.

sexta-feira, 16 de janeiro de 2026

Burn 0.20 Unleashed: A New Era for High-Performance AI with Rust and CubeK

 

AI

Burn 0.20, the Rust-based deep learning framework, launches with CubeK & CubeCL, enabling peak AI performance on NVIDIA CUDA, AMD ROCm, Apple Metal, WebGPU & CPU. See benchmarks vs. LibTorch and explore the future of unified, efficient ML kernels. Read the full technical analysis.

sábado, 10 de janeiro de 2026

AMD Lays Linux Foundation for RDNA 4 & Next-Gen AI GPUs: A Deep Dive into Kernel 7.0 Driver Updates

 

Radeon

 AMD's latest Linux kernel driver update, targeting Linux 7.0, introduces foundational support for next-gen RDNA 4 (GC 12.1) and enhanced RDNA 3.5 (GC 11.5.4) GPUs, plus new NPU integration via SMU 15. This in-depth analysis covers the IP block enablement strategy, ROCm compute improvements, and what it signals for AMD's 2026-2027 graphics and AI accelerator roadmap. 

domingo, 4 de janeiro de 2026

Revolutionizing GPU Memory Management: AMD’s Batch Userptr Allocation for High-Performance Computing

 

Radeon

Explore AMD's breakthrough batch userptr allocation in the KFD kernel driver, enhancing GPU memory management for fragmented workloads. Learn how this ROCm innovation boosts HPC & AI performance with contiguous GPU VA mapping, reducing syscall overhead. Full technical analysis inside.

sexta-feira, 26 de dezembro de 2025

NVIDIA's Strategic Move: CUDA Tile IR Goes Open-Source Under Apache 2.0

 

NVIDIA

 NVIDIA open-sources CUDA Tile IR, an MLIR-based compiler infrastructure for GPU kernel optimization. Explore the technical implications for AMD, Intel, & AI accelerators, its impact on cross-vendor portability like ZLUDA, and why this 2026 roadmap shift matters for developers. Full analysis inside.

quarta-feira, 19 de novembro de 2025

MLPerf Client v1.5 Linux Support: Experimental Build Analysis and Cross-Platform AI Benchmarking

 

AI

MLPerf Client v1.5 introduces experimental Linux CLI support with OpenVINO acceleration, expanding AI PC benchmarking beyond Windows and macOS. Explore its capabilities and limitations for local LLM inference performance testing on client hardware. Learn about this industry-standard benchmark from MLCommons.

quinta-feira, 13 de novembro de 2025

Red Hat Enterprise Linux 10.1 is Here: A New Era for AI and Enterprise Computing

 

Red Hat

Red Hat Enterprise Linux 10.1 simplifies AI with vendor-validated GPU drivers, offers systemd soft-reboots for less downtime, and enhances security with post-quantum cryptography. Discover how to accelerate your enterprise IT.

quinta-feira, 30 de outubro de 2025

AMD XDNA Driver Update Unveils NPU3A Silicon and Strategic Shift Towards Linux Upstreaming

 

AMD

Explore AMD's new XDNA 202610.2.21.17 driver with NPU3A support & Linux upstreaming to XRT. This in-depth analysis covers Ryzen AI's architecture, what user pointer allocation means for performance, and the future of NPU computing on Linux.

quinta-feira, 16 de outubro de 2025

Ollama Breaks New Ground: Experimental Vulkan API Support Unlocks Broader GPU Access for LLMs

 


Ollama 0.12.6-rc0 introduces experimental Vulkan API support, expanding GPU compatibility for LLMs like Llama 3 and Gemma 3 on AMD and Intel hardware. This guide covers the technical implications for AI inferencing and machine learning workflows. 

PyTorch 2.9 Release Unleashes Broader Hardware Support and Performance Gains for AI Developers

 

AI


PyTorch 2.9 release enhances AI development with expanded AMD ROCm & Intel XPU support, simplified installation via wheel variants, and new features like symmetric memory and FlexAttention. Explore the performance upgrades for multi-GPU and edge computing.