Understanding your system's specifications is fundamental when working with Large Language Models. Determining the capabilities of your own hardware, including Video RAM (VRAM) and system RAM, is essential before attempting to download or run models. This knowledge ensures you have the necessary resources to run a specific LLM.This section provides practical guidance on how to find the amount of VRAM available on your Graphics Processing Unit (GPU) and the total system RAM installed on your computer for common operating systems.Finding Your GPU's VRAMThe amount of VRAM is often the most immediate limiting factor for running larger LLMs locally. Here’s how to check it:On Windows:Task Manager: The easiest way is often through the Task Manager.Right-click the taskbar and select "Task Manager", or press Ctrl+Shift+Esc.Navigate to the "Performance" tab.Click on your GPU in the left-hand pane (it might be labeled GPU 0, GPU 1, etc.). If you have multiple GPUs (e.g., integrated and dedicated), make sure you select the dedicated one, which is usually used for intensive tasks like running LLMs.Look for "Dedicated GPU memory" near the bottom. This value represents your VRAM. You'll often see both the total amount and the current usage.DirectX Diagnostic Tool:Press Win+R, type dxdiag, and press Enter.Wait for the tool to collect information.Go to the "Display" tab (or "Render" tab if you have multiple GPUs).Look for "Display Memory (VRAM)" or a similar field. Note that this might sometimes combine VRAM with shared system memory; the Task Manager often provides a clearer view of dedicated VRAM.GPU Manufacturer Software: If you have NVIDIA or AMD GPUs, their respective control panels (NVIDIA Control Panel or AMD Radeon Software) usually display hardware information, including VRAM, often under a "System Information" or similar section.On macOS:About This Mac: This provides a quick overview.Click the Apple menu () in the top-left corner.Select "About This Mac".In the window that appears, look for the "Graphics" line. It will typically list the GPU model and the amount of VRAM (e.g., "AMD Radeon Pro 5700 XT 16 GB").System Information: For more detail:Click the Apple menu () > "About This Mac".Click the "System Report..." button.In the left sidebar, under "Hardware", select "Graphics/Displays".The details for your GPU(s) will be shown on the right, including the "VRAM (Total)" or "VRAM (Dynamic, Max)" depending on the GPU type (dedicated vs. integrated/unified memory). For Apple Silicon Macs (M1, M2, M3 series), the memory listed here is unified memory shared between the CPU and GPU, functioning effectively as both RAM and VRAM.On Linux:NVIDIA GPUs (nvidia-smi): If you have an NVIDIA GPU and the appropriate drivers installed, the nvidia-smi command is the standard tool.Open a terminal window.Type nvidia-smi and press Enter.Look at the top section of the output. It will show the GPU name and the total memory (e.g., "16384MiB" for 16GB VRAM). It also shows current memory usage.+-----------------------------------------------------------------------------+ | NVIDIA-SMI 525.125.06 Driver Version: 525.125.06 CUDA Version: 12.0 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 NVIDIA GeForce ... On | 00000000:01:00.0 Off | N/A | | 30% 40C P8 15W / 350W | 10MiB / 16384MiB | 0% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ ...Example output of the nvidia-smi command, showing 16384MiB (16GB) of total VRAM.AMD GPUs (radeontop or lspci + System Info):For a quick overview of usage, you might use radeontop (may need installation).To identify the card and then check system details: Use lspci | grep -i vga or lspci | grep -i display to identify your AMD GPU model.System monitoring tools or graphical system information utilities available in your distribution's desktop environment (like GNOME System Monitor or KDE System Monitor) often report VRAM under display or hardware sections. The command glxinfo | grep "Video memory" might also work if Mesa utils are installed.General (lspci and System Monitors): The lspci command combined with graphical system monitoring tools available in most desktop environments (like System Monitor on Ubuntu/GNOME) can often provide details about the detected graphics hardware.Finding Your System RAMSystem RAM is needed to load the operating system, applications, and potentially parts of the model or data if VRAM is insufficient (though this is much slower).On Windows:Task Manager:Open Task Manager (Ctrl+Shift+Esc).Go to the "Performance" tab.Click on "Memory".The total amount of installed RAM is displayed prominently near the top right (e.g., "16.0 GB").System Information:Press Win+R, type msinfo32, and press Enter.Look for "Installed Physical Memory (RAM)" in the System Summary.On macOS:About This Mac:Click the Apple menu () > "About This Mac".The "Memory" line directly shows the installed RAM (e.g., "16 GB 2667 MHz DDR4"). For Apple Silicon Macs, this is the total unified memory.System Information:Click the Apple menu () > "About This Mac" > "System Report...".Under "Hardware", select "Memory". It shows details about the installed RAM modules and the total amount.On Linux:free command: A standard command-line tool.Open a terminal.Type free -h and press Enter. The -h flag shows the values in human-readable format (e.g., GiB for Gibibytes).Look at the "Mem:" row under the "total" column.~$ free -h total used free shared buff/cache available Mem: 31Gi 5.8Gi 18Gi 1.2Gi 7.4Gi 24Gi Swap: 2.0Gi 0B 2.0GiExample output of free -h, showing approximately 31GiB of total system RAM.htop or similar tools: Tools like htop (often needs installation via sudo apt install htop or sudo yum install htop) provide a more interactive view of system resources, including total RAM shown typically at the top.Graphical System Monitors: Most Linux desktop environments include a graphical system monitor (e.g., GNOME System Monitor, KSysGuard) that displays total RAM, usually on a "Resources" or "System" tab.By following these steps, you can quickly determine the VRAM and RAM available on your system. Comparing these specifications against the estimated requirements for an LLM (using the methods discussed earlier in this chapter) will give you a clear idea of whether your hardware is suitable for running that particular model effectively. Remember that besides memory, the GPU's processing power (related to its model and architecture, like CUDA cores or Tensor cores for NVIDIA) also significantly impacts performance, though VRAM is often the first bottleneck you'll encounter.