Virtual Memory Deep Dive: Mechanisms, Paging, and OS-Level Implementation

Virtual memory is a fundamental abstraction in modern operating systems that decouples application memory from the underlying physical hardware. It enables process isolation, memory overcommitment, and sophisticated paging strategies that make multitasking possible on systems with limited RAM.

In this article, we explore the underlying architecture, components, and behaviors of virtual memory with a focus on Linux internals and memory management units (MMUs).

What is Virtual Memory?

Virtual memory (VM) is a memory management capability that provides an application with the illusion of a large, contiguous address space, even though the physical memory (RAM) may be fragmented or insufficient to hold all active pages simultaneously.

Modern CPUs support virtual memory through hardware-level address translation, using the Memory Management Unit (MMU) and page tables.

Core Concepts in Virtual Memory

1. Virtual vs. Physical Address Space

Virtual Address (VA): The logical address used by the process, starting typically from 0x00000000.
Physical Address (PA): The real location in RAM.
The MMU, in collaboration with the OS, translates VA → PA via a page table hierarchy (e.g., 4-level page tables in x86_64).

2. Pages and Page Frames

Virtual memory is split into fixed-size blocks called pages (typically 4 KB, 2 MB, or 1 GB on x86).
Physical memory is similarly divided into page frames.
The page table tracks VA → PA mappings and access permissions (read/write/execute).

3. Page Table Entries (PTEs)

Each entry contains: - Frame number - Valid bit - Dirty bit (has the page been written to?) - Accessed bit (for LRU-like algorithms) - Protection bits (R/W/X/U/S)

4. Page Faults

Triggered when a process accesses a virtual address that is not currently mapped.
The OS traps into a page fault handler and may:
- Load the page from swap
- Allocate a new zero page
- Terminate the process (segfault) if invalid

5. Translation Lookaside Buffer (TLB)

Hardware cache for recent VA→PA translations
On a context switch or TLB miss, the CPU must walk the page tables
TLB shootdowns are required in multiprocessor systems during address space updates

Linux Virtual Memory Implementation

Linux uses a demand-paging model backed by a combination of: - Anonymous pages (heap, stack, malloc) - File-backed pages (shared libraries, memory-mapped files) - Swap (secondary storage)

Each process has its own virtual memory layout:

+-----------------------------+ 0xC0000000
| Kernel space (for kernel)  |
+-----------------------------+ 0xBFFFFFFF
| Stack                      |
| Heap                       |
| .bss .data .text           |
| mmap() regions             |
+-----------------------------+ 0x00000000

You can examine a process’s layout: bash cat /proc/$(pidof bash)/maps

Page Tables in Linux (x86_64):

PGD → PUD → PMD → PTE (4-level hierarchy)

Use pagemap and vmstat interfaces for deeper analysis: bash cat /proc/self/pagemap vmstat -s

Page Replacement and Swapping

Linux uses the Least Recently Used (LRU) approximation via an active/inactive list strategy: - kswapd daemon handles background reclamation - Swappable pages moved to disk (/proc/swaps)

View current swap usage: bash swapon -s free -h

Python Simulation: Paging Behavior (Conceptual Model)


class VirtualMemory:
    def init(self, ramsize):
        self.pagetable = {}
        self.ram = [None] * ramsize
        self.disk = {}
        self.lruqueue = []

def access_page(self, page_id):
    if page_id in self.page_table:
        print(f"Hit: Page {page_id} in frame {self.page_table[page_id]}.")
    else:
        print(f"Page fault: Loading page {page_id}.")
        self.handle_page_fault(page_id)
    self._touch_lru(page_id)

def handle_page_fault(self, page_id):
    if None in self.ram:
        frame = self.ram.index(None)
    else:
        evict_page = self.lru_queue.pop(0)
        frame = self.page_table[evict_page]
        print(f"Evicting page {evict_page} from frame {frame}.")
        del self.page_table[evict_page]
    self.ram[frame] = page_id
    self.page_table[page_id] = frame

def _touch_lru(self, page_id):
    if page_id in self.lru_queue:
        self.lru_queue.remove(page_id)
    self.lru_queue.append(page_id)


vm = VirtualMemory(3)
for pid in [1,2,3,1,4,2,5]:
    vm.access_page(pid)

SEO Keywords

virtual memory deep dive, page tables linux, how virtual memory works in linux, x86_64 page tables, mmu address translation, tlb shootdown, virtual memory internals, page fault handler, swap memory linux

Conclusion

Virtual memory is the foundation of process abstraction in modern computing. From address translation to page faults, and TLB caches to swap-backed paging, it powers the multitasking, isolation, and stability of contemporary operating systems. Mastery of these internals is critical for systems programmers, kernel developers, and performance engineers alike.

In future posts, we’ll look at implementing page replacement algorithms in detail and dive into kernel memory allocators like slab and buddy systems.

Abhisheyk Gaur on Blogger