The 5G Core: A Service-Based Architecture (SBA)

The evolution from the 4G Evolved Packet Core (EPC) to the 5G Core network represents the most radical architectural transformation in the history of telecommunications. The 4G EPC was a rigid, monolithic architecture. It relied heavily on expensive, proprietary, hardware-specific boxes provided by legacy telecom vendors (e.g., physical MME routers or specialized SGW hardware). If a network operator wanted to increase the capacity of their network to handle a sudden influx of users, they had to physically purchase, rack-mount, and wire a new proprietary hardware server. This made scaling the network agonizingly slow, incredibly inflexible, and financially prohibitive.

The 5G Core completely dismantles this hardware-centric paradigm by adopting a cloud-native Service-Based Architecture (SBA). The core network is no longer defined by physical boxes, but by distributed software microservices communicating over standard IT protocols like HTTP/2 and REST APIs. This massive transition is realized through two foundational, intertwined technologies: NFV and SDN.

Network Function Virtualization (NFV)

Network Function Virtualization (NFV) takes the rigid, monolithic network functions of the 4G core—such as authentication (AUSF), session management (SMF), and policy control (PCF)—and extracts them entirely from the underlying proprietary hardware. These functions are rewritten as independent, virtualized software instances known as Virtual Network Functions (VNFs).

Crucially, these software instances do not require telecom-specific hardware. They are designed to run on standard, commercial off-the-shelf (COTS) x86 cloud servers in standard datacenters. By divorcing the software from the hardware, operators can treat their massive cellular network exactly like an AWS cloud environment.

Software-Defined Networking (SDN)

Software-Defined Networking (SDN) works in tandem with NFV by cleanly separating the control plane from the user plane.

The Control Plane: This is the complex “brain” of the network, containing the routing logic, policies, and algorithms deciding exactly where packets should go.
The User Plane: This is the “brawn” of the network, consisting of the dumb, high-speed physical switches that rapidly forward the data packets according to the rules set by the brain.

SDN centralizes the control plane into a software controller that can dynamically program the user plane switches across the entire country using open APIs (like OpenFlow).

Together, NFV and SDN provide unprecedented agility. If a massive sporting event requires localized data capacity, the operator does not need to deploy hardware. They simply use software APIs to instantly spin up hundreds of new virtualized User Plane Functions (UPFs) on standard servers near the stadium, dynamically route the local traffic through them using SDN, and seamlessly tear them down when the event ends, saving massive capital expenditure.

Multi-Access Edge Computing (MEC)

The migration to the cloud introduced a severe limitation regarding latency. This limitation is solved by moving the cloud to the physical edge of the network.

The Problem with Traditional Mobile Cloud Computing (MCC)

Traditional Mobile Cloud Computing (MCC) relies on a highly centralized architecture. When a mobile device needs to execute a computationally heavy task—such as natural language processing for voice assistants, or rendering a complex 3D scene—it lacks the local CPU power to do so efficiently. It offloads this task over the cellular network and across the public internet to a massive, centralized hyperscale datacenter (such as an AWS facility in Virginia or Ireland).

While the compute power at the datacenter is effectively infinite, the data must travel thousands of miles at the speed of light in fiber optic cables. The round-trip delay (latency) incurred traversing cross-country backbones, optical amplifiers, and multiple internet exchange points can easily exceed 50 to 100 milliseconds. While acceptable for downloading a web page, this latency renders real-time, interactive applications sluggish and unusable.

The MEC Solution: Bringing the Cloud to the Tower

Multi-Access Edge Computing (MEC) completely decentralizes this architecture to eliminate the speed-of-light penalty. MEC takes the high-performance compute and storage capabilities of the massive cloud datacenter and physically moves them to the extreme “edge” of the network.

Mini-datacenters (MEC servers) are physically co-located at the base of the 5G cell towers, or within the very first local neighborhood switching centers.

Enabling Ultra-Low Latency Applications (AR/VR)

This architectural shift is the singular enabling factor for ultra-low latency applications, most notably Augmented Reality (AR) and Autonomous Driving.

AR requires the system to render virtual 3D objects onto a live camera feed instantly as the user moves their head. Human biology dictates that if the “Motion-to-Photon” latency exceeds 20 milliseconds, the visual lag causes a sensory disconnect resulting in severe nausea and motion sickness.

With traditional MCC, the 100ms round-trip latency makes cloud-rendered AR physically sickening to use. With MEC, the mobile device offloads the heavy rendering task to the MEC server located at the local cell tower. The data only travels a few miles over a dedicated, uncongested fiber link, rather than traversing the country. This physical proximity slashes the round-trip network latency to a mere 1-5 milliseconds. The MEC server renders the complex 3D frame instantly and streams it back to the headset, making immersive, real-time Augmented Reality computationally viable on thin, lightweight mobile headsets without causing nausea.

Device-to-Device (D2D) Communication

Device-to-Device (D2D) communication fundamentally alters the topology and routing philosophy of cellular networks, blurring the lines between infrastructure and ad-hoc networking.

Bypassing the Infrastructure

Traditionally, in a cellular network, all communication must be routed through the central infrastructure. If User A is standing three feet away from User B and sends them a high-resolution picture, the data must travel from User A’s phone, up to the cell tower, down through the core network routers, back up to the cell tower, and finally down to User B. This “tromboning” effect wastes massive bandwidth on both the uplink and downlink radio channels and introduces unnecessary latency for locally relevant data.

D2D allows the two nearby mobile phones to completely bypass the base station infrastructure and transmit the data directly to each other peer-to-peer using the licensed cellular spectrum. The base station only acts as a control plane manager, setting up the connection and allocating the frequencies, but the actual heavy data payload flows directly between the devices. This drastically reduces latency, saves battery power (as devices transmit over 3 feet instead of 3 miles), and massively unburdens the cellular core network.

Spectrum Allocation: Overlay vs. Underlay

The central engineering challenge of D2D is how to allocate radio spectrum to these peer-to-peer links without breaking the macro cellular network.

“Overlay” (Dedicated Mode) Allocation

In Overlay allocation, the network operator strictly partitions the total spectrum. They reserve a dedicated chunk of frequencies exclusively for D2D links, and macro cellular users communicating with the tower are mathematically forbidden from using them.

Advantage: There is zero risk of interference between D2D users and macro cellular users.
Disadvantage: Massive spectrum waste. If no users in the cell are currently utilizing D2D features, those dedicated frequencies sit completely idle, crippling the overall system capacity of the tower.

“Underlay” (Shared Mode) Allocation

In Underlay allocation, efficiency is prioritized over absolute isolation. D2D users are allowed to transmit directly to each other using the exact same frequencies that are simultaneously being used by regular macro cellular users connecting to the tower.

Advantage: Maximizes spectral efficiency, as absolutely no bandwidth is wasted or left idle.
Disadvantage: Severe interference risks. If User A is transmitting to User B via D2D on Frequency X, and User C (located far away) is attempting to transmit to the Base Station on that same Frequency X, User A’s D2D transmission acts as catastrophic Co-Channel Interference to the Base Station. If User A transmits with too much power, their D2D signal will completely drown out User C’s signal at the Base Station receiver.
The Necessity of Power Control: Therefore, Underlay mode makes strict, centralized power control absolutely mandatory. The Base Station must actively monitor and strictly cap the maximum transmit power of all D2D links in the cell to ensure their signals decay rapidly and remain beneath the noise floor of the macro network receivers.

The 5G Core: A Service-Based Architecture (SBA)#

Network Function Virtualization (NFV)#

Software-Defined Networking (SDN)#

Multi-Access Edge Computing (MEC)#

The Problem with Traditional Mobile Cloud Computing (MCC)#

The MEC Solution: Bringing the Cloud to the Tower#

Enabling Ultra-Low Latency Applications (AR/VR)#

Device-to-Device (D2D) Communication#

Bypassing the Infrastructure#

Spectrum Allocation: Overlay vs. Underlay#

“Overlay” (Dedicated Mode) Allocation#

“Underlay” (Shared Mode) Allocation#