Which port must be reachable for a Prisma SD-WAN ION device to register and stay online with the controller?

Correct answer: d) TCP 443 from the controller port to the Prisma SD-WAN controller. d. The ION reaches the cloud controller over outbound TCP 443 from its controller port; if blocked, the device shows offline. UDP 4500 is IPsec NAT-T data plane, TCP 22 is SSH admin, and there is no per-peer 8443 requirement for controller registration.

At an Infosys campus, Vikram defines a custom app for an internal portal on 172.20.5.0/24 over TCP 8080, but flows still show unknown-tcp. What should he check first?

Correct answer: a) That the App Definition includes the 172.20.5.0/24 network and TCP 8080, and is bound to a category. a. A flow falls back to unknown-tcp when no App Definition matches it, so the first check is that the custom definition carries the right prefix, port, and category. MTU, NAT, and BGP affect reachability and forwarding, not L7 classification.

Divya needs interactive Zoom traffic at a Wipro branch to ride broadband first and fail to MPLS only on degradation. Which object does she configure?

Correct answer: c) A Path Policy rule for the Zoom app with broadband primary, MPLS secondary, and an SLA profile. c. Transport order plus SLA-driven failover for a named app is exactly a Path Policy rule. QoS only shapes bandwidth, ZBFW only permits/denies, and a static route cannot do app-aware, quality-based steering.

An HCL site has two equal-cost data-center summaries from two hubs. Aman wants the secondary hub used only on failure. What is the cleanest change on the IONs?

Correct answer: b) Adjust the route cost/metric so the primary hub's prefix is preferred, secondary as backup. b. Path preference between equal-cost prefixes is solved by tuning the route cost/metric so one is primary and the other stays as failover. MTU does not change route preference, deleting the route removes redundancy, and Security Zones govern firewalling, not routing preference.

A TCS branch ION shows online, all WAN links up, but a few SaaS apps are slow only in the evening. Per-app metrics show MPLS jitter spiking while broadband stays clean, yet traffic never moves. What does this indicate?

Correct answer: d) The Path Policy rule for those apps has no strict SLA profile, so brownout failover never triggers. d. The device is online and links are up, so steering is the issue: without a tight SLA profile on the app's Path Policy rule, the ION tolerates the degraded MPLS instead of failing to broadband. A down controller would not stop forwarding, ZBFW drops would block not slow, and session keys rotate automatically.

At a Pune BFSI hub, branches reach the DC fine but new branch-to-branch traffic for one app is blackholed. Routing is correct and tunnels are up. The app's flows hit the firewall and stop. What is the most likely root cause?

Correct answer: a) A Zone-Based Firewall rule denies that app between the two branch Security Zones. a. Routing and tunnels are healthy and the flow reaches the firewall then stops, which points to a ZBFW deny between the branch zones. A lost controller would not blackhole forwarding, a missing App Definition yields unknown classification not a drop, and QoS would slow, not blackhole.

Which port must be reachable for a Prisma SD-WAN ION device to register and stay online with the controller?

Correct answer: TCP 443 from the controller port to the Prisma SD-WAN controller. d. The ION reaches the cloud controller over outbound TCP 443 from its controller port; if blocked, the device shows offline. UDP 4500 is IPsec NAT-T data plane, TCP 22 is SSH admin, and there is no per-peer 8443 requirement for controller registration.

At an Infosys campus, Vikram defines a custom app for an internal portal on 172.20.5.0/24 over TCP 8080 , but flows still show unknown-tcp . What should he check first?

Correct answer: That the App Definition includes the 172.20.5.0/24 network and TCP 8080 , and is bound to a category. a. A flow falls back to unknown-tcp when no App Definition matches it, so the first check is that the custom definition carries the right prefix, port, and category. MTU, NAT, and BGP affect reachability and forwarding, not L7 classification.

Divya needs interactive Zoom traffic at a Wipro branch to ride broadband first and fail to MPLS only on degradation. Which object does she configure?

Correct answer: A Path Policy rule for the Zoom app with broadband primary, MPLS secondary, and an SLA profile. c. Transport order plus SLA-driven failover for a named app is exactly a Path Policy rule. QoS only shapes bandwidth, ZBFW only permits/denies, and a static route cannot do app-aware, quality-based steering.

An HCL site has two equal-cost data-center summaries from two hubs. Aman wants the secondary hub used only on failure. What is the cleanest change on the IONs?

Correct answer: Adjust the route cost/metric so the primary hub's prefix is preferred, secondary as backup. b. Path preference between equal-cost prefixes is solved by tuning the route cost/metric so one is primary and the other stays as failover. MTU does not change route preference, deleting the route removes redundancy, and Security Zones govern firewalling, not routing preference.

A TCS branch ION shows online , all WAN links up, but a few SaaS apps are slow only in the evening. Per-app metrics show MPLS jitter spiking while broadband stays clean, yet traffic never moves. What does this indicate?

Correct answer: The Path Policy rule for those apps has no strict SLA profile, so brownout failover never triggers. d. The device is online and links are up, so steering is the issue: without a tight SLA profile on the app's Path Policy rule, the ION tolerates the degraded MPLS instead of failing to broadband. A down controller would not stop forwarding, ZBFW drops would block not slow, and session keys rotate automatically.

At a Pune BFSI hub, branches reach the DC fine but new branch-to-branch traffic for one app is blackholed. Routing is correct and tunnels are up. The app's flows hit the firewall and stop. What is the most likely root cause?

Correct answer: A Zone-Based Firewall rule denies that app between the two branch Security Zones. a. Routing and tunnels are healthy and the flow reaches the firewall then stops, which points to a ZBFW deny between the branch zones. A lost controller would not blackhole forwarding, a missing App Definition yields unknown classification not a drop, and QoS would slow, not blackhole.

Neha sees a single branch ION show offline in the controller, but users at that site still browse and reach the DC over VPN. The WAN underlay is healthy. What is happening, and what is the operational impact?

Correct answer: Only the control plane to the controller is broken; forwarding, policy, and VPNs continue, with session keys rotating up to 72 hours. b. Control and data planes are independent, so a controller-unreachable ION keeps forwarding, enforces existing policy, and holds its VPNs while rotating per-VPN session keys hourly for up to 72 hours. You only lose config pushes and live analytics, not connectivity or encryption.

Karthik adds a third LTE transport at a Hyderabad SOC branch. Links show up, but the LTE VPN to peers never forms while MPLS and broadband VPNs are fine. Which design check best isolates the fault?

Correct answer: Verify the LTE interface is bound to a defined WAN Network with a valid discovered public IP for tunnel endpoints. a. Auto-VPN only builds when each WAN interface maps to a defined WAN Network with a usable public-IP endpoint; an unmapped or mistyped LTE interface has no tunnel endpoint. VLANs and LAN ports are downstream, ZBFW ICMP does not gate VPN formation, and QoS shapes existing traffic rather than building tunnels.

A Bangalore ITES architect proposes putting the Prisma SD-WAN controller inline in the data path for tighter control. As the reviewer, how should you judge this and why?

Correct answer: Reject it; the controller is a separate management plane and an inline design would break the resilience that lets IONs run when the controller is unreachable. b. Prisma SD-WAN's value rests on control/data plane separation: the cloud controller never forwards packets, so branches survive controller loss. Putting it inline destroys that resilience and adds a single point of failure. The objection is architectural, not just hub-scope or cost.

Two designs for a Mumbai bank: Design A enforces app permit/deny purely with Path Policies; Design B uses Zone-Based Firewall for permit/deny and Path Policy only for steering. Which is sound and why?

Correct answer: Design B, because permit/deny is the firewall's job and Path Policy only chooses transports for already-permitted flows. c. Path Policy cannot block traffic; it only steers permitted flows across transports by SLA. Security permit/deny belongs to the Zone-Based Firewall, so Design B uses each object for its real purpose. The two are not interchangeable, and 'fewer objects' does not justify mis-using a steering construct as a control.

Prisma SD-WAN (CloudGenix ION) Interview QnA

Content-specific feature visual for this lesson: use it as the 60-second map before reading the full detail.

Why this matters — SD-WAN that watches the app, not the cable

Think of an ambulance driver in Mumbai traffic. A bad driver only checks whether a road is open — if it has tarmac, he takes it. A great driver watches whether traffic on that road is actually moving and reroutes the second it slows, before the patient suffers. Legacy SD-WAN is the first driver: it picks a path because the link is up. Prisma SD-WAN is the second: it measures the application — loss, latency and jitter per flow — and moves traffic the moment quality drops, even while the link still shows green.

Interviewers probe this because most candidates can recite "IPSec tunnel over two ISPs" but cannot explain how an ION decides a path is browning out, why the Controller never sits in the data path, or what happens to the branch when the cloud is unreachable. Those three ideas separate a button-pusher from an engineer who can design and defend a fabric.

Scenario · Sneha — L2 network engineer at a Pune BFSI

Sneha is in a final-round panel. The interviewer says: "Both your branch ISPs show up, but the Bangalore agents complain the Salesforce app is laggy. Your ION is not failing over. Why, and what do you check?" She freezes — she has only ever watched interface state, never per-app quality.

The fix is a mental model: Prisma SD-WAN steers on application health, so she should be looking at the path's loss/latency/jitter against the app's SLA, the active Path policy, and the flow browser — not the link LED. Learn that model here and the panel question becomes a layup.

1. Prisma SD-WAN Architecture & ION

Prisma SD-WAN (the product formerly called CloudGenix) splits cleanly into a cloud Controller (management + control plane) and the ION appliances that sit in the branch and DC and forward traffic. Get the control/data separation crisp — it is the first thing a panel checks.

Q1 What is an ION device in Prisma SD-WAN, and what role does it play?L1

An ION (Instant-On Network) is the data-plane appliance Prisma SD-WAN places at every branch and data center. It terminates the WAN circuits, builds the secure fabric overlay (IPSec) to other IONs, measures per-path quality, and enforces the policies the Controller pushes down. IONs come as physical models (for example the 1200-S, 3200, 9000 series for DC) and as a vION virtual form factor for AWS, Azure, GCP and VMware. The key point: the ION forwards every packet. The cloud Controller never touches user traffic — it only programs the ION.

ION = the data-plane box; physical + vION; Controller is not in the data path.

Q2 A screening question: 'CloudGenix, Prisma SD-WAN, Controller, Strata Cloud Manager, ION — which of these are the same thing?' Untangle the names.L1

This is a deliberate name trap to check you've worked with the product across its renames, not just read one old blog. CloudGenix was the original company Palo Alto acquired in 2020; the product is now Prisma SD-WAN — same technology, new name. The Controller is the legacy term for the cloud management plane; that role now lives inside Strata Cloud Manager (SCM), Palo Alto's unified cloud management console, so 'Controller' and 'SCM' point at the same control plane. ION (Instant-On Network) is unchanged — still the edge appliance. So when Rohan reads a 2019 CloudGenix doc that says 'log into the Controller and claim the ION', the modern equivalent is 'log into Strata Cloud Manager and claim the ION'. Knowing the old and new names map to the same thing is exactly what the interviewer is fishing for.

CloudGenix=Prisma SD-WAN; Controller=now Strata Cloud Manager; ION unchanged — old/new names map to same thing.

Q3 What is the Prisma SD-WAN Controller and where does it sit?L1

The Controller is the multi-tenant cloud control and management plane. You log into it for the whole lifecycle: claiming IONs, defining sites, building Path/QoS/Security/NAT policy stacks, and viewing analytics. It computes control-plane decisions — which paths exist, what policies apply — and pushes that state to each ION. Crucially it is out of band: it is not a packet forwarder and not a tunnel concentrator. If a branch loses reach to the Controller, the ION keeps forwarding on its last-known policy, so the branch does not go down just because the cloud is unreachable.

Cloud, out-of-band control/management plane; branch survives Controller loss.

Q4 Explain the control-plane / data-plane separation and why it makes Prisma SD-WAN resilient.L2

The data plane lives entirely on the ION: tunnel build, packet forwarding, per-flow steering, and quality measurement all happen locally. The control plane (policy intent, topology, analytics) lives in the cloud Controller and is pushed to the ION. Because forwarding decisions are pre-programmed into the ION, traffic does not need a round-trip to the cloud per packet. So if the WAN path to the Controller dies — or the Controller has a maintenance window — Aditya's branch in a Hyderabad SOC keeps running on cached policy and the fabric stays up. This is the same principle as a router still forwarding when its management VLAN is down. It also means the cloud is not a single point of failure for user traffic.

Forwarding is local + cached; cloud loss ≠ branch outage; no per-packet cloud round-trip.

Q5 What is the AppFabric / secure fabric overlay, and how is it built between IONs?L2

The secure fabric is the overlay of IPSec tunnels (VPNLinks) that Prisma SD-WAN auto-builds between IONs over every usable WAN circuit. Each ION advertises its WAN interfaces to the Controller; the Controller decides which IONs should peer and the IONs establish encrypted tunnels directly between themselves — the Controller is not a hub in the data path. Over each underlay circuit (MPLS, broadband, LTE) you get a separate measured path, so the same branch-to-DC pair may have three or four parallel fabric paths. Application traffic then rides whichever fabric path currently meets the app's SLA. This auto-meshing is why you do not hand-configure crypto maps per site.

Auto-built IPSec overlay (VPNLinks) directly ION-to-ION; one measured path per underlay circuit.

Q6 How does a branch ION differ from a data-center ION in role and sizing?L2

Functionally both run the same software, but their role and scale differ. A branch ION (for example a 1200-S at a Chennai ITES floor) terminates a handful of circuits, serves local users, and offers Direct Internet Access (DIA). A DC ION (3200/9000 class) is the aggregation point: it terminates fabric tunnels from many branches, sits in front of DC services, and usually peers with the core via BGP. In the Controller you group DC IONs into a Data Center group so branches can be pointed at "the DC" rather than at individual boxes — giving you N+1 redundancy at the hub. So the difference is throughput, tunnel scale and topology position, not feature set.

Same software, different scale/position; DC ION = aggregation + BGP + DC group redundancy.

Q7 Walk through how a brand-new ION is claimed and onboarded to the Controller.L2

This is the Zero-Touch Provisioning (ZTP) flow. The ION ships with firmware that, on first boot with internet, reaches the Prisma SD-WAN cloud and registers. In the Controller, Priya claims the device by its serial number, which binds it to her tenant. She then assigns it to a site and applies a configuration profile — interface roles (WAN/LAN), circuit labels, policy bindings. The Controller pushes that down, the ION builds its fabric tunnels, and it goes online. The branch staff only had to rack it and plug in the circuits — no CLI. If claiming fails, the usual culprits are the device having no internet reach to the cloud, a wrong/duplicate serial, or it already being claimed in another tenant.

ZTP: register → claim by serial → bind to site → push profile; field staff do no CLI.

Q8 A panel asks: 'If the Controller is in the cloud, how do you forward traffic when an ION cannot reach it? Design for that.' How do you answer?L3

I separate steady state from failure. In steady state the Controller pushes policy and topology to the ION; nothing about forwarding depends on a live cloud session. On Controller unreachability, the ION keeps forwarding on its last-known-good policy and keeps its fabric tunnels up, because tunnels and steering are local. What you lose is the ability to make new config changes and to stream fresh analytics — not connectivity. To harden this I would: give the ION more than one path to reach the cloud (DIA over a second ISP), make the local breakout policy explicit so critical SaaS still has a route, and confirm DNS/NTP locally so the box does not get stuck on a control dependency. So the honest answer is: the branch does not go dark — it goes 'frozen-but-forwarding' — and good design makes even the control-channel loss unlikely.

Frozen-but-forwarding model; loss = no new config/analytics, not loss of traffic; redundant cloud reach.

Legend untrusted / attacker trusted / corporate inspection / policy point the key "aha" node allowed

Thesis: the Controller never carries your traffic. Look for the split — cloud Controller runs the control plane, branch and DC IONs build the data-plane fabric directly between themselves.

Prisma SD-WAN terms an interviewer will probe

🖥️

ION device

tap to flip

The CloudGenix appliance (or VM) at a branch or DC. It forwards traffic and enforces policy — so a dead Controller never stops live flows.

☁️

Cloud Controller

tap to flip

The management brain at controller.cloudgenix.com. Pushes policy and collects telemetry only — it never carries your data traffic.

🔗

Secure fabric

tap to flip

Auto-built IPsec mesh between IONs. This is the real data plane, so branches talk directly without hairpinning through the DC.

🎯

AppFabric / app-defined

tap to flip

Routing driven by each app's live SLA, not static metrics. So a brownout on one link steers that app to a healthy path instantly.

🚪

DIA

tap to flip

Direct Internet Access — trusted SaaS exits straight from the branch. Lower latency for O365 because traffic skips the DC backhaul.

🛡️

Prisma Access tie-in

tap to flip

Risky or general web is sent to Prisma Access for SWG, CASB and threat scan. So one fabric covers both SD-WAN and SASE security.

Quick check · inline mini-quiz #1

Sneha, an L2 at a Pune BFSI, sees a branch ION 3200 go red in the controller. The site WAN circuits are fine and users still reach SaaS apps over the VPN tunnels. The Controller Connection field shows offline. What is the single most likely cause?

a) The data plane crashed, so all VPN tunnels are down b) Outbound TCP 443 from the controller port to the Prisma SD-WAN controller is blocked, breaking only the control plane c) The site's App Definitions were deleted from the controller d) BGP to the LAN core flapped, taking the device offline

Correct: b. Prisma SD-WAN fully separates control and data planes. The ION reaches the controller over TCP 443 from its controller port; if that path is filtered, the device shows offline but keeps forwarding traffic and holds its VPNs. The data plane is independent (a), App Definitions live in the controller and would not flip the device offline (c), and LAN BGP affects routing, not controller reachability (d).

2. App-Defined Fabric & Policies

The thing that sells Prisma SD-WAN is that it steers on application health, not raw link state. This section is where panels separate people who memorised 'two tunnels and a failover' from people who understand path/QoS/security/NAT stacks and DIA design.

Q9 What does 'app-defined SD-WAN' mean, and how is it different from link-state SD-WAN?L2

Link-state SD-WAN fails over when an interface goes down or a probe says the tunnel is dead — a binary up/down view. App-defined SD-WAN measures the actual quality each application is getting — loss, latency and jitter per path — and compares it to that app's SLA. So even if both ISPs at Rahul's Mumbai bank branch are up, if the path carrying Microsoft Teams crosses the jitter threshold, Prisma SD-WAN moves the Teams flows to the better path while leaving bulk file-copy where it is. The decision is per-application, not per-link. That is the headline difference: it steers on the experience the app sees, so it catches brownouts that a link-up check is blind to.

Per-app loss/latency/jitter vs SLA, not interface up/down; catches brownouts.

Q10 How does Prisma SD-WAN identify applications?L1

It uses an application catalog of thousands of pre-defined apps plus the ability to define custom applications. An app definition can be matched by Layer 3/4 tuple (IP ranges, ports, protocol), by domain/URL for SaaS, and by deep inspection of the first packets of a flow. SaaS apps like Salesforce or Office 365 come with their published prefix/domain sets so the ION knows which destinations belong to them. For an in-house app at an Infosys campus you build a custom app definition with its server IPs and ports. Once a flow is classified, every policy stack — path, QoS, security — can reference that application by name instead of by raw IP.

Built-in catalog + custom apps; matched by L3/L4 + domain/URL; policy references app by name.

Q11 Explain the policy stacks in Prisma SD-WAN and the order they evaluate.L2

Policy is organised as separate stacks, each a set of rules you bind to a site: Path (which path/transport a flow takes), QoS (priority and bandwidth treatment), Security (zone-based allow/deny at the branch), and NAT (source/destination translation for breakout). A flow is first classified to an application, then evaluated against the Path policy to pick the egress path, the QoS policy to set its queue/priority, the NAT policy if it breaks out locally, and the Security policy for zone enforcement. Keeping them as distinct stacks means Divya can change steering for an app without touching its QoS or security — clean separation of concerns. You bind a stack to many sites, so one edit rolls out fleet-wide.

Four stacks (Path/QoS/Security/NAT); classify-then-evaluate; stacks bind to many sites.

Q12 What is a Path policy, and what knobs does it give you?L2

A Path policy decides which transport an application's traffic should ride and how it should behave when quality degrades. For each app (or app group) you set the preferred path (e.g. send ERP over MPLS, send YouTube via DIA), allowed/forbidden circuits, and the action on SLA breach — move to the next-best path or even replicate. You also tie it to the app's network-context SLA so the ION knows what 'good' means. For example Karthik sends voice with a tight jitter SLA preferring broadband-1, falling back to broadband-2, and never onto LTE unless both fail. The Path policy is the heart of app-defined steering — it is where 'business intent' becomes a forwarding rule.

Per-app preferred/forbidden paths + SLA-breach action; encodes business intent as steering.

Q13 What are circuit labels (categories) and why must they be spelt consistently across sites?L2

Circuit labels (categories) are the names you give each WAN transport — for example INET-Primary, MPLS-Core, LTE-Backup — and they come in public and private categories (you get up to 32 of each). They are not cosmetic: the fabric uses labels to decide which circuits build tunnels to which, and Path policies reference traffic by label rather than by a specific site's interface, so one global policy applies fleet-wide. That global reach is also the trap. If Suresh tags the Wipro Pune branch's broadband as INET-Primary but a colleague tags the Chennai branch's as INET-Prim, the policy that says 'prefer INET-Primary' simply does not match the misspelt site — and it fails silently, no error. So a disciplined, documented label taxonomy is a real operational requirement, not a nicety.

Labels drive tunnel-building + global policy matching by name; a misspelt label silently breaks the policy at that site.

Q14 A new Path policy was created but traffic still isn't being steered. Where do most people forget to look?L2

The classic miss is binding. In Prisma SD-WAN you build a policy stack (Path or QoS), but a stack does nothing until it is bound to the site(s) that should use it. Engineers build a beautiful Path policy, save it, and assume it is live — but if it was never bound to the Infosys Bangalore branch, that branch keeps running its old/default policy and the new rules never apply, with no error shown. So my first check when 'the policy isn't working' is: is the right stack bound to this site, and is the rule I expect actually the one being hit? Other near-misses are rule order (a broader earlier rule shadows mine) and the app not being classified the way I assumed. But unbound stack is the number-one silent failure, and it is the first thing I verify in the site's configuration before touching anything else.

A policy stack must be BOUND to the site to take effect; unbound = silent no-op; also check rule order + classification.

Q15 After tightening an app's SLA, the link keeps flip-flopping between two circuits every few seconds. What's happening and how do you fix it?L3

That's path flapping, and the usual cause is an SLA threshold set too tight relative to the circuits' normal variance. If you tell Prisma SD-WAN that an app must never exceed, say, 20 ms jitter, but both broadband circuits routinely brush 18–22 ms, the ION keeps deciding the active path 'breached', steers to the alternate, that one then breaches too, and it oscillates — hurting the very app you were protecting and churning sessions. The fix is to set the SLA against measured reality: look at each path's baseline loss/latency/jitter in the metrics or AIOps view, then set the threshold with sensible headroom above normal variance so only a genuine brownout trips it. You can also dampen the reaction so a single brief spike doesn't cause an immediate move. The senior lesson for the panel: an SLA that is too aggressive is as harmful as one that is too loose — tune it to the link's real behaviour, not to an aspirational number.

Too-tight SLA vs normal variance = oscillation/flapping; fix = baseline the paths, set headroom + dampening, not aspirational numbers.

Q16 Compare Direct Internet Access (DIA) with backhauling, and when you choose each.L2

DIA means the branch breaks out to the internet locally through its own ISP — best for trusted SaaS like Office 365 or Zoom, because it cuts the hairpin and gives the lowest latency. Backhaul sends internet traffic across the fabric to a DC or regional hub, then out through centralized security. You backhaul when you need centralized inspection, a fixed egress IP for an allow-listed partner, or when local breakout is not trusted. The modern answer combines both: DIA for sanctioned SaaS, and for the rest you either backhaul to the DC or — better — service-chain the branch to Prisma Access so you get cloud-delivered security without the DC hairpin. So the trade-off is latency/cost (DIA wins) versus centralized control (backhaul wins), and SASE largely resolves it.

DIA = local breakout, low latency for SaaS; backhaul = centralized inspection/fixed egress; SASE bridges them.

Q17 What is a Performance policy (and compliance/transit policies), and when would you use one?L3

Beyond basic path steering, Prisma SD-WAN lets you express performance/compliance intent. A Performance policy ties an application to its required loss/latency/jitter SLA and the remediation when the active path violates it — so you are not just picking a path once, you are continuously holding it to a number. Compliance-style rules let you mandate, for example, that PCI traffic at a Mumbai bank must only traverse MPLS and never break out locally, regardless of which path is 'best'. Transit considerations cover how branch-to-branch traffic flows through a hub when there is no direct path. The senior point: business rules ('regulated traffic stays on private transport') override pure performance optimisation, and the policy stacks let you encode that hierarchy explicitly rather than hoping the algorithm agrees.

Performance = continuous SLA enforcement; compliance overrides 'best path' for regulated traffic.

Q18 Microsoft Teams at a Bangalore ITES is fine on broadband-1 but choppy on broadband-2; both links are up. Design the policy so users never feel it.L3

First, ensure Teams is correctly classified as its own application (Microsoft's published endpoints, real-time-media sub-app). Then in the Path policy set Teams' preferred path to whichever circuit currently meets its SLA, with the action-on-breach set to immediately move to the next-best path — so a jitter spike on broadband-2 evicts Teams in sub-second, not after a tunnel goes down. Attach a Performance/SLA profile with a tight jitter and loss threshold (voice/video is jitter-sensitive). In QoS, mark Teams real-time traffic to the highest priority queue so it is protected during congestion. Finally I'd verify in the flow browser that Teams flows are actually sitting on the good path and watch the path's measured jitter. The combination — accurate classification, tight SLA, fast breach action, and QoS priority — means the ION steers around the choppy link before the user notices.

Classify Teams → tight SLA + fast breach action in Path policy → QoS priority → verify in flow browser.

Thesis: the app's health picks the path, not a static route. Watch the MPLS jitter cross the Teams SLA — the ION steers the live call to broadband mid-stream, no flow drop.

▶ Watch a Teams call survive a brownout — Karthik at a Wipro branch

You will watch the ION measure both links, catch MPLS jitter crossing the Teams SLA, and steer the live call to broadband without dropping it.

① CALL UP Karthik joins a Teams call. The ION pins the flow to MPLS — jitter 9ms, well inside SLA.

▼

② MEASURE The ION continuously probes both paths and tracks latency / jitter / loss per app, live.

▼

③ JITTER SPIKE MPLS jitter climbs 9ms → 55ms. The Teams app SLA caps jitter at 30ms.

▼

④ BROWNOUT The ION marks MPLS = brownout for Teams — not a full outage, but unfit for voice.

▼

⑤ STEER Path policy re-pins the live Teams flow to broadband mid-call. No reconnect, no drop.

▼

⑥ LOGGED The Controller records the steer event with timestamps — Karthik never noticed a thing.

Press Play to start. Each Next advances one stage.

Pause & Predict #1

Aditya rolls out a new Custom-ERP app at a Chennai ITES site. The app uses TCP 9443 to a DC server, but in the controller it keeps getting classified as unknown-tcp and rides the default path, ignoring his ERP Path Policy. Predict the cause and the fix.

The App Definition is incomplete, so the flow never matches. The ION classifies first by deep inspection, then by the IP/port hints in the custom App Definition. If the definition is missing the server prefix and the TCP 9443 port (or the domain), the flow falls back to unknown-tcp and the Path Policy rule keyed on Custom-ERP never fires. Fix: edit the custom App Definition to add the DC server network and TCP 9443, bind it to the right App Category, then confirm with dump app-flows on the ION that new sessions now show Custom-ERP and hit the intended transport.

3. Routing, Topology & Connectivity

Steering is the glamour; routing is the plumbing. Panels want to see you understand underlay vs overlay, how BGP fits at the DC, and how branches reach data-center groups across the secure fabric.

Q19 Distinguish the underlay from the overlay in Prisma SD-WAN.L1

The underlay is the raw transport — your MPLS, broadband and LTE circuits and whatever routing gets the ION's WAN interface reachable (often a default route or BGP/static to the ISP). The overlay is the secure fabric of IPSec tunnels Prisma SD-WAN builds on top of those circuits, plus the app-aware steering that rides it. User and site prefixes are carried inside the overlay; the underlay just has to deliver tunnel packets between ION WAN IPs. Keeping the two separate is the whole point: the underlay can be any messy mix of ISPs, while the overlay presents one consistent, encrypted, policy-driven fabric to applications.

Underlay = physical circuits + ISP reachability; overlay = IPSec fabric + app steering on top.

Q20 How does BGP fit into a Prisma SD-WAN deployment, and where is it typically used?L2

BGP is the main dynamic-routing tool, used most at the DC IONs and at branches with their own routed LAN. A DC ION typically runs eBGP/iBGP with the core or a firewall to learn DC service prefixes and to advertise the branch prefixes it learns over the fabric. At a branch, the ION can BGP-peer with an internal L3 switch to learn LAN networks dynamically instead of static routes. Prisma SD-WAN then redistributes between the fabric and BGP so that a route learned at one site is reachable from another. The senior nuance: you control which prefixes you advertise into the fabric and into BGP so you do not leak DC routes everywhere or create asymmetric paths.

BGP mainly at DC IONs (to core/firewall) + branch LANs; redistribution into fabric; control advertised prefixes.

Q21 What is a Data Center group, and why point branches at the group instead of a single ION?L2

A Data Center group bundles two or more DC IONs (often across two physical data centers) into one logical hub. Branches build fabric tunnels to the group, so traffic can land on whichever DC ION is healthy. If the primary DC ION or its circuit fails, branch traffic shifts to the secondary member without you re-pointing every site — that is your hub-level N+1/active-active redundancy. It also lets you do controlled DC failover and maintenance: drain one member, let branches ride the other. Pointing a branch at a single ION would make that one box a single point of failure for the whole site's DC-bound and backhauled traffic, which no panel wants to hear you design.

Group = redundant DC IONs as one hub; branches survive a DC-ION/circuit failure; enables drains.

Q22 Describe common branch topologies — full mesh, hub-and-spoke, and partial mesh — and the trade-offs.L2

Hub-and-spoke sends all branch traffic through a DC/regional hub — simplest to secure and route, but branch-to-branch traffic hairpins through the hub, adding latency. Full mesh builds direct fabric tunnels between every branch — best for branch-to-branch apps (VoIP between a Pune and a Chennai office) but tunnel and policy count explodes as sites grow. Partial mesh is the pragmatic middle: hub-and-spoke as the base, plus direct branch-to-branch tunnels only where the traffic justifies it. In Prisma SD-WAN you express this through how you bind branches to DC groups and whether you enable direct branch peering. For most enterprises I'd default to partial mesh — centralized control with on-demand direct paths for latency-sensitive site-to-site flows.

Hub-spoke (simple, hairpin), full mesh (low latency, scale cost), partial mesh as the practical default.

Q23 How does prefix advertisement work across the fabric — how does a branch learn DC and other-branch routes?L2

Each ION advertises the networks behind it (its LAN prefixes, statically or via BGP) to the Controller's view of the fabric, and those prefixes are distributed to the IONs that need them. So a DC ION advertises DC service subnets; a branch ION advertises its local 10.x LAN. Branches reach the DC over the fabric using the DC's advertised prefixes, and reach each other (in a mesh) the same way. You can summarise and filter what gets advertised so you do not flood every branch with every route — for instance only advertise the DC service block to branches, not inter-branch management subnets. The principle is the same as any routing domain: advertise reachability, control scope, avoid loops and asymmetry.

IONs advertise their prefixes into the fabric; distributed to who needs them; summarise/filter scope.

Q24 When would you keep a 'standard VPN' or IPSec to a non-ION peer alongside the secure fabric, and how do you route it?L3

The auto-built secure fabric only exists between IONs. When you must reach something that is not an ION — a partner extranet, a legacy firewall, a cloud security stack, or a third-party SD-WAN — you configure a Standard VPN: a manually defined IPSec tunnel from the ION to that peer, with its own crypto parameters and the remote networks you route over it. You then steer the relevant application traffic onto that standard VPN via the Path policy, while normal site-to-site traffic stays on the fabric. The design care points are: keep the standard VPN's reachable prefixes scoped so you do not blackhole fabric routes, and treat it as a path the policy can prefer or avoid. So fabric = ION-to-ION automation; standard VPN = the escape hatch to the non-ION world.

Standard VPN = manual IPSec to a non-ION peer; steer via Path policy; scope its prefixes carefully.

Thesis: pick the egress by app trust + security need. Use this when the interviewer asks DIA vs backhaul vs Prisma Access — match the traffic class to the right exit.

🖥️ This is the screen you'll use — Prisma SD-WAN → Policies → Path → Add Rule. (Recreated for clarity — your console matches this.)

sdwan.paloaltonetworks.com

Prisma SD-WAN → Policies → Path → Add Rule

1Rule NameSteer-Teams-Voice-Realtime

2ApplicationMicrosoft Teams (Realtime / Voice)

·Network ContextCorporate-Users

·Service & DC GroupsInternet-DIA-Group

·PathPrimary: Broadband · Secondary: MPLS

·ActionSteer on SLA breach (jitter > 30ms)

Save Rule

Quick check · inline mini-quiz #2

At a Bangalore ITES hub, Rahul advertises the same 10.50.0.0/16 data-center summary into the Secure Fabric from two hub IONs for redundancy. Branches suddenly prefer the slower hub. What knob fixes the path preference cleanly?

a) Shut one hub ION down so only the fast hub advertises the route b) Raise the MTU on the slow hub's WAN interface c) Tune the prefix cost / metric on the data-center routes so the faster hub is preferred, keeping both as backup d) Disable Application Reachability Probes on all branches

Correct: c. With dual hubs both advertising the same summary, branches pick by route metric/cost, so you adjust the prefix cost so the faster hub wins while the second hub stays as failover. Shutting a hub (a) kills the redundancy you built. MTU (b) does not change route preference. Disabling probes (d) blinds path quality and breaks app-aware steering.

Pause & Predict #2

Neha brings up a second WAN at a Hyderabad SOC branch. The ION sees both circuits up, but the Secure Fabric VPN over the new broadband stays down while the MPLS tunnel is fine. Predict the cause and the fix.

The new circuit's WAN interface is missing its WAN Network / public-IP binding, so VPN endpoints never form. Prisma SD-WAN builds auto-VPN between sites only when each WAN interface is mapped to a defined WAN Network (Internet or Private) with the correct NAT/public-IP discovery. An unmapped or wrongly typed broadband interface has no usable tunnel endpoint, so the overlay stays down even though the underlay link is up. Fix: bind the broadband interface to the correct Internet WAN Network, confirm the discovered public IP, and verify the tunnel turns green and dump vpn on the ION lists the new peer with rotating session keys.

4. Integration & Security (Prisma Access / CloudBlades)

SD-WAN alone is half a SASE story. Panels for senior roles want to see you connect Prisma SD-WAN to cloud-delivered security — Prisma Access and CloudBlades — and reason about branch security and onboarding at scale.

Q25 How does Prisma SD-WAN integrate with Prisma Access, and what does that give you?L2

Prisma SD-WAN provides the WAN edge and app-aware steering; Prisma Access provides cloud-delivered security (firewall, threat prevention, URL filtering, CASB, ZTNA). Integrated, the branch ION service-chains internet- and app-bound traffic to the nearest Prisma Access location instead of backhauling to a DC for inspection. Together they form SASE: networking from SD-WAN, security from Prisma Access, both cloud-managed. So when Neha's Hyderabad branch user opens a risky URL, the ION steers that flow to Prisma Access where full inspection happens, while sanctioned SaaS can still take DIA. The win is consistent security policy everywhere without putting a next-gen firewall in every closet, and without the latency of hairpinning all internet traffic through HQ.

ION steers to Prisma Access for cloud security = SASE; no NGFW per branch, no DC hairpin.

Q26 Design 'Unified SASE' for a 200-branch retailer: how do Prisma SD-WAN and Prisma Access actually combine, including service chaining?L3

Unified SASE is its own design domain, not just 'turn on a feature' — networking and security become one managed fabric. The split: Prisma SD-WAN (the IONs) owns transport, app-aware steering and on-box segmentation; Prisma Access owns cloud-delivered security (FW, IPS, URL filtering, CASB, ZTNA). I onboard each branch ION to its nearest Prisma Access location — typically Mumbai or Bengaluru for India — over redundant IPSec tunnels, advertising branch prefixes with private BGP and ECMP across two tunnels for resilience. Then I service-chain by intent in the Path/security policy: sanctioned SaaS (Office 365, Salesforce) takes DIA locally; all general web and risky traffic is steered through Prisma Access for inspection before egress; internal app traffic rides the AppFabric to the DC. The two planes share one management surface (Strata Cloud Manager), so security policy is consistent at all 200 sites without a next-gen firewall in any closet. The headline answer: SD-WAN decides which path, Prisma Access decides what's safe, and service chaining is how a flow is handed from one to the other per business intent.

Unified SASE = SD-WAN (path) + Prisma Access (security), one mgmt plane; onboard via redundant IPSec+ECMP+private BGP; chain per-intent (DIA for SaaS, inspect general web).

Q27 What are CloudBlades, and give a concrete example.L2

CloudBlades are Prisma SD-WAN's framework for integrating third-party (and Palo Alto) cloud services into the fabric without local hardware or manual tunnel config. You enable a CloudBlade in the Controller and it programmatically wires the branches to that service. The classic example is the Zscaler CloudBlade: enable it, and Prisma SD-WAN automatically builds the IPSec/GRE tunnels and steering from your branches to the nearest Zscaler ZIA node, so internet traffic gets Zscaler inspection with no per-site tunnel building. Other CloudBlades cover services like cloud security and monitoring integrations. The point for a panel: CloudBlades turn 'integrate 200 branches with a cloud security vendor' from a manual tunnel project into a toggle plus policy.

CloudBlade = API-driven integration of a cloud service (e.g. Zscaler) with auto tunnels/steering; no per-site manual config.

Q28 Explain Zero-Touch Provisioning (ZTP) and why it matters at scale.L2

ZTP lets you deploy an ION without sending an engineer or touching a CLI. The box is pre-claimed to the tenant in the Controller; when local staff rack it and connect a circuit with internet, it calls home, authenticates, and pulls its full configuration and policy. At scale this is the difference between rolling out 300 branches in weeks versus months — Aman in the NOC stages every device's config centrally, and the branch electrician just plugs in cables. It also reduces human error: every site gets the same vetted profile. The caveats a senior should mention: the first hop must have internet/DHCP to reach the cloud, and you need a process for the rare device that fails to claim (serial mismatch, already-claimed, or no cloud reachability).

ZTP = pre-claim + call-home pull of config; mass rollout, fewer errors; needs first-hop internet.

Q29 What is AIOps / autonomous operation in Prisma SD-WAN, and how would you use it?L3

AIOps for Prisma SD-WAN applies analytics and ML across your fabric's telemetry to surface problems and capacity issues before they page you. It does things like baseline each path's normal loss/latency/jitter and flag anomalies, predict circuit capacity exhaustion, correlate an app-degradation event to a specific path or site, and recommend remediation. In practice I'd use it for proactive ops: instead of waiting for a Chennai ITES user to complain about Teams, AIOps shows the path that is trending toward its jitter SLA and which branches are affected, so the team acts on a trend rather than an outage. The honest framing for a panel: it is decision-support and early warning built on the per-app telemetry the IONs already collect — it shortens mean-time-to-detect, it does not replace an engineer's judgement on the fix.

ML over fabric telemetry: anomaly/capacity prediction + correlation; proactive ops, lowers MTTD.

Q30 What security do you actually get on the ION at the branch, versus what you push to the cloud?L3

The ION is an SD-WAN edge, not a full next-gen firewall, so I split the job. On the ION you get the fabric's IPSec encryption, zone-based security policy (allow/deny between LAN/WAN/fabric zones), and segmentation so guest, IoT and corporate stay separated at the branch. To the cloud you push the heavy security — threat prevention, IPS, URL filtering, DLP, CASB, ZTNA — by service-chaining to Prisma Access or a CloudBlade like Zscaler. So the design is: enforce basic segmentation and encryption locally for low latency, and steer anything needing deep inspection to cloud security. For a regulated Mumbai bank I'd be explicit that PCI segments are isolated on the ION and that all internet egress is inspected in the cloud, never broken out raw.

ION = IPSec + zone policy + segmentation; deep inspection (IPS/URL/DLP) chained to Prisma Access/CloudBlade.

Q31 Explain service chaining in this context and a failure mode to watch for.L3

Service chaining means the ION forwards selected traffic through an external security service before it reaches the internet or the destination — typically Prisma Access or a Zscaler CloudBlade. You define in the Path/security policy which apps must traverse the chain (e.g. all general web), and the ION tunnels those flows to the service node, which inspects and forwards. The failure mode to watch: if the chained service node or its tunnel goes down, you must decide fail-open vs fail-closed. Fail-closed (drop traffic if security is unreachable) is safest for a regulated branch but risks an outage; fail-open keeps users working but sends traffic uninspected. A senior answer says: pick per traffic class — fail-closed for sensitive/regulated flows, and provide a redundant security node or second nearest Prisma Access location so the question rarely arises. Always test the failover, do not assume it.

Chain = steer traffic through cloud security first; design fail-open/closed per class + redundant nodes; test it.

Q32 A NetDevOps-leaning JD asks for 'scripting and APIs for network automation' on Prisma SD-WAN. What automation surface exists and how would you use it?L3

Prisma SD-WAN is fully API-driven — the Controller / Strata Cloud Manager exposes a REST API, and Palo Alto ships a Python SDK (commonly the prisma-sase / CloudGenix SDK) plus reference scripts so the GUI is just one client over that same API. For a fleet role I'd use it to script repetitive config — bulk-claim and bind IONs to sites, push standard Path/QoS policy stacks across hundreds of branches, and template circuit labels so a new site is provisioned by code, not clicks. I'd also pull telemetry programmatically — path metrics, flow data, site health — into a dashboard or alerting pipeline, and wire it into CI/CD so policy changes are reviewed and version-controlled like code (NetDevOps). A concrete example: a Python job that reads a CSV of new Wipro branches and creates each site, assigns its ION, applies the regional policy stack, and verifies the device came online — turning a day of manual work into a reviewed, repeatable run. The panel point: know that everything in the UI is reachable by API, and that scripted, idempotent config is how you operate at scale without drift.

REST API + Python SDK (prisma-sase/CloudGenix) behind the GUI; script bulk claim/site/policy config + pull telemetry; version-control policy (NetDevOps), idempotent at scale.

Quick check · inline mini-quiz #3

Priya must let only the HR-Payroll app reach the DC and block lateral RDP between two branch LAN zones on a Mumbai bank's IONs. Which Prisma SD-WAN construct does she use?

a) A Path Policy with a performance SLA profile b) A Zone-Based Firewall (Security) policy using Security Zones and App Definitions c) A QoS / bandwidth allocation rule d) A NAT policy on the internet circuit

Correct: b. Lateral and app-to-DC permit/deny is the job of the ION's stateful Zone-Based Firewall, written against Security Zones plus App Definitions. Path Policy (a) only steers a permitted flow across transports by SLA. QoS (c) shapes bandwidth, it does not block. NAT (d) rewrites addresses and never enforces a zone-to-zone deny.

5. Troubleshooting & Ops

This is where the role is really decided. Anyone can say 'check the link'. The panel wants the person who reaches for the flow browser, reads loss/latency/jitter, and distinguishes a brownout from a blackout under pressure.

Q33 Define a path brownout versus a path blackout, and how Prisma SD-WAN reacts to each.L2

A blackout is total path failure — the tunnel is down, packets are dropped completely. A link-state SD-WAN catches this fine, and so does Prisma SD-WAN: it stops using that path immediately. A brownout is the dangerous one — the path is still up but quality has degraded: loss climbs, latency or jitter spikes, so the app suffers while the LED stays green. Legacy SD-WAN is blind to brownouts; Prisma SD-WAN's continuous per-app measurement detects when a path crosses the application's SLA and steers the affected flows off the browned-out path to a better one, even though no link went down. So in interview language: blackout = link down, easy; brownout = link up but bad, and app-defined steering is precisely what handles it.

Blackout = link down (everyone catches it); brownout = up-but-degraded (only app-defined steering catches it).

Q34 Users report an app is slow but both ISPs show 'up'. Walk me through your troubleshooting.L2

I stop looking at the LED and look at quality and steering. First, open the flow browser and find the affected app's flows — see which path they are actually on and what loss/latency/jitter that path is currently delivering. If the path is browning out, I check why steering did not move them: is the app classified correctly, is there a Path policy for it, is its SLA threshold set, and are other paths actually healthy to move to? Then I check whether the issue is on the WAN path or beyond — is it every site hitting that SaaS (provider/Prisma Access side) or just this branch (local circuit)? I confirm against the per-path metrics and AIOps anomaly view. The structured answer is: classify the flow, read its path metrics, verify the policy and SLA, and localise branch-vs-DC-vs-provider — not 'reboot the ION'.

Flow browser → which path + its metrics → check classification/policy/SLA → localise branch vs provider.

Q35 What is the flow browser and how do you use it day to day?L2

The flow browser is the Controller's live/near-live view of individual application flows traversing the fabric. For each flow it shows the source/destination, the classified application, which path/circuit it is using, and the path's measured performance. Day to day I use it to answer 'where is this traffic actually going and is it healthy?' — confirming that Teams is on the good circuit, that a backhauled app is taking the DC path I expect, or that a flow I thought should be DIA is wrongly riding the fabric. When a user complains, the flow browser turns a vague 'it's slow' into a concrete 'this flow is on broadband-2 which has 4% loss'. It is the single most useful screen for app-level troubleshooting, which is exactly why panels ask if you know it exists.

Live per-flow view: app + path + metrics; turns 'it's slow' into a concrete path/loss number.

Q36 An ION won't claim to the Controller. What do you check, in order?L2

I work outward from the box. 1) Internet reachability: the ION's WAN interface needs a working uplink — DHCP/IP, gateway, and DNS to resolve the cloud, plus the relevant outbound ports allowed by any upstream firewall. 2) Serial / claim state: confirm the serial number I'm claiming matches the device and that it is not already claimed in another tenant — a common cause for refurbished or re-deployed units. 3) Time / NTP: a badly wrong clock breaks the TLS handshake to the cloud. 4) Firmware: very old firmware may not know the current cloud endpoints. 5) Physical/interface role: make sure the cable is in a WAN-role port that actually has the path out. I verify each from the local console/status before escalating — most 'won't claim' cases are upstream internet, DNS, time, or a duplicate/already-claimed serial.

Internet+DNS+ports → serial/already-claimed → NTP/time → firmware → correct WAN port.

Q37 Failover is not happening even though one path looks degraded. How do you root-cause it?L3

I check the chain that enables steering, link by link. First, is the degraded path actually crossing the app's SLA threshold, or is it just 'a bit worse'? If the threshold is loose, Prisma SD-WAN correctly stays put — so I read the real loss/latency/jitter versus the configured SLA. Second, is there a Path policy matching this app at all, with an action-on-breach that says move? A flow with no specific policy may ride a default that does not steer aggressively. Third, are the alternative paths healthy — failover needs a better path to move to; if both circuits are degraded, there is nowhere good to go. Fourth, is the app classified as I assume (custom app misconfigured = wrong policy applied). Fifth, check whether a compliance/forbidden-path rule is pinning the traffic deliberately. So the root-cause is almost always: SLA not actually breached, no/loose policy, no healthy alternate, or misclassification — and the flow browser plus the policy config tell me which.

Verify SLA actually breached → policy + breach-action present → healthy alternate exists → classification → compliance pin.

Q38 How do you measure and interpret link quality — loss, latency, jitter — and which matters for which app?L3

Prisma SD-WAN continuously probes each fabric path and reports packet loss, latency (round-trip delay) and jitter (variation in delay). The interpretation is app-specific. Voice and video (Teams, Zoom, SIP) are most hurt by jitter and loss — even 1–2% loss or rising jitter makes calls choppy, while moderate latency is tolerable. Interactive/transactional apps (SSH, RDP, a trading front-end at a Pune BFSI) are latency-sensitive. Bulk transfer (backups, file sync) mostly cares about throughput and tolerates higher latency. So I set each app's SLA against the metric that actually hurts it — tight jitter/loss for real-time media, tight latency for interactive, looser for bulk — and let the path policy steer on the right number. Reading the metric without mapping it to the app's sensitivity is the rookie mistake.

Loss+latency+jitter per path; jitter/loss kills voice/video, latency kills interactive, bulk tolerant; SLA per app sensitivity.

Q39 Give your standard playbook for triaging 'the branch is having WAN problems' on Prisma SD-WAN.L3

I run a consistent loop. 1) Scope it: one app, one site, or many? Many sites to one SaaS points at the provider or the cloud security path, not the branch. 2) Read the paths: in the Controller check each circuit's state and its loss/latency/jitter — is this a blackout (down) or brownout (up-but-bad)? 3) Flow browser: find the affected flows, confirm which path they ride and whether steering moved them. 4) Policy check: is the right Path/QoS policy applied, SLA sane, alternate path healthy? 5) Underlay: if a circuit itself is bad, validate the ISP handoff — is it the ION or the carrier? 6) Onboarding/health: confirm the ION is claimed, online, on current firmware, and reaching the Controller. 7) AIOps: check for a flagged anomaly or capacity trend that explains it. I document the path metrics that prove the conclusion. This ordering — scope, paths, flows, policy, underlay, device, AIOps — keeps me from rebooting boxes randomly and gives the panel a repeatable method.

Repeatable loop: scope → path metrics (brown/black) → flow browser → policy → underlay/ISP → device health → AIOps; evidence-led.

Thesis: five things every Prisma SD-WAN engineer must recall on demand. Skim these tiles before the interview — defaults, ports, and the fabric facts that trip candidates up.

Pause & Predict #3

Karthik gets a ticket: voice quality at a Flipkart warehouse site drops every evening. The MPLS path shows rising jitter, yet the ION keeps voice on MPLS instead of moving it to the healthy broadband. Predict the cause and the fix.

The Path Policy for the voice App has no (or too-loose) performance SLA thresholds, so brownout steering never triggers. App-aware path selection only fails a transport away when the measured jitter/latency/loss crosses the SLA bound set in the Path Policy's network-context rule. If the voice rule lacks a strict SLA profile, the ION tolerates the degraded MPLS instead of switching. Fix: attach a tight SLA profile (for example jitter and loss limits sized for RTP) to the voice Path Policy rule with broadband as the secondary path, then watch the App link health flip to broadband when MPLS browns out, verified in the per-app path metrics.

⚡ Prisma SD-WAN last-minute cheat-sheet

Control vs DataController = cloud control/mgmt, never in data path. ION = forwards every packet. Cloud loss = branch frozen-but-forwarding, not down.

The big differentiatorApp-defined steering = per-app loss / latency / jitter vs SLA. Catches brownout (up-but-bad) that link-state SD-WAN misses.

Policy stacksPath (which transport) · QoS (priority) · Security (zones) · NAT (breakout). Classify app → evaluate stacks. Bind a stack to many sites.

DIA vs backhaulDIA = local breakout, low latency for trusted SaaS. Backhaul = centralized inspection / fixed egress. SASE (Prisma Access) bridges both.

RoutingUnderlay = ISP circuits. Overlay = IPSec fabric. BGP mainly at DC IONs to core/firewall. Point branches at a DC group, not one ION.

IntegrationPrisma Access = cloud security chain (SASE). CloudBlade = toggle-on integration (e.g. Zscaler ZIA tunnels auto-built). ZTP = claim by serial → call home.

Brownout vs blackoutBlackout = link down (everyone catches it). Brownout = link up + degraded → only app-defined steering moves the flow.

Troubleshoot orderScope → path loss/latency/jitter → flow browser → policy/SLA → alternate path healthy? → underlay/ISP → ION claimed+online → AIOps.

Glossary — terms an interviewer will probe

ION: Instant-On Network — the Prisma SD-WAN data-plane appliance (physical or vION) at a branch/DC that forwards traffic and builds the fabric.
vION: Virtual ION form factor for AWS, Azure, GCP and VMware deployments.
Controller: The cloud, out-of-band control and management plane that programs IONs but never forwards user traffic.
Secure fabric: The auto-built IPSec overlay of tunnels (VPNLinks) connecting IONs directly to each other.
VPNLink: A single measured IPSec tunnel/path between two IONs over one underlay circuit.
App-defined SD-WAN: Steering based on per-application loss/latency/jitter versus an SLA, not just interface up/down.
Path policy: Policy stack that picks the transport per app and defines the action when its SLA is breached.
DIA: Direct Internet Access — the branch breaks out to the internet locally instead of backhauling.
Data Center group: Two or more DC IONs bundled as one redundant logical hub that branches connect to.
Standard VPN: A manually configured IPSec tunnel from an ION to a non-ION peer (partner, firewall, third-party).
Brownout: A path that is still up but degraded (high loss/latency/jitter) so the app suffers.
Blackout: A path that is fully down — packets dropped completely.
Flow browser: Controller screen showing live per-flow detail: app, path/circuit and that path's measured performance.
CloudBlade: API-driven integration that wires branches to a cloud service (e.g. Zscaler) without per-site manual tunnels.
Prisma Access: Palo Alto's cloud-delivered security (FW, IPS, URL, CASB, ZTNA) that SD-WAN service-chains to for SASE.
ZTP: Zero-Touch Provisioning — an ION calls home and pulls its config after being claimed, no CLI needed.

Ask the AI Tutor — six interviewer follow-ups

🤖 Ask the AI Tutor

Tap any question — instant context-aware answer. The follow-ups your panel lobs after a textbook answer.

Pre-curated from Palo Alto docs + community threads. For deeper, live questions, ask at chat.techclick.in.

Lock it in — explain it in your own words

📝 Self-explain · 2 minutes

In two sentences, explain the difference between a Path Policy and a Zone-Based Firewall rule on a Prisma SD-WAN ION, and why a panel cares that you do not confuse them.

📩 Spaced recall · 7 days, 21 days

Forgetting curve says half of this leaves your head in 7 days. Opt in and we'll send 3 micro-Qs on day 7 and day 21.

Quiz me on this in 7 days & 21 days

Sources cited inline (re-checked 2026-06)

Palo Alto Networks — Prisma SD-WAN product & administration documentation: docs.paloaltonetworks.com/prisma/prisma-sd-wan
Palo Alto Networks — Prisma SD-WAN (CloudGenix) ION device datasheets and form factors: paloaltonetworks.com/sase/sd-wan
Palo Alto Networks — CloudBlades and Zscaler integration guide: docs.paloaltonetworks.com/prisma/prisma-sd-wan/prisma-sd-wan-admin/cloudblades
Palo Alto Networks — Prisma Access / SASE reference architecture: docs.paloaltonetworks.com/prisma/prisma-access
Palo Alto Networks LIVEcommunity — Prisma SD-WAN troubleshooting & flow browser threads: live.paloaltonetworks.com
Reddit r/paloaltonetworks — practitioner discussions on ION claiming, brownout steering and CloudBlades: reddit.com/r/paloaltonetworks
Palo Alto Networks Certified — PCNSE / SD-WAN learning blueprint topics: paloaltonetworks.com/services/education/certification
RFC 4301 (IPSec architecture) and RFC 4271 (BGP-4) — underlying protocols used by the fabric and DC routing: rfc-editor.org

Next lesson · Prisma SD-WAN — Hands-on Path & QoS policy lab

Now turn theory into config: build a Path policy that steers Teams off a browning-out circuit, attach an SLA profile, and verify the move in the flow browser. We will also wire a Zscaler CloudBlade and confirm branch breakout end to end.

📚 All lessons 🧪 Practice exam 💬 Ask deeper Qs

📩 Quiz me on this in 7 days. Opt in and we'll email you 3 micro-questions from this lesson at Day 1, Day 7 and Day 30 — spaced repetition is how it sticks. Un-tick any time.

Prisma SD-WAN Interview Q&A — ION, App-Fabric & Real Troubleshooting

🎯 By the end of this lesson you'll be able to

Pick your weak spot — jump straight to it

Architecture & ION

App-Defined Fabric

Routing & Topology

Integration + Troubleshooting

Why this matters — SD-WAN that watches the app, not the cable

1. Prisma SD-WAN Architecture & ION

Prisma SD-WAN terms an interviewer will probe

2. App-Defined Fabric & Policies

▶ Watch a Teams call survive a brownout — Karthik at a Wipro branch

3. Routing, Topology & Connectivity

4. Integration & Security (Prisma Access / CloudBlades)

5. Troubleshooting & Ops

⚡ Prisma SD-WAN last-minute cheat-sheet

Glossary — terms an interviewer will probe

Ask the AI Tutor — six interviewer follow-ups

🤖 Ask the AI Tutor

Lock it in — explain it in your own words

📝 Self-explain · 2 minutes

📩 Spaced recall · 7 days, 21 days

📋 Final assessment — 10 questions, 70% to pass

Sources cited inline (re-checked 2026-06)

Next lesson · Prisma SD-WAN — Hands-on Path & QoS policy lab