Most engineers think…
Most people assume DLP only watches data leaving — email, web uploads, USB. That misses the biggest pile of risk: the sensitive files already sitting on old shares, in SharePoint, in databases and on laptops, often with wide-open permissions.
Forcepoint DLP Discovery is how you find and clean that up. It scans data at rest, classifies it with the same policies as inline DLP, and optionally acts on it. The on-prem crawler handles network targets and fingerprinting, the DLP Endpoint agent scans laptops even offline, and cloud discovery covers sanctioned SaaS. File filters keep scans incremental, and an action plan — audit, label, encrypt, quarantine, unshare or a remediation script — turns a finding into a fix.
① What 'data at rest' discovery is — and why it matters
Inline DLP watches data that is moving or being used right now. Discovery is the other half: it goes looking for sensitive content that is just sitting there — old project exports on a file share, customer records in a database, a PST full of contracts on a laptop. The job is three steps: scan the data at rest, classify it against your policies, and optionally enforce an action on what breaches.
Why it matters: stale, forgotten sensitive files are where breaches come from. A spreadsheet of PAN and bank details on an openly-shared folder is a quiet liability until Discovery flags it. Two components do the work: the on-prem crawler performs network discovery, and Forcepoint DLP Endpoint performs endpoint discovery — and crucially, Discovery reuses the same classifiers and fingerprints as inline DLP, so you are not re-authoring policy.
Forcepoint DLP Discovery is best described as…
② Choosing targets — network, endpoint and cloud
There are three flavours of discovery, each with its own task wizard. Network discovery is run by the crawler and reaches file shares, SharePoint (on-prem and online), Exchange (on-prem and online), IBM Domino servers, databases (SQL Server, Oracle, etc.) and Outlook PST files. Endpoint discovery uses the DLP Endpoint agent to scan local drives on managed Windows and macOS devices — even offline, with no server connection needed. Cloud discovery scans sanctioned SaaS such as Office 365 / SharePoint Online, OneDrive and Box.
What each target needs
Each network task takes targets, credentials, file filters, classifiers and a schedule. Cloud discovery has prerequisites you must name: the Cloud Applications license, the DLP Cloud Applications service connected, and the apps defined in the Forcepoint CASB portal. You build all of these under Policy Management ▸ Discovery Policies.
The on-prem agent that performs network discovery and fingerprinting scans. Add standalone crawler instances to parallelise large scans.
The endpoint agent scans local drives on managed Windows/macOS devices — even offline — and syncs incidents back when reconnected.
Fingerprinting for discovery: Files covers files, directories and SharePoint; Database covers tables, views and CSVs. Only partial hashes are stored.
What fires on a match — audit, label, encrypt, copy, move/quarantine, unshare or run a Python remediation script fed the incident as XML.
Network targets (shares, SharePoint, Exchange, Domino, databases, PST) go to the crawler; laptops and desktops — including off-network ones — go to endpoint discovery; sanctioned SaaS goes to cloud discovery, which needs the Cloud Applications license and a connected CASB service.
Which component performs network discovery of file shares, SharePoint and databases?
③ Scheduling, file filters and incremental scans
A naive scan of a multi-terabyte share every night is a non-starter. File Filtering is how you scope a task so it only touches relevant, changed data. Filter by type (e.g. *.doc;*.xls;*.pdf, with an Except list), by size (larger or smaller than X), and by age — Within a period, More than N months old, or a From…To date range — so re-scans only re-read recently modified files.
Full-scan triggers and timestamps
On the endpoint Advanced page you also choose when a full scan fires: Only on policy update, On policy update or fingerprinting classifier update, or Always. Enable Preserve original access time so the scan does not change a file's last-accessed timestamp and mislead backup or archiving tools. For precision, Discovery can use fingerprinting: PreciseID Files fingerprints files, directories and SharePoint, while PreciseID Database fingerprints DB tables, views and CSVs — and only partial hashes are stored, never the original data.
A nightly full scan of a multi-terabyte share will never finish and will hammer the server. Use File Filtering by type, size and age, set full scans to run only on policy/fingerprint update, and enable Preserve original access time so backups aren't misled.
▶ Watch a discovery scan find and quarantine a PAN spreadsheet
How one File System discovery task scans a share end-to-end. Press Play for the healthy path, then Break it to see the classic failure.
You must avoid re-scanning millions of unchanged files every night on a huge share. What do you use?
④ Remediation — from audit to action, done safely
Finding a breach is half the job; the action plan decides what happens next. The safe default is audit only — log the incident and review it in Reporting ▸ Discovery. From there actions escalate: apply a file label (Microsoft Information Protection or Boldon James), encrypt or apply rights/DRM, copy, move/quarantine (often leaving a tombstone note), or delete.
Cloud actions and scripts
Cloud remediation offers Permit, Safe copy, Quarantine, Quarantine with note, Unshare external and Unshare all. For network and endpoint, you Run a remediation script — a Python script that receives each incident as an XML file and performs the copy/move/encrypt/delete/notify logic. Endpoints need the Python language interpreter installed to run scripts. The failure mode everyone hits is jumping straight to delete or quarantine before reviewing in audit — you destroy or hide files that turn out to be false positives.
Arjun Mehta, infosec lead at Northstar Analytics, Pune, faces this
An internal audit finds spreadsheets full of customer PAN and bank details sitting on an old, openly-shared \\fileserv01\projects file share.
Years of project hand-offs dumped sensitive exports onto a legacy file share with broad permissions and no scanning.
In Security Manager he creates a File System Discovery task pointed at the share, attaches his PII/financial classifiers, and on File Filtering scopes it to *.xls;*.csv;*.pdf — then reviews the hits.
Main ▸ Policy Management ▸ Discovery Policies ▸ Network + Main ▸ Reporting ▸ DiscoveryHe sets the action plan to Run remediation script, moving breaching files to a locked \\quarantine folder with a tombstone note, and applies a rights/encryption action to live working files; he also tightens the share permissions.
A re-run with an age = Within last 7 days filter shows zero new breaches, and a planted test file with a fake PAN is correctly quarantined.
Never set a brand-new discovery task straight to delete or quarantine. Run it in audit, review the real hits in Reporting ▸ Discovery, then escalate to label/encrypt/quarantine for genuine breaches — and re-scan to confirm zero new hits.
Which cloud remediation action removes only external sharing while keeping internal access?
🤖 Ask the AI Tutor
Tap any question — instant, scoped to this lesson. No login, no waiting.
Pre-curated from vendor docs + community Q&A, scoped to this lesson. For a live prod issue, paste your export into chat.techclick.in.
📝 Wrap-up assessment — six more
You've answered 4 inline. Six left. 70% (7 of 10) marks the lesson complete on your profile. Tap Submit all answers at the end.
🧠 In your own words
Type one line: why is Discovery needed when you already run inline DLP on email, web and the endpoint? Then compare with the expert version.
🗣 Teach a friend
Best way to lock it in — explain it in one line to a teammate. Tap to generate a paste-ready summary.
📖 Glossary
- Discovery
- Scanning data at rest, classifying it against policy, and optionally enforcing an action on what breaches.
- Network discovery
- Crawler-driven scan of file shares, SharePoint, Exchange, Domino, databases and PST files.
- Endpoint discovery
- DLP Endpoint agent scan of local drives on managed devices — works even when the device is offline.
- Cloud discovery
- Scan of sanctioned cloud apps via the DLP Cloud Applications / CASB service; needs the Cloud Applications license.
- Crawler
- The on-prem agent that performs network discovery and fingerprinting; add standalone instances to scale large scans.
- PreciseID Files / Database
- Fingerprinting for discovery — Files covers files, directories and SharePoint; Database covers tables, views and CSVs. Only partial hashes are stored.
- File Filtering
- Task settings that scope a scan by file type, size and modification age so re-scans stay incremental.
- Action plan
- The set of actions run on a match — audit, label, encrypt, copy, move/quarantine, unshare or a remediation script.
- Remediation script
- A Python script handed each incident as an XML file; performs move, encrypt, delete, rights or notify logic.
- Tombstone / replacement note
- A placeholder message left where a quarantined file used to be, telling users where it went and why.
📚 Sources
- Forcepoint Help — What is discovery? (data at rest scanning overview). help.forcepoint.com/dlp
- Forcepoint Help — Forcepoint data discovery options: network, endpoint and cloud. help.forcepoint.com
- Forcepoint Help — Scheduling discovery tasks and discovery policies. help.forcepoint.com
- Forcepoint Help — Endpoint discovery task wizard: File Filtering & Advanced (full-scan triggers, preserve access time). help.forcepoint.com
- Forcepoint Help — Configuring cloud discovery scans (DLP Cloud Applications / CASB). help.forcepoint.com
- Forcepoint — Data Loss Prevention datasheet / brochure. forcepoint.com/product/data-loss-prevention-dlp
What's next?
Found and cleaned up your data at rest? Next, go deep on the classifiers that decide what actually matches — regex, dictionaries, EDM, IDM, PreciseID fingerprinting and OCR — and why fingerprinting crushes false positives.