Extract text from MIME message¶
With the “Extract text from a MIME message” page, the text from an uploaded email will be extracted and returned as a downloadable text file. This helps with the creation of DLP rules because it allows you to see exactly which text the DLP scanner sees when it scans for patterns.
The uploaded file should be a “raw” email in MIME format, i.e, the headers and the body. The maximum allowed size of the uploaded email is 10MB.
All words will be extracted from the uploaded MIME file and the extracted text will be normalized: Unicode normalized, converted to lower case, tabs replaced by a spaces, excessive spaces removed, words from the “skip list” removed etc. (see Text normalization for more information).