Supported File Types
This technical note explains and details the capacity of file processing and hosting within the Lexbe eDiscovery Platform.
Upon upload, the Lexbe eDiscovery Platform (LEP) creates a document record and hosts all native files in the ORIGINAL tab of the Doc Viewer. The native file is accessible and can be viewed directly within the Doc Viewer where applicable, or alternatively can be easily downloaded and viewed in its native viewer on a local computer.
Through automatic processing, LEP converts all supported files to PDF. If a file is unable to be converted to PDF (media file, incompatible, etc.) it will receive a placeholder.
LEP has the ability to both extract text from a native file as well as OCR the converted PDF and/or other image files (e.g., JPG, PNG TIFF, BMP, image-based PDFs). This provides the most robust search index and acts as a safeguard to capture all available text. For native files unable to be converted to PDF, the available text is extracted and added to the search index. For image based files, text is recognized through OCR and added to the search index. For some files, text is both extracted and OCR’d. For more information see Uber Index
LEP also offers a "DeNIST" process per request to further filter data sets. "NIST" in DeNIST is the National Institute of Standards and Technology. This list is maintained and updated several times per year as part of the National Software Reference Library Project. Through the DeNIST process, LEP compares all ESI in the collection against the National Software Library list and removes files matching those on the list. These are known system files and therefore unlikely to contain relevant information. For more information see DeNIST
The spreadsheet attached at the bottom of the page outlines all file types recognized by LEP and details the functionality LEP can provide with regard to each type.
The first tab “Supported for Processing” details specific file types and their supported processing elements. The columns outlined are below. The second tab “Recognized File Types” lists over 1,000 extensions that LEP can recognize and host.