Viewing the Processing Details

The Processing Details page appears when Nebula begins uploading files. It can also be accessed from the History page (Import > Dashboard).

To view details of an import collection

  1. Click the Action icon for the collection you want to view, then click Details.
  2. View the following information:
    • Status
    • Start Time
    • Stop Time
    • Elapsed

    Details section:

    Overview of documents located during the processing, as well as the number that were culled into Review.

    1. Discovered: Number of files in input of collection.
    2. Exceptions: Number of exceptions during processing into Cull or a matter. (A report is available for these documents.)
    3. Note: A document can have more than one exception for not processing.

    4. Input Size: Data size in GB of the original data size (compressed).
    5. Exploded: Number of documents present after extraction of compressed files, such as ZIPs and RARs, and including email attachments.
    6. NIST/Excluded: Number of deNISTed documents/number of excluded files (for example, JPGs <5kb, signature line images, and so on.)
    7. Output Size: Data size in GB of the uncompressed data.
    8. Processed: Number of total files processed (including containers, extracted documents, embedded documents and attachments).
    9. Need OCR: Number of documents flagged as having image layers without text.
    10. Duplicates: Document Count of family duplicates within the collection.
    11. Containers: Number of PSTs, ZIPs, RAR, other container files found in the data set.
    12. OCRed: Number of files sucessfully OCRed (whether within Nebula or imported from external application.)
    13. Exported Duplicates: Document count of family duplicates to data within the collection that have been exported.
    14. Extracted: Total number of documents extracted and searchable.
    15. Promoted to Review: Final number of documents promoted to the review matter.

    Charts section:

    Bar graph of documents imported into Nebula.

    1. Custodian Summary
    2. Doc Types Application Summary

    Document Sources section:

    1. Path
    2. Custodian
    3. Output path

    Operations section:

    1. Export Metadata
    2. HTML Applicable Files in the Collection
    3. Detect Language on the Collection
    4. Reindex Applicable Files in the Collection
    5. OCR Applicable Files in the Collection
    6. Export files that need OCR in the Collection
    7. Import files into Collection as OCR
    8. Named Entity Detection on the Collection
    9. Sentiment Analysis on the Collection

    Processing operations section:

    1. Text Email Headers
    2. Max Spreadsheet Size
    3. Exclude Attached Images
    4. Use System Date
    5. De-NIST
    6. Ignore System Dates after Collection Date
    7. Explode Embedded

To reprocess an import collection

  1. On the Processing Details page, click Reprocess.
  2. On the Upload Files page, click Restart.

To generate a report for an import collection

  1. On the Processing Details page, click Reports.
  2. On the Create Report window, select the Report Type you want to generate:
    • Exclusion Report: Summary of documents that were not imported, usually because they were duplicates or did not meet minimum requirements.
    • Exception Report: Summary of documents that did not process due to an issue with the file.
    • OCR Report: Summary of documents of OCR’d due to the RUN option.
  3. Click CSV.