Intro
The document scanner leverages Google’s ML Kit, allowing users to digitize physical documents with an on-device flow. Users can select automatic or manual capture and easily edit, retake, or delete specific or all scanned documents.
Document scanner using Google’s ML Kit
Anatomy
A. Camera View
The camera view enables users to focus on and capture their scan through either manual or auto mode. The shutter button allows users to manually capture their scans regardless of the enabled mode. The user can also utilize their phone’s flash to brighten the environment in which they are scanning.
B. Gallery View
The gallery view provides a view of all the captured scans, and enables users to filter, clean, crop, rotate, retake, and delete their scans. Once the user feels the scans are sufficient, the “Done” button saves all the scans in the gallery and prompts the user to the next page in their in-app journey.
Anatomy of the document scanner camera view (left) and the gallery view (right)
Behavior and Interaction
Automatic Capture
Automatically scanning a document
Manual Capture
Manually scanning a document
Add Another Scan
Triggering the document scanner to add a scan
Crop and Rotate
Cropping and rotating the scan
Filter
Using the filter options from the gallery view
Clean
Cleaning up the document by erasing visible fingers in the scan
Retake & Delete
Deleting the scan by tapping the top left “Cancel” button