Intro
The OCR scan takes advantage of mobile technology to provide users a more efficient way to input data. The scanner identifies specific text from an object or document based on app specific data structure, and pre-fills the form.
Mobile OCR scan screen (left); tablet OCR scan screen (right)
Anatomy
The OCR scanner comes as a full-screen modal that includes an app bar, scan window, and alignment guide.
A. App Bar
The app bar provides persistent access to two main actions:
- “Close” icon on the left to exit the scanner view
- Flash toggle button on the right to turn the camera’s flash on/off for clearer view of target content.
B. Scan Window
The scan window occupies the rest of the screen with a half transparent overlay. Users can see through the overlay to adjust the camera to the right point.
C. Alignment Guide
The alignment guide is a frame that assists users to adjust the camera to the right angle and distance. The OCR scanner processes the content inside the alignment guide area. The shape of the frame could be customizable based on the target scanning object, for example, a business card or receipt.
Behavior and Interaction
Trigger
Launch the OCR scan through the menu
Scanning
Scanning view
Confirmation
Information gathered through the OCR Scan are automatically entered
Resources
Development: MlKitTextDetectionViewOptical Character Recognition (OCR)
SAP Fiori for iOS: OCR Scanner