Skip to end of metadata
Go to start of metadata
- About
- Azure Video OCR utilizes the results produced by Azure Video Indexer, including the main indexer file and the video OCR artifacts, and generates the standard AMP Video OCR JSON.
- It has been added as a tool on AMP's Galaxy and performs video OCR on the input videos.
- The output produced by this tool is a JSON consisting of the text and the corresponding bounding box information on each frame in the input.
- Source Code
- galaxy/tools/amp_vocr/azure_video_ocr.xml : This is the configuration file that details the tools usage, its inputs, outputs, version, and other things.
- galaxy/tools/amp_vocr/azure_video_ocr.py : This is a python wrapper to generate AMP Video OCR JSON based on Azure Video indexer JSON and Video OCR Artifacts JSON output from Azure Video Indexer.
- galaxy/tools/amp_schema/video_ocr.py : Classes used to construct AMP Video OCR json output.
- Running
- The tool can be invoked from Galaxy UI as other tools, it needs be used as the next MGM taking outputs from Azure Video Indexer with the include_ocr flag set to true.
- Parameters
- input_video: The same input video file used by Azure Video Indexer
- azure_video_index: Azure Video Index JSON output from Azure Video Indexer
- azure_artifact_ocr: Azure Artifact OCR JSON output from Azure Video Indexer
- Outputs
- amp_vocr: The standardized AMP Video OCR JSON
{"serverDuration": 42, "requestCorrelationId": "c18790c02ea330ce"}