List of MGMs
Audio and Text
- Speech-to-text
- AWS Transcribe
- Kaldi
- Local
- HPC
- Forced Alignment
- Gentle
- Named Entity Recognition
- AWS Comprehend
- SpaCy
- Vocabulary Tagging
- Segmentation
- INA Speech Segmenter
- Music Program OCR
- Applause Detection
- Acoustic Classification Segmentation
Video
- Video OCR
- MS Azure Video Indexer
- Tesseract+FFMPEG
- Shot Detection
- MS Azure Video Indexer
- PyScenedetect
- Facial Recognition
- Python face_recognition
- Contact Sheet Generation
- Based on given time interval
- Based on total number of frames evenly spaced
- Based on output of Shot Detection (the middle frame of each shot)
- Based on output of Facial Recognition
Other Documentation
This page will grow as more MGMs are evaluated.