Speech-to-Text > Transcript Editor > Forced Aligner
Summary:
{ } |
{ } |
Official documentation: Gentle on Github
Language: REST API or Python on command line
Description:
Cost: Free (open source)
Social impact:
Notes:
Two options for installation:
Input formats
Audio (mp3, wav, possibly other formats) and transcript (plain text).
curl -F "audio=@audio.mp3" -F "transcript=@words.txt" "http://localhost:8765/transcriptions?async=false"# ORpython3 align.py audio.mp3 words.txt |
{ "transcript": "Now, let me looking at the Congress, uh, as one of the institutions in trouble, uh, to some degree, not the same degree as others, perhaps, but still part of the whole mail.", "words": [ { "alignedWord": "now", "case": "success", "end": 38.29, "endOffset": 3, "phones": [ { "duration": 0.12, "phone": "n_B" }, { "duration": 0.01, "phone": "aw_E" } ], "start": 38.16, "startOffset": 0, "word": "Now" }, { "alignedWord": "let", "case": "success", "end": 38.65, "endOffset": 8, "phones": [ { "duration": 0.05, "phone": "l_B" }, ... ] } |
Official documentation: <link>
Language:
Description:
Cost: <$ OR Free (open source)>
Social impact:
Notes: