[toc]
Autosub是一个用于自动语音识别和字幕生成的实用程序。它以视频或音频文件为输入,执行语音活动检测以找到语音区域,向Google Web Speech API发出并行请求以生成这些区域的转录,(可选)将其翻译为其他语言,最后保存结果磁盘字幕。它支持多种输入和输出语言,并且当前可以产生SRT格式或JSON字幕。
-
科学上网。当前购买的香港服务器,保证稳定性
-
需要绑定一张 VISA信用卡
已成功 申请 KEY ,可以调用如下URL测试 key是否有效
https://www.googleapis.com/language/translate/v2?key=you-key&source=en&target=de&q=Hello%20world { "data": { "translations": [ { "translatedText": "Hallo Welt" } ] } }
-
Install ffmpeg
1. yum install ffmpeg -y 安装ffmpeg 2. ffmpeg 检查是否安装成功
-
Run pip install autosub
1. python -V 查看版本 2. pip install autosub 安装 3. pip list 查看安装列表 4. autosub -h 检查是否安装成功
-
帮助
$ autosub -h usage: autosub [-h] [-C CONCURRENCY] [-o OUTPUT] [-F FORMAT] [-S SRC_LANGUAGE] [-D DST_LANGUAGE] [-K API_KEY] [--list-formats] [--list-languages] [source_path] positional arguments: source_path Path to the video or audio file to subtitle optional arguments: -h, --help show this help message and exit -C CONCURRENCY, --concurrency CONCURRENCY Number of concurrent API requests to make -o OUTPUT, --output OUTPUT Output path for subtitles (by default, subtitles are saved in the same directory and name as the source path) -F FORMAT, --format FORMAT Destination subtitle format -S SRC_LANGUAGE, --src-language SRC_LANGUAGE Language spoken in source file -D DST_LANGUAGE, --dst-language DST_LANGUAGE Desired language for the subtitles -K API_KEY, --api-key API_KEY The Google Translate API key to be used. (Required for subtitle translation) --list-formats List all available subtitle formats --list-languages List all available source/destination languages
-
支持语言列表
[root@izj6c46jnvhw9ajs6itxowz ~]# autosub --list-languages List of all languages: af Afrikaans ar Arabic az Azerbaijani be Belarusian bg Bulgarian bn Bengali bs Bosnian ca Catalan ceb Cebuano cs Czech cy Welsh da Danish de German el Greek en English eo Esperanto es Spanish et Estonian eu Basque fa Persian fi Finnish fr French ga Irish gl Galician gu Gujarati ha Hausa hi Hindi hmn Hmong hr Croatian ht Haitian Creole hu Hungarian hy Armenian id Indonesian ig Igbo is Icelandic it Italian iw Hebrew ja Japanese jw Javanese ka Georgian kk Kazakh km Khmer kn Kannada ko Korean la Latin lo Lao lt Lithuanian lv Latvian mg Malagasy mi Maori mk Macedonian ml Malayalam mn Mongolian mr Marathi ms Malay mt Maltese my Myanmar (Burmese) ne Nepali nl Dutch no Norwegian ny Chichewa pa Punjabi pl Polish pt Portuguese ro Romanian ru Russian si Sinhala sk Slovak sl Slovenian so Somali sq Albanian sr Serbian st Sesotho su Sudanese sv Swedish sw Swahili ta Tamil te Telugu tg Tajik th Thai tl Filipino tr Turkish uk Ukrainian ur Urdu uz Uzbek vi Vietnamese yi Yiddish yo Yoruba zh-CN Chinese (Simplified) zh-TW Chinese (Traditional) zu Zulu
-
对比原视频生成中文SRT字幕来看,语音识别不是很准,可能需要对原视频降噪处理、ffmpeg调优、人工调整。
-
中文字幕不准导致英文翻译也不准,可以选择手改SRT字幕后再进行翻译。Google文档翻译
-
也可对比 BingLingGroup-autosub 看看效果,升级autosub,python版本
-
其他方案
第一遍MAC安装运行时,主要问题是代理不稳定导致请求超时,后面上了香港服务器一遍打通。
小问题:
-
autosub -h 找不到,需要
sudo安装 -
代理网络问题














