一个简单的本地语音识别服务 - V2EX
爱意满满的作品展示区。
jianchang512

一个简单的本地语音识别服务

  •  
  •   jianchang512
    jianchang512 Jan 1, 2024 5197 views
    This topic created in 870 days ago, the information mentioned may be changed or developed.
    基于 openai-whipser 开源模型和 flask ,搭建的本地离线运行的语音识别服务,主要自用用于取代百度语音识别的。

    GitHub: https://github.com/jianchang512/stt

    这是一个离线运行的本地语音识别转文字工具,基于 openai-whipser 开源模型,可将视频/音频中的人类声音识别并转为文字,可输出 json 格式、srt 字幕带时间戳格式、纯文字格式。可用于自行部署后替代 openai 的语音识别接口或百度语音识别等,准确率基本等同 openai 官方 api 接口。


    10 replies    2024-11-28 02:00:58 +08:00
    lloovve
        1
    lloovve  
       Jan 2, 2024 via iPhone
    识别效果如何?能在 linux 下部署么
    kkstart
        2
    kkstart  
       Jan 2, 2024
    赞,效果如何?
    tqyq88
        3
    tqyq88  
       Jan 2, 2024   1
    https://github.com/SYSTRAN/faster-whisper 这个性能吊打 openai 原生的
    eatgrass
        4
    eatgrass  
       Jan 2, 2024
    https://huggingface.co/spaces/Xenova/whisper-web
    直接浏览器里运行,0 部署
    JNian
        5
    JNian  
       Jan 2, 2024
    请问作者有没有考虑增加 diarization 功能
    buyno1
        6
    buyno1  
       Mar 22, 2024
    对 windows 版本有什么要求?电脑配图有什么要求
    buyno1
        7
    buyno1  
       Mar 23, 2024
    @tqyq88 colab 有没有平替的 用来你说这个部署
    buyno1
        8
    buyno1  
       Mar 23, 2024
    @eatgrass 用 19 秒的 mp3 试了 报错
    chopin1998519
        9
    chopin1998519  
       Aug 15, 2024
    WizardLeo
        10
    WizardLeo  
       Nov 28, 2024
    @eatgrass 有点强,这个真好用
    About     Help     Advertise     Blog     API     FAQ     Solana     1081 Online   Highest 6679       Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 49ms UTC 22:51 PVG 06:51 LAX 15:51 JFK 18:51
    Do have faith in what you're doing.
    ubao msn snddm index pchome yahoo rakuten mypaper meadowduck bidyahoo youbao zxmzxm asda bnvcg cvbfg dfscv mmhjk xxddc yybgb zznbn ccubao uaitu acv GXCV ET GDG YH FG BCVB FJFH CBRE CBC GDG ET54 WRWR RWER WREW WRWER RWER SDG EW SF DSFSF fbbs ubao fhd dfg ewr dg df ewwr ewwr et ruyut utut dfg fgd gdfgt etg dfgt dfgd ert4 gd fgg wr 235 wer3 we vsdf sdf gdf ert xcv sdf rwer hfd dfg cvb rwf afb dfh jgh bmn lgh rty gfds cxv xcv xcs vdas fdf fgd cv sdf tert sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf shasha9178 shasha9178 shasha9178 shasha9178 shasha9178 liflif2 liflif2 liflif2 liflif2 liflif2 liblib3 liblib3 liblib3 liblib3 liblib3 zhazha444 zhazha444 zhazha444 zhazha444 zhazha444 dende5 dende denden denden2 denden21 fenfen9 fenf619 fen619 fenfe9 fe619 sdf sdf sdf sdf sdf zhazh90 zhazh0 zhaa50 zha90 zh590 zho zhoz zhozh zhozho zhozho2 lislis lls95 lili95 lils5 liss9 sdf0ty987 sdft876 sdft9876 sdf09876 sd0t9876 sdf0ty98 sdf0976 sdf0ty986 sdf0ty96 sdf0t76 sdf0876 df0ty98 sf0t876 sd0ty76 sdy76 sdf76 sdf0t76 sdf0ty9 sdf0ty98 sdf0ty987 sdf0ty98 sdf6676 sdf876 sd876 sd876 sdf6 sdf6 sdf9876 sdf0t sdf06 sdf0ty9776 sdf0ty9776 sdf0ty76 sdf8876 sdf0t sd6 sdf06 s688876 sd688 sdf86