python-自动化篇-运维-语音识别

本文介绍: □ 安装完 speech_recognition 之后是不支持中文的，需要在 Sphinx 语音识别工具包里面下载对应的普通话升学和语言模型。使用名为 pyttsx 的 python 包，你可以将文本转换为语音。语音识别技术，也被称为自动语音识别，目标是以电脑自动将人类的语音内容转换为相应的文字和文字转换为语音。使用 SpeechLib，可以从文本文件中获取输入，再将其转换为语音。在 python 中，你也可以使用 SAPI 来做文本到语音的转换。

语音识别技术，也被称为自动语音识别，目标是以电脑自动将人类的语音内容转换为相应的文字和文字转换为语音。

使用名为 pyttsx 的 python 包，你可以将文本转换为语音。直接使用 pip 就可以进行安装，命令如下：

pip install pyttsx3

【示例】使用 pyttsx 实现文本转换语音

import pyttsx3 as pyttsx 
engine=pyttsx.init() 
engine.say('你好 pyttsx') 
engine.runAndWait()

在 python 中，你也可以使用 SAPI 来做文本到语音的转换。

【示例】使用 SAPI 实现文本转换语音

from win32com.client import Dispatch 
msg="你好 SAPI" 
speaker = Dispatch('SAPI.SpVoice') 
speaker.Speak(msg) 
del speaker

pip install comtypes

from comtypes.client import CreateObject 
engine=CreateObject("SAPI.SpVoice") 
stream=CreateObject('SAPI.SpFileStream') 
from comtypes.gen import SpeechLib 
infile='demo.txt' 
outfile='demo_audio.wav'
stream.Open(outfile,SpeechLib.SSFMCreateForWrite) engine.AudioOutputStream=stream 
f=open(infile,'r',encoding='utf-8') 
theText=f.read() 
f.close() 
engine.speak(theText) 
stream.close()

pip install PocketSphinx 
pip install SpeechRecognition

import speech_recognition as sr 
audio_file='demo_audio.wav' 
r=sr.Recognizer() 
with sr.AudioFile(audio_file) as source: 
audio =r.record(source) try: 
	# print('文本内容：',r.recognize_sphinx(audio,language="zh_CN")) 
	print('文本内容：',r.recognize_sphinx(audio)) 
except Exception as e: 
	print(e)

import pyttsx3 as pyttsx
engine=pyttsx.init()
engine.say('好好学习')
engine.runAndWait()

from win32com.client import Dispatch
speaker=Dispatch('SAPI.SpVoice')
speaker.Speak('大家好')
del speaker

from comtypes.client import CreateObject
from comtypes.gen import SpeechLib
engine=CreateObject('SAPI.SpVoice')
stream=CreateObject('SAPI.SpFileStream')
infile='demo.txt'
outfile='demo_audio.wav'
stream.open(outfile,SpeechLib.SSFMCreateForWrite)
engine.AudioOutputStream=stream
#读取文本内容
f=open(infile,'r',encoding='utf-8')
theText=f.read()
f.close()
engine.speak(theText)
stream.close()

import speech_recognition as sr
audio_file='demo_audio.wav'
r=sr.Recognizer()
#打开语音文件
with sr.AudioFile(audio_file) as source:
    audio=r.record(source)

#将语音转换为文本
# print('文本内容：',r.recognize_sphinx(audio))
print('文本内容：',r.recognize_sphinx(audio,language='zh-CN'))