Python腳本輕松實現(xiàn)檢測麥克風(fēng)功能

更新時間：2025年08月15日 08:41:21 作者：dongab_9

在進行音頻處理或開發(fā)需要使用麥克風(fēng)的應(yīng)用程序時,確保麥克風(fēng)功能正常是非常重要的,本文將介紹一個簡單的Python腳本,能夠幫助我們檢測本地麥克風(fēng)的功能,需要的可以參考下

輕松檢測麥克風(fēng)功能

在進行音頻處理或開發(fā)需要使用麥克風(fēng)的應(yīng)用程序時，確保麥克風(fēng)功能正常是非常重要的。本文將介紹一個簡單的Python腳本，它能夠幫助我們檢測本地麥克風(fēng)的功能，確保我們的設(shè)備能夠正常錄音。

腳本介紹

下面是一個名為sound_check.py的Python腳本，它使用sounddevice庫來檢測和測試麥克風(fēng)，同時使用soundfile庫來保存錄音文件。

功能概述

獲取麥克風(fēng)列表：腳本首先會列出所有可用的麥克風(fēng)設(shè)備。
選擇麥克風(fēng)設(shè)備：用戶可以從列表中選擇一個麥克風(fēng)進行測試。
錄音：腳本將使用選定的麥克風(fēng)進行錄音，時長為5秒。
保存錄音：錄音完成后，腳本會將錄音保存為WAV文件。

一、Python環(huán)境準備

在開始之前，請確保你的Python環(huán)境已經(jīng)安裝了sounddevice和soundfile這兩個庫。如果沒有安裝，可以通過以下命令進行安裝（清華鏡像源下載）：

pip install sounddevice soundfile -i https://pypi.tuna.tsinghua.edu.cn/simple

二、代碼解析

以下是sound_check.py腳本的詳細代碼解析：

# 導(dǎo)入所需的庫
import sounddevice as sd
import soundfile as sf

# 定義測試麥克風(fēng)的函數(shù)
def test_microphone(device_index=None, output_filename="output.wav"):
    # 設(shè)置錄音參數(shù)
    duration = 5  # 錄音時長（秒）
    fs = 44100  # 采樣頻率

    # 獲取麥克風(fēng)列表
    devices = sd.query_devices()
    # 如果提供了設(shè)備索引并且有效，則使用指定的麥克風(fēng)
    if device_index is not None and device_index < len(devices):
        print(f"Using microphone: {devices[device_index]['name']}")
    else:
        print("Using default microphone.")

    # 獲取并設(shè)置麥克風(fēng)支持的采樣率
    supported_rates = devices[device_index]['default_samplerate']
    if supported_rates != fs:
        print(f"Adjusting sample rate to {int(supported_rates)} Hz (supported by the device).")
        fs = int(supported_rates)  # 確保采樣率是整數(shù)

    print("Recording...")
    # 使用sounddevice錄制聲音
    recording = sd.rec(int(duration * fs), samplerate=fs, channels=1, device=device_index)

    # 等待錄音完成
    sd.wait()

    print("Recording finished.")

    # 保存錄音為WAV文件
    sf.write(output_filename, recording, fs)

    print(f"File saved as {output_filename}")

# 主函數(shù)入口
if __name__ == "__main__":
    # 獲取麥克風(fēng)列表并打印
    devices = sd.query_devices()
    for i, device in enumerate(devices):
        print(f"{i}: {device['name']}")

    # 用戶輸入選擇麥克風(fēng)設(shè)備索引
    device_index = int(input("Enter the index of the microphone to use: "))
    test_microphone(device_index)

三、使用方法

運行腳本，它會自動列出所有可用的麥克風(fēng)設(shè)備。
根據(jù)列表，輸入你想要測試的麥克風(fēng)的索引號。
輸入索引號后回車，腳本將開始錄音，并在5秒后保存錄音文件。

通過這個簡單的腳本，可以輕松地檢測本地麥克風(fēng)設(shè)備是否工作正常，并且能夠保存錄音以供進一步分析。無論是在開發(fā)過程中還是日常使用中，這個腳本工具都能提供極大的便利。

四、知識擴展

python調(diào)用麥克風(fēng)和揚聲器，并調(diào)用百度實時語音轉(zhuǎn)文字

1. 導(dǎo)入必要的模塊和配置百度的 SDK

import time
import queue
import sounddevice as sd
import numpy as np
from aip import AipSpeech
import sys
 
# 百度云配置信息
APP_ID = '你的 App ID'  # 替換為實際的 APP ID
API_KEY = '你的 Api Key'  # 替換為實際的 API KEY
SECRET_KEY = '你的 Secret Key'  # 替換為實際的 SECRET KEY
 
client = AipSpeech(APP_ID, API_KEY, SECRET_KEY)
 
# Queue to hold the recorded audio data
audio_queue = queue.Queue()
speaker_queue = queue.Queue()
 
# Callback function to capture audio data from microphone
def audio_callback(indata, frames, time, status):
    if status:
        print(status, file=sys.stderr)
    audio_queue.put(indata.copy())
 
# Callback function to capture audio data from speaker
def speaker_callback(indata, frames, time, status):
    if status:
        print(status, file=sys.stderr)
    speaker_queue.put(indata.copy())

2. 實現(xiàn)實時語音識別類

class RealTimeSpeechRecognizer:
    def __init__(self, client, name):
        self.client = client
        self.name = name
 
    def send_audio(self, audio_data):
        result = self.client.asr(audio_data, 'pcm', 16000, {
            'dev_pid': 1537,
        })
        if result.get('err_no') == 0:
            print(f"{self.name} 識別結(jié)果: {result['result']}")
        else:
            print(f"{self.name} 錯誤: {result['err_msg']}")
 
# 調(diào)用百度的語音轉(zhuǎn)文字的接口
def recognize_speech(audio_data, recognizer):
    audio_data = np.concatenate(audio_data)
    recognizer.send_audio(audio_data.tobytes())

3. 開始音頻流并處理音頻數(shù)據(jù)

def start_audio_stream(mic_recognizer, speaker_recognizer, speaker_device_index):
    with sd.InputStream(callback=audio_callback, channels=1, samplerate=16000, dtype='int16') as mic_stream, \
            sd.InputStream(callback=speaker_callback, channels=1, samplerate=16000, dtype='int16', device=speaker_device_index) as spk_stream:
        print("Recording audio... Press Ctrl+C to stop.")
        mic_audio_buffer = []
        speaker_audio_buffer = []
        try:
            while True:
                while not audio_queue.empty():
                    mic_audio_buffer.append(audio_queue.get())
                while not speaker_queue.empty():
                    speaker_audio_buffer.append(speaker_queue.get())
 
                if len(mic_audio_buffer) >= 10:
                    recognize_speech(mic_audio_buffer, mic_recognizer)
                    mic_audio_buffer = []  # Clear buffer after sending
 
                if len(speaker_audio_buffer) >= 10:
                    recognize_speech(speaker_audio_buffer, speaker_recognizer)
                    speaker_audio_buffer = []  # Clear buffer after sending
 
                time.sleep(0.1)
        except KeyboardInterrupt:
            print("Stopping audio recording.")

4. 主程序入口

if __name__ == "__main__":
    speaker_device_index = 8  # 使用 pulse 設(shè)備（索引 8）來捕獲揚聲器輸出
 
    mic_recognizer = RealTimeSpeechRecognizer(client, "麥克風(fēng)接收：")
    speaker_recognizer = RealTimeSpeechRecognizer(client, "揚聲器接收：")
 
    start_audio_stream(mic_recognizer, speaker_recognizer, speaker_device_index)

到此這篇關(guān)于Python腳本輕松實現(xiàn)檢測麥克風(fēng)功能的文章就介紹到這了,更多相關(guān)Python檢測麥克風(fēng)內(nèi)容請搜索腳本之家以前的文章或繼續(xù)瀏覽下面的相關(guān)文章希望大家以后多多支持腳本之家！

您可能感興趣的文章: