Python中的频率分析

我试图用Python来检索现场音频输入的主频。目前我正在尝试使用笔记本电脑内置麦克风的音频流，但在测试下面的代码时，我得到的结果非常糟糕。Python中的频率分析

# Read from Mic Input and find the freq's 
    import pyaudio 
    import numpy as np 
    import bge 
    import wave 

    chunk = 2048 

    # use a Blackman window 
    window = np.blackman(chunk) 
    # open stream 
    FORMAT = pyaudio.paInt16 
    CHANNELS = 1 
    RATE = 1920 

    p = pyaudio.PyAudio() 
    myStream = p.open(format = FORMAT, channels = CHANNELS, rate = RATE, input = True, frames_per_buffer = chunk) 

    def AnalyseStream(cont): 
     data = myStream.read(chunk) 
     # unpack the data and times by the hamming window 
     indata = np.array(wave.struct.unpack("%dh"%(chunk), data))*window 
     # Take the fft and square each value 
     fftData=abs(np.fft.rfft(indata))**2 
     # find the maximum 
     which = fftData[1:].argmax() + 1 
     # use quadratic interpolation around the max 
     if which != len(fftData)-1: 
      y0,y1,y2 = np.log(fftData[which-1:which+2:]) 
      x1 = (y2 - y0) * .5/(2 * y1 - y2 - y0) 
      # find the frequency and output it 
      thefreq = (which+x1)*RATE/chunk 
      print("The freq is %f Hz." % (thefreq)) 
     else: 
      thefreq = which*RATE/chunk 
      print("The freq is %f Hz." % (thefreq)) 

    # stream.close() 
    # p.terminate()

的代码是从this question，与波形文件的傅立叶分析涉及蚕食。它是在当前的模块化结构中，因为我正在使用Blender Game Environment（因此在顶部导入bge）来实现它，但是我非常确定我的问题在于AnalyseStream模块。

任何意见，你可以提供将不胜感激。

更新：我得到正确的值，每隔一次又一次，但他们很少发现不正确的值（< 10Hz）。那个和程序真的很慢。

来源

2012-01-31 Dave Moore

1920的采样率看起来很腥。更典型的音频采样率是8000或44100.你用什么样的声音来进行正确性测试？如果它不是来自正弦波发生器，那么您听到的音高和频率峰值可能会非常不同。 – hotpaw2 2012-01-31 16:31:51

你好，发现实时分析的FFT的最大计算速度有点慢。

如果您不能使用复杂的波形来查找频率，那么您可以使用基于时域的任何方法，例如性能会更好的过零点。

在去年我做了一个简单的函数来计算过零点的频率。

#Eng Eder de Souza 01/12/2011 
#ederwander 
from matplotlib.mlab import find 
import pyaudio 
import numpy as np 
import math 


chunk = 1024 
FORMAT = pyaudio.paInt16 
CHANNELS = 1 
RATE = 44100 
RECORD_SECONDS = 20 


def Pitch(signal): 
    signal = np.fromstring(signal, 'Int16'); 
    crossing = [math.copysign(1.0, s) for s in signal] 
    index = find(np.diff(crossing)); 
    f0=round(len(index) *RATE /(2*np.prod(len(signal)))) 
    return f0; 


p = pyaudio.PyAudio() 

stream = p.open(format = FORMAT, 
channels = CHANNELS, 
rate = RATE, 
input = True, 
output = True, 
frames_per_buffer = chunk) 

for i in range(0, RATE/chunk * RECORD_SECONDS): 
    data = stream.read(chunk) 
    Frequency=Pitch(data) 
    print "%f Frequency" %Frequency

ederwander

来源

2012-02-02 21:20:37 ederwander

另外还有一点计算博士伦-Scargle周期图，并是自v0.10.0可用的功能scipy.signal.lombscargle。即使对于不均匀采样的信号，该方法也应该工作。看起来这个数据的意思是必须减去这个方法才能正常工作，尽管这在文档中没有提到。更多信息可以在scipy参考指南中找到： http://docs.scipy.org/doc/scipy/reference/tutorial/signal.html#lomb-scargle-periodograms-spectral-lombscargle

来源

2012-06-07 16:15:27

Python中的频率分析

回答

相关问题