This section includes information for using the pure Python API of bob.ap.
Bases: bob.ap.Spectrogram
Ceps(sampling_frequency, [win_length_ms=20., [win_shift_ms=10., [n_filters=24, [n_ceps=19, [f_min=0., [f_max=4000., [delta_win=2, [pre_emphasis_coeff=0.95, [mel_scale=True, [dct_norm=True]]]]]]]]]]) -> new Ceps Ceps(other) -> new Ceps
Objects of this class, after configuration, can extract the cepstral coefficients from 1D audio array/signals.
Parameters:
A factor by which the cepstral coefficients are multiplied
The integer delta value used for computing the first and second order derivatives
Tells whether we compute a spectrogram or energy bands
Tells whether we use the energy or the square root of the energy
The energy flooring threshold
The maximum frequency of the filter bank
The minimum frequency of the filter bank
Computes the shape of the output features, given the size of an input array or an input array.
Parameters:
This method always returns a 2-tuple containing the shape of output features produced by this extractor.
Tells whether we use the log triangular filter or the triangular filter
Tells whether cepstral features are extracted on a linear (LFCC) or Mel (MFCC) scale
The number of cepstral coefficients
The number of filter bands
The coefficient used for the pre-emphasis
The sampling frequency/frequency rate
The normalized window length w.r.t. the sample frequency
The window length of the cepstral analysis in milliseconds
The normalized window shift w.r.t. the sample frequency
The window shift of the cepstral analysis in milliseconds
Tells if we add the first derivatives to the output feature
Tells if we add the second derivatives to the output feature
Tells if we add the energy to the output feature
Bases: bob.ap.FrameExtractor
Energy(sampling_frequency, [win_length_ms=20., [win_shift_ms=10.]]) -> new Energy Energy(other) -> new Energy
Objects of this class, after configuration, can extract the energy of frames extracted from a 1D audio array/signal.
Parameters:
The energy flooring threshold
Computes the shape of the output features, given the size of an input array or an input array.
Parameters:
This method always returns a 2-tuple containing the shape of output features produced by this extractor.
The sampling frequency/frequency rate
The normalized window length w.r.t. the sample frequency
The window length of the cepstral analysis in milliseconds
The normalized window shift w.r.t. the sample frequency
The window shift of the cepstral analysis in milliseconds
Bases: object
FrameExtractor(sampling_frequency, [win_length_ms=20., [win_shift_ms=10.]]) -> new FrameExtractor FrameExtractor(other) -> new FrameExtractor
This class is a base type for classes that perform audio processing on a frame basis. It can be instantiated from Python.
Objects of this class, after configuration, can extract audio frame from a 1D audio array/signal. You can instantiate objects of this class by passing a set of construction parameters or another object of which the base type is FrameExtractor.
Parameters:
Computes the shape of the output features, given the size of an input array or an input array.
Parameters:
This method always returns a 2-tuple containing the shape of output features produced by this extractor.
The sampling frequency/frequency rate
The normalized window length w.r.t. the sample frequency
The window length of the cepstral analysis in milliseconds
The normalized window shift w.r.t. the sample frequency
The window shift of the cepstral analysis in milliseconds
Bases: bob.ap.Energy
Spectrogram(sampling_frequency, [win_length_ms=20., [win_shift_ms=10., [n_filters=24, [f_min=0., [f_max=4000., [pre_emphasis_coeff=0.95, [mel_scale=True]]]]]]]) -> new Spectrogram Spectrogram(other) -> new Spectrogram
Objects of this class, after configuration, can extract the spectrogram from 1D audio array/signals.
Parameters:
Tells whether we compute a spectrogram or energy bands
Tells whether we use the energy or the square root of the energy
The energy flooring threshold
The maximum frequency of the filter bank
The minimum frequency of the filter bank
Computes the shape of the output features, given the size of an input array or an input array.
Parameters:
This method always returns a 2-tuple containing the shape of output features produced by this extractor.
Tells whether we use the log triangular filter or the triangular filter
Tells whether cepstral features are extracted on a linear (LFCC) or Mel (MFCC) scale
The number of filter bands
The coefficient used for the pre-emphasis
The sampling frequency/frequency rate
The normalized window length w.r.t. the sample frequency
The window length of the cepstral analysis in milliseconds
The normalized window shift w.r.t. the sample frequency
The window shift of the cepstral analysis in milliseconds