Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

aubiopitch(1) [debian man page]

AUBIOPITCH(1)						      General Commands Manual						     AUBIOPITCH(1)

NAME
aubiopitch -- a command line tool to extract pitch candidates from sound files. SYNOPSIS
aubiopitch [options] -i soundfile DESCRIPTION
aubiopitch attempts to extract fundamental frequency in sound files. When no pitch candidate is found, the output is 0. Five fundamental frequency extraction methods are available. Results can be printed in Hertz or in MIDI pitch. A plotting module is available via the Gnuplot Python package. OPTIONS
This program follows the usual GNU command line syntax, with long options starting with two dashes (`-'). A summary of options is included below. -i --input filein input sound file -m --mode pitch_detection_mode pitch detection mode [default=mcomb] mcomb|yin|fcomb|schmitt -u --units unitmode output pitch in units [default=Hz] freq|midi|cent|bin -B --bufsize=BUFSIZE buffer size [default=1024] -H --hopsize=HOPSIZE overlap size [default=512] -t --threshold=THRESHOLD pitch threshold (for yin) [default=0.1] -s --silence=SILENCE silence threshold [default=-70] -D --delay=DELAY number of seconds to take back [default=0] -S --smoothing=frames temporal smoothing using a median filter of N frames [default=0] -M --pitchmax=max maximum pitch values to look for (Hz) [default=20000] -l --pitchmin=min minimum pitch values to look for (Hz) [default=20] -n --note NOT IMPLEMENTED output notes -q --quiet be quiet -h --help Show a summary of options. -v --verbose make lots of noise [default] Plotting options When the Python interface to Gnuplot is installed, the following additional options are available. -p --plot draw plot of the pitch track -T --plottruth draw plot of the ground truth pitch track -x --xsize=SIZE define horizontal plot size [default=1.] -y --ysize=SIZE define vertical plot size [default=1.] -O --outplot=OUTPLOT save plot to output.{ps,eps,png,svg} instead of displaying it BUGS
For now the program has only been tested on audio signals sampled at 44.1 kHz. SEE ALSO
aubioonset(1) aubiotrack(1) aubionotes(1) aubiocut(1) AUTHOR
This manual page was written by Paul Brossier (piem@altern.org). Permission is granted to copy, distribute and/or modify this document under the terms of the GNU General Public License, Version 2 any later version published by the Free Software Foundation. On Debian systems, the complete text of the GNU General Public License can be found in /usr/share/common-licenses/GPL. AUBIOPITCH(1)

Check Out this Related Man Page

LPANAL(1)						  The Canonical Csound Reference						 LPANAL(1)

NAME
lpanal - Performs both linear predictive and pitch-tracking analysis on a soundfile. . DESCRIPTION
Linear predictive analysis for the Csound Linear Predictive Coding (LPC) Resynthesis opcodes. SYNTAX
csound -U lpanal [flags] infilename outfilename lpanal [flags] infilename outfilename INITIALIZATION
lpanal performs both lpc and pitch-tracking analysis on a soundfile to produce a time-ordered sequence of frames of control information suitable for Csound resynthesis. Analysis is conditioned by the control flags below. A space is optional between the flag and its value. -a -- [alternate storage] asks lpanal to write a file with filter poles values rather than the usual filter coefficient files. When lpread / lpreson are used with pole files, automatic stabilization is performed and the filter should not get wild. (This is the default in the Windows GUI) - Changed by Marc Resibois. -s srate -- sampling rate of the audio input file. This will over-ride the srate of the soundfile header, which otherwise applies. If neither is present, the default is 10000. -c channel -- channel number sought. The default is 1. -b begin -- beginning time (in seconds) of the audio segment to be analyzed. The default is 0.0 -d duration -- duration (in seconds) of the audio segment to be analyzed. The default of 0.0 means to the end of the file. -p npoles -- number of poles for analysis. The default is 34, the maximum 50. -h hopsize -- hop size (in samples) between frames of analysis. This determines the number of frames per second (srate / hopsize) in the output control file. The analysis framesize is hopsize * 2 samples. The default is 200, the maximum 500. -C string -- text for the comments field of the lpfile header. The default is the null string. -P mincps -- lowest frequency (in Hz) of pitch tracking. -P0 means no pitch tracking. -Q maxcps -- highest frequency (in Hz) of pitch tracking. The narrower the pitch range, the more accurate the pitch estimate. The defaults are -P70, -Q200. -v verbosity -- level of terminal information during analysis. o 0 = none o 1 = verbose o 2 = debug The default is 0. EXAMPLES
lpanal -a -p26 -d2.5 -P100 -Q400 audiofile.test lpfil22 will analyze the first 2.5 seconds of file "audiofile.test", producing srate/200 frames per second, each containing 26-pole filter coefficients and a pitch estimate between 100 and 400 Hertz. Stabilized (-a) output will be placed in "lpfil22" in the current directory. File Format Output is a file comprised of an identifiable header plus a set of frames of floating point analysis data. Each frame contains four values of pitch and gain information, followed by npoles filter coefficients. The file is readable by Csound's lpread. lpanal is an extensive modification of Paul Lanksy's lpc analysis programs. AUTHORS
Barry Vercoe MIT Media Lab Author. Dan Ellis MIT Media Lab, Cambridge Massachussetts Author. COPYRIGHT
5.10 08/01/2011 LPANAL(1)
Man Page