Skip to content

VAMP Plugin outputs description

Mateusz Korzeniowski edited this page Sep 14, 2018 · 4 revisions

bbc-vamp-plugins:bbc-energy

  • URL: https://github.com/bbc/bbc-vamp-plugins
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Moving average window size (1.0 seconds)
    • Moving average percentile (3.0)
    • Dip threshold (3.0)
    • Low Energy threshold (1.0)
  • Outputs:
    • rmsenergy
      • Description: As described above, the term average power refers to the average value of the instantaneous power waveform over time. As this is typically derived from the root mean square (RMS) of the sine wave voltage
      • Summary: Sound energy of all the frequencies at given timestamp
      • Usefulness: 3
    • rmsdelta
      • Description: Difference between RMS of previous and current blocks
      • Comment: Change of sound energy
      • Usefulness: 2
    • lowenergy
      • Description: Percentage of track which is below the low energy threshold
      • Usefulness: 1
    • average
      • Description: Mean of RMS values over moving average window
      • Usefulness: 2
    • pdip
      • Description: Probability of the RMS energy dipping below threshold
      • Usefulness: 1

bbc-vamp-plugins:bbc-intensity

  • URL: https://github.com/bbc/bbc-vamp-plugins
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Sub-bands (7.0)
    • Window shape (Hann)
  • Outputs:
    • intensity
      • Description: Sum of the FFT bin absolute values
      • Comment: Similar to RMS
      • Usefulness: 2
    • intensity-ratio
      • Description: Sum of each sub-bands absolute values
      • Comment: Intensity of each of sub-bands
      • Usefulness: 3

bbc-vamp-plugins:bbc-peaks

bbc-vamp-plugins:bbc-rhythm

  • URL: https://github.com/bbc/bbc-vamp-plugins
  • Block size: 1024
  • Step size: 256
  • Parameter (default):
    • Sub-bands (7)
    • Threshold (1)
    • Moving avg window len (200 frames)
    • Onset peak window length (6 frames)
    • Min BPM 12
    • Max BPM 300
  • Outputs:
    • onset_curve
      • Description: Onset detection curve
      • Comment: accent strengths of an rhythm section
      • Usefulness: 3
    • average
      • Description: ?
      • Comment: Moving average of an rhythm accent strengths
      • Usefulness: 3
    • diff
      • Description: Difference between onset and average
      • Comment: Similar to onset_curve, but only highest points are available
      • Usefulness: 1
    • onset
      • Description: ?
      • Comment: Similar to onset_curve, but binary (just points in time)
      • Usefulness: 1
    • avg-onset-freq
      • Description: Rate of onsets per minute
      • Usefulness: 2
    • rhythm-strength
      • Description: Average value of peaks in onset curve
      • Usefulness: 2
    • autocor
      • Description: Autocorrelation of onset detection curve
      • Comment: Does not work?
      • Usefulness: 1
    • mean-correlation-peak
      • Description: Mean of the peak autocorrelation values
      • Comment: Same as above but averaged
      • Usefulness: 1
    • peak-valley-ratio
      • Description: Ratio of the mean correlation peak to the mean correlation valley
      • Usefulness: 1
    • tempo
      • Description: Overall tempo of the track in BPM
      • Usefulness: 2

bbc-vamp-plugins:bbc-spectral-contrast

  • URL: https://github.com/bbc/bbc-vamp-plugins
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • Alpha (0.02)
    • Sub-bands (7)
  • Outputs:
    • valleys
      • Description: Valley of the spectrum
      • Comment: Values per sub-band, similar to intensity but less reasonable
      • Usefulness: 2
    • peaks
      • Description: Peak of the spectrum
      • Comment: Spectrum peaks per sub-band, more reasonable than valleys
      • Usefulness: 3
    • mean
      • Description: Mean of the spectrum
      • Comment: Average spectrum per sub-band (bass bands have higher values)
      • Usefulness: 2

bbc-vamp-plugins:bbc-spectral-flux

  • URL: https://github.com/bbc/bbc-vamp-plugins
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Use L2 norm over L1 (False)
  • Outputs:
    • spectral-flux
      • Description: ?
      • Comment: Similar to RMS / Intensity but averaged(?)
      • Usefulness: 2

bbc-vamp-plugins:bbc-speechmusic-segmenter

  • URL: https://github.com/bbc/bbc-vamp-plugins
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Resolution (256)
    • Change threshold (0.08)
    • Decision threshold (0.27)
    • Minimum music segment length (0.0)
    • Margin (14.0)
  • Outputs:
    • segmentation
      • Description: Segmentation
      • Comment: Splits music and spoken word (binary segments)
      • Usefulness: 1
    • skewness
      • Description: Detection function
      • Comment: Probability of spoken word
      • Usefulness: 2

beatroot-vamp:beatroot

  • URL: https://code.soundsoftware.ac.uk/projects/beatroot-vamp
  • Block size: 2048
  • Step size: 441
  • Parameter (default):
    • Pre-Margin factor (0.15)
    • Post-Margin factor (0.3)
    • Maximum Change (0.2)
    • Expiry Time (10.0)
  • Outputs:
    • beats
      • Description: Identify beat locations in music
      • Comment: Binary points for beat locations
      • Usefulness: 2

cqvamp:cqchromavamp

  • URL: https://github.com/cannam/constant-q-cpp
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Lowest Contributing Octave (0)
    • Contributing Octave Count (7)
    • Tuning Frequency (440 Hz)
    • Bins per Octave (36)
  • Outputs:
    • chromagram
      • Description: Extract Constant-Q spectrogram with a constant ratio of centre frequency to resolution from the audio, then wrap it around into a single-octave chromagram
      • Comment: Intensity of a sound on particular note (notes range: 0-35; 1 - C, 4 - C#, 7 - D, …, 34 - B)
      • Usefulness: 3

cqvamp:cqvamp

  • URL: https://github.com/cannam/constant-q-cpp
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Minimum Frequency (110 Hz)
    • Maximum Frequency (14700 Hz)
    • Bins per Octave (36 bins)
    • Interpolation (Linear)
  • Outputs:
    • constantq
      • Description: Extract a spectrogram with constant ratio of centre frequency to resolution from the input audio, specifying the frequency range in Hz
      • Comment: Intensity of a sound in particular frequency range (288 separate ranges)
      • Usefulness: 1

cqvamp:cqvampmidi

  • URL: https://github.com/cannam/constant-q-cpp
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Minimum Pitch (36 MIDI units)
    • Maximum Pitch (96 MID units)
    • Tuning Frequency (440 Hz)
    • Bins per Octave (36)
    • Interpolation (Linear)
  • Outputs:
    • constantq
      • Description: Extract spectrogram with constant ratio of centre frequency to resolution from the input audio, specifying the frequency range in MIDI pitch units
      • Comment: Intensity of a sound on particular note (180 notes; 0 - C#2, 1 - D2, …, 180 - C7)
      • Usefulness: 3

match-vamp-plugin-2:match

match-vamp-plugin:match

mtg-melodia:melodia

  • URL: https://www.upf.edu/web/mtg/melodia
  • Block size: 2048
  • Step size: 128
  • Parameter (default):
    • Program (Polyphonic)
    • Min Freq: 55 Hz
    • Max Freq: 1760 Hz
    • Voicing Tolerance: 0.2
    • Monophonic Noise Filter: 0
  • Outputs:
    • melody
      • Description: Estimates the melody pitch in polyphonic music; segments without melody are indicated by zero or negative values
      • Comment: ? Very weird chart
      • Usefulness: 1

mtg-melodia:melodiaviz

nnls-chroma:chordino

  • URL: http://www.isophonics.net/nnls-chroma
  • Block size: 16384
  • Step size: 2048
  • Parameter (default):
    • Use approximate transcription (NNLS)
    • Bass noise threshold (0.0 %)
    • Tuning mode (global tuning)
    • Spectral whitening (1.0)
    • Spectral shape (0.7)
    • Boost N (0.1)
  • Outputs:
    • simplechord
      • Description: Chord Estimate: Estimated chord times and labels
      • Comment: Segments of chords (just labels without values)
      • Usefulness: 2
    • chordnotes
      • Description: Note representation of Chord Estimate
      • Comment: Estimates of notes in the chord
      • Usefulness: 2
    • harmonicchange
      • Description: An indication of the likelihood of harmonic change
      • Comment: Probability of chord change
      • Usefulness: 2
    • loglikelihood
      • Description: Logarithm of the likelihood value of the simple chord estimate
      • Comment: As harmonicchange, but less reasonable
      • Usefulness: 1

nnls-chroma:nnls-chroma

  • URL: http://www.isophonics.net/nnls-chroma
  • Block size: 16384
  • Step size: 2048
  • Parameter (default):
    • Use approximate transcription (NNLS)
    • Bass noise threshold (0.0 %)
    • Tuning mode (global tuning)
    • Spectral whitening (1.0)
    • Spectral shape (0.7)
    • Chroma normalization (None)
  • Outputs:
    • logfreqspec
      • Description: Log-Frequency Spectrum (constant Q) that is obtained by cosine filter mapping
      • Comment: As tunedlogfreqspec, but more blurry
      • Usefulness: 0
    • tunedlogfreqspec
      • Description: Log-Frequency Spectrum (constant Q) that is obtained by cosine filter mapping, then its tuned using the estimated tuning frequency
      • Comment: Very precise map of sound intensity with lot of gaps between single notes; no labels; values range: 0-256(?)
      • Usefulness: 1
    • semitonespectrum
      • Description: A semitone-spaced log-frequency spectrum derived from the third-of-a-semitone-spaced tuned log-frequency spectrum
      • Comment: Semitone estimate (values 0-84, no labels)
      • Usefulness: 1
    • chroma
      • Description: Tuning-adjusted chromagram from NNLS approximate transcription, with an emphasis on the medium note range
      • Comment: Estimate of note intensity (values 0-12; 0 - A, 1 - Bb, …, 12 - Ab) with emphasis on mid-band
      • Usefulness: 3
    • basschroma
      • Description: Tuning-adjusted bass chromagram from NNLS approximate transcription, with an emphasis on the bass note range
      • Comment: Estimate of note intensity (values 0-12; 0 - A, 1 - Bb, …, 12 - Ab) with emphasis on bass-band
      • Usefulness: 3
    • bothchroma
      • Description: Tuning-adjusted chromagram from NNLS approximate transcription, with an emphasis on the medium note range
      • Comment: Estimate of note intensity (values 0-24; 0 - A bass, 1 - Bb bass, …, 13 - A, …, 24 - Ab)
      • Usefulness: 2

nnls-chroma:tuning

  • URL: http://www.isophonics.net/nnls-chroma
  • Block size: 16384
  • Step size: 8192
  • Parameter (default):
    • Bass noise threshold (0.0%)
  • Outputs:
    • tuning
      • Description: Returns singe-label containing estimated concert pitch tuning in Hz
      • Comment: ?
      • Usefulness: 0
    • localtuning
      • Description: Returns per-analysis-frame tuning estimate
      • Comment: Tuning estimate in each point (would be nice for large block size?)
      • Usefulness: 2

pyin:localcandidatepyin

  • URL: https://code.soundsoftware.ac.uk/projects/pyin
  • Block size: 2048
  • Step size: 256
  • Parameter (default):
    • Yin threshold distribution (Beta)
    • Output estimates classified as unvoiced? (No)
    • Use non-standard precise YIN timing (slow) (False)
  • Outputs:
    • pitchtrackcandidates
      • Description: Monophonic pitch and note tracking based on probabilistic Yin extension
      • Comment: ? Did not finish
      • Usefulness: 0

pyin:pyin

  • URL: https://code.soundsoftware.ac.uk/projects/pyin
  • Block size: 2048
  • Step size: 256
  • Parameter (default):
    • Yin threshold distribution (Beta)
    • Output estimates classified as unvoiced? (No)
    • Use non-standard precise YIN timing (slow) (False)
    • Suppress low amplitude pitch estimates (0.1)
    • Onset sensitivity (0.7)
    • Duration pruning threshold (0.1)
  • Outputs:
    • f0candidates
      • Description: Estimated fundamental frequency candidates
      • Comment: Estimated frequency at point (Hz)
      • Usefulness: 1
    • f0probs
      • Description: Probability of estimated fundamental frequency candidates
      • Comment: Probability of f0candidates
      • Usefulness: 0
    • voicedprob
      • Description: Probability that signal is voiced according to Probabilistic Yin
      • Comment: ? Seems unreasonable
      • Usefulness: 1
    • candidatesalience
      • Description: Candidate salience
      • Comment: ? Seems unreasonable
      • Usefulness: 1
    • smoothedpitchtrack
      • Description: ?
      • Comment: ?
      • Usefulness: 1
    • notes
      • Description: Derived fixed-pitch note frequencies
      • Comment: Notes estimate; pointless
      • Usefulness: 0

pyin:yin

  • Outputs:
    • f0
    • periodicity
    • rms
    • salience

qm-vamp-plugins:qm-adaptivespectrogram

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Number of resolutions (3)
    • Smallest resolution (512)
    • Decimation factor (No decimation)
    • Omit alternate solutions (False)
    • Multi-threaded processing (True)
  • Outputs:
    • output
      • Description: Adaptive Spectrogram produces a composite spectrogram from a set of series of short-time Fourier transforms at differing resolutions. Values are selected from these spectrograms by repeated subdivision by time and frequency in order to maximise an entropy function across each column
      • Comment: Notes mixed with percussion?
      • Usefulness: 1

qm-vamp-plugins:qm-barbeattracker

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • Beats per bar (4)
    • Alpha (0.9)
    • Tempo hint (120 BPM)
    • Constrain Tempo (False)
  • Outputs:
    • beats
      • Description: The estimated beat locations, returned as a single feature, with timestamp but no value, for each beat, labelled with the number of that beat within the bar (e.g. consecutively 1, 2, 3, 4 for 4 beats to the bar)
      • Comment: Detects beats in bars, labels are beat count in a bar
      • Usefulness: 3
    • bars
      • Description: The estimated bar line locations, returned as a single feature, with timestamp but no value, for each bar
      • Comment: Detects bars
      • Usefulness: 2
    • beatcounts
      • Description: The estimated beat locations, returned as a single feature, with timestamp and a value corresponding to the number of that beat within the bar. This is similar to the Beats output except that it returns a counting function rather than a series of instants
      • Comment: Counts beats in bar
      • Usefulness: 0
    • beatsd
      • Description: The new-bar likelihood function used in bar line estimation
      • Comment: Likelihood of new bar
      • Usefulness: 1

qm-vamp-plugins:qm-chromagram

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 16384
  • Step size: 2048
  • Parameter (default):
    • Minimum Pitch (36 MIDI units)
    • Maximum Pitch (96 MIDI units)
    • Tuning Frequency (440 Hz)
    • Bins per Octave (12 bins)
    • Normalization (None)
  • Outputs:
    • chromagram
      • Description: Output of a chromagram, as a single vector per process block
      • Comment: Note detection in given point (0-12 values; 0-C, 12-B)
      • Usefulness: 3
    • chromameans
      • Description: Mean values of chromagram bins across duration of the input audio
      • Comment: Some kind of averaged notes from overall track?
      • Usefulness: 3

qm-vamp-plugins:qm-constantq

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 16384
  • Step size: 2048
  • Parameter (default):
    • Minimum Pitch (36 MIDI units)
    • Maximum Pitch (84 MIDI units)
    • Tuning Frequency (440 Hz)
    • Bins per Octave (12 bins)
    • Normalization (False)
  • Outputs:
    • constantq
      • Description: Extract a spectrogram with constant ratio of centre frequency to resolution from the input audio
      • Comment: Sound energy on given note (across octaves)
      • Usefulness: 2

qm-vamp-plugins:qm-dwt

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Scales (10)
    • Wavelet (Haar)
    • Threshold (0.0)
    • Absolute values (False)
  • Outputs:
    • wcoeff
      • Description: (Discrete Wavelet Transform) Visualization by scalogram
      • Comment: ?
      • Usefulness: 1?

qm-vamp-plugins:qm-keydetector

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 32768
  • Step size: 32768
  • Parameter (default):
    • Tuning Frequency (440 Hz)
    • Window Length (10 chroma frames)
  • Outputs:
    • tonic
      • Description: Tonic of the estimated key (from C=1 to B=12)
      • Comment: Segments of keys (F#, Ab etc)
      • Usefulness: 2
    • mode
      • Description: Estimates the key of music; major or minor mode of the estimated key (major = 0, minor = 1)
      • Comment: Binary segments; either minor or major; can be combined with notes!
      • Usefulness: 2
    • key
      • Description: Estimated key (from C major = 1 to B major = 12 and C minor = 13 to B minor = 24)
      • Comment: Combination of mode and tonic; seems very useful!
      • Usefulness: 3
    • keystrength
      • Description: Correlation of the chroma vector with stored key profile for each major and minor key
      • Comment: value of 13 is empty; 1-12 is major, 14-25 is minor; seems weird but may be useful
      • Usefulness: 2

qm-vamp-plugins:qm-mfcc

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 2048
  • Step size: 1024
  • Parameter (default):
    • Number of Coefficients (20)
    • Power of Mel Amplitude Logs (1)
    • Include C0 (True)
  • Outputs:
    • coefficients
      • Description: MFCC Values
      • Comment: ?
      • Usefulness: 1
    • means
      • Description: Mean values of MFCCs across duration of audio input
      • Comment: Same as above but single values for each MFCC
      • Usefulness: ?

qm-vamp-plugins:qm-onsetdetector

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • Program (None, General purpose, Soft onsets, Percussive onsets)
    • Onset Detection Function Type (Complex Domain)
    • Onset Detector Sensitivity (50%)
    • Adaptive Whitening (False)
  • Outputs:
    • onsets
      • Description: Perceived note onset positions
      • Comment: Binary points indicating note/percussion etc.
      • Usefulness: 0
    • detection_fn
      • Description: Probability function of note onset likelihood
      • Comment: As onsets but with probability per point
      • Usefulness: 1
    • smoothed_df
      • Description: Smoothed probability function used for peak-picking
      • Comment: As detection_fn but smoothed
      • Usefulness: 2

qm-vamp-plugins:qm-segmenter

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 28800
  • Step size: 9600
  • Parameter (default):
    • Number of segment-types (10)
    • Feature Type (Hybrid (Constant-Q))
    • Minimum segment duration (4)
  • Outputs:
    • segmentation
      • Description: Divide the tack into a sequence of consistent segments
      • Comment: Shows repeatable segments of a song
      • Usefulness: 3

qm-vamp-plugins:qm-similarity

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 4096
  • Step size: 2048
  • Parameter (default):
    • Feature Type (Timbre and Rhythm; Timbre; Chroma; Chroma and Rhythm; Rhythm)
  • Outputs:
    • distancematrix
      • Description: Distance matrix for similarity metric. Smaller = more similar. Should be assymetrical.
      • Comment: 4 values, 2 per channel
      • Usefulness: 1?
    • distancevector (Distance from first channel)
      • Description: Distance vector for similarity of each channel to the first channel. Smaller = more similar.
      • Comment: Two values (one per channel)
      • Usefulness: 1
    • sorteddistancevector
      • Description:
      • Comment: ?
      • Usefulness: ?
    • means
      • Description: Means of the feature bins. Feature time (sec) corresponds to input channel. Number of bins depends on selected feature type.
      • Comment: ?
      • Usefulness: ?
    • variances
      • Description: Variances of the feature bins. Feature time (sec) corresponds to input channel. Number of bins depends on selected feature type.
      • Comment: ?
      • Usefulness: ?
    • beatspectrum
      • Description: Rhythmic self-similarity vectors (beat spectra) for the input channels. Feature time (sec) corresponds to input channel. Not returned if rhythm weighting is zero.
      • Comment: ?
      • Usefulness: ?

qm-vamp-plugins:qm-tempotracker

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 1114
  • Step size: 557
  • Parameter (default):
    • Beat Tracking Method (New)
    • Onset detection function type (Complex Domain)
    • Adaptive Whitening (False)
    • Alpha (0.9)
    • Tempo Hint (120 BPM)
    • Constrain Tempo (False)
  • Outputs:
    • beats
      • Description: Estimated metrical beat locations
      • Comment: HANGS
      • Usefulness: ?
    • detection_fn
      • Description: Probability function of note onset likelihood
      • Comment: HANGS
      • Usefulness: ?
    • tempo
      • Description: Locked tempo estimates
      • Comment: HANGS
      • Usefulness: ?

qm-vamp-plugins:qm-tonalchange

  • URL: https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html
  • Block size: 16384
  • Step size: 2048
  • Parameter (default):
    • Gaussian smoothing (5 frames)
    • Chromagram minimum pitch (32 MIDI units)
    • Chromagram maximum pitch (108 MIDI units)
    • Chromagram tuning frequency 440 Hz)
  • Outputs:
    • tcstransform (Transform to 6D Tonal Content Space)
      • Description: Representation of content in a six-dimensional tonal space
      • Comment: ?
      • Usefulness: 1?
    • tcfunction (Tonal Change Detection Function)
      • Description: Estimate of the likelihood of a tonal change occurring within each spectral frame
      • Comment: ? Flat line
      • Usefulness: 2?
    • changepositions (Tonal Change Positions)
      • Description: Estimated locations of tonal changes
      • Comment: ?
      • Usefulness: 1?

qm-vamp-plugins:qm-transcription

segmentino:segmentino

silvet:silvet

  • URL: https://code.soundsoftware.ac.uk/projects/silvet
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Processing mode (Intensive)
    • Instrument (Multiple or unknown instruments)
    • Return fine pitch estimates (False)
  • Outputs:
    • notes (Note Transcription)
      • Description: Overall note transcription. Each note has time, duration, estimated fundamental frequency and synthetic MIDI velocity (0-127) estimated from strength of the pitch in the mixture
      • Comment: MIDI notes (with frequencies and durations)
      • Usefulness: 2
    • onsets (Note onsets)
      • Description: Note onsets with durations. These can be calculated sooner than complete notes, because it isn't necessary to wait for a note to finish before returning its feature. Each event has time, estimated fundamental frequency in Hz, and synthethic MIDI velocity.
      • Comment: Same as notes, but with less detail
      • Usefulness: 1
    • onoffsets (Note onsets and offsets)
      • Description: Note onsets and offsets as separate events (Offsets are represented in the same way but with a velocity of 0)
      • Comment: Same as onsets but with offets, too
      • Usefulness: 1
    • timefreq (Time-frequency distribution)
      • Description: Filtered constant-Q time-frequency distribution as used in input to the expectation-maximisation algorithm
      • Comment: Seems very useful in determining "richness" of sound; constant-q with clearly visible notes and holes
      • Usefulness: 2
    • pitchactivation (Pitch activation distribution)
      • Description: Pitch activation distribution resulting from expectation-maximisation algorithm, prior to note extraction
      • Comment: Clearly visible melody of a track; values 1-87 (1 - A0, 88 - C8)
      • Usefulness: 2
    • chroma (Pitch chroma distribution)
      • Description: Pitch chroma distribution formed by wrapping the un-thresholded pitch activation distribution into a single octave of semitone bins
      • Comment: same as pitchactivation, but for single octave; values 1-12 (1-A, 12 - G#)
      • Usefulness: 2
    • templates
      • Description: Constant-Q spectral templates the selected instrument pack
      • Comment: ?
      • Usefulness: 0

tempogram:tempogram

  • URL: https://code.soundsoftware.ac.uk/projects/vamp-tempogram
  • Block size: 2048
  • Step size: 1024
  • Parameter (default):
    • Novelty Curve Spectrogram Compression Constant (1000)
    • Novelty Curve Minimum DB (-74)
    • Tempogram Window Length (1024)
    • Tempogram Hopsize (64)
  • Outputs:
    • cyclicTempogram
      • Description: Cyclic tempogram calculated by octave folding the DFT tempogram
      • Comment: ?
      • Usefulness: 1
    • tempogramDFT
      • Description: Tempogram calculated using Discrete Fourier Transform
      • Comment: ?
      • Usefulness: 1
    • tempogramACT
      • Description: Tempogram calculated using auto-correlation method
      • Comment: ?
      • Usefulness: 1
    • nc
      • Description: Novelty curve underlying the tempogram calculations
      • Comment: Flat line?
      • Usefulness: ?

ua-vamp-plugins:mf0ua

  • URL: https://grfia.dlsi.ua.es/cm/projects/drims/softwareVAMP.php
  • Block size: 4458
  • Step size: 557
  • Parameter (default):
    • Algorithm (EURASIP12)
    • Maximum polyphony (6)
    • Minimum fundamental frequency (38 Hz)
    • Maximum fundamental frequency (2100 Hz)
    • Minimum note duration (0.02 s)
  • Outputs:
    • mf0ua
      • Description: Multiple fundamental frequency estimation for polyphonic music transcription
      • Comment: Notes in time with duration, frequency and note classification
      • Usefulness: 2

ua-vamp-plugins:onsetsua

  • URL: https://grfia.dlsi.ua.es/cm/projects/drims/softwareVAMP.php
  • Block size: 4458
  • Step size: 2229
  • Parameter (default):
    • Sensitivity (0.18)
  • Outputs:
    • odf (Onset Detection Function)
      • Description: Note onset detection using a one-semitone filterbank
      • Comment: Function of likelihood that in given timestamp, a note is set
      • Usefulness: 2
    • onsets
      • Description: Multiple fundamental frequency estimation for polyphonic music transcription
      • Comment: Note onset detection using a one-semitone filterbank
      • Usefulness: 1

vamp-example-plugins:amplitudefollower

vamp-example-plugins:fixedtempo

  • URL: https://www.vamp-plugins.org/plugin-doc/vamp-example-plugins.html
  • Block size: 256
  • Step size: 64
  • Parameter (default):
    • Minimum estimated tempo (50 BPM)
    • Maximum estimated tempo (190 BPM)
    • Input duration to study (10 s)
  • Outputs:
    • tempo
      • Description: Study a short section of audio and estimate its tempo, assuming the tempo is constant
      • Comment: Single-line function of first few seconds of track
      • Usefulness: 1
    • candidates
      • Usefulness: 0
    • detectionfunction
      • Usefulness: 0
    • acf
      • Usefulness: 0
    • filtered_acf
      • Usefulness: 0

vamp-example-plugins:percussiononsets

  • URL: https://www.vamp-plugins.org/plugin-doc/vamp-example-plugins.html
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • Energy rise threshold (3 dB)
    • Sensitivity (40%)
  • Outputs:
    • onsets
      • Description: Percussive note onset locations
      • Comment: Binary markings of percussion onsets
      • Usefulness: 2
    • detectionfunction
      • Description: Detect percussive note onsets by identifying broadband energy rises
      • Comment: Probability of percussion onset at given time
      • Usefulness: 2

vamp-example-plugins:powerspectrum

vamp-example-plugins:spectralcentroid

  • URL: https://www.vamp-plugins.org/plugin-doc/vamp-example-plugins.html
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • (None)
  • Outputs:
    • logcentroid
      • Description: Centroid of the log weighted frequency spectrum
      • Comment: Center of frequency?
      • Usefulness: 2
    • linearcentroid
      • Description: Centroid of the linear frequency spectrum
      • Comment: As logcentroid, but more diversed
      • Usefulness: 2

vamp-example-plugins:zerocrossing

  • Outputs:
    • counts
      • Usefulness: 0
    • zerocrossings
      • Description: Location of zero-crossing points
      • Usefulness: 0

vamp-libxtract:amdf

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • (None)
  • Outputs:
    • amdf (Average Magnitude Difference Function)
      • Description: Extract the AMDF of an audio signal
      • Comment: ?
      • Usefulness: 1?
    • asdf (Average Squared Difference Function)
      • Description: Extract the ASDF of an audio signal
      • Comment: ?
      • Usefulness: 1?

vamp-libxtract:autocorrelation

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • (None)
  • Outputs:
    • autocorrelation
      • Description: Extract the autocorrelation of an audio signal
      • Comment: ?
      • Usefulness: 1?

vamp-libxtract:average_deviation

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • average_deviation
      • Description: Extract the average deviation of a range of values
      • Comment: ?
      • Usefulness: 1?

vamp-libxtract:bark_coefficients

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • bark_coefficients
      • Description: Extract bark coefficients from an audio spectrum
      • Comment: ? Row range 1-26, don't know if it's frequency range
      • Usefulness: 2

vamp-libxtract:crest

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • crest (Spectral Crest Measure)
      • Description: Extract Spectral Crest Measure from an audio spectrum
      • Comment: Almost flat through whole song up to the end, where higher-frequency noise is visible
      • Usefulness: 1

vamp-libxtract:dct

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • (None)
  • Outputs:
    • dct (Discrete Cosine Transform)
      • Description: Extract DCT of an audio signal
      • Comment: ?
      • Usefulness: 1?

vamp-libxtract:f0

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • (None)
  • Outputs:
    • f0 (Fundamental Frequency)
      • Description: Extract Fundamental Frequency of an audio signal
      • Comment: Weird way: fundamental frequency is detected at almost the end of the track (441 Hz), throughout the track it's 0
      • Usefulness: 1

vamp-libxtract:failsafe_f0

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • (None)
  • Outputs:
    • failsafe_f0
      • Description: Extract Fundamental Frequency of an audio signal
      • Comment: Same as f0
      • Usefulness: 1

vamp-libxtract:flatness

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • (None)
  • Outputs:
    • flatness
      • Description: Extract the spectral flatness of an audio spectrum
      • Comment: Similar to crest output
      • Usefulness: 1

vamp-libxtract:harmonic_spectrum

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Peak threshold (10.0 %)
    • Harmonic threshold (0.1)
  • Outputs:
    • amplitudes
      • Description: Extract the harmonics from an audio spectrum
      • Comment: 3D harmonics, rows 0-256
      • Usefulness: 1

vamp-libxtract:highest_value

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • highest_value
      • Description: Extract the highest value from a given range
      • Comment: 2D function
      • Usefulness: 1

vamp-libxtract:irregularity_j

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • irregularity_j
      • Description: Extract the irregularity (type I) of an audio spectrum
      • Comment: 2D function
      • Usefulness: 2

vamp-libxtract:irregularity_k

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • irregularity_k
      • Description: Extract the irregularity (type II) of an audio spectrum
      • Comment: 2D function
      • Usefulness: 2

vamp-libxtract:kurtosis

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • kurtosis
      • Description: Extract the kurtosis of a range of values
      • Comment: 2D function
      • Usefulness: 2

vamp-libxtract:loudness

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • loudness
      • Description: Extract the loudness of an audio signal from its spectrum
      • Comment: 2D function
      • Usefulness: 2

vamp-libxtract:lowest_value

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • lowest_value
      • Description: Extract the lowest value from a given range
      • Comment: 2D function
      • Usefulness: 1

vamp-libxtract:mean

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • mean
      • Description: Extract the mean of a given range
      • Comment: 2D function
      • Usefulness: 1

vamp-libxtract:mfcc

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Minimum frequency (80.0 Hz)
    • Maximum frequency (18000.0 Hz)
    • # Mel Frequency Bands (40)
    • Lowest coefficient returned (0)
    • Highest coefficient returned (20)
    • MFCC Type (Equal Gain)
  • Outputs:
    • mfcc
      • Description: Extract MFCC from an audio spectrum
      • Comment: 3D function
      • Usefulness: 1

vamp-libxtract:noisiness

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Peak Threshold (10.0 %)
    • Harmonic Threshold (0.1)
  • Outputs:
    • noisiness
      • Description: Extract MFCC from an audio spectrum
      • Comment: Zero values(!)
      • Usefulness: 0

vamp-libxtract:nonzero_count

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Peak Threshold (10.0 %)
  • Outputs:
    • nonzero_count
      • Description: Extract the number of non-zero elements in an input spectrum
      • Comment: 2d function
      • Usefulness: 0

vamp-libxtract:odd_even_ratio

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Peak Threshold (10.0 %)
    • Harmonic Threshold (0.1)
  • Outputs:
    • odd_even_ratio
      • Description: Extract the odd-to-even harmonic ratio of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:peak_spectrum

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Peak Threshold (10.0 %)
  • Outputs:
    • amplitudes
      • Description: Extract the spectral peaks from an audio spectrum
      • Comment: 3d function
      • Usefulness: 1

vamp-libxtract:rms_amplitude

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • (None)
  • Outputs:
    • rms_amplitude
      • Description: Extract the RMS amplitude of an audio signal
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:rolloff

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Rolloff Threshold (90.0 %)
  • Outputs:
    • rolloff
      • Description: Extract the rolloff point of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:sharpness

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • sharpness
      • Description: Extract the spectral sharpness of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:skewness

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • skewness
      • Description: Extract the skewness of a range of values
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:smoothness

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • smoothness
      • Description: Extract the smoothness of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:spectral_centroid

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • spectral_centroid
      • Description: Extract the spectral centroid of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:spectral_inharmonicity

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Peak Threshold (10.0 %)
  • Outputs:
    • spectral_inharmonicity
      • Description: Extract the inharmonicity of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:spectral_kurtosis

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • spectral_kurtosis
      • Description: Extract the kurtosis of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:spectral_skewness

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • spectral_skewness
      • Description: Extract the skewness of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:spectral_slope

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • spectral_slope
      • Description: Extract the spectral slope of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:spectral_standard_deviation

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • spectral_standard_deviation
      • Description: Extract the standard deviation of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:spectral_variance

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • spectral_variance
      • Description: Extract the variance of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:spectrum

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • Type (Magnitude Spectrum)
    • Include DC (False)
    • Normalise (False)
  • Outputs:
    • amplitudes
      • Description: Extract the variance of an audio spectrum
      • Comment: 3d function
      • Usefulness: 2

vamp-libxtract:spread

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • spread
      • Description: Extract the spectral spread of an audio spectrum
      • Comment: 2d function
      • Usefulness: 3

vamp-libxtract:standard_deviation

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • standard_deviation
      • Description: Extract the standard deviation of a range of values
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:sum

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • sum
      • Description: Extract the sum of values in a given range
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:tonality

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • tonality
      • Description: Extract the tonality of an audio spectrum
      • Comment: List of all 0s(?)
      • Usefulness: 0

vamp-libxtract:tristimulus_1

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Peak Threshold (10.0 %)
    • Harmonic Threshold (0.1)
  • Outputs:
    • tristimulus_1
      • Description: Extract the tristimulus (type I) of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:tristimulus_2

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Peak Threshold (10.0 %)
    • Harmonic Threshold (0.1)
  • Outputs:
    • tristimulus_2
      • Description: Extract the tristimulus (type II) of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:tristimulus_3

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • Peak Threshold (10.0 %)
    • Harmonic Threshold (0.1)
  • Outputs:
    • tristimulus_3
      • Description: Extract the tristimulus (type III) of an audio spectrum
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:variance

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 1024
  • Parameter (default):
    • (None)
  • Outputs:
    • variance
      • Description: Extract the variance of a range of values
      • Comment: 2d function
      • Usefulness: 2

vamp-libxtract:zcr

  • URL: https://github.com/jamiebullock/LibXtract
  • Block size: 1024
  • Step size: 512
  • Parameter (default):
    • (None)
  • Outputs:
    • zcr
      • Description: Extract the zero crossing rate of an audio signal
      • Comment: 2d function
      • Usefulness: 1

Toolbox:

Clone this wiki locally