An Objective Measure of Quality for Time-Scale Modification of Audio

Article


Roberts, Timothy and Paliwal, Kuldip K.. 2021. "An Objective Measure of Quality for Time-Scale Modification of Audio." The Journal of the Acoustical Society of America. 149 (3), pp. 1843-1854. https://doi.org/10.1121/10.0003753
Article Title

An Objective Measure of Quality for Time-Scale Modification of Audio

ERA Journal ID1297
Article CategoryArticle
AuthorsRoberts, Timothy and Paliwal, Kuldip K.
Journal TitleThe Journal of the Acoustical Society of America
Journal Citation149 (3), pp. 1843-1854
Number of Pages12
Year2021
PublisherAIP Publishing
Place of PublicationUnited States
ISSN0001-4966
1520-8524
Digital Object Identifier (DOI)https://doi.org/10.1121/10.0003753
Web Address (URL)https://pubs.aip.org/asa/jasa/article-abstract/149/3/1843/973703/An-objective-measure-of-quality-for-time-scale?redirectedFrom=fulltext
Abstract

Objective evaluation of audio processed with time-scale modification (TSM) remains an open problem. Recently, a dataset of time-scaled audio with subjective quality labels was published and used to create an initial objective measure of quality (OMOQ). In this paper, an improved OMOQ for time-scaled audio is proposed. The measure uses handcrafted features and a fully connected network to predict subjective mean opinion scores (SMOS). Basic and advanced perceptual evaluation of audio quality features are used in addition to nine features specific to TSM artefacts. Six methods of alignment are explored with interpolation of the reference magnitude spectrum to the length of the test magnitude spectrum giving the best performance. The proposed measure achieves a mean root mean square error of 0.490 and a mean Pearson correlation of 0.864 to SMOS, equivalent to the 97th and 82nd percentiles of the subjective sessions, respectively. The proposed measure is used to evaluate TSM algorithms, finding that Elastique gives the highest objective quality for solo instrument and voice signals, whereas the identity phase-locking phase vocoder gives the highest objective quality for music signals and the best overall quality. The objective measure is available online at https://www.github.com/zygurt/TSM.

KeywordsTime Scale Modification; Quality; Neural Network; Audio
Contains Sensitive ContentDoes not contain sensitive content
ANZSRC Field of Research 2020400607. Signal processing
461104. Neural networks
Public Notes

This article may be downloaded for personal use only. Any other use requires prior permission of the author and AIP Publishing. This article appeared in J. Acoust. Soc. Am. 149, 1843–1854 (2021) and may be found at https://doi.org/10.1121/10.0003753.

Byline AffiliationsGriffith University
Permalink -

https://research.usq.edu.au/item/zz13w/an-objective-measure-of-quality-for-time-scale-modification-of-audio

Download files


Published Version
JASA OMOQ Roberts-Published.pdf
License: Used with permission
File access level: Anyone

  • 3
    total views
  • 1
    total downloads
  • 3
    views this month
  • 1
    downloads this month

Export as

Related outputs

Underground LoRa sensor node for bushfire monitoring
Herring, Ben, Sharp, Tony, Roberts, Tim, Fastier-Wooller, Jarred, Kelly, Greg, Sahin, Oz, Thiel, David, Dao, Dao and Woodfield, Peter L.. 2022. "Underground LoRa sensor node for bushfire monitoring." Fire Technology. 58 (3), pp. 1087-1095. https://doi.org/10.1007/s10694-022-01224-3
Deep learning-based single-ended quality prediction for time-scale modified audio
Roberts, Timothy, Nicolson, Aaron and Paliwal, Kuldip K.. 2021. "Deep learning-based single-ended quality prediction for time-scale modified audio." Journal of the Audio Engineering Society. 69 (9), pp. 644-655. https://doi.org/10.17743/jaes.2021.0031
Design of Objective Quality Measures for Time-Scale Modification of Audio
Roberts, Timothy. 2021. Design of Objective Quality Measures for Time-Scale Modification of Audio. PhD Thesis Doctor of Philosophy. Griffith University. https://doi.org/10.25904/1912/4070
A Time-Scale Modification Dataset with Subjective Quality Labels
Roberts, Timothy and Paliwal, Kuldip K. 2020. "A Time-Scale Modification Dataset with Subjective Quality Labels." The Journal of the Acoustical Society of America. 148 (1), pp. 201-210. https://doi.org/10.1121/10.0001567
Time-scale modification using fuzzy epoch-synchronous overlap-add (FESOLA)
Roberts, Timothy and Paliwal, Kuldip K.. 2019. "Time-scale modification using fuzzy epoch-synchronous overlap-add (FESOLA)." 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). New Paltz, United States 20 - 23 Oct 2019 United States. IEEE (Institute of Electrical and Electronics Engineers). https://doi.org/10.1109/waspaa.2019.8937258