PhD studentships in AI for Sound - Automatic sound labelling for broadcast audio

The AI for Sound project in the Centre for Vision, Speech and Signal Processing (CVSSP) at the University of Surrey is offering the following PhD studentships in AI for Sound, available from 1 October 2021:
(1) Automatic sound labelling for broadcast audio

The aim of this project is to develop new methods for automatic labelling of sound environments and events in broadcast audio, assisting production staff to find and search through content, and helping the general public access archive content. The project will undertake a combination of interviews and user profiling, analysis of audio search datasets, and categorisation by audio experts to determine the most useful terminology for production staff and the general public as user groups. The project will develop a taxonomy of labels, and examine the similarities and differences between each group. The project will also investigate the application of a labelled library in a production environment, examining workflows with common broadcast tools, then integrating and evaluating prototype systems. The project will also investigate methods for automatic subtitling of non-speech sounds, such as end-to-end encoder-decoder models with alignment, to directly map the acoustic signal to text sequences. Working with BBC R&D, the student will develop software tools to demonstrate the results, especially for broadcasting and the management of audiovisual archive data, and benchmark the results against human-assigned tags and descriptions of audio content. Using archive data provided by BBC R&D, the student will engage with audio production and research experts through Expert Panels, and potential end users through Focus Groups. As part of this PhD, you will have the opportunity for close day-to-day collaboration with the BBC as a member of the R&D Audio Team.

We particularly encourage applications from candidates with disabilities, Black, Asian and Minority Ethnic candidates and female candidates as these groups are underrepresented throughout our area. We also welcome enquiries from self-funded and part-funded candidates.

For informal enquiries on opportunities related to AI for Sound, please contact Prof Mark Plumbley (m.plumbley@surrey.ac.uk).

More information and how to apply: https://www.surrey.ac.uk/fees-and-funding/studentships/automatic-sound-labelling-broadcast-audio

Posted on 16th July 2021 in Careers in acoustics, Communication and Room Acoustics, Early Career

PhD studentships in AI for Sound – Automatic sound labelling for broadcast audio