Finding the WHO, WHAT & WHEN in audio
Oxford Wave Research (OWR) is a leading R&D company based in Oxford, UK, specialising in audio and speech processing, voice biometrics and deep learning-related product development. Our team has many years of experience developing solutions for law enforcement, military as well as other agencies both in the UK and around the world.
Inclusive security using voice biometrics and Microsoft Identity
Inclusive security for the visually impaired, those with difficulty reading or understanding a language, or don’t have access to a dedicated personal device
This is the fascinating use case for which our partner WhoIAM has been using our voice biometrics technology. This has just been featured in the latest edition of the Microsoft Azure Identity partner integration video that uses our speaker biometrics-based authentication to make identity security design more inclusive. We at Oxford Wave Research support this laudable goal all the way!
As Ajith Alexander, head of product management at WhoIAM, writes in the Microsoft Azure AD identity blog:
“Using voice biometrics for verification is also a powerful tool for implementing inclusive security. Human voices are readily available, can be recorded in a contactless way, and do not require specialized hardware. Our voice carries an imprint of our identity that comes through regardless of what we’re saying, what language we’re speaking, or where we’re speaking from. This makes voice biometrics an ideal choice for catering to users who are visually impaired, have difficulty reading or understanding a language, don’t have access to a dedicated personal device (residents at assisted-living communities, shift-workers), or live in less developed areas that rely on fixed phone lines. …Creatively solving for flexible, inclusive user verification ensures we can log in previously marginalized customers securely without identity verification being a frustrating experience.”Ajith Alexander, Head of Product Management, WhoIAM
Assessing perceived voice similarity using ASR methods incorporating phonetic features can have major implications for voice lineups, voice banking, even forensic speaker recognition - A joint work with OWR and Cambridge University.https://t.co/hpKeY9jjfw#ECR2021 @CamLangSci pic.twitter.com/M2xCflKN3R— Oxford Wave Research (@OxfordWave) June 16, 2021
Security & Policing 2021- The Virtual one!
Virtual Event, Virtual Stand, even Virtual Sweets, but the same real people!The OWR team, Nikki, Ekrem, Oscar and Anil warmly welcome you to join us at the first ever virtual Security & Policing event taking place 9-11 March 2020. This year's event coincides with our 10 year anniversary and we are proud to be sharing with you our state-of-the-art desktop and ‘on-device’ speaker recognition and audio processing software in use at the forefront of the voice biometric field.
Inclusive security for the visually impaired, those with difficulty reading or understanding a language, or don’t have a dedicated personal device - thrilled for our tech to be featured in a Microsoft Identity partner video with https://t.co/KnguTii238 https://t.co/v8X8MHqYaj pic.twitter.com/VfWQ15eGev— Oxford Wave Research (@OxfordWave) May 21, 2021
Oxford Wave Research PhD Studentship at Cambridge
Collaborative PhD studentship between Oxford Wave Research, the University of Cambridge, and the Cambridge Trust
We are delighted that Oxford Wave Research Ltd and the University of Cambridge, in collaboration with the Cambridge Trust, have established a new PhD studentship based at Selwyn College, Cambridge. The award enables a student to undertake a PhD in Theoretical and Applied Linguistics, commencing in October 2020. The studentship was open to UK and EU candidates of outstanding academic potential, and covers tuition fees and maintenance for three years. The studentship is in the area of forensic phonetics, the application of phonetic analysis to criminal cases, often where the identity of a speaker is in question, either due to an incriminating recording (e.g. hoax call, ransom demand, telephone threat, etc.) or due to a witness having heard a speech event at a crime scene. Forensic phonetics uses both traditional phonetic and automatic (machine-based) techniques. The PhD project aims to consider the relationships between traditional phonetic analyses and automatic speaker recognition (computer-based identification and recognition of the identity behind a voice). The studentship will include collaborative opportunities for the student to gain industry experience and to conduct research in conjunction with Oxford Wave Research, an audio processing and voice-biometrics company which specialises in developing solutions for law enforcement agencies in forensic voice comparison. The student’s research will consider both human and machine-based, algorithmic selection of different groups of speakers for various forensic analyses based on different criteria and the implication of the selections of these groups in the evaluation of the strength of evidence. These criteria include voice similarity perceived by human listeners and demographic features such as gender, language, age, regional accent. Further, the research will attempt to evaluate how the human or automatic, machine-based selection of databases can result in algorithmic bias.
“We are delighted to be working with a leading audio-processing and voice biometrics company that has such a strong track record of developing solutions in the forensic speech and audio arena. Cambridge has a well-established tradition of research excellence and innovation in forensic phonetics and the opportunity to bring automatic speaker recognition techniques to complement our acoustic-phonetic and perceptual approaches represents an exciting new line of investigation for our Phonetics Lab.” Dr Kirsty McDougall, University of Cambridge
“This studentship overseen by Dr McDougall at the Phonetics Laboratory in Cambridge represents an incredible opportunity for us to formally collaborate with one of the best-regarded forensic phonetics research groups in the country, with an enduring legacy of fundamental and important research work. We look forward to the exciting research collaboration planned with the laboratory in this studentship that has important implications for how forensic casework involving speech is done in the future and which will help the legal system by providing timely, just and balanced analysis.” Dr Anil Alexander, CEO of Oxford Wave Research
Current award-holderThe recipient of this studentship in 2020 is Ms Linda Gerlach. Linda obtained her undergraduate degree in Language and Communication at Philipps University Marburg, Germany, and went on to complete her masters degree in Speech Science with a focus on phonetics at the same university. For her masters thesis titled “A study on voice similarity ratings: humans versus machines”, she worked in collaboration with the University of Cambridge during an internship at Oxford Wave Research (2018-2019).
About University of Cambridge Phonetics LaboratoryThe University of Cambridge Phonetics Laboratory is based in the university’s Theoretical and Applied Linguistics Section, Faculty of Modern and Medieval Languages, and accommodates a strong community of teaching and research staff, research students, a number of affiliated researchers in phonetics, and a lab manager. As well as hosting an extensive programme of research in forensic phonetics, the lab fosters research in phonetics and phonology across a diverse range of topics including speech production and perception, language acquisition, psycholinguistics, prosody, tone, sociophonetics, and language variation and change. Recent funded projects in forensic phonetics include DyViS, VoiceSim and IVIP.
About Oxford Wave ResearchOxford Wave Research (OWR) is a specialised audio R&D company with expertise in voice biometrics, speaker diarization, audio fingerprinting and audio enhancement. The OWR team have contributed to major government projects, nationally and internationally. OWR has been particularly successful in bringing practical applications of state-of-the-art academic research algorithms to usable commercial products for law enforcement, military and other agencies. OWR’s solutions are used by law enforcement and forensic laboratories across the world including the UK, Germany, Netherlands, France, Canada, Switzerland. OWR are the creators of the well-established forensic voice comparison system 'VOCALISE', used in forensic audio labs across the world, as well as ‘WHISPERS’ which is a powerful networked ‘one to many’ voice comparison system.
Oxford Wave Research publications at ODYSSEY 2020
Two of our publications at the ODYSSEY 2020 Speaker and Language Recognition Workshop
Two of our collaborative papers, one on voice spoofing detection, and the other on the effects of device variability on forensic speaker comparison, are appearing at this week’s virtual ODYSSEY 2020 Speaker and Language Recognition Workshop. Video presentations for both papers are now available on the workshop website: http://www.odyssey2020.org/
The full papers, along with the rest of the conference proceedings, can be found at: https://www.isca-speech.org/archive/Odyssey_2020/
In our paper with Bence Halpern (PhD student, University of Amsterdam), “Residual networks for resisting noise: analysis of an embeddings-based spoofing countermeasure,” we propose a new embeddings-based method of spoofed speech detection using Constant Q-Transform (CQT) features and a Dilated ResNet Deep Neural Network (DNN) architecture. The novel CQT-GMM-DNN approach, which uses the DNN embeddings with a Gaussian Mixed Model (GMM) classifier, performs favourably compared to the baseline system in both clean and noisy conditions. We also present some ‘explainable audio’ results, which provide insight into the information the DNN exploits for decision-making. This study shows that reliable detection of spoofed speech is increasingly possible, even in the presence of noise.
See a blog post from Bence (including some explainable audio examples) here: https://karkirowle.github.io/publication/odyssey-2020
In our paper with David van der Vloed (from the Netherlands Forensic Institute), “Exploring the effects of device variability on forensic speaker comparison using VOCALISE and NFI-FRIDA, a forensically realistic database,” we investigate the effect of recording device mismatch on forensic speaker comparison with VOCALISE. Using the forensically-realistic NFI-FRIDA database, consisting of speech simultaneously-recorded on multiple devices (e.g. close-mic, far-mic, and telephone intercept, as seen in the data collection image), we demonstrate that while optimal performance is achieved by matching the relevant population recording device to the case data recording device, it is not necessary to match the precise device; broadly matching the device type is sufficient. This study presents a research methodology for how a forensic practitioner can corroborate their subjective judgment of the ‘representativeness’ of the relevant population in forensic speaker comparison casework.
One day to go! The 2021 Security & Policing Event starts tomorrow and we’re looking forward to ‘virtually’ seeing you there! Here’s a sneak peek till then!https://t.co/8CEICM7Aai#oxfordwaveresearch #biometrics #voicebiometrics #securityandpolicing #forensicaudio— Oxford Wave Research (@OxfordWave) March 8, 2021
Do face coverings affect identifying voices?
Vlog: Do face coverings affect identifying voices?
A small experiment using VOCALISE and PHONATE
In these recent months of 2020, like many others around the world, we have found ourselves adjusting to the new normal of wearing masks in various places like supermarkets and other public spaces. We found ourselves (minorly) annoyed that some biometric identification, like face recognition, doesn't quite work when wearing masks. This made us wonder how well voice biometric solutions could work when speakers are wearing masks, and we decided to perform a small experiment to analyse this.
Over the last few weeks, we have been performing some small-scale tests of our VOCALISE and PHONATE software against speech spoken from behind a mask. We have found our systems to be quite robust to masked speech - they are able to recognise speakers across different mask-wearing conditions well.
The video below explains our experiment and discusses our findings. We hope that you find it interesting![embed]https://www.youtube.com/watch?v=NUSD-TWTCQY&feature=youtu.be[/embed] Download Transcript