Skip to yearly menu bar Skip to main content


Short Presentation
in
Affinity Workshop: LXAI Research @ NeurIPS 2020

Towards forensic speaker identification in Spanish using triplet loss

Ivan Vladimir Meza Ruiz


Abstract:

This work explores the use of a triplet loss deep network setting for the forensic identification of speakers in Spanish. Within the framework we train a convolutional network to produce vector representations of speech spectrogram slices. Then we test how similar are these vectors for a given speaker and how dissimilar are compared with other speakers. Based on these metrics we propose the calculation of the Likelihood Radio which is a cornerstone for forensic identification.

Chat is not available.