$ 0 0 I would suggest using Mp3FileReader (or NLayer) to decode the file. Once you have the raw PCM samples, you'll need to do voice analysis. I'm not familiar with the state of the art there, but there's plenty of documentation out there: Google. Good luck!