$ 0 0 Knowing who is speaking is a really hard algorithm to write. It cannot be done on the basis of pitch alone.