Hearing different "tones" may be actually hearing different timbres, that is different distributions of audio power in the overtones. I believe the app uses an FFT (Fast Fourier Transform algorithm) to sort thorough this and identify the fundamental.
As for accuracy in measuring length, the tension determination is proportional to length so a 5% error in length will give 5% error in tension, to first order anyway.