testset = top 1000 lines of ../val/GlobalVoices.src.shuffled!