testset = second top 5000 lines of ../val/Tatoeba.src.shuffled!