### File names:

all.wav: generated by all 12 WSJ ASR (ID:1-12) guidance, without speaker guidance.
enroll.wav: the enrollment speech that used for speaker.
gt.wav: ground truth speech.
spk_6: generated by 6 ASR (ID 1-6) guidance, with speaker guidance.
spk_all.wav: generated by all 12 WSJ ASR (ID:1-12) guidance,  with speaker guidance.