There is some risk that Whisper transcribed inaccurately, but that’s less model collapse and more “the dataset is bad”.
There is some risk that Whisper transcribed inaccurately, but that’s less model collapse and more “the dataset is bad”.