It's because the Neo Geo CD system only has 1 megabyte of ram for V rom data (sound samples). For most games to fit, you have to trim the game's V rom data to be smaller than 1 megabyte.
Then when the game needs to access any of that sample data that got trimmed off, your speakers get filled with that glitchy mess instead of the intended sound.
The way to fix this would be to program load screens in the middle of the game, but that's too much effort. That's why most conversions are only for the smaller games, that fit within the Neo Geo CD's ram limitations.