I'm not sure if it still the case, but CAPTCHA isn't as random as it seems. The words/ phrases come from the computers ( and companies ) which are digitising books. The words which you are trying to guess, are words or phrases from scanned text which the computer couldn't make out, or wants confirmation it's guess is correct.
Again, this may not be correct anymore, but that's certainly what it used to do.