If it's human entered data, then you'll spend too much time trying to code around the exceptions.
Try:
Regular expression to extract the zip code
Zip code lookup (via appropriate government DB) to get the correct address
Get an intern to manually verify the new data matches the old