Please can someone help me parse these links from an HTML page
Your regular expression is looking at ALL tags. "handle" is always used as "/dspace/handle" etc. so you can use something like this to scrape the urls you're looking for:
Pattern pattern = Pattern.compile("