I\'ve written background InputStream (and OutputStream) implementations that wrap other streams, and read ahead on a background thread, primarily allow         
        
I'd be interested. I've thought through a similar project, but couldn't figure out how to handle pieces that finish compression out of order.
Sounds interesting. I've never run across anything that does this out of the box but it makes perfect sense to try and use an idle core for the compression if it's available.
Perhaps you could make use of Commons I/O - it is a well tested lib which could help handle some of the more boring stuff and let you focus on extending the cool parallel parts. Maybe you could even contribute your code to the Commons project ;-)