I know it is bad style to answer one's own question, but I found a nice BSD-licensed API with a number of different implementations. Its name is Cognitive Foundry and it is developed in an US National Lab. It also comes with implementations for significance tests, clustering, statistic utilities and a text package.