I am currently working on a project where I a data matching algorithm needs to be implemented. An external system passes in all data it knows about a customer, and the syste
What about a machine learning approach. Create. Distances per item.
These become your input space. Build a training set on correctly matched custers based on these distances. Run through your favourite machine learner algo. Get your parameters for decision func which reflect strength of match. Tune. Apply to new cases. Go to the bank.