I just joined and started working on a project that I'm wondering if it's already been done. I have a DB that stores info about users, things like login ID, firstname, last name, employee ID, email, etc... I've been asked to devise and algorithm to do some type of fuzzy match so that whenever we import a new user, we can compare the login id vs the data elements to see if it's the same person. Things like: jdoe has an 80% probability of matching an entry with first name john and last name doe.
So we would have a set of rules and pattern matching based on 5 or 6 data elements.
Does anyone know if this has been done and any references or open source code to help?
Post A Reply