MySQL Forums
Forum List  »  PHP

Re: Checking for exact match in row
Posted by: Rick James
Date: July 12, 2009 07:02PM

"close to" is a non-trivial research project.

One approach is the come up with a "canonical form" of each word and build a lookup table that maps the canonical form to the real form(s).
* Strip trailing stuff
* Strip optional prefixes
* Pick one of multiple variants
* remove spaces, apostrophes, etc

Some thoughts on names:
Mac... -> Mc...
O' -> O
Ian/Juan/Jack -> John
Dick/Rich/Rick -> Richard

SOUNDEX() is a crude way to get a canonical form, but it depends somewhat on English pronunciation.

Another trick is to remove all vowels, on the presumption that they are the most likely to be wrong.

Options: ReplyQuote


Subject
Written By
Posted
Re: Checking for exact match in row
July 12, 2009 07:02PM


Sorry, you can't reply to this topic. It has been closed.

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.