I have a Rails app with a contact list that needs to interface with
and at least one other external data source. My fear is that as my
throws CSVs at my Web app, I will have subtly different items referring
the same contact.
I’ve come up with several alternate approaches to this but thought I’d
if anyone else has already faced this problem. FWIW, here were two
approaches I felt might work:
Tag contacts that have already been sync’ed with Outlook. Strangely,
Outlook does not provide any unique identifier with its contact
so this would have to be done in some custom field. Ack!
Use a proximity or fuzzy match to determine whether the same contact
being updated. So, “Sam Smith” and “Sammy Smith” might be the same
but “Sam Jones” would not be. The user could then manually resolve
Regarding (2), ferret seems like a good way to get a Levenshtein
for my existing data, as the data can be indexed as added, economizing
the matching hassle later.
Anyone have any thoughts or experience with this?
View this message in context:
Sent from the RubyOnRails Users mailing list archive at Nabble.com.