Problem definition: implementing a solution for online customer matching for a large telecommunications company servicing more than 10 million customers.
Customer Relationship Management applications face a major challenge in implementing accurate and timely customer identification for several reasons:
- customer reps misspelling customer name and addresses
- large number of records (ranging anywhere from few millions to hundreds of millions) drives search response times higher
- potential for fraud in case of identity misrepresentation
There are few lessons learned from this:
- don't trust legacy data, use a data cleansing tool to validate and normalize
- no matter how sophisticated, no out of box tool offers a comprehensive solution
- performance tuning is critical and will drive a lot the design and implementation, as I found in performance testing with real live data volumes, only few weeks before going live ...
