A Python script for generating duplicate data to test the performance of record linkage and master data management systems.