Robin Linacre – Rapid deduplication and fuzzy matching of large datasets using Splink



www.pydata.org Data deduplication is a ubiquitous data quality problem that most data people will encounter at some point in …

source

Leave a Reply

Your email address will not be published. Required fields are marked *

Amazon Affiliate Disclaimer

Amazon Affiliate Disclaimer

“As an Amazon Associate I earn from qualifying purchases.”

Learn more about the Amazon Affiliate Program