RecordLinkage: Record Linkage Functions for Linking and Deduplicating Data Sets

Provides functions for linking and deduplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain. For details, see our paper "The RecordLinkage Package: Detecting Errors in Data" Sariyar M / Borg A (2010) <doi:10.32614/RJ-2010-017>.

Version: 0.4-12.1
Depends: R (≥ 3.5.0), DBI, RSQLite (≥ 1.0.0), ff
Imports: e1071, rpart, ada, ipred, stats, evd, methods, data.table (≥ 1.7.8), nnet, xtable
Suggests: RUnit, knitr
Published: 2020-08-25
Author: Murat Sariyar [aut, cre], Andreas Borg [aut]
Maintainer: Murat Sariyar <murat.sariyar at bfh.ch>
License: GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]
URL: https://journal.r-project.org/archive/2010-2/RJournal_2010-2_Sariyar+Borg.pdf
NeedsCompilation: yes
Materials: NEWS
CRAN checks: RecordLinkage results

Documentation:

Reference manual: RecordLinkage.pdf
Vignettes: Classes for record linkage of big data sets
Record Linkage with Extreme Value Theory
Supervised Classification
Weight-based deduplication

Downloads:

Package source: RecordLinkage_0.4-12.1.tar.gz
Windows binaries: r-devel: RecordLinkage_0.4-12.1.zip, r-devel-UCRT: RecordLinkage_0.4-12.1.zip, r-release: RecordLinkage_0.4-12.1.zip, r-oldrel: RecordLinkage_0.4-12.1.zip
macOS binaries: r-release (arm64): RecordLinkage_0.4-12.1.tgz, r-release (x86_64): RecordLinkage_0.4-12.1.tgz, r-oldrel: RecordLinkage_0.4-12.1.tgz
Old sources: RecordLinkage archive

Reverse dependencies:

Reverse imports: fobitools
Reverse enhances: SoundexBR

Linking:

Please use the canonical form https://CRAN.R-project.org/package=RecordLinkage to link to this page.