Package: refinr 0.3.3

refinr: Cluster and Merge Similar Values Within a Character Vector

These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>.

Authors:Chris Muir [aut, cre]

refinr_0.3.3.tar.gz
refinr_0.3.3.zip(r-4.7)refinr_0.3.3.zip(r-4.6)refinr_0.3.3.zip(r-4.5)
refinr_0.3.3.tgz(r-4.6-x86_64)refinr_0.3.3.tgz(r-4.6-arm64)refinr_0.3.3.tgz(r-4.5-x86_64)refinr_0.3.3.tgz(r-4.5-arm64)
refinr_0.3.3.tar.gz(r-4.7-arm64)refinr_0.3.3.tar.gz(r-4.7-x86_64)refinr_0.3.3.tar.gz(r-4.6-arm64)refinr_0.3.3.tar.gz(r-4.6-x86_64)
refinr_0.3.3.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
refinr/json (API)
NEWS

# Install 'refinr' in R:
install.packages('refinr', repos = c('https://chrismuir.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/chrismuir/refinr/issues

Uses libs:
  • c++– GNU Standard C++ Library v3

On CRAN:

Conda:

approximate-string-matchingclusteringdata-cleaningdata-clusteringfuzzy-matchingngramopenrefinecpp

6.95 score 103 stars 172 scripts 245 downloads 2 exports 3 dependencies

Last updated from:a323b46787. Checks:13 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-arm64OK132
linux-devel-x86_64OK142
source / vignettesOK180
linux-release-arm64OK170
linux-release-x86_64OK142
macos-release-arm64OK205
macos-release-x86_64OK240
macos-oldrel-arm64OK158
macos-oldrel-x86_64OK427
windows-develOK151
windows-releaseOK113
windows-oldrelOK104
wasm-releaseOK117

Exports:key_collision_mergen_gram_merge

Dependencies:Rcppstringdiststringi

Intro to refinr

Rendered fromrefinr-vignette.Rmdusingknitr::rmarkdownon May 14 2026.

Last update: 2022-04-23
Started: 2018-01-05