Package: refinr 0.3.3

refinr: Cluster and Merge Similar Values Within a Character Vector

These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>.

Authors:Chris Muir [aut, cre]

refinr_0.3.3.tar.gz
refinr_0.3.3.zip(r-4.5)refinr_0.3.3.zip(r-4.4)refinr_0.3.3.zip(r-4.3)
refinr_0.3.3.tgz(r-4.5-x86_64)refinr_0.3.3.tgz(r-4.5-arm64)refinr_0.3.3.tgz(r-4.4-x86_64)refinr_0.3.3.tgz(r-4.4-arm64)refinr_0.3.3.tgz(r-4.3-x86_64)refinr_0.3.3.tgz(r-4.3-arm64)
refinr_0.3.3.tar.gz(r-4.5-noble)refinr_0.3.3.tar.gz(r-4.4-noble)
refinr_0.3.3.tgz(r-4.4-emscripten)refinr_0.3.3.tgz(r-4.3-emscripten)
refinr.pdf |refinr.html
refinr/json (API)
NEWS

# Install 'refinr' in R:
install.packages('refinr', repos = c('https://chrismuir.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/chrismuir/refinr/issues

Uses libs:
  • c++– GNU Standard C++ Library v3

On CRAN:

Conda:

approximate-string-matchingclusteringdata-cleaningdata-clusteringfuzzy-matchingngramopenrefinecpp

6.80 score 104 stars 121 scripts 323 downloads 2 exports 3 dependencies

Last updated 12 months agofrom:a323b46787. Checks:12 OK. Indexed: yes.

TargetResultLatest binary
Doc / VignettesOKMar 09 2025
R-4.5-win-x86_64OKMar 09 2025
R-4.5-mac-x86_64OKMar 09 2025
R-4.5-mac-aarch64OKMar 09 2025
R-4.5-linux-x86_64OKMar 09 2025
R-4.4-win-x86_64OKMar 09 2025
R-4.4-mac-x86_64OKMar 09 2025
R-4.4-mac-aarch64OKMar 09 2025
R-4.4-linux-x86_64OKMar 09 2025
R-4.3-win-x86_64OKMar 09 2025
R-4.3-mac-x86_64OKMar 09 2025
R-4.3-mac-aarch64OKMar 09 2025

Exports:key_collision_mergen_gram_merge

Dependencies:Rcppstringdiststringi

Intro to refinr

Rendered fromrefinr-vignette.Rmdusingknitr::rmarkdownon Mar 09 2025.

Last update: 2022-04-23
Started: 2018-01-05