Package: refinr 0.3.3
refinr: Cluster and Merge Similar Values Within a Character Vector
These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>.
Authors:
refinr_0.3.3.tar.gz
refinr_0.3.3.zip(r-4.5)refinr_0.3.3.zip(r-4.4)refinr_0.3.3.zip(r-4.3)
refinr_0.3.3.tgz(r-4.5-x86_64)refinr_0.3.3.tgz(r-4.5-arm64)refinr_0.3.3.tgz(r-4.4-x86_64)refinr_0.3.3.tgz(r-4.4-arm64)refinr_0.3.3.tgz(r-4.3-x86_64)refinr_0.3.3.tgz(r-4.3-arm64)
refinr_0.3.3.tar.gz(r-4.5-noble)refinr_0.3.3.tar.gz(r-4.4-noble)
refinr_0.3.3.tgz(r-4.4-emscripten)refinr_0.3.3.tgz(r-4.3-emscripten)
refinr.pdf |refinr.html✨
refinr/json (API)
NEWS
# Install 'refinr' in R: |
install.packages('refinr', repos = c('https://chrismuir.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/chrismuir/refinr/issues
approximate-string-matchingclusteringdata-cleaningdata-clusteringfuzzy-matchingngramopenrefinecpp
Last updated 12 months agofrom:a323b46787. Checks:12 OK. Indexed: yes.
Target | Result | Latest binary |
---|---|---|
Doc / Vignettes | OK | Mar 09 2025 |
R-4.5-win-x86_64 | OK | Mar 09 2025 |
R-4.5-mac-x86_64 | OK | Mar 09 2025 |
R-4.5-mac-aarch64 | OK | Mar 09 2025 |
R-4.5-linux-x86_64 | OK | Mar 09 2025 |
R-4.4-win-x86_64 | OK | Mar 09 2025 |
R-4.4-mac-x86_64 | OK | Mar 09 2025 |
R-4.4-mac-aarch64 | OK | Mar 09 2025 |
R-4.4-linux-x86_64 | OK | Mar 09 2025 |
R-4.3-win-x86_64 | OK | Mar 09 2025 |
R-4.3-mac-x86_64 | OK | Mar 09 2025 |
R-4.3-mac-aarch64 | OK | Mar 09 2025 |
Exports:key_collision_mergen_gram_merge
Dependencies:Rcppstringdiststringi
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Value merging based on Key Collision | key_collision_merge |
Value merging based on ngram fingerprints | n_gram_merge |
Cluster and Merge Similar Values Within a Character Vector | refinr-package refinr |