Description
Harry is a small tool for comparing strings. The tool supports several common distance and kernel functions for strings as well as some exotic similarity measures. The focus of Harry lies on implicit similarity measures, that is, comparison functions that do not give rise to an explicit vector space. Examples of such similarity measures are the Levenshtein distance, the Jaro-Winkler distance or the spectrum kernel.
During operation Harry loads a set of strings from input, computes the specified similarity measure and writes a matrix of similarity values to output. The similarity measure can be computed based on the granularity of bytes, bits and tokens (words) contained in the strings. The configuration of this process, such as the input format, the similarity measure and the output format, are specified in a configuration file and can be additionally refined using command-line options.
Installation Instructions
Instructions not filled in by author. Author knows what to do. Everybody else should avoid this repo.
Active Releases
The following unofficial repositories are provided as-is by owner of this project. Contact the owner directly for bugs or issues (IE: not bugzilla).
Release | Architectures | Repo Download |
---|---|---|
EPEL 7 | x86_64 (0)* | EPEL 7 (36 downloads) |
EPEL 8 | x86_64 (0)* | EPEL 8 (38 downloads) |
Fedora 38 | x86_64 (0)* | Fedora 38 (0 downloads) |
Fedora 39 | x86_64 (0)* | Fedora 39 (20 downloads) |
Fedora 40 | x86_64 (0)* | Fedora 40 (9 downloads) |
Fedora 41 | x86_64 (0)* | Fedora 41 (2 downloads) |
Fedora rawhide | x86_64 (6)* | Fedora rawhide (37 downloads) |
* Total number of downloaded packages.