gram@lemmy.ml to RustEnglish · 4 months agotextdistance.rs, Rust crate with 25+ algorithms for comparing strings. Now with no_std support!github.comexternal-linkmessage-square3fedilinkarrow-up158arrow-down10cross-posted to: rust
arrow-up158arrow-down1external-linktextdistance.rs, Rust crate with 25+ algorithms for comparing strings. Now with no_std support!github.comgram@lemmy.ml to RustEnglish · 4 months agomessage-square3fedilinkcross-posted to: rust
minus-squareDeebsterlinkfedilinkarrow-up4·edit-24 months agoToken-based string distances looks like exactly what I need for my current side project - I’m using Levenshtein but I should be comparing based on words, not characters. I just need to figure out which (if any) of these does what I need. Edit: looks like the Python version has that information: https://github.com/life4/textdistance?tab=readme-ov-file#algorithms
minus-squaregram@lemmy.mlOPlinkfedilinkEnglisharrow-up2·4 months agoIn Python version, pass the list of words directly into the algorithm, and it will compare words. In Rust version, use Algorithm.for_words: https://docs.rs/textdistance/1.1.0/textdistance/trait.Algorithm.html#method.for_words
Token-based string distances looks like exactly what I need for my current side project - I’m using Levenshtein but I should be comparing based on words, not characters.
I just need to figure out which (if any) of these does what I need.
Edit: looks like the Python version has that information: https://github.com/life4/textdistance?tab=readme-ov-file#algorithms
In Python version, pass the list of words directly into the algorithm, and it will compare words. In Rust version, use Algorithm.for_words:
https://docs.rs/textdistance/1.1.0/textdistance/trait.Algorithm.html#method.for_words