imara-diff feat: word diffs

Adds a word diffing feature for each hunk. The following questions need to be answered to move this out of draft state:

EDIT: all done!

The current implementation is based on bytes. This is consistent with how strings are used internally in other places. Is this useful for word-diffs? We could also use .chars() or .graphemes(true) (via this crate). It's also possible to offer several of these options. (EDIT: we now use words)
The current implementation reuses the diff, but it reallocates the bytes of the underlying strings for every hunk. Is this acceptable? If no, which alternative should we go with? (EDIT: no longer applies since we no longer use bytes)
The current implementation reuses the diff, but it reallocates the InternedInput for every hunk. Is this acceptable? If no, which alternative should we go with? (EDIT: we now reuse the interner)
The current implementation estimates the token count to be 256 for byte token sources. Please let me know if that is an unreasonable heuristic for byte token sources. (EDIT: no longer applies since we no longer use bytes)
The current implementation always uses the Myers algo. Should we offer a second method which performs a minimal diff? (EDIT: no, we can add that later if requested)
Is there anything else I am missing?

Closes #1.

Sep 27 '25 11:09 KnorpelSenf

@pascalkuthe @Byron I have addressed all comments. This is ready for review now.

Nov 07 '25 17:11 KnorpelSenf

@pascalkuthe is there a way for to me make this easier to review?

Nov 21 '25 20:11 KnorpelSenf

left one cooment otherwise lgtm

Nov 21 '25 21:11 pascalkuthe

Thanks! Can you allow CI to run?

Nov 21 '25 22:11 KnorpelSenf

Seems the build failed due to the usual need to deref

Nov 21 '25 23:11 pascalkuthe

Let's try again

Nov 22 '25 08:11 KnorpelSenf