转换差异以降价带删除线？差异

2023-09-11 03:46:28 作者：偷偷靠近你

我想差异的输出转换（在降价文件）降价与＆LT;罢工＆GT; 和＆LT; EM＆GT; 标签，这样我可以看到什么了被移离或添加到文档的新版本。（这种治疗是法律文件很常见的。）

I'd like to convert the output of diff (on a Markdown file) to Markdown with <strike> and <em> tags, so that I can see what has been removed from or added to a new version of a document. (This kind of treatment is very common for legal documents.)

的希望的输出例：

为什么的我们学习编程语言的？的不是的的不的顺序到...

Why do weWe study programming languages? notNot in order to ...

其中的许多困难的是，差异的输出是面向行的，在这里我想看到个别字的差异。有没有人有建议，要使用什么算法，或建立在什么软件？

One of the many difficulties is that diff's output is line-oriented, where I want to see differences in individual words. Does anyone have suggestions as to what algorithm to use, or what software to build on?

推荐答案

使用 wdiff 。它已经做了你要找的字的字比较;它的输出转换成降价应该只是一些简单的规律EX pressions。

Use wdiff. It already does the word-by-word comparison you're looking for; converting its output to markdown should take just a few simple regular expressions.

例如：

$ cat foo
Why do we study programming languages?  Not in order to
$ cat bar
We study programming languages not in order to
$ wdiff foo bar
[-Why do we-]{+We+} study programming [-languages?  Not-] {+languages not+} in order to
$ wdiff foo bar | sed 's|\[-|<em>|g;s|-]|</em>|g;s|{+|<strike>|g;s|+}|</strike>|g'
<em>Why do we</em><strike>We</strike> study programming <em>languages?  Not</em> <strike>languages not</strike> in order to

编辑：其实，wdiff有一些选项，使其更容易：

Actually, wdiff has some options that make it even easier:

$ wdiff -w '<em>' -x '</em>' -y '<strike>' -z '</strike>' foo bar
<em>Why do we</em><strike>We</strike> study programming <em>languages?  Not</em> <strike>languages not</strike> in order to

上一篇：算法来呈现文本水平二元杂交树/ ASCII格式算法、文本、水平、格式

下一篇：常见的策略来处理货币密集软舍入误差？误差、密集、货币、策略

相关推荐

精彩图集

精彩推荐

图片推荐

摩洛哥女子杀死男友碎尸，干成“人肉炒饭”