You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When using the following test HTML files as input...
$ cat old.html
<html>
<body>
some <div>text and more</div> text
</body>
</html>
$ cat new.html
<html>
<body>
some <div class='red'>text</div> and more <strong>text</strong>
</body>
</html>
$ graphtage old.html new.html
<html>
<body>
some <̟d̟i̟v̟ ̟c̟l̟a̟s̟s̟=̟"̟r̟e̟d̟"̟>̟t̟e̟x̟t̟<̟/̟d̟i̟v̟>̟
<̟s̟t̟r̟o̟n̟g̟>̟t̟e̟x̟t̟<̟/̟s̟t̟r̟o̟n̟g̟>̟
<̶d̶i̶v̶>̶t̶e̶x̶t̶ ̶a̶n̶d̶ ̶m̶o̶r̶e̶<̶/̶d̶i̶v̶>̶
</body>
</html>
+ screenshot:
..., as you can see, the text and more is missing from the diff generated by graphtage.
I've tried some other diff tools and it seems and none of them had any success with correctly processing these two files for some reason (many are using the same core algorithm I suppose). Is there some kind of general issue with processing text not enclosed in tags (as in, and more is between two elements, but not enclosed in any tag (apart from the parent <body> tag) itself)?
I have also tried surrounding and more in a <p> tag in new.html, which resulted in this mess:
When using the following test HTML files as input...
+ screenshot:
data:image/s3,"s3://crabby-images/9a22c/9a22c5f5b86b31ae9048da7ab2fa863acfd9dfb1" alt="image"
..., as you can see, the text
and more
is missing from the diff generated by graphtage.I've tried some other diff tools and it seems and none of them had any success with correctly processing these two files for some reason (many are using the same core algorithm I suppose). Is there some kind of general issue with processing text not enclosed in tags (as in,
and more
is between two elements, but not enclosed in any tag (apart from the parent<body>
tag) itself)?I have also tried surrounding
and more
in a<p>
tag innew.html
, which resulted in this mess:+ screenshot:
data:image/s3,"s3://crabby-images/185b6/185b60325b33b3b2144041999577f06454681df4" alt="image"
What's happening?
The text was updated successfully, but these errors were encountered: