I am searching to remove duplicates in a document using Regex or something similar; to remove the following:
First Line
<Important text /><Important text />Other random words
I need to remove the duplicates of <some text/>
and keep everything else remain as it is. The text may or may not be on multiple lines.
It will need to work off of several different words but use the < > tags.
EDIT:
I do not know what the words will be. Some will be nested inside < > tags and some will not be. I will need to remove all duplicates that repeat one after each other something like:
<text/><text/><words/><words/><words/>
And the output should be:
<text/><words/>