I have a bunch of email conversations collected in one big HTML file. Each email within this document has a long tail with previous messages quoted that I want to remove. The quoted stuff is within <BLOCKQUOTE> and </BLOCKQUOTE> tags so I just need to find a way that some kind of application (maybe an HTML editor) can delete everything between these tags. Doing this manually would be much of a pain, since there are almost 700 of these tags in my file.
Does anyone have an idea how I could batch remove that stuff?
I tried find/replace ..but what do I have to put in "find" field? I dont think that it works, but maybe you could give me a hint?
And the xml parser sounds interesting. ive never done that. Does anyone know how I have to set it up? I mean basically it just needs to look for the first blockquote tag and then delete everything until it finds the closing tag.
Does anyone have an idea how I could batch remove that stuff?
I tried find/replace ..but what do I have to put in "find" field? I dont think that it works, but maybe you could give me a hint?
And the xml parser sounds interesting. ive never done that. Does anyone know how I have to set it up? I mean basically it just needs to look for the first blockquote tag and then delete everything until it finds the closing tag.