0
Hi I'd like to get this into Excel Columns of
Phrase, Translation, %YES
<tr>
<td>Phrase in a language</td>
<td>Translation</td>
<td>
<span id="Accuracy">
<a href="javascript:YES(#####);"><img src="/images/YES.gif" border="0"></a>(70%)
<a href="javascript:NOT(#####)"><img src="/images/NOPE.gif" border="0"></a>(30%)
</span>
</td>
</tr>
I'm using Notepad++ to get it into .csv and remove text between < >
<.*?>
And in the end this is what I get
Phrase in a language
Translation
(##%)
(##%)
After I tried to remove line breaks and replace them with commas
[\r\n]+
There is of course a lot of phrases and translations, this is just one of many, so...
Any ideas how to easily make it into the three columns, please?
No need for the second %, but it's easy to just delete a whole column in excel I guess if it can't be parsed out.
Thanks
Parsing HTML with regex is a hard job. HTML and regex are not good friends. Use a parser, it is simpler, faster and much more maintainable. – Toto – 2019-09-06T08:32:45.807