BaumGeist@lemmy.ml to Programmer Humor@lemmy.ml · 2 years agoThe best answer on StackOverflow: Using RegEx to parse HTMLstackoverflow.comexternal-linkmessage-square28linkfedilinkarrow-up152arrow-down11
arrow-up151arrow-down1external-linkThe best answer on StackOverflow: Using RegEx to parse HTMLstackoverflow.comBaumGeist@lemmy.ml to Programmer Humor@lemmy.ml · 2 years agomessage-square28linkfedilink
minus-squaremoriquende@lemmy.worldlinkfedilinkarrow-up0·2 years agoYou can’t parse every html opening tag with regex, because a html opening tag doesn’t have a set structure. How would you match, with regex, this opening tag? <mytag myattribute="<value of \"myattribute\">" >
minus-squareschnurrito@discuss.tchncs.delinkfedilinkarrow-up0arrow-down1·2 years agoIs this valid HTML? My understanding is that that attribute value needs to be escaped, i.e. <value of \"myattribute\">.
minus-squaremoriquende@lemmy.worldlinkfedilinkarrow-up1·2 years agoThe quote must not be escaped when you start with a single quote. The rest doesn’t. This is valid and tested: <img alt='my "<img>"'>
??? Non sequitur
You can’t parse every html opening tag with regex, because a html opening tag doesn’t have a set structure. How would you match, with regex, this opening tag?
<mytag myattribute="<value of \"myattribute\">" >Is this valid HTML? My understanding is that that attribute value needs to be escaped, i.e.
<value of \"myattribute\">.The quote must not be escaped when you start with a single quote. The rest doesn’t. This is valid and tested:
<img alt='my "<img>"'>