Avatar of sharingsunshine
sharingsunshine
Flag for United States of America asked on

What regex will remove duplicate rel="nofolow" tags?

I had this question after viewing Python error - Need Help.

I created this regex to remove the duplicate rel="nofollow" tags using grep in TextWrangler but I am not clear how to add this into the Python regex code.

rel="nofollow"(\s|\n|\n\r)rel="nofollow"

Open in new window


replace with
rel="nofollow"

Open in new window

PythonMac OS XRegular Expressions

Avatar of undefined
Last Comment
sharingsunshine

8/22/2022 - Mon
ASKER CERTIFIED SOLUTION
pepr

Log in or sign up to see answer
Become an EE member today7-DAY FREE TRIAL
Members can start a 7-Day Free trial then enjoy unlimited access to the platform
Sign up - Free for 7 days
or
Learn why we charge membership fees
We get it - no one likes a content blocker. Take one extra minute and find out why we block content.
Not exactly the question you had in mind?
Sign up for an EE membership and get your own personalized solution. With an EE membership, you can ask unlimited troubleshooting, research, or opinion questions.
ask a question
pepr

I have noticed a bug in the original page:
<a 1="" href="http://www.theherbsplace.com/" imageanchor=" rel="nofollow" style="...

Open in new window


Notice the 1="" and the imageanchor=" without the enclosing double quote.
sharingsunshine

ASKER
Thanks for the help.  On the other exceptions you pointed out I will just have to fix them as I find them.
Your help has saved me hundreds of hours of internet surfing.
fblack61