Solved

function to remove tags and/or tag attributes as I specify, and add tag attributes as I specify

Posted on 2006-11-20
8
186 Views
Last Modified: 2013-11-19
Hi,

I need a php function that can allow me to remove tags & tag attributes into/from a string.

I should give the function the string to be processed and an array containing the attributes and attribute properties to be removed.

For example when a tag (<a>) has a style attribute, I want to be able to remove specific style properties for example border & padding, but leave the unspecified ones in tact - and if no attribute properties are specified, the entire attribute needs to be removed.

The function should also remove tags that I specify, including closing tags, and incorrectly opened / closed tags.

The function should work with xhtml & html

The second function should be able to add attributes to specified tags within the given string.  For example: After I stripped all attributes from my <a> tag in the given string, I want to be able to add a rel attribute with for example a nofollow and an external attribute property.

All functions should automatically remove all double tags, hacking attempts, etc... for example it should remove all tags except those I specify as safe, similar to the php strip_tags function.  It should check for things like these:

<!-- I want to put a <div>tag</div> -->
<!-- Or a carrot > -->
<![CDATA[what about this! It's to protect from HTML characters like <tag>, > and so on in XML, no?]]> -->

If I'm not exactly clear on everything, please let me know, but I think this is pretty clear.  The functions will be used primarily to process blog replies.

Thanks for any help!
0
Comment
Question by:nelwa
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 4
  • 2
8 Comments
 
LVL 6

Expert Comment

by:aescnt
ID: 17986010
Hi,

If you want to clean up 3rd party HTML (like for blog comments), you'll love this project: http://pixel-apes.com/safehtml/

I don't think it does nofollow or style-stripping, but it does some essential things to HTML to make it 'safe.' (XSS issues and such.) After passing your 3rd party HTML through it, I'm sure you can use regex afterwards to add nofollow and remove styles.
0
 
LVL 1

Author Comment

by:nelwa
ID: 17986075
ok, yes so that function will work to remove xss and such, but now I need something to add attributes and attribute properties to certain tags...
0
 
LVL 1

Author Comment

by:nelwa
ID: 18002449
ok, so is nobody going to help me with this??
0
Online Training Solution

Drastically shorten your training time with WalkMe's advanced online training solution that Guides your trainees to action. Forget about retraining and skyrocket knowledge retention rates.

 
LVL 1

Author Comment

by:nelwa
ID: 18002506
All I need now is a function to remove all tag attributes, and another function that can add attributes with properties to the tags I specify... any help appreciated.

Thanks
0
 
LVL 6

Expert Comment

by:aescnt
ID: 18002516
Sorry, the code required for that is rather too complicated to write quickly.
0
 
LVL 1

Author Comment

by:nelwa
ID: 18002840
I have found a solution to my problem: http://cyberai.com/inputfilter/

Thanks
0
 
LVL 1

Accepted Solution

by:
Computer101 earned 0 total points
ID: 18256196
PAQed with points refunded (500)

Computer101
EE Admin
0

Featured Post

Enroll in May's Course of the Month

May’s Course of the Month is now available! Experts Exchange’s Premium Members and Team Accounts have access to a complimentary course each month as part of their membership—an extra way to increase training and boost professional development.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Introduction Knockoutjs (Knockout) is a JavaScript framework (Model View ViewModel or MVVM framework).   The main ideology behind Knockout is to control from JavaScript how a page looks whilst creating an engaging user experience in the least …
Have you tried to learn about Unicode, UTF-8, and multibyte text encoding and all the articles are just too "academic" or too technical? This article aims to make the whole topic easy for just about anyone to understand.
Viewers will learn one way to get user input in Java. Introduce the Scanner object: Declare the variable that stores the user input: An example prompting the user for input: Methods you need to invoke in order to properly get  user input:
Viewers will learn about the regular for loop in Java and how to use it. Definition: Break the for loop down into 3 parts: Syntax when using for loops: Example using a for loop:

737 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question