Solved

how do you get these � Marks in your text?

Posted on 2016-09-25
9
59 Views
Last Modified: 2016-10-11
Hi There,
All over my news pages here http://www.bizgro.jobs/news/ you can see rouge glyphs like �
How on earth do they get there and how do I remove them without actually going through every single page and removing them?
Thanks in advance, A
0
Comment
Question by:Amanda Watson
9 Comments
 
LVL 82

Accepted Solution

by:
Dave Baldwin earned 250 total points
ID: 41815374
They indicate a mismatch in character sets.  Your page is set as UTF-8 but the articles are encoded as Western or Latin-1.  When I switch the encoding in Firefox to Western, those glyphs disappear.  Probably the most common reason for that is copying and pasting text that was generated in Word which uses Windows-1252 which is a Latin / Western character encoding.
0
 
LVL 35

Expert Comment

by:Terry Woods
ID: 41815394
There's a solution suggested here: http://wpfab.com/clean-up-weird-characters-in-your-wordpress-posts/

Back up your site first, or limit it to one post, just in case it does something unexpected.

UPDATE wp_posts SET post_content = REPLACE(post_content, '�', '');

Open in new window

0
 
LVL 11

Author Comment

by:Amanda Watson
ID: 41815411
Shall I enter that code into SQL in the database via phpmyAdmin?
0
 
LVL 37

Expert Comment

by:Geert Gruwez
ID: 41815416
this has nothing to do with Delphi
Tag removed
0
IT, Stop Being Called Into Every Meeting

Highfive is so simple that setting up every meeting room takes just minutes and every employee will be able to start or join a call from any room with ease. Never be called into a meeting just to get it started again. This is how video conferencing should work!

 
LVL 11

Author Comment

by:Amanda Watson
ID: 41815421
Well I ran the query and 0 rows were affected so they remain?
Now what, any ideas?
0
 
LVL 48

Expert Comment

by:PortletPaul
ID: 41815521
It may not be as simple as a single update query in the database. There seem to be several reasons for those glyphs including quotes, hyphens, apostrophes and possibly more than just those.

Here is an example (but for some parts I am guessing)
(as is)
�Marketing is telling the truth attractively� � as heard from Peter Daniels. To me, 
marketing is opening yourself up to the world – which is risky � and being proud of what you
 can give and serve others with. I also believe that if a business is not marketing it will have 
poor customer service. We should always be willing to give with no expectation of a return. 
That doesn�t mean we should never ask for remuneration for our goods and services. It just
 means that we have enough of our own self-esteem that we don�t need outward 
affirmation (paid or unpaid) that we have given enough.

Open in new window


(to be)
"Marketing is telling the truth attractively" as heard from Peter Daniels.

To me, marketing is opening yourself up to the world – which is risky -  and being proud of what you can give and serve others with. I also believe that if a business is not marketing it will have poor customer service. We should always be willing to give with no expectation of a return. That doesn't mean we should never ask for remuneration for our goods and services. It just means that we have enough of our own self-esteem that we don't need outward affirmation (paid or unpaid) that we have given enough.

In that example some of the glyphs relate to possible quotation marks or perhaps bullets while others relate to hyphenation or apostrophes.

Most likely is that someone is using a WYSIWYG editor (such as Word) and pasting into into the Wordpress text boxes assuming all formatting is universal (which is not true).

That practice will have to stop if you are to solve this from happening again and again.
2
 
LVL 16

Assisted Solution

by:DansDadUK
DansDadUK earned 250 total points
ID: 41815883
I agree with the analysis of the problem by previous responders, indicating that the most likely cause is "... a mismatch in character sets ..." and "... glyphs relate to possible quotation marks or perhaps bullets while others relate to hyphenation or apostrophes ...".

The ISO-8859-1 Latin-1 character set is an exact subset of the Unicode character set.

But the Windows Latin-1 (CP1252) character set (which is a 'superset' of ISO-8859-1) is not an exact subset of Unicode.

In ISO-8859-1 (and Unicode), the code-point range 0x80 -> 0x9F is reserved for the (little-used) non-graphic C1 control-code characters.

But the (frequently used) Windows Latin-1 (CP1252) character set uses this range to define various additional graphic characters, including 'smart quotes', and 'dot' and 'dash' characters:

C1 range in CP1252
So (as others have said) if text, encoded using this character set includes such characters, is pasted directly into a page which is expecting the character set to be the UTF-8 encoding of Unicode, then these characters will map to the Unicode "REPLACEMENT CHARACTER", used to replace an unknown or unrepresentable character.
0
 
LVL 11

Author Comment

by:Amanda Watson
ID: 41816987
Thanks for the explanation.   Any idea how to remove them easily as per the question?
0
 
LVL 48

Expert Comment

by:PortletPaul
ID: 41817035
As I attempted to demonstrate, it probably isn't as simple as a single update query.

I suggest you try manual correction on one or two, and you will then understand that if you replace all those glyphs with a single character you will still have a large problem to solve. Perhaps worse than before by the way as it will be harder to identify.

Sorry.
0

Featured Post

Maximize Your Threat Intelligence Reporting

Reporting is one of the most important and least talked about aspects of a world-class threat intelligence program. Here’s how to do it right.

Join & Write a Comment

Who says nothing in life is free? WordPress.com is a freebie. WordPress.org's downloadable publishing platform is free. Heck, even WordPressMU is free. WordPress is an open source project, which means it can be used on any personal or commerc…
In order to have all security and back ups taken care of, WordPress users can sign up for services with WP Engine.
The purpose of this video is to demonstrate how to exclude a particular blog category from the main blog page. This is can be used when a category already has its own tab, or you simply want certain types of posts not to show up on the main blog. …
The purpose of this video is to demonstrate how to Test the speed of a WordPress Website. Site Speed is an important metric of a site’s health. Slow site speed can result in viewers leaving your site quickly and not seeing your content. This…

746 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

9 Experts available now in Live!

Get 1:1 Help Now