HtmlEncode and Curly Quotes, from Mysql to Ajax to Textarea, back to Mysql

Posted on 2012-04-03
Medium Priority
Last Modified: 2012-12-09
I need help on properly ENCODING the following:

1 - grab a record in MySQL with French Characters and curly braces
2 - pass it via ajax to a textarea
3 - view all foreign characters normally inside textarea
4 - edit text and send it back for update via ajax to MySQL

Can you provide a simple example on how to grab this text, edit it, and update it with proper encoding.

Je m’apelle François, J’ai “tois enfants”
Gérard et à “wow” c’est bon àâçéèêëïîôùù

This may be simple to a seasoned programmer, but it's been kicking  my you know what...

I tried htmlentities() before sending to ajax but that didn't work, help.
Question by:dimsouple
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 5
  • 2
  • 2
  • +1

Accepted Solution

designatedinitializer earned 2000 total points
ID: 37804118
The fundamental thing to have in mind is to use UTF-8 encoding. Use UTF-8 in your database and in your php files.

In your ajax, use some serialization function.
I assume you are using jQuery. (if you're not, then you should).
If so, here's an example of an ajax request:

			type	: "POST",
			cache	: false,
			url		: "../participar.php",
			data	: $("#prizeForm").serializeArray(),
			success	: function(data) {

Open in new window

And here is a snippet of the PHP code that receives the ajax request.
You simply treat it as POST:
if(array_key_exists('action', $_POST)) {	switch($_POST['action'])
		case "alterar":
			// This is an AJAX request from the main window
			$user = new user();
			$user = $session->getVar("user");
			if(!is_a($user,"user")) {
				// Logout
				die("A sua sessão terminou. Por favor faça login novamente.");
			$name     = trim($_POST['altNome']);
			$password = trim($_POST['altPass']);


Open in new window

Then be careful to use mysql_real_escape_string() on all user input, before inserting into the database.

Assisted Solution

designatedinitializer earned 2000 total points
ID: 37804126
IMPORTANT: in your text editor, be sure to change your php files' encoding to "UTF-8, with no BOM"!
(Simply put, the BOM is a bunch of non-visible garbage that gets into the start of your file and can mess with your request headers and spawn misterious errors)
LVL 82

Expert Comment

ID: 37804215
... and if you already have records in an other encoding(latin-1/ISO 8859-1), you should consider this data as corrupted
Are You Using the Best Web Development Editor?

The worlds of web hosting and web development are constantly evolving. Every year we see design trends change, coding standards adapt and new frameworks/CMS created. With such a quick pace of change it’s easy to get lost trying to keep up.

See if your editor made the list.

LVL 111

Expert Comment

by:Ray Paseur
ID: 37810390
You do not need unicode for western european characters.  ISO-8859-1 works perfectly.  The central issue with this or any other encoding problem is getting consistency across the platforms.  This article explains some of it.

See http://www.laprbass.com/RAY_temp_dimsouple.php
<?php // RAY_temp_dimsouple.php

$html = <<<HTML
<!DOCTYPE html>
<html dir="ltr" lang="en-US">
<meta charset="iso-8859-1" />
<title>Accented Characters in ISO-8859-1</title>
Je m’apelle François, J’ai "tois enfants"
Gérard et à "wow" c’est bon àâçéèêëïîôùù

echo $html;

Open in new window


Expert Comment

ID: 37810497
@Ray: Of course ISO-8859-1 encodes french diacritics and such, but there are strong reasons for ditching it in favor of utf-8 (as Joel does in the article you posted...)
LVL 111

Expert Comment

by:Ray Paseur
ID: 37810607
The one reason I would be careful about ditching any ANSI font goes to the need for consistency across all the levels of the platform.  This means the data base, the file system, things that were stored in cookies, client keyboard input, JavaScript, values created inside scripts, HTML, etc.  Any of these things may come with the legacy assumption that they are all single-byte characters.  That assumption may lead to encoding collisions, and in my experience the resulting encoding collisions are very difficult to explain since the conversion to UTF-8 may be difficult for financial managers to understand.  A common response goes something like, "You did what?  It was working before.  Why did you eff with it?"

Expert Comment

ID: 37810629
I do agree with you on this: if it is working, there's no need to fix it.
However, if you are starting something from scratch, always go with Unicode.

Author Comment

ID: 37816468
Thank you all so much. the part about the data being corrupted is no lie. because I failed to specify the charset in the old pages, the form input were coming in in many different formats.

now I've changed everything to UTF-8 and unfortunately, some of the data is in other format.

I've found out that this does the trick on the coruppted data


Assisted Solution

designatedinitializer earned 2000 total points
ID: 37817122
People like me are eagerly (not that much, but anyway...) awating for the next major release of PHP, which supposedly is going to have native support for Unicode, down to variable names and other language tokens.
Meanwhile, we use UTF-8 and we are careful to specify utf-8 files with no BOM.

Other useful features are the utf8_encode and utf8_decode PHP functions, and in MySQL, the ability to specify the character encoding down to the SELECT level: you can have different SELECT statements specify different character encodings.
One other thing to keep in mind is that the character encoding is not the collation (some people tend to confuse these two).
LVL 82

Expert Comment

ID: 37833084

Featured Post

WordPress Tutorial 3: Plugins, Themes, and Widgets

The three most common changes you will make to your website involve the look (themes), the functionality (plugins), and modular elements (widgets).

In this article we will briefly define each again, and give you directions on how to install them.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Nothing in an HTTP request can be trusted, including HTTP headers and form data.  A form token is a tool that can be used to guard against request forgeries (CSRF).  This article shows an improved approach to form tokens, making it more difficult to…
Introduction This article is intended for those who are new to PHP error handling (https://www.experts-exchange.com/articles/11769/And-by-the-way-I-am-New-to-PHP.html).  It addresses one of the most common problems that plague beginning PHP develop…
This tutorial will teach you the core code needed to finalize the addition of a watermark to your image. The viewer will use a small PHP class to learn and create a watermark.
The viewer will learn how to create a basic form using some HTML5 and PHP for later processing. Set up your basic HTML file. Open your form tag and set the method and action attributes.: (CODE) Set up your first few inputs one for the name and …
Suggested Courses

764 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question