UTF-8 Character to Unicode

Hi Experts;

I have a UTF-8 Character that I need to convert to unicode so that it can be shown correctly in a web browser:

I have this character ® that should be converted to = '®' so that the browser can interpret it correctly. Is there an easier way to convert it in Java?

Thank you.
simple21Asked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

Mick BarryJava DeveloperCommented:
try this:

String escaped = "&#"+Character.digit(c, 10)+";";
0
simple21Author Commented:
Hi Objects,

It returns: '&#-1;' What I need is: '®' for ®, here is where I get the table:
http://www.tony-franks.co.uk/UTF-8.htm

Here is my code:


public class CharacterEncodingUtil {
 
	public CharacterEncodingUtil()  {
        System.out.println(toUnicodeEscaped('®'));
	}
	
	public static String toUnicodeEscaped(char c)  {
		String unicodeChar = "&#" + Character.digit(c, 10) +";";
        return unicodeChar;
  }
	public static void main(String[] args) {
		new CharacterEncodingUtil();
	}
}

Open in new window

0
Mick BarryJava DeveloperCommented:
actually this should be all you need

                String unicodeChar = "&#" + ((int)c) +";";
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
Build an E-Commerce Site with Angular 5

Learn how to build an E-Commerce site with Angular 5, a JavaScript framework used by developers to build web, desktop, and mobile applications.

CEHJCommented:
Try
String unicodeChar = "&#" + (int)c +";";

Open in new window

0
simple21Author Commented:
Thanks for all the help one last question, if you may. Which class can I use to check if there is a UTF-8 character in a string?


String utf = "This is a test company ® private limited".

Open in new window

0
CEHJCommented:
What you mean is 'how can i check if there are any 'special' characters in a string?' They're all UTF-8 essentially. A rough but workable way is to examine, where 'c' is the character in question,

boolean special = c >= 0x7F;
0
CEHJCommented:
>>where 'c' is the character in question,

i.e. 'c' is of type char
0
Mick BarryJava DeveloperCommented:
> Which class can I use to check if there is a UTF-8 character in a string?

there isn't anything standard.  If you know what special chars you are expecting then you can use replaceAll() to replace them.
commans lang also has a variety of methods for escaping strings
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Java

From novice to tech pro — start learning today.