Solved

Detect language of a string

Posted on 2013-05-20
2
281 Views
Last Modified: 2013-05-20
Hello,

Is it possible to detect the language of a string?

For example:

"Hello" -> English
"über" -> German
"¿a¿¿µ¿¿a" -> Greek

I understand that words written in English but in another language cannot be filtered (e.g. "Guten Tag") however I am asking only for those that have characters specific to each language like the examples above.

Thank you very much!
0
Comment
Question by:infodigger
2 Comments
 
LVL 10

Assisted Solution

by:ienaxxx
ienaxxx earned 250 total points
ID: 39180973
AFAIK there should be something in the google translate API.
Not sure if you was searching for something like this...
HTH
0
 
LVL 108

Accepted Solution

by:
Ray Paseur earned 250 total points
ID: 39181018
Each language has a "signature" that can be detected from its vocabulary, however it's not 100% accurate and the risk of error goes way up on smaller strings because the orthography evident in short strings is rarely unique.  The Google API occasionally suggests that my computer code is written in Dutch, etc.  I think letter-only detection from a single word would be nearly useless except for a very few languages.  For example, the U-Umlaut (Diaresis) may appear in Hungarian, Karelian, Turkish, Uyghur Latin script, Estonian, Azeri, Turkmen, Crimean Tatar and Tatar Latin alphabets, as well as in German.

See http://en.wikipedia.org/wiki/Language_identification
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
wordpress url rewriting plugin 5 42
count download link and run update query 9 54
object oriented programming comparison 5 52
myqsl update statement on phpMyAdmin 8 21
Password hashing is better than message digests or encryption, and you should be using it instead of message digests or encryption.  Find out why and how in this article, which supplements the original article on PHP Client Registration, Login, Logo…
Nothing in an HTTP request can be trusted, including HTTP headers and form data.  A form token is a tool that can be used to guard against request forgeries (CSRF).  This article shows an improved approach to form tokens, making it more difficult to…
The viewer will learn how to dynamically set the form action using jQuery.
The viewer will learn how to create and use a small PHP class to apply a watermark to an image. This video shows the viewer the setup for the PHP watermark as well as important coding language. Continue to Part 2 to learn the core code used in creat…

932 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

11 Experts available now in Live!

Get 1:1 Help Now