Solved

How can I enable Unicode support in Apache Solr?

Posted on 2010-09-04
2
1,149 Views
Last Modified: 2012-05-10
I want to enable the Unicode in Solr. Updating the index does not give me an error. But as soon as I try to search some Chinese text, I get an error. I have added the following line to my schema.

The following exception occurs as a result of adding the quoted filter:

org.apache.solr.common.SolrException: Error loading class 'solr.CollationKeyFilterFactory'
at org.apache.solr.core.SolrResourceLoader.findClass(SolrResourceLoader.java:375)
at org.apache.solr.core.SolrResourceLoader.newInstance(SolrResourceLoader.java:390)
at org.apache.solr.util.plugin.AbstractPluginLoader.create(AbstractPluginLoader.java:84)
at org.apache.solr.util.plugin.AbstractPluginLoader.load(AbstractPluginLoader.java:141)
at org.apache.solr.schema.IndexSchema.readAnalyzer(IndexSchema.java:835)
<filter class="solr.CollationKeyFilterFactory" language="" strength="primary"/>

Open in new window

0
Comment
Question by:raheelasadkhan
2 Comments
 
LVL 15

Accepted Solution

by:
samri earned 500 total points
Comment Utility
Hi,

I think unicode is support is already enabled in Apache.

#
# Specify a default charset for all content served; this enables
# interpretation of all content as UTF-8 by default.  To use the
# default browser choice (ISO-8859-1), or to allow the META tags
# in HTML content to override this choice, comment out this
# directive:
#
AddDefaultCharset UTF-8


http://www.seapine.com/kb/questions/1383/Configuring+Apache+to+Display+Unicode+Characters+in+TestTrack+Pro+Web

http://httpd.apache.org/docs/2.2/mod/mod_mime.html#addcharset
http://httpd.apache.org/docs/2.2/mod/core.html#adddefaultcharset

Some links on Unicode support for Solaris:
http://developers.sun.com/dev/gadc/unicode/solaris/index.html
http://www.cl.cam.ac.uk/~mgk25/unicode.html#solaris
0
 

Author Closing Comment

by:raheelasadkhan
Comment Utility
Thanks.
0

Featured Post

Maximize Your Threat Intelligence Reporting

Reporting is one of the most important and least talked about aspects of a world-class threat intelligence program. Here’s how to do it right.

Join & Write a Comment

As Wikipedia explains 'robots.txt' as -- the robot exclusion standard, also known as the Robots Exclusion Protocol or robots.txt protocol, is a convention to prevent cooperating web spiders and other web robots from accessing all or part of a websit…
In Solr 4.0 it is possible to atomically (or partially) update individual fields in a document. This article will show the operations possible for atomic updating as well as setting up your Solr instance to be able to perform the actions. One major …
Excel styles will make formatting consistent and let you apply and change formatting faster. In this tutorial, you'll learn how to use Excel's built-in styles, how to modify styles, and how to create your own. You'll also learn how to use your custo…
In this seventh video of the Xpdf series, we discuss and demonstrate the PDFfonts utility, which lists all the fonts used in a PDF file. It does this via a command line interface, making it suitable for use in programs, scripts, batch files — any pl…

762 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

6 Experts available now in Live!

Get 1:1 Help Now