?
Solved

MySQL fulltext search benchmark

Posted on 2003-11-19
7
Medium Priority
?
796 Views
Last Modified: 2008-02-01
Anyone has a benchmark on MySQL's fulltext indexing and search?

My plan is to index 4M of "contact info" records, each with different fields (name, address, phone #...etc), where name and address field has to be fulltext indexed. However, I need to have some idea about the performance.

Thx.
0
Comment
Question by:aboywong
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 3
  • 2
7 Comments
 
LVL 17

Expert Comment

by:Squeebee
ID: 9788293
I can't actually point to an independant benchmark, I can report that large sites are using it, and those sites have big speed concerns. Of course, the best thing to do it test it yourself to ensure that the benchmark is relevant.
0
 
LVL 1

Author Comment

by:aboywong
ID: 9794162
can you name which site is using mySQL's fulltext search? and how many records they are running...?
0
 
LVL 17

Expert Comment

by:Squeebee
ID: 9794212
Yahoo. As for records I am not sure.

Source: http://jeremy.zawodny.com/blog/archives/000576.html

Quote: There's anywhere from a few hundred thousand to 5 million of them.
0
 
LVL 1

Author Comment

by:aboywong
ID: 9808421
Squeebee,

  Thx a lot. That link really provide some useful info. However, I think I still need to run my own benchmark....
By the way, I got MySQL 4.1.0 installed (via rpm), and I tried to use utf8 as the default character set and met the following error when login:

mysql: File '/usr/share/mysql/charsets/?.conf' not found (Errcode: 2)
mysql: Character set '#33' is not a compiled character set and is not specified in the '/usr/share/mysql/charsets/Index' file

Any hints about it? Does it mean utf8 is not supported (but it should, as in the documentation...)

0
 
LVL 17

Accepted Solution

by:
Squeebee earned 400 total points
ID: 9810408
I would reccomend going to 4.1.1 for utf8 support, as it is one of the newest features. And yes, I reccomend running your own benchmark.
0

Featured Post

Prepare for your VMware VCP6-DCV exam.

Josh Coen and Jason Langer have prepared the latest edition of VCP study guide. Both authors have been working in the IT field for more than a decade, and both hold VMware certifications. This 163-page guide covers all 10 of the exam blueprint sections.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Introduction Since I wrote the original article about Handling Date and Time in PHP and MySQL several years ago, it seemed like now was a good time to update it for object-oriented PHP.  This article does that, replacing as much as possible the pr…
Load balancing is the method of dividing the total amount of work performed by one computer between two or more computers. Its aim is to get more work done in the same amount of time, ensuring that all the users get served faster.
In this video, Percona Solution Engineer Rick Golba discuss how (and why) you implement high availability in a database environment. To discuss how Percona Consulting can help with your design and architecture needs for your database and infrastr…
In this video, Percona Solutions Engineer Barrett Chambers discusses some of the basic syntax differences between MySQL and MongoDB. To learn more check out our webinar on MongoDB administration for MySQL DBA: https://www.percona.com/resources/we…

752 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question