Solved

How can I optimize this mysql database for better performance?

Posted on 2015-02-09
2
133 Views
Last Modified: 2015-02-12
Hello there.  I'm trying to figure out the best way of optimizing a very large database so that the queries go as fast as possible and need some expert advice because i've never dealt with a database this large.  Ok..so here are the technical aspects of the mysql database:

database size is 92.5 gigabytes
1 table called "logs" with 265,321,000 rows!  yes...265.3 million rows

index length: 34.45 GB (36,987,469,824)
data length: 48.53 GB (52,105,838,592)

here are the fields in the table:
/*
Navicat MySQL Data Transfer

Source Server         : localhost-mydbase
Source Server Version : 50623
Source Host           : localhost:3307
Source Database       : mydbase

Target Server Type    : MYSQL
Target Server Version : 50623
File Encoding         : 65001

Date: 2015-02-09 08:45:05
*/

SET FOREIGN_KEY_CHECKS=0;

-- ----------------------------
-- Table structure for logs
-- ----------------------------
DROP TABLE IF EXISTS `logs`;
CREATE TABLE `logs` (
  `id` bigint(255) unsigned NOT NULL AUTO_INCREMENT,
  `username` varchar(255) DEFAULT NULL,
  `allowed` int(255) DEFAULT NULL,
  `url` longtext,
  `dtime` datetime DEFAULT NULL,
  `category` varchar(255) DEFAULT NULL,
  `compname` varchar(255) DEFAULT NULL,
  `srcip` varchar(100) DEFAULT NULL,
  PRIMARY KEY (`id`),
  KEY `user` (`username`),
  KEY `allow` (`allowed`),
  KEY `dtime1` (`dtime`),
  KEY `cat1` (`category`),
  KEY `compname1` (`compname`),
  KEY `srcip1` (`srcip`)
) ENGINE=InnoDB AUTO_INCREMENT=265790858 DEFAULT CHARSET=utf8;

First off, what server specs should i be looking for in order to query a database of this size?  memory..etc etc...

Most queries against this database are going to return all fields looking for a specific username within a specified date range.  Occasionally there will be some queries against the url table using a '%LIKE%' clause that will look for URL's that contain a specified domain...such as, give me the date and time, username and ip address for the user that visited espn.com from September of last year to September of this year.

This database will continue to grow and i'm sure there is a limit somewhere that will force me to create another table or re-structure this thing.  I've just never dealt with this much data before and really need to hear from people who understand mysql and dealing with a database of this size.  What can I do to make this quick and responsive?  Are my indexes setup properly?  should the database be broken up into separate tables?  Just need to know (thinking towards the future) the best way to structure this beast so I can get the fastest possible results from the queries I'm needing.  Any suggestions or modifications are greatly appreciated as I can destroy this table and redo the entire thing if I need to so all suggestions are very welcomed!

Oh almost forgot....should i even be using mysql for this?  i can use MS sql server if need be but you know, free is really awesome!

Also, this query took 156 seconds which is just too long:
SELECT `logs`.id, `logs`.username, `logs`.allowed, `logs`.url, `logs`.dtime, `logs`.category, `logs`.compname, `logs`.srcip FROM `logs` WHERE `logs`.username = 'myuser' AND `logs`.dtime > '2014-12-28 02:13:46' AND `logs`.dtime < '2015-01-05 12:06:29'

Thanks very much in advance!
0
Comment
Question by:linuxrox
  • 2
2 Comments
 
LVL 32

Accepted Solution

by:
Daniel Wilson earned 500 total points
ID: 40598559
Your table is too large to perform SELECTs quickly. Some kind of restructuring is in order.

Possibly some kind of aggregation of the records.  Possibly partitioning so that each month has its own table.
0
 
LVL 32

Assisted Solution

by:Daniel Wilson
Daniel Wilson earned 500 total points
ID: 40598565
0

Featured Post

Networking for the Cloud Era

Join Microsoft and Riverbed for a discussion and demonstration of enhancements to SteelConnect:
-One-click orchestration and cloud connectivity in Azure environments
-Tight integration of SD-WAN and WAN optimization capabilities
-Scalability and resiliency equal to a data center

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

If you get continual lockouts after changing your Active Directory password, there are several possible reasons.  Two of the most common are using other devices to access your email and stored passwords in the credential manager of windows.
While rebooting windows server 2003 server , it's showing "active directory rebuilding indices please wait" at startup. It took a little while for this process to complete and once we logged on not all the services were started so another reboot is …
This is used to tweak the memory usage for your computer, it is used for servers more so than workstations but just be careful editing registry settings as it may cause irreversible results. I hold no responsibility for anything you do to the regist…
Get a first impression of how PRTG looks and learn how it works.   This video is a short introduction to PRTG, as an initial overview or as a quick start for new PRTG users.

821 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question