Solved

Parsing time zone database

Posted on 2013-10-30
10
762 Views
Last Modified: 2013-11-11
We have time zone objects in our code that we fill from a time zone file that we maintain manually. This is silly because there is a nice one (Olson's) that already exists on Linux and in glibc so it should be readily available to our compiler.

What I'm trying to do is parse through the time zone database to fill our time zone structures. I'm having trouble figuring out the best way to do this. I don't really want to parse the entire file and match all the current rules to the current time zone definitions etc. I only care about the current rules. I assume there is a better way to do this than starting from scratch with the tz database text files. I'm trying to produce something like this for all of them:
Zone name        offset DST? dst_mo   dst_day  dst_hr  std_mo std_day std_hr
UTC                0:00    0
America/New_York  -5:00    1      3    Sun>=8       2      11  Sun>=1      2
Europe/Paris      +1:00    1      3   lastSun       1      10 lastSun      1

Open in new window

Isn't there some good way to call into time.h or something to get this? I can't seem to find one.
0
Comment
Question by:TommySzalapski
  • 6
  • 4
10 Comments
 
LVL 13

Expert Comment

by:frankhelk
ID: 39620192
I havn't seen the original files, but it can't be that complicated to parse them.
Could you attach one of these files ?

Other idea: Maybe there's something useful in the TimeZoneInfo class (MS C#/C++) ?
0
 
LVL 37

Author Comment

by:TommySzalapski
ID: 39621769
I don't need to attach them, they are publicly available and already on your computer (even in you use MS which it appears you do, although I think they hide it somewhat).
IANA officially maintains them.
http://www.iana.org/time-zones

I use Linux (note the zones) so MS and C# don't really help.

Again. I certainly could write a parser for the files, but since they are built into GCC, surely there is a better way.
0
 
LVL 13

Expert Comment

by:frankhelk
ID: 39623700
Ooops - mybe I was not fully awake when reading the question. Sorry.

Have you checked out the ressouce links there ?

http://www.iana.org/time-zones/repository/tz-link.html

I've just read over them, but it looks like there's some useful code there ...
0
 
LVL 37

Author Comment

by:TommySzalapski
ID: 39624245
But if all of that data is already built into glibc, then why should I be reinventing the wheel and writing a parser for a flat text file?
0
 
LVL 13

Expert Comment

by:frankhelk
ID: 39624526
OK - I reverse my thought about parsing ... just use the already invented wheel and utilize the routines/classes in glibc (I've not looked over those routines, but I bet they're an equal replacement for the C#/VC classes I've mentioned while being on the Windows rail ...).

While thinking abut, how about that: The files from the IANA site (the TZ database) contains a makfile to generate binary tz files, which I presume glibc reads.

How about just adding creating some kind of addendum with your own time zone to one of the original tz files and appending it to the dl'd file automatically when new IANA files are available. When you subsequently compile the files with the IANA makefile to get the binary tz files, there should not be much in the way to use them and have your own tz available with the glibc stuff ?
0
Do You Know the 4 Main Threat Actor Types?

Do you know the main threat actor types? Most attackers fall into one of four categories, each with their own favored tactics, techniques, and procedures.

 
LVL 37

Author Comment

by:TommySzalapski
ID: 39625859
Hmm... still not quite awake yet? :)
I'm not trying to add new timezones, I'm trying to get the ones that are already in glibc and put them into our structures.

Anyway, I tried to find what I needed in time.h and other files but had no success.
I suppose I could parse the binary files which would be a bit easier, but I'd prefer library calls.
0
 
LVL 13

Assisted Solution

by:frankhelk
frankhelk earned 250 total points
ID: 39628309
Hmmm ... OK. Weird thing, that brain of mine.

Nevertheless, I've browsed a bit thru http://www.gnu.org/software/libc/manual/pdf/libc.pdf

Seems it deals with timezones here and there, and it's possibly more informative than time.h ... but I presume you've read that, too ...

Just a tought - while glibc incorporates  the info of the IANA files, their data needs to be included into the sources of glibc in some way. Maybe a peek into these sources might lead to some kind of back door to that data, or you could possibly insert a custom extension into that code somewhere to achive that goal ?
0
 
LVL 37

Accepted Solution

by:
TommySzalapski earned 0 total points
ID: 39628404
You are correct. I already read the info in the pdf and there's nothing particularly useful.
Peeking into the glibc code isn't a bad idea, I'm guessing they get the data from where I just found it.
I've done some searching and here is what I've found so far.

There are binary files (super easy to 'parse') stored here: /usr/share/zoneinfo
You can get the most recent text files here: http://www.iana.org/time-zones
And compile them into the binary format using zic on the command line (time zone compiler that ships with Linux).

I think we'll end up going that route. I do appreciate your time trying to help me figure this out.
0
 
LVL 37

Author Comment

by:TommySzalapski
ID: 39628414
The IANA also publishes a library that you can link with to get the data. I haven't really messed with it, but you can get to it here: http://www.iana.org/time-zones
0
 
LVL 37

Author Closing Comment

by:TommySzalapski
ID: 39638304
I think I found the best solution for my needs. As a side note, boost's timezone stuff is a bit more fleshed out than the time.h stuff. Anyone reading this in the future might want to look there: http://www.boost.org/doc/libs/1_54_0/doc/html/date_time.html
0

Featured Post

IT, Stop Being Called Into Every Meeting

Highfive is so simple that setting up every meeting room takes just minutes and every employee will be able to start or join a call from any room with ease. Never be called into a meeting just to get it started again. This is how video conferencing should work!

Join & Write a Comment

A short article about a problem I had getting the GPS LocationListener working.
Although it can be difficult to imagine, someday your child will have a career of his or her own. He or she will likely start a family, buy a home and start having their own children. So, while being a kid is still extremely important, it’s also …
The goal of the video will be to teach the user the difference and consequence of passing data by value vs passing data by reference in C++. An example of passing data by value as well as an example of passing data by reference will be be given. Bot…
The viewer will learn additional member functions of the vector class. Specifically, the capacity and swap member functions will be introduced.

708 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

12 Experts available now in Live!

Get 1:1 Help Now