Link to home
Start Free TrialLog in
Avatar of wrobel
wrobel

asked on

Parsing dates - Any papers/books?

Dear Sir,
   I need to write a program to parse dates in many
various formats: e.g.
    11th Century
    circa 1565
    1900-1910
    12-may-1998:14-june-1998
    Eleventh century
    Early 1300's
    etc
    Are there any journals/papers/books that discuss
    ways of analysing such text?
    Yours
     Allan
    cairs-support@cairs.co.uk  
Avatar of ppeck
ppeck
Flag of Austria image

hi allan,

i suppose you want to extract these dates from files containing text in natural language.
parsing natural language is, however, a rather complicated thing to do. but if you can
narrow your date formats down to a few dozen, it should be possible to utilize
the lex/yacc pair of utilities used to construct compilers for computer languages.
lex and yacc are unix tools and documentation is widely available. give it a try.

if you need more, you should consider using a grammar used for natural language
processing. unfortunately, i cannot point you to any information in this area.

patrick

Avatar of wrobel
wrobel

ASKER

Dear Patrick,
     Thanks for replying. I know lex/yacc. But it's the
intellectual effort of working out a BNF (formal grammar)
that can pick out dates from text, that's really what I'm
looking for. Failing that some papers discussing the subject
would come in useful.
     Yours
     Allan
ASKER CERTIFIED SOLUTION
Avatar of hougaard
hougaard

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial