wrobel
asked on
Parsing dates - Any papers/books?
Dear Sir,
I need to write a program to parse dates in many
various formats: e.g.
11th Century
circa 1565
1900-1910
12-may-1998:14-june-1998
Eleventh century
Early 1300's
etc
Are there any journals/papers/books that discuss
ways of analysing such text?
Yours
Allan
cairs-support@cairs.co.uk
I need to write a program to parse dates in many
various formats: e.g.
11th Century
circa 1565
1900-1910
12-may-1998:14-june-1998
Eleventh century
Early 1300's
etc
Are there any journals/papers/books that discuss
ways of analysing such text?
Yours
Allan
cairs-support@cairs.co.uk
ASKER
Dear Patrick,
Thanks for replying. I know lex/yacc. But it's the
intellectual effort of working out a BNF (formal grammar)
that can pick out dates from text, that's really what I'm
looking for. Failing that some papers discussing the subject
would come in useful.
Yours
Allan
Thanks for replying. I know lex/yacc. But it's the
intellectual effort of working out a BNF (formal grammar)
that can pick out dates from text, that's really what I'm
looking for. Failing that some papers discussing the subject
would come in useful.
Yours
Allan
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
i suppose you want to extract these dates from files containing text in natural language.
parsing natural language is, however, a rather complicated thing to do. but if you can
narrow your date formats down to a few dozen, it should be possible to utilize
the lex/yacc pair of utilities used to construct compilers for computer languages.
lex and yacc are unix tools and documentation is widely available. give it a try.
if you need more, you should consider using a grammar used for natural language
processing. unfortunately, i cannot point you to any information in this area.
patrick