Solved

discover sentences

Posted on 2001-07-16
2
641 Views
Last Modified: 2006-11-17
Hello.

I have some English text which I want to process.
I seem to be having difficulties splitting the text into sentences.

Can someone here suggest a code for that?
0
Comment
Question by:huitema
  • 2
2 Comments
 
LVL 8

Expert Comment

by:shlomoy
ID: 6288655
Check out a perl module I wrote and is available in CPAN:
http://search.cpan.org/search?dist=Lingua-EN-Sentence

The module's name is Lingua::EN::Sentence

Note that there is another module in CPAN trying to do the same (but in my opinion fails in many places where mine doesn't) - and its name is Text::Sentence
0
 
LVL 8

Accepted Solution

by:
shlomoy earned 300 total points
ID: 6288657
SYNOPSIS

        use Lingua::EN::Sentence qw( get_sentences add_acronyms );


        add_acronyms('lt','gen');               ## adding support for 'Lt. Gen.'
        my $sentences=get_sentences($text);     ## Get the sentences.
        foreach my $sentence (@$sentences) {
                ## do something with $sentence
        }

0

Featured Post

How your wiki can always stay up-to-date

Quip doubles as a “living” wiki and a project management tool that evolves with your organization. As you finish projects in Quip, the work remains, easily accessible to all team members, new and old.
- Increase transparency
- Onboard new hires faster
- Access from mobile/offline

Join & Write a Comment

On Microsoft Windows, if  when you click or type the name of a .pl file, you get an error "is not recognized as an internal or external command, operable program or batch file", then this means you do not have the .pl file extension associated with …
In the distant past (last year) I hacked together a little toy that would allow a couple of Manager types to query, preview, and extract data from a number of MongoDB instances, to their tool of choice: Excel (http://dilbert.com/strips/comic/2007-08…
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…
Get a first impression of how PRTG looks and learn how it works.   This video is a short introduction to PRTG, as an initial overview or as a quick start for new PRTG users.

743 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

12 Experts available now in Live!

Get 1:1 Help Now