Solved

Regular expression to locate words ending in a particular format.

Posted on 2013-11-10
18
374 Views
Last Modified: 2013-11-11
Hello Experts,
I need to locate all the words ending with the letters "an" in a text file using notepad++. Can you please suggest some regular expressions for the same?

Thanks.
0
Comment
Question by:sukhoi35
  • 5
  • 5
  • 4
  • +3
18 Comments
 
LVL 48

Accepted Solution

by:
Rgonzo1971 earned 50 total points
ID: 39637148
Hi,

pls try

\w*an\b

Open in new window


regards
0
 

Author Comment

by:sukhoi35
ID: 39637162
Hey, thanks for the response.
But this didn't work.

Screenshot
0
 
LVL 30

Expert Comment

by:Marco Gasi
ID: 39637166
I used

(an)$
0
 

Author Comment

by:sukhoi35
ID: 39637173
Hi marqusG,
Your expression is only locating the last words in every line ending with the letter "an"

For example:

Swan Duck Right Man
Ocean Sea Fish Fan

If I use the expression (an)$ on the above lines, only the words "man" and "fan" are located. Observe that even Swan and Ocean end with "an', but they are not located.
0
 
LVL 30

Assisted Solution

by:Marco Gasi
Marco Gasi earned 150 total points
ID: 39637179
You're right:

(an)\b
0
 

Author Comment

by:sukhoi35
ID: 39637184
Even (an)\b isn't working.

Screen capture
0
 
LVL 30

Expert Comment

by:Marco Gasi
ID: 39637188
Don't knowHere it works
0
 
LVL 10

Expert Comment

by:mark_harris231
ID: 39637282
Try .*?an[\s|\r|\n]
0
 
LVL 35

Assisted Solution

by:Terry Woods
Terry Woods earned 300 total points
ID: 39637461
@mark_harris231, your pattern will fail to match words ending in an which don't have a space character after them. A full stop after the word would cause it to fail.

This worked for me, with the Match case option off:
[a-z]*an(?![a-z])

The (?!...) part of the pattern is called a negative lookahead, which in this case means don't match if a letter is found immediately after "an"

Change the [a-z] to \w if you want to treat the underscore character as a word character. Not that the word "can't" will be found; if that's a problem, I'll adjust the pattern for you.
0
Enabling OSINT in Activity Based Intelligence

Activity based intelligence (ABI) requires access to all available sources of data. Recorded Future allows analysts to observe structured data on the open, deep, and dark web.

 
LVL 9

Expert Comment

by:Derek Jensen
ID: 39639726
Looking for words ending in 'an':

(\w+?an)[^a-z]

Open in new window

0
 
LVL 35

Expert Comment

by:Terry Woods
ID: 39639765
@bigdogdman, that pattern requires a character after the word. For real data, it would probably be fine as a I think the only case it would miss would be a word at the very end of the file, however the negative lookahead works fine in notepad++ and covers all cases.
0
 
LVL 9

Expert Comment

by:Derek Jensen
ID: 39639853
@Terry,

Actually, it doesn't. Granted, I haven't tried it in NotePad++, but unless it has a really sucky Regex engine, [^a-z] should match $ or \n or \r...

Again, untested, so...could still be wrong. :-)
0
 
LVL 30

Expert Comment

by:Marco Gasi
ID: 39639872
Sorry, I'm anovice of regex: can you explain why in my pc works what I shown in post ID 39637188

I don't want points but only understand why (an)\b works in my Notepad++ on Win7 but not for sukhoi35: I really don't understand this...
0
 
LVL 35

Expert Comment

by:Terry Woods
ID: 39639875
I tested the negative lookahead in notepad++ and it was fine...

I've never encountered any regex engine that matches an end-of-line or end-of-file (which aren't a character of any sort) with a character set such as [^a-z], but you're correct it will match a \n or \r character, or a literal dollar sign character. Notepad++ works as expected. I'll keep an open mind for other languages though, as some regex engines have some pretty unexpected behaviour!

Note that Notepad++ is a free download, for anyone who hasn't used it before.
0
 
LVL 35

Assisted Solution

by:Terry Woods
Terry Woods earned 300 total points
ID: 39639924
@marqusG, doing some more testing in Notepad++ (I have version 6.4.2), your original pattern worked fine for me:
\w*an\b

Open in new window

as did the pattern:
(an)\b

Open in new window

though the round brackets are unnecessary.

The only possible faults with it are unlikely to be a problem in real data:
\w will also match an underscore character or number
\b matches a boundary between a word character (\w) and a non-word character. Again, because \w matches an underscore or a number, there may be odd cases missed, although it may not be a problem
eg
ocean1 would not be picked up
ocean_ would not be picked up
With pattern \w*an\b a value of 2345_2345an can be matched.

@sukhoi35, perhaps you'd like to retry using pattern \w*an\b ? It's simpler than the negative lookahead that I suggested.
0
 
LVL 30

Expert Comment

by:Marco Gasi
ID: 39639974
Thanks for your answer, Terry: unfortunately, I can't give you points ;)
0
 

Author Closing Comment

by:sukhoi35
ID: 39640660
Thanks!
0
 

Author Comment

by:sukhoi35
ID: 39640664
Hi Guys,
By the way, upgrading to notepad++ v6.5.1 from the existing v5.9.8 helped me use some of the solutions posted earlier. Looks like there were problems with the earlier versions which got fixed recently. Thanks all for your time!
0

Featured Post

How to run any project with ease

Manage projects of all sizes how you want. Great for personal to-do lists, project milestones, team priorities and launch plans.
- Combine task lists, docs, spreadsheets, and chat in one
- View and edit from mobile/offline
- Cut down on emails

Join & Write a Comment

Suggested Solutions

Title # Comments Views Activity
CKEditor - will not function 7 36
radio button value 3 47
HTML Lists 3 29
Urgent need help ASAP With CSS 5 25
Showing your events from Google Calendar in Google Maps Why? I travel all week and I thought it would be ideal if staff in office knew where I was based on my calendar. (OK real reason: my son wanted to see where I would be working, and I thoug…
Jaspersoft Studio is a plugin for Eclipse that lets you create reports from a datasource.  In this article, we'll go over creating a report from a default template and setting up a datasource that connects to your database.
In this tutorial viewers will learn how to embed videos in a webpage using HTML5. Ensure your DOCTYPE declaration is set to HTML5: "<!DOCTYPE html>": Use the <video> tag to insert a video. Define the src as the URL of your video; this is similar to …
The viewer will learn how to synchronize PHP projects with a remote server in NetBeans IDE 8.0 for Windows.

743 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

15 Experts available now in Live!

Get 1:1 Help Now