Solved

Java replaceAll regex

Posted on 2009-04-02
7
1,588 Views
Last Modified: 2013-11-23

 Hi Experts,
 
    I want to replace the following string embeded in an input line
   AFunctionName[startPosition , endPosition]  
   with
   SUBSTR(AFunctionName, startPosition , endPosition)
 
   match requirements:
   1.  space in AFunctionName[startPosition , endPosition],  before and after comma ","  is option;
   2. All names, AFunctionName, startPosition, endPosition are made of  word characters

   I wrote a method to parse it doesn't work, any help is appreciated.

0
Comment
Question by:SarahDeng
  • 4
  • 3
7 Comments
 
LVL 40

Accepted Solution

by:
Richard Quadling earned 300 total points
ID: 24055081
Try this ...
StringBuffer resultString = new StringBuffer();
try {
	Pattern regex = Pattern.compile("(\\w+)\\[(\\w+) , (\\w+)\\]", Pattern.CASE_INSENSITIVE | Pattern.UNICODE_CASE | Pattern.MULTILINE);
	Matcher regexMatcher = regex.matcher(subjectString);
	while (regexMatcher.find()) {
		try {
			// You can vary the replacement text for each match on-the-fly
			regexMatcher.appendReplacement(resultString, "SUBSTR($1, $2, $3)");
		} catch (IllegalStateException ex) {
			// appendReplacement() called without a prior successful call to find()
		} catch (IllegalArgumentException ex) {
			// Syntax error in the replacement text (unescaped $ signs?)
		} catch (IndexOutOfBoundsException ex) {
			// Non-existent backreference used the replacement text
		} 
	}
	regexMatcher.appendTail(resultString);
} catch (PatternSyntaxException ex) {
	// Syntax error in the regular expression
}

Open in new window

0
 

Author Closing Comment

by:SarahDeng
ID: 31566024
Can't expect better and quicker than this!
0
 

Author Comment

by:SarahDeng
ID: 24055341
Hi RQuadling,
Oops! there is one exception.
It doesn't work when there is no space before and after the ","
example:
 No space in AFunctionName[startPosition,endPosition],  before and after comma ","
I change the expression to the following, it doesn't work for me.  Could you help again? Thanks!
 "(\\w+)\\[(\\w+)(\\s*),(\\s*) (\\w+)\\]",
0
Are your AD admin tools letting you down?

Managing Active Directory can get complicated.  Often, the native tools for managing AD are just not up to the task.  The largest Active Directory installations in the world have relied on one tool to manage their day-to-day administration tasks: Hyena. Start your trial today.

 
LVL 40

Expert Comment

by:Richard Quadling
ID: 24055392
Try this ...
StringBuffer resultString = new StringBuffer();
try {
	Pattern regex = Pattern.compile("(\\w+)\\[(\\w+)\\s+?,\\s+?(\\w+)\\]", Pattern.CASE_INSENSITIVE | Pattern.UNICODE_CASE | Pattern.MULTILINE);
	Matcher regexMatcher = regex.matcher(subjectString);
	while (regexMatcher.find()) {
		try {
			// You can vary the replacement text for each match on-the-fly
			regexMatcher.appendReplacement(resultString, "SUBSTR($1, $2, $3)");
		} catch (IllegalStateException ex) {
			// appendReplacement() called without a prior successful call to find()
		} catch (IllegalArgumentException ex) {
			// Syntax error in the replacement text (unescaped $ signs?)
		} catch (IndexOutOfBoundsException ex) {
			// Non-existent backreference used the replacement text
		} 
	}
	regexMatcher.appendTail(resultString);
} catch (PatternSyntaxException ex) {
	// Syntax error in the regular expression
}

Open in new window

0
 

Author Comment

by:SarahDeng
ID: 24055491
I tried... It doesn't work with the case that no space before and after the ","
Like this.
string_variable[startPosition,endPostion]

0
 

Author Comment

by:SarahDeng
ID: 24055529
Changed the expression to
"(\\w+)\\[(\\w+)\\s*,\\s*(\\w+)\\]"
it works.
Can you explain why?
0
 
LVL 40

Expert Comment

by:Richard Quadling
ID: 24057754
Ah Yes.

\s+\s*

Options: case insensitive; ^ and $ match at line breaks

Match a single character that is a whitespace character (spaces, tabs, line breaks, etc.) «\s+?»
   Between one and unlimited times, as few times as possible, expanding as needed (lazy) «+?»

Match a single character that is a whitespace character (spaces, tabs, line breaks, etc.) «\s*»
   Between zero and unlimited times, as many times as possible, giving back as needed (greedy) «*»


I was assuming at least 1 space. Sorry.
0

Featured Post

Netscaler Common Configuration How To guides

If you use NetScaler you will want to see these guides. The NetScaler How To Guides show administrators how to get NetScaler up and configured by providing instructions for common scenarios and some not so common ones.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
map related example 6 47
windows explorer path to command prompt 5 44
servlet example 11 40
tomcat administrtor 12 46
I have been reconstructing a PHP-based application that has grown into a full blown interface system over the last ten years by a developer that has now gone into business for himself building websites. I am not incredibly fond of writing PHP code o…
Basic understanding on "OO- Object Orientation" is needed for designing a logical solution to solve a problem. Basic OOAD is a prerequisite for a coder to ensure that they follow the basic design of OO. This would help developers to understand the b…
This tutorial covers a practical example of lazy loading technique and early loading technique in a Singleton Design Pattern.
This tutorial explains how to use the VisualVM tool for the Java platform application. This video goes into detail on the Threads, Sampler, and Profiler tabs.

770 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question