Solved

regex expression for up to but not including and split

Posted on 2010-11-16
9
421 Views
Last Modified: 2012-05-10
I have a line of text

blah blah blah;blah blah blah

I'm trying to formulate a regex expression to capture everything up to the semicolon.

On a second regex or to replace the earlier, I basically need to split up a string separated by semicolons.

Thanks in advance.
0
Comment
Question by:kblackwel
  • 6
  • 3
9 Comments
 
LVL 35

Expert Comment

by:Terry Woods
ID: 34150009
$line = preg_replace("/;.*/","",$line);
0
 
LVL 35

Expert Comment

by:Terry Woods
ID: 34150028
To split a string separated by semicolons, use:

$my_array = preg_split("/;/",$string);
0
 
LVL 35

Expert Comment

by:Terry Woods
ID: 34150035
Ah, I made an assumption you're using PHP - which language are you using?
0
 

Author Comment

by:kblackwel
ID: 34150163
It's for a ETL, so it has to be regex
0
What Should I Do With This Threat Intelligence?

Are you wondering if you actually need threat intelligence? The answer is yes. We explain the basics for creating useful threat intelligence.

 
LVL 35

Expert Comment

by:Terry Woods
ID: 34150190
Ok, well the patterns I've provided are valid, but the first one requires use of a replace command rather than a capture.

To capture text before the first ; character, use pattern:

^(.*?);
0
 
LVL 35

Expert Comment

by:Terry Woods
ID: 34150218
Does the string have a fixed (or limited) number of fields? Depending on your regex tool, you might be able to use the following to split a string:

(?:([^;]*);)*

or, with a limited number of fields, something like this:

([^;]*);([^;]*);([^;]*);([^;]*);([^;]*)
0
 

Author Comment

by:kblackwel
ID: 34150535
Thank you,

For my needs, all I needed was

([^;]*)

That matched blah blah blah up to the semicolon.

The field I'm trying to parse would only have 2 semicolons in it max.

Any thoughts on matching the second blah blah blah past the first semicolon.

Again, thanks on the first solution.

Basically I'm pulling a string out of a DB table. In my etl program, I don't have access to split or anything like that. But regex is available. Trying to use that to parse an address table row that is delimited by semicolons.
0
 

Author Comment

by:kblackwel
ID: 34150542
Actually from the semicolon to end of line would be fine too.
0
 
LVL 35

Accepted Solution

by:
Terry Woods earned 125 total points
ID: 34150610
From the first semi-colon to the end of the line:

;(.*)

0

Featured Post

What Should I Do With This Threat Intelligence?

Are you wondering if you actually need threat intelligence? The answer is yes. We explain the basics for creating useful threat intelligence.

Join & Write a Comment

This article will show, step by step, how to integrate R code into a R Sweave document
Entering a date in Microsoft Access can be tricky. A typo can cause month and day to be shuffled, entering the day only causes an error, as does entering, say, day 31 in June. This article shows how an inputmask supported by code can help the user a…
The viewer will learn how to use the return statement in functions in C++. The video will also teach the user how to pass data to a function and have the function return data back for further processing.
The viewer will learn additional member functions of the vector class. Specifically, the capacity and swap member functions will be introduced.

746 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

11 Experts available now in Live!

Get 1:1 Help Now