kblackwel
asked on
regex expression for up to but not including and split
I have a line of text
blah blah blah;blah blah blah
I'm trying to formulate a regex expression to capture everything up to the semicolon.
On a second regex or to replace the earlier, I basically need to split up a string separated by semicolons.
Thanks in advance.
blah blah blah;blah blah blah
I'm trying to formulate a regex expression to capture everything up to the semicolon.
On a second regex or to replace the earlier, I basically need to split up a string separated by semicolons.
Thanks in advance.
$line = preg_replace("/;.*/","",$l ine);
To split a string separated by semicolons, use:
$my_array = preg_split("/;/",$string);
$my_array = preg_split("/;/",$string);
Ah, I made an assumption you're using PHP - which language are you using?
ASKER
It's for a ETL, so it has to be regex
Ok, well the patterns I've provided are valid, but the first one requires use of a replace command rather than a capture.
To capture text before the first ; character, use pattern:
^(.*?);
To capture text before the first ; character, use pattern:
^(.*?);
Does the string have a fixed (or limited) number of fields? Depending on your regex tool, you might be able to use the following to split a string:
(?:([^;]*);)*
or, with a limited number of fields, something like this:
([^;]*);([^;]*);([^;]*);([ ^;]*);([^; ]*)
(?:([^;]*);)*
or, with a limited number of fields, something like this:
([^;]*);([^;]*);([^;]*);([
ASKER
Thank you,
For my needs, all I needed was
([^;]*)
That matched blah blah blah up to the semicolon.
The field I'm trying to parse would only have 2 semicolons in it max.
Any thoughts on matching the second blah blah blah past the first semicolon.
Again, thanks on the first solution.
Basically I'm pulling a string out of a DB table. In my etl program, I don't have access to split or anything like that. But regex is available. Trying to use that to parse an address table row that is delimited by semicolons.
For my needs, all I needed was
([^;]*)
That matched blah blah blah up to the semicolon.
The field I'm trying to parse would only have 2 semicolons in it max.
Any thoughts on matching the second blah blah blah past the first semicolon.
Again, thanks on the first solution.
Basically I'm pulling a string out of a DB table. In my etl program, I don't have access to split or anything like that. But regex is available. Trying to use that to parse an address table row that is delimited by semicolons.
ASKER
Actually from the semicolon to end of line would be fine too.
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.