Solved

regex expression for up to but not including and split

Posted on 2010-11-16
9
436 Views
Last Modified: 2012-05-10
I have a line of text

blah blah blah;blah blah blah

I'm trying to formulate a regex expression to capture everything up to the semicolon.

On a second regex or to replace the earlier, I basically need to split up a string separated by semicolons.

Thanks in advance.
0
Comment
Question by:kblackwel
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 6
  • 3
9 Comments
 
LVL 35

Expert Comment

by:Terry Woods
ID: 34150009
$line = preg_replace("/;.*/","",$line);
0
 
LVL 35

Expert Comment

by:Terry Woods
ID: 34150028
To split a string separated by semicolons, use:

$my_array = preg_split("/;/",$string);
0
 
LVL 35

Expert Comment

by:Terry Woods
ID: 34150035
Ah, I made an assumption you're using PHP - which language are you using?
0
Get 15 Days FREE Full-Featured Trial

Benefit from a mission critical IT monitoring with Monitis Premium or get it FREE for your entry level monitoring needs.
-Over 200,000 users
-More than 300,000 websites monitored
-Used in 197 countries
-Recommended by 98% of users

 

Author Comment

by:kblackwel
ID: 34150163
It's for a ETL, so it has to be regex
0
 
LVL 35

Expert Comment

by:Terry Woods
ID: 34150190
Ok, well the patterns I've provided are valid, but the first one requires use of a replace command rather than a capture.

To capture text before the first ; character, use pattern:

^(.*?);
0
 
LVL 35

Expert Comment

by:Terry Woods
ID: 34150218
Does the string have a fixed (or limited) number of fields? Depending on your regex tool, you might be able to use the following to split a string:

(?:([^;]*);)*

or, with a limited number of fields, something like this:

([^;]*);([^;]*);([^;]*);([^;]*);([^;]*)
0
 

Author Comment

by:kblackwel
ID: 34150535
Thank you,

For my needs, all I needed was

([^;]*)

That matched blah blah blah up to the semicolon.

The field I'm trying to parse would only have 2 semicolons in it max.

Any thoughts on matching the second blah blah blah past the first semicolon.

Again, thanks on the first solution.

Basically I'm pulling a string out of a DB table. In my etl program, I don't have access to split or anything like that. But regex is available. Trying to use that to parse an address table row that is delimited by semicolons.
0
 

Author Comment

by:kblackwel
ID: 34150542
Actually from the semicolon to end of line would be fine too.
0
 
LVL 35

Accepted Solution

by:
Terry Woods earned 125 total points
ID: 34150610
From the first semi-colon to the end of the line:

;(.*)

0

Featured Post

The Ultimate Checklist to Optimize Your Website

Websites are getting bigger and complicated by the day. Video, images, custom fonts are all great for showcasing your product/service. But the price to pay in terms of reduced page load times and ultimately, decreased sales, can lead to some difficult decisions about what to cut.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

When we want to run, execute or repeat a statement multiple times, a loop is necessary. This article covers the two types of loops in Python: the while loop and the for loop.
Whether you’re a college noob or a soon-to-be pro, these tips are sure to help you in your journey to becoming a programming ninja and stand out from the crowd.
The viewer will learn additional member functions of the vector class. Specifically, the capacity and swap member functions will be introduced.
This video will show you how to get GIT to work in Eclipse.   It will walk you through how to install the EGit plugin in eclipse and how to checkout an existing repository.

734 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question