?
Solved

Use regEx to get substrings

Posted on 2003-03-27
5
Medium Priority
?
222 Views
Last Modified: 2010-04-06
I am trying to extract certain values from a long text file (actually postscript). It is mostly unstructured so I need a flexible parsing method. I think I'm looking for the RegEx counterpart to a SQL Like valuestartswith%valueendswith (or valuestartswith*valueendswith). That is, I know what the match should start with and I know what it ends with but I don't know what's in between.

I put some sample data below. The initial goal is just to retrieve the text between the "/Type" and "end" for each block of data. Alternatively, it would be fine to strip out the other text. It would be even better if I could extract an array containing Type,Q,D,A for each block.

My preferred language would be .Net but I'm not closed to other options.

Here is some sample data
/Type Text
/Q (dept)
/D (sales)
/A (1)
end
/random text
/more random text
/still more random text
/Type Date
/Q (birthdate)
/A (Jan-1-1968)
end
/other stuff
/more other stuff
/Type Text
/Q  (title)
/A (Asst. Marketing VP)
end
0
Comment
Question by:jovball
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
5 Comments
 
LVL 14

Accepted Solution

by:
avner earned 152 total points
ID: 8216489
Here is a sample test case with the regexp :

<html>
<head>
<title>about:blank</title>
<script language="javascript1.2">
<!-- copyright(c) avcoh@yahoo.com
function testRE()
{
var s = document.getElementById("aa").innerText;

var sStart = "/D";
var sEnd = "end";
var oRE = new RegExp("("+sStart+")([^\\t])*?("+sEnd+")","g");
alert(s.match(oRE))
}
-->
</script>
<style>

</style>
</head>
<body onload="testRE()">
<textarea id="aa" cols="20" rows="20">
/Type Text
/Q (dept)
/D (sales)
/A (1)
end
/random text
/more random text
/still more random text
/Type Date
/Q (birthdate)
/A (Jan-1-1968)
end
/other stuff
/more other stuff
/Type Text
/Q  (title)
/A (Asst. Marketing VP)
end
</textarea>
</body>
</html>
0
 
LVL 7

Assisted Solution

by:markhoy
markhoy earned 148 total points
ID: 8226533
0
 
LVL 14

Expert Comment

by:avner
ID: 8426378
jovball, do you need any additional help or can this question be closed ?
0
 
LVL 53

Expert Comment

by:COBOLdinosaur
ID: 9115289
This question has been classified abandoned. I will make a recommendation to the
moderators on its resolution in a week or two. I appreciate any comments
that would help me to make a recommendation.

<note>
Unless it is clear to me that the question has been answered I will recommend delete.  It is possible that a Grade less than A will be given if no expert makes a case for an A grade. It is assumed that any participant not responding to this request is no longer interested in its final disposition.
</note>

If the user does not know how to close the question, the options are here:
http://www.experts-exchange.com/help/closing.jsp


Cd&

0
 
LVL 6

Expert Comment

by:Programming_Gal
ID: 9656067
No comment has been added lately, so it's time to clean up this TA.
I will leave a recommendation in the Cleanup topic area that this question is:

Split between avner and markhoy

Please leave any comments here within the next seven days.

PLEASE DO NOT ACCEPT THIS COMMENT AS AN ANSWER!

Programming_Gal
EE Cleanup Volunteer
0

Featured Post

Free Tool: Subnet Calculator

The subnet calculator helps you design networks by taking an IP address and network mask and returning information such as network, broadcast address, and host range.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article covers the basics of the Sass, which is a CSS extension language. You will learn about variables, mixins, and nesting.
SASS allows you to treat your CSS code in a more OOP way. Let's have a look on how you can structure your code in order for it to be easily maintained and reused.
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…
This tutorial will teach you the core code needed to finalize the addition of a watermark to your image. The viewer will use a small PHP class to learn and create a watermark.
Suggested Courses

770 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question