Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people, just like you, are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions

Split one file and make them into two

Posted on 2016-08-29
Last Modified: 2016-09-04
//expression to be found in file name
$find = '.5010.';

//directory name
//we will store renamed files here
$dirname = '5010';
    mkdir($dirname, 0777);

//read all files from a directory
//skip directories
$directory_with_files = './';
$dh  = opendir($directory_with_files);
$files = array();
while (false !== ($filename = readdir($dh)))
    if(in_array($filename, array('.', '..')) || is_dir($filename))

    $files[] = $filename;

//iterate collected files
foreach($files as $file)
    //check if file name is matching $find
    if(stripos($file, $find) !== false)

        //open file
        $handle = fopen($file, "r");
        if ($handle)
            //read file, line by line
            while (($line = fgets($handle)) !== false)

                //find REF line
                $refid = 'REF*2U*'; 
                if(stripos($line, $refid) !== false) 
                    //glue refernce numbers 
                    //check if reference number is not empty 

                    $refnumber = str_replace(array($refid, '~'), array('', ''), $line);
                if($refnumber != '') 
                $refnumber = '_'. $refnumber .'_'; 

                $filerenamed = str_replace($find, $refnumber, $file); 
                    copy($file, $dirname . '/' . $filerenamed); 

                echo $refnumber . "\n"; 

            //close file

Open in new window

I have the code above, to read my files that contain ".5010." and replace it with the characters after REF*2U*. However in some files there are more than one REF*2U*, is there a code that would split them through a line called "N1*PR*" and output 2 files each with its own REF*2U* character?
Question by:Jasmine Ikhreishi
LVL 109

Expert Comment

by:Ray Paseur
ID: 41774785
This is where we could use the SSCCE.  We do not really need to see code that isn't working.  All we need to see are the data sets.  If you can show us the test data with the contents of before  and after  files, we can show you the code to bridge the gap.
LVL 55

Expert Comment

by:Julian Hansen
ID: 41774880
Can you give some examples of the input and output. What does a sample file look like, what is its name what output does the script produce currently.
What does the file look like for the case you want to cater for - what does the output from that look like?
LVL 109

Expert Comment

by:Ray Paseur
ID: 41774889
Now that I've had a look at your other question and its file (at least that's what I think I'm looking at) I wonder if it might not be a good idea to step back from this implementation and take a higher-level look at the problem.  I have been in software development for years and I have never seen a design pattern that looks quite like this.  Maybe if you tell us about the task in plain, non-technical language, we can suggest a more mainstream solution.  Questions that come to mind are: Where does this data come from?  How does it get used?  If we can understand that we can probably give you a much simpler design!
Free Tool: Subnet Calculator

The subnet calculator helps you design networks by taking an IP address and network mask and returning information such as network, broadcast address, and host range.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

LVL 43

Expert Comment

by:zephyr_hex (Megan)
ID: 41774908
This looks like EDI data.  REF and N1 are both EDI segment identifiers.  And 5010 is an EDI format specification.

So my guess is that the OP is attempting to parse EDI transactions.

There's a pretty decent python based parser I've used for this called Bots, although I'm not sure if the OP is looking for an open source solution that's not based in PHP.

Parsing EDI data is quite tricky because of the various permutations that can occur.  Accounting for more than on REF segment is just the tip of the iceberg.

Author Comment

by:Jasmine Ikhreishi
ID: 41776158
I re-asked it in a better way in another question, please take a look!

Accepted Solution

Jasmine Ikhreishi earned 0 total points
ID: 41776173
LVL 109

Expert Comment

by:Ray Paseur
ID: 41776215
I'll take a look at your other question, but in the future, you can still use a single question -- just provide the additional information that we ask for, and we can continue the discussion "under one roof."

Author Closing Comment

by:Jasmine Ikhreishi
ID: 41783533
I re-asked it in a better manner

Featured Post

Free Tool: ZipGrep

ZipGrep is a utility that can list and search zip (.war, .ear, .jar, etc) archives for text patterns, without the need to extract the archive's contents.

One of a set of tools we're offering as a way to say thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Popularity Can Be Measured Sometimes we deal with questions of popularity, and we need a way to collect opinions from our clients.  This article shows a simple teaching example of how we might elect a favorite color by letting our clients vote for …
Introduction HTML checkboxes provide the perfect way for a web developer to receive client input when the client's options might be none, one or many.  But the PHP code for processing the checkboxes can be confusing at first.  What if a checkbox is…
Learn how to match and substitute tagged data using PHP regular expressions. Demonstrated on Windows 7, but also applies to other operating systems. Demonstrated technique applies to PHP (all versions) and Firefox, but very similar techniques will w…
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…

861 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question