Split one file and make them into two

Posted on 2016-08-29
Last Modified: 2016-09-04
//expression to be found in file name
$find = '.5010.';

//directory name
//we will store renamed files here
$dirname = '5010';
    mkdir($dirname, 0777);

//read all files from a directory
//skip directories
$directory_with_files = './';
$dh  = opendir($directory_with_files);
$files = array();
while (false !== ($filename = readdir($dh)))
    if(in_array($filename, array('.', '..')) || is_dir($filename))

    $files[] = $filename;

//iterate collected files
foreach($files as $file)
    //check if file name is matching $find
    if(stripos($file, $find) !== false)

        //open file
        $handle = fopen($file, "r");
        if ($handle)
            //read file, line by line
            while (($line = fgets($handle)) !== false)

                //find REF line
                $refid = 'REF*2U*'; 
                if(stripos($line, $refid) !== false) 
                    //glue refernce numbers 
                    //check if reference number is not empty 

                    $refnumber = str_replace(array($refid, '~'), array('', ''), $line);
                if($refnumber != '') 
                $refnumber = '_'. $refnumber .'_'; 

                $filerenamed = str_replace($find, $refnumber, $file); 
                    copy($file, $dirname . '/' . $filerenamed); 

                echo $refnumber . "\n"; 

            //close file

Open in new window

I have the code above, to read my files that contain ".5010." and replace it with the characters after REF*2U*. However in some files there are more than one REF*2U*, is there a code that would split them through a line called "N1*PR*" and output 2 files each with its own REF*2U* character?
Question by:Jasmine Ikhreishi
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
LVL 110

Expert Comment

by:Ray Paseur
ID: 41774785
This is where we could use the SSCCE.  We do not really need to see code that isn't working.  All we need to see are the data sets.  If you can show us the test data with the contents of before  and after  files, we can show you the code to bridge the gap.
LVL 58

Expert Comment

by:Julian Hansen
ID: 41774880
Can you give some examples of the input and output. What does a sample file look like, what is its name what output does the script produce currently.
What does the file look like for the case you want to cater for - what does the output from that look like?
LVL 110

Expert Comment

by:Ray Paseur
ID: 41774889
Now that I've had a look at your other question and its file (at least that's what I think I'm looking at) I wonder if it might not be a good idea to step back from this implementation and take a higher-level look at the problem.  I have been in software development for years and I have never seen a design pattern that looks quite like this.  Maybe if you tell us about the task in plain, non-technical language, we can suggest a more mainstream solution.  Questions that come to mind are: Where does this data come from?  How does it get used?  If we can understand that we can probably give you a much simpler design!
Creating Instructional Tutorials  

For Any Use & On Any Platform

Contextual Guidance at the moment of need helps your employees/users adopt software o& achieve even the most complex tasks instantly. Boost knowledge retention, software adoption & employee engagement with easy solution.

LVL 43

Expert Comment

by:zephyr_hex (Megan)
ID: 41774908
This looks like EDI data.  REF and N1 are both EDI segment identifiers.  And 5010 is an EDI format specification.

So my guess is that the OP is attempting to parse EDI transactions.

There's a pretty decent python based parser I've used for this called Bots, although I'm not sure if the OP is looking for an open source solution that's not based in PHP.

Parsing EDI data is quite tricky because of the various permutations that can occur.  Accounting for more than on REF segment is just the tip of the iceberg.

Author Comment

by:Jasmine Ikhreishi
ID: 41776158
I re-asked it in a better way in another question, please take a look!

Accepted Solution

Jasmine Ikhreishi earned 0 total points
ID: 41776173
LVL 110

Expert Comment

by:Ray Paseur
ID: 41776215
I'll take a look at your other question, but in the future, you can still use a single question -- just provide the additional information that we ask for, and we can continue the discussion "under one roof."

Author Closing Comment

by:Jasmine Ikhreishi
ID: 41783533
I re-asked it in a better manner

Featured Post

Free Tool: Path Explorer

An intuitive utility to help find the CSS path to UI elements on a webpage. These paths are used frequently in a variety of front-end development and QA automation tasks.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article discusses four methods for overlaying images in a container on a web page
3 proven steps to speed up Magento powered sites. The article focus is on optimizing time to first byte (TTFB), full page caching and configuring server for optimal performance.
Learn how to match and substitute tagged data using PHP regular expressions. Demonstrated on Windows 7, but also applies to other operating systems. Demonstrated technique applies to PHP (all versions) and Firefox, but very similar techniques will w…
The viewer will learn how to create and use a small PHP class to apply a watermark to an image. This video shows the viewer the setup for the PHP watermark as well as important coding language. Continue to Part 2 to learn the core code used in creat…

624 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question