Split one file and make them into two

Posted on 2016-08-29
Last Modified: 2016-09-04
//expression to be found in file name
$find = '.5010.';

//directory name
//we will store renamed files here
$dirname = '5010';
    mkdir($dirname, 0777);

//read all files from a directory
//skip directories
$directory_with_files = './';
$dh  = opendir($directory_with_files);
$files = array();
while (false !== ($filename = readdir($dh)))
    if(in_array($filename, array('.', '..')) || is_dir($filename))

    $files[] = $filename;

//iterate collected files
foreach($files as $file)
    //check if file name is matching $find
    if(stripos($file, $find) !== false)

        //open file
        $handle = fopen($file, "r");
        if ($handle)
            //read file, line by line
            while (($line = fgets($handle)) !== false)

                //find REF line
                $refid = 'REF*2U*'; 
                if(stripos($line, $refid) !== false) 
                    //glue refernce numbers 
                    //check if reference number is not empty 

                    $refnumber = str_replace(array($refid, '~'), array('', ''), $line);
                if($refnumber != '') 
                $refnumber = '_'. $refnumber .'_'; 

                $filerenamed = str_replace($find, $refnumber, $file); 
                    copy($file, $dirname . '/' . $filerenamed); 

                echo $refnumber . "\n"; 

            //close file

Open in new window

I have the code above, to read my files that contain ".5010." and replace it with the characters after REF*2U*. However in some files there are more than one REF*2U*, is there a code that would split them through a line called "N1*PR*" and output 2 files each with its own REF*2U* character?
Question by:Jasmine Ikhreishi
LVL 108

Expert Comment

by:Ray Paseur
ID: 41774785
This is where we could use the SSCCE.  We do not really need to see code that isn't working.  All we need to see are the data sets.  If you can show us the test data with the contents of before  and after  files, we can show you the code to bridge the gap.
LVL 51

Expert Comment

by:Julian Hansen
ID: 41774880
Can you give some examples of the input and output. What does a sample file look like, what is its name what output does the script produce currently.
What does the file look like for the case you want to cater for - what does the output from that look like?
LVL 108

Expert Comment

by:Ray Paseur
ID: 41774889
Now that I've had a look at your other question and its file (at least that's what I think I'm looking at) I wonder if it might not be a good idea to step back from this implementation and take a higher-level look at the problem.  I have been in software development for years and I have never seen a design pattern that looks quite like this.  Maybe if you tell us about the task in plain, non-technical language, we can suggest a more mainstream solution.  Questions that come to mind are: Where does this data come from?  How does it get used?  If we can understand that we can probably give you a much simpler design!
LVL 42

Expert Comment

ID: 41774908
This looks like EDI data.  REF and N1 are both EDI segment identifiers.  And 5010 is an EDI format specification.

So my guess is that the OP is attempting to parse EDI transactions.

There's a pretty decent python based parser I've used for this called Bots, although I'm not sure if the OP is looking for an open source solution that's not based in PHP.

Parsing EDI data is quite tricky because of the various permutations that can occur.  Accounting for more than on REF segment is just the tip of the iceberg.
Better Security Awareness With Threat Intelligence

See how one of the leading financial services organizations uses Recorded Future as part of a holistic threat intelligence program to promote security awareness and proactively and efficiently identify threats.


Author Comment

by:Jasmine Ikhreishi
ID: 41776158
I re-asked it in a better way in another question, please take a look!

Accepted Solution

Jasmine Ikhreishi earned 0 total points
ID: 41776173
LVL 108

Expert Comment

by:Ray Paseur
ID: 41776215
I'll take a look at your other question, but in the future, you can still use a single question -- just provide the additional information that we ask for, and we can continue the discussion "under one roof."

Author Closing Comment

by:Jasmine Ikhreishi
ID: 41783533
I re-asked it in a better manner

Featured Post

How your wiki can always stay up-to-date

Quip doubles as a “living” wiki and a project management tool that evolves with your organization. As you finish projects in Quip, the work remains, easily accessible to all team members, new and old.
- Increase transparency
- Onboard new hires faster
- Access from mobile/offline

Join & Write a Comment

Suggested Solutions

This article will explain how to display the first page of your Microsoft Word documents (e.g. .doc, .docx, etc...) as images in a web page programatically. I have scoured the web on a way to do this unsuccessfully. The goal is to produce something …
I imagine that there are some, like me, who require a way of getting currency exchange rates for implementation in web project from time to time, so I thought I would share a solution that I have developed for this purpose. It turns out that Yaho…
The viewer will learn how to create and use a small PHP class to apply a watermark to an image. This video shows the viewer the setup for the PHP watermark as well as important coding language. Continue to Part 2 to learn the core code used in creat…
This tutorial will teach you the core code needed to finalize the addition of a watermark to your image. The viewer will use a small PHP class to learn and create a watermark.

759 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

19 Experts available now in Live!

Get 1:1 Help Now