Solved

php sort for dups in two files

Posted on 2003-11-02
5
246 Views
Last Modified: 2007-12-19
OK here's the problem I have two files block.txt and edit.txt
they contain domain entries on per line (.aol.com)
I would like to merge the twp file together but not have any dups. Both files have different amounts a entries (2400 and 420). Here's what I have so far:

set_time_limit(50);

$MAX=99999;
$free=0;
$empty="empty\n";

$array = @File("$type");
$total = count($array);

$free=$total;

// Time to open the entry file

$filename="Edit2.txt";
$fp=fopen("$filename","r")or die("Can not open file");

// Read the file into an array

if ($fp) {  $array = explode("\n", fread($fp, filesize($filename))); }

// Close the file

fclose($fp);

// Echo the second entry in the file

echo "sencond entery in new file: $array[1]<br>\n";


// Count the number of entries in the file

$entry=count($array);


// Echo the number of entries.

echo "entery count total: $entry<br>\n";



// Open the blocking file

$fp=fopen("block","r")or die("can not open the block file");
if($fp) { $arrayb = explode("\n", fread($fp, filesize(block))); }
fclose($fp);

echo "first entery in block file: $arrayb[0]<br>\n";
$blocking=count($arrayb);
echo "total blocking: $blocking\n<br>";


for($k = 0 ; $k < $blocking ; $k++) {
      //$d=$k;
      //if(trim($array[$k] != $arrayb[$d])) {
      for($d = 0; $d < $entry ; $d++) {      
                  if(trim($array[$k] != $arrayb[$d])) {
            
                        //$outputfile .= "$array[$k]\n";
                  $t=1;
                        echo "file ok $k add:$array[$k] and block:$arrayb[$d]<br>";
                  }
            if($t == 1){
                  $outputfile .= "$array[$k]\n";
                  $t=0;
            }
            else {$t=0;}
            
            
            
      }
      //else { $d++; }
}

echo $outputfile;

I know it's no right and right now I only need it to print a web page I can change it to files later.

TIA
0
Comment
Question by:jscart
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
5 Comments
 
LVL 33

Expert Comment

by:snoyes_jw
ID: 9667226
You could read everything from both files into an array, then use array_unique() to delete all the duplicates, and write the results back out to a file.
0
 
LVL 11

Accepted Solution

by:
shmert earned 125 total points
ID: 9667592
Just an implementation of snoyes' post:

$array = array_unique(array_merge(file('block.txt'), file('edit.txt')));

// Note: the line breaks will still be there for each element, so you may want to iterate through and trim() each entry.
// This should be a very speedy way to do it, though.

foreach($array AS $key=>$value) {
    $array[$key] = rtrim($value);
}
0
 
LVL 3

Expert Comment

by:wide_awake
ID: 9669047
If you're running in a unix environment, you can use the "sort" command to do it for you.

$array = explode("\n", `cat block.txt > tmp.txt; cat edit.txt >> tmp.txt; sort -u tmp.txt`);

-mark

0
 
LVL 3

Expert Comment

by:wide_awake
ID: 9669056
even easier:

$array = explode("\n", `sort -u block.txt edit.txt`);
0
 
LVL 1

Author Comment

by:jscart
ID: 9687215
THanks for the suggestions I'll start trying them and then score.
0

Featured Post

On Demand Webinar - Networking for the Cloud Era

This webinar discusses:
-Common barriers companies experience when moving to the cloud
-How SD-WAN changes the way we look at networks
-Best practices customers should employ moving forward with cloud migration
-What happens behind the scenes of SteelConnect’s one-click button

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Things That Drive Us Nuts Have you noticed the use of the reCaptcha feature at EE and other web sites?  It wants you to read and retype something that looks like this.Insanity!  It's not EE's fault - that's just the way reCaptcha works.  But it is …
Many old projects have bad code, but the budget doesn't exist to rewrite the codebase. You can update this code to be safer by introducing contemporary input validation, sanitation, and safer database queries.
The viewer will learn how to dynamically set the form action using jQuery.
The viewer will learn how to look for a specific file type in a local or remote server directory using PHP.

749 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question