Solved

php sort for dups in two files

Posted on 2003-11-02
5
247 Views
Last Modified: 2007-12-19
OK here's the problem I have two files block.txt and edit.txt
they contain domain entries on per line (.aol.com)
I would like to merge the twp file together but not have any dups. Both files have different amounts a entries (2400 and 420). Here's what I have so far:

set_time_limit(50);

$MAX=99999;
$free=0;
$empty="empty\n";

$array = @File("$type");
$total = count($array);

$free=$total;

// Time to open the entry file

$filename="Edit2.txt";
$fp=fopen("$filename","r")or die("Can not open file");

// Read the file into an array

if ($fp) {  $array = explode("\n", fread($fp, filesize($filename))); }

// Close the file

fclose($fp);

// Echo the second entry in the file

echo "sencond entery in new file: $array[1]<br>\n";


// Count the number of entries in the file

$entry=count($array);


// Echo the number of entries.

echo "entery count total: $entry<br>\n";



// Open the blocking file

$fp=fopen("block","r")or die("can not open the block file");
if($fp) { $arrayb = explode("\n", fread($fp, filesize(block))); }
fclose($fp);

echo "first entery in block file: $arrayb[0]<br>\n";
$blocking=count($arrayb);
echo "total blocking: $blocking\n<br>";


for($k = 0 ; $k < $blocking ; $k++) {
      //$d=$k;
      //if(trim($array[$k] != $arrayb[$d])) {
      for($d = 0; $d < $entry ; $d++) {      
                  if(trim($array[$k] != $arrayb[$d])) {
            
                        //$outputfile .= "$array[$k]\n";
                  $t=1;
                        echo "file ok $k add:$array[$k] and block:$arrayb[$d]<br>";
                  }
            if($t == 1){
                  $outputfile .= "$array[$k]\n";
                  $t=0;
            }
            else {$t=0;}
            
            
            
      }
      //else { $d++; }
}

echo $outputfile;

I know it's no right and right now I only need it to print a web page I can change it to files later.

TIA
0
Comment
Question by:jscart
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
5 Comments
 
LVL 33

Expert Comment

by:snoyes_jw
ID: 9667226
You could read everything from both files into an array, then use array_unique() to delete all the duplicates, and write the results back out to a file.
0
 
LVL 11

Accepted Solution

by:
shmert earned 125 total points
ID: 9667592
Just an implementation of snoyes' post:

$array = array_unique(array_merge(file('block.txt'), file('edit.txt')));

// Note: the line breaks will still be there for each element, so you may want to iterate through and trim() each entry.
// This should be a very speedy way to do it, though.

foreach($array AS $key=>$value) {
    $array[$key] = rtrim($value);
}
0
 
LVL 3

Expert Comment

by:wide_awake
ID: 9669047
If you're running in a unix environment, you can use the "sort" command to do it for you.

$array = explode("\n", `cat block.txt > tmp.txt; cat edit.txt >> tmp.txt; sort -u tmp.txt`);

-mark

0
 
LVL 3

Expert Comment

by:wide_awake
ID: 9669056
even easier:

$array = explode("\n", `sort -u block.txt edit.txt`);
0
 
LVL 1

Author Comment

by:jscart
ID: 9687215
THanks for the suggestions I'll start trying them and then score.
0

Featured Post

Don't Cry: How Liquid Web is Ensuring Security

WannaCry is just the start. Read how Liquid Web is protecting itself and its customers against new threats.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article discusses four methods for overlaying images in a container on a web page
3 proven steps to speed up Magento powered sites. The article focus is on optimizing time to first byte (TTFB), full page caching and configuring server for optimal performance.
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…
The viewer will learn how to count occurrences of each item in an array.

696 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question