Solved

File upload in cgi

Posted on 2004-04-09
11
305 Views
Last Modified: 2012-05-04
Hi,
I have a web page that uploads a file
<INPUT TYPE="file" NAME="uploaded_file" VALUE="starting value" SIZE=50 MAXLENGTH=80 >

Now, in my cgi, I am trying to retrieve the contents of the file like this

$filename = $query->param('uploaded_file');
printf $output "Files contents are ";
while (<$filename>) { printf $output "$_"; }

For text files, the contents do get printed. But for word docs, I am expecting an output like

#######################
Mime-Version: 1.0
Content-Type: multipart/mixed;
        boundary="=====================_108983900==_"

--=====================_108983900==_
Content-Type: text/plain; charset="us-ascii"; format=flowed


--=====================_108983900==_
Content-Type: application/msword; name="database.doc";
 x-mac-type="42494E41"; x-mac-creator="4D535744"
Content-Transfer-Encoding: base64
Content-Disposition: attachment; filename="database.doc"

0M8R4KGxGuEAAAAAAAAAAAAAAAAAAAAAPgADAP7/CQAGAAAAAAAAAAAAAAABAAAAXQAAAAAAAAAA
EAAAPwAAAAEAAAD+////AAAAAF4AAAD/////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////

#######################

But all I get is
###############
 
 
 <TY&¨ÿÿÿÿÿÿÿÿÿ
 <
 <TY&¨ÿÿÿÿÿÿÿÿÿ
 
 <
 &
 &
 
 <TY&¨ÿÿÿÿÿÿÿÿÿ
###############

Can you tell me what is wrong and how I can read through a word doc and get the binary information?
Thanks.
0
Comment
Question by:makam_75
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 6
  • 4
11 Comments
 
LVL 10

Expert Comment

by:rj2
ID: 10796024
Try to add statements below before you open the file.
binmode(STDOUT);
binmode(FILE);

The boundary and content-type stuff will not get printed when you print the content of the file.
That is part of the http headers, not part of the Word file.
0
 

Author Comment

by:makam_75
ID: 10806678
Tried like this, no luck yet...

$filename = $query->param('uploaded_file');
binmode(STDOUT);
binmode($filename);
printf $output "Files contents are ";
while (<$filename>) { printf $output "$_"; }
printf $output "----------------------------------";

Also, I notice that the line ---- is not being printed. (The program execution seem to stop at the while loop above.) If anyone can suggest any pointers, I really appreciate it.
Thankyou.
0
 
LVL 10

Expert Comment

by:rj2
ID: 10807260
Try like this.

$filename = $query->param('uploaded_file');
binmode(STDOUT);
while (my $bytesread = read($filename, my $buffer, 16384)) {
      print $buffer;
}
      

But why are you trying to print the contents of a Word file, a binary file, to the browser?
0
Technology Partners: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

 

Author Comment

by:makam_75
ID: 10807642
Thank you, but it does not work either.
Here is my code

$filename = $query->param('uploaded_file');
binmode(STDOUT);
while (my $bytesread = read($filename, my $buffer, 16384)) {
printf $output "Contents are $buffer";
}
printf $output "----------------------------------";

And the output I get when I upload a word doc is
Contents are Contents are Contents are Contents are ----------------------------------

I am creating a web page to accept candidates resumes. Our recuriting application can parse word docs if I pass the binary contents of the file.  Thanks for your help.
0
 
LVL 10

Accepted Solution

by:
rj2 earned 120 total points
ID: 10807843
You have not the same code as I posted.
Use print for this, not printf.
And what is $output?
Try to remove it so code becomes as below

$filename = $query->param('uploaded_file');
binmode(STDOUT);
while (my $bytesread = read($filename, my $buffer, 16384)) {
     print $buffer;
}
0
 

Author Comment

by:makam_75
ID: 10808215
But I want to print to a file, not on the web page.
I had $output = "data.txt";

Anyway I did try
print "Contents are $buffer";
I got "Contents are Contents are Contents are Contents are " printed on my web page.
Thanks.
0
 

Author Comment

by:makam_75
ID: 10808622
BTW, my form information is

$html_method = "POST";
$action="submit3.cgi";
$encoding="multipart/form-data";
print $query->start_multipart_form($method,$action,$encoding);

0
 
LVL 10

Assisted Solution

by:rj2
rj2 earned 120 total points
ID: 10808887
Ok, try like this then.
Please post your entire script if you still have problems.

my( $buffer,$bytesread);
my $filename = $query->param('uploaded_file');
if(open(OUTFILE, ">$output")) {
      binmode(OUTPUT);
      while ($bytesread = read($filename, $buffer, 16384)) {
           print OUTFILE $buffer;
      }
      close(OUTFILE);
}
0
 

Author Comment

by:makam_75
ID: 10826682
Thank you, I was out sick yesterday, so could not respond to your message.

On doing more research, what I found is:
My $output was mail, meaning I was making the server email the data. like this.
   $output = "MAIL";
   open (MAIL, "| /usr/lib/sendmail -oi -t");
   print $output $buffer;

When I changed the $output to a.txt, it works just fine. a.txt does have all the binary info from the word doc in the way i expected.  like this.
open (OUTFILE,">a.txt");
print OUTFILE $buffer;

So looks like when the info get transferred to sendmail, it somehow get changed?
(BTW I am viewing the email through a vi editor, so it is not an email client issue. )

Our recruiting application expects email with the binary content of word doc. so looks like i am stuck.
0
 

Author Comment

by:makam_75
ID: 10828394
So rj2, you have given the solution for my intial problem, so all the points belong to you.
Can someone tell me how to Accept rj2's answer? I cant find that button anymore .. :(
0

Featured Post

[Webinar] Code, Load, and Grow

Managing multiple websites, servers, applications, and security on a daily basis? Join us for a webinar on May 25th to learn how to simplify administration and management of virtual hosts for IT admins, create a secure environment, and deploy code more effectively and frequently.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

I've just discovered very important differences between Windows an Unix formats in Perl,at least 5.xx.. MOST IMPORTANT: Use Unix file format while saving Your script. otherwise it will have ^M s or smth likely weird in the EOL, Then DO NOT use m…
I have been pestered over the years to produce and distribute regular data extracts, and often the request have explicitly requested the data be emailed as an Excel attachement; specifically Excel, as it appears: CSV files confuse (no Red or Green h…
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…

740 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question