Solved

File upload in cgi

Posted on 2004-04-09
11
300 Views
Last Modified: 2012-05-04
Hi,
I have a web page that uploads a file
<INPUT TYPE="file" NAME="uploaded_file" VALUE="starting value" SIZE=50 MAXLENGTH=80 >

Now, in my cgi, I am trying to retrieve the contents of the file like this

$filename = $query->param('uploaded_file');
printf $output "Files contents are ";
while (<$filename>) { printf $output "$_"; }

For text files, the contents do get printed. But for word docs, I am expecting an output like

#######################
Mime-Version: 1.0
Content-Type: multipart/mixed;
        boundary="=====================_108983900==_"

--=====================_108983900==_
Content-Type: text/plain; charset="us-ascii"; format=flowed


--=====================_108983900==_
Content-Type: application/msword; name="database.doc";
 x-mac-type="42494E41"; x-mac-creator="4D535744"
Content-Transfer-Encoding: base64
Content-Disposition: attachment; filename="database.doc"

0M8R4KGxGuEAAAAAAAAAAAAAAAAAAAAAPgADAP7/CQAGAAAAAAAAAAAAAAABAAAAXQAAAAAAAAAA
EAAAPwAAAAEAAAD+////AAAAAF4AAAD/////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////

#######################

But all I get is
###############
 
 
 <TY&¨ÿÿÿÿÿÿÿÿÿ
 <
 <TY&¨ÿÿÿÿÿÿÿÿÿ
 
 <
 &
 &
 
 <TY&¨ÿÿÿÿÿÿÿÿÿ
###############

Can you tell me what is wrong and how I can read through a word doc and get the binary information?
Thanks.
0
Comment
Question by:makam_75
  • 6
  • 4
11 Comments
 
LVL 10

Expert Comment

by:rj2
ID: 10796024
Try to add statements below before you open the file.
binmode(STDOUT);
binmode(FILE);

The boundary and content-type stuff will not get printed when you print the content of the file.
That is part of the http headers, not part of the Word file.
0
 

Author Comment

by:makam_75
ID: 10806678
Tried like this, no luck yet...

$filename = $query->param('uploaded_file');
binmode(STDOUT);
binmode($filename);
printf $output "Files contents are ";
while (<$filename>) { printf $output "$_"; }
printf $output "----------------------------------";

Also, I notice that the line ---- is not being printed. (The program execution seem to stop at the while loop above.) If anyone can suggest any pointers, I really appreciate it.
Thankyou.
0
 
LVL 10

Expert Comment

by:rj2
ID: 10807260
Try like this.

$filename = $query->param('uploaded_file');
binmode(STDOUT);
while (my $bytesread = read($filename, my $buffer, 16384)) {
      print $buffer;
}
      

But why are you trying to print the contents of a Word file, a binary file, to the browser?
0
 

Author Comment

by:makam_75
ID: 10807642
Thank you, but it does not work either.
Here is my code

$filename = $query->param('uploaded_file');
binmode(STDOUT);
while (my $bytesread = read($filename, my $buffer, 16384)) {
printf $output "Contents are $buffer";
}
printf $output "----------------------------------";

And the output I get when I upload a word doc is
Contents are Contents are Contents are Contents are ----------------------------------

I am creating a web page to accept candidates resumes. Our recuriting application can parse word docs if I pass the binary contents of the file.  Thanks for your help.
0
 
LVL 10

Accepted Solution

by:
rj2 earned 120 total points
ID: 10807843
You have not the same code as I posted.
Use print for this, not printf.
And what is $output?
Try to remove it so code becomes as below

$filename = $query->param('uploaded_file');
binmode(STDOUT);
while (my $bytesread = read($filename, my $buffer, 16384)) {
     print $buffer;
}
0
Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

 

Author Comment

by:makam_75
ID: 10808215
But I want to print to a file, not on the web page.
I had $output = "data.txt";

Anyway I did try
print "Contents are $buffer";
I got "Contents are Contents are Contents are Contents are " printed on my web page.
Thanks.
0
 

Author Comment

by:makam_75
ID: 10808622
BTW, my form information is

$html_method = "POST";
$action="submit3.cgi";
$encoding="multipart/form-data";
print $query->start_multipart_form($method,$action,$encoding);

0
 
LVL 10

Assisted Solution

by:rj2
rj2 earned 120 total points
ID: 10808887
Ok, try like this then.
Please post your entire script if you still have problems.

my( $buffer,$bytesread);
my $filename = $query->param('uploaded_file');
if(open(OUTFILE, ">$output")) {
      binmode(OUTPUT);
      while ($bytesread = read($filename, $buffer, 16384)) {
           print OUTFILE $buffer;
      }
      close(OUTFILE);
}
0
 

Author Comment

by:makam_75
ID: 10826682
Thank you, I was out sick yesterday, so could not respond to your message.

On doing more research, what I found is:
My $output was mail, meaning I was making the server email the data. like this.
   $output = "MAIL";
   open (MAIL, "| /usr/lib/sendmail -oi -t");
   print $output $buffer;

When I changed the $output to a.txt, it works just fine. a.txt does have all the binary info from the word doc in the way i expected.  like this.
open (OUTFILE,">a.txt");
print OUTFILE $buffer;

So looks like when the info get transferred to sendmail, it somehow get changed?
(BTW I am viewing the email through a vi editor, so it is not an email client issue. )

Our recruiting application expects email with the binary content of word doc. so looks like i am stuck.
0
 

Author Comment

by:makam_75
ID: 10828394
So rj2, you have given the solution for my intial problem, so all the points belong to you.
Can someone tell me how to Accept rj2's answer? I cant find that button anymore .. :(
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Email validation in proper way is  very important validation required in any web pages. This code is self explainable except that Regular Expression which I used for pattern matching. I originally published as a thread on my website : http://www…
Checking the Alert Log in AWS RDS Oracle can be a pain through their user interface.  I made a script to download the Alert Log, look for errors, and email me the trace files.  In this article I'll describe what I did and share my script.
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…
Both in life and business – not all partnerships are created equal. As the demand for cloud services increases, so do the number of self-proclaimed cloud partners. Asking the right questions up front in the partnership, will enable both parties …

911 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

23 Experts available now in Live!

Get 1:1 Help Now