Solved

File upload in cgi

Posted on 2004-04-09
11
299 Views
Last Modified: 2012-05-04
Hi,
I have a web page that uploads a file
<INPUT TYPE="file" NAME="uploaded_file" VALUE="starting value" SIZE=50 MAXLENGTH=80 >

Now, in my cgi, I am trying to retrieve the contents of the file like this

$filename = $query->param('uploaded_file');
printf $output "Files contents are ";
while (<$filename>) { printf $output "$_"; }

For text files, the contents do get printed. But for word docs, I am expecting an output like

#######################
Mime-Version: 1.0
Content-Type: multipart/mixed;
        boundary="=====================_108983900==_"

--=====================_108983900==_
Content-Type: text/plain; charset="us-ascii"; format=flowed


--=====================_108983900==_
Content-Type: application/msword; name="database.doc";
 x-mac-type="42494E41"; x-mac-creator="4D535744"
Content-Transfer-Encoding: base64
Content-Disposition: attachment; filename="database.doc"

0M8R4KGxGuEAAAAAAAAAAAAAAAAAAAAAPgADAP7/CQAGAAAAAAAAAAAAAAABAAAAXQAAAAAAAAAA
EAAAPwAAAAEAAAD+////AAAAAF4AAAD/////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////
////////////////////////////////////////////////////////////////////////////

#######################

But all I get is
###############
 
 
 <TY&¨ÿÿÿÿÿÿÿÿÿ
 <
 <TY&¨ÿÿÿÿÿÿÿÿÿ
 
 <
 &
 &
 
 <TY&¨ÿÿÿÿÿÿÿÿÿ
###############

Can you tell me what is wrong and how I can read through a word doc and get the binary information?
Thanks.
0
Comment
Question by:makam_75
  • 6
  • 4
11 Comments
 
LVL 10

Expert Comment

by:rj2
ID: 10796024
Try to add statements below before you open the file.
binmode(STDOUT);
binmode(FILE);

The boundary and content-type stuff will not get printed when you print the content of the file.
That is part of the http headers, not part of the Word file.
0
 

Author Comment

by:makam_75
ID: 10806678
Tried like this, no luck yet...

$filename = $query->param('uploaded_file');
binmode(STDOUT);
binmode($filename);
printf $output "Files contents are ";
while (<$filename>) { printf $output "$_"; }
printf $output "----------------------------------";

Also, I notice that the line ---- is not being printed. (The program execution seem to stop at the while loop above.) If anyone can suggest any pointers, I really appreciate it.
Thankyou.
0
 
LVL 10

Expert Comment

by:rj2
ID: 10807260
Try like this.

$filename = $query->param('uploaded_file');
binmode(STDOUT);
while (my $bytesread = read($filename, my $buffer, 16384)) {
      print $buffer;
}
      

But why are you trying to print the contents of a Word file, a binary file, to the browser?
0
 

Author Comment

by:makam_75
ID: 10807642
Thank you, but it does not work either.
Here is my code

$filename = $query->param('uploaded_file');
binmode(STDOUT);
while (my $bytesread = read($filename, my $buffer, 16384)) {
printf $output "Contents are $buffer";
}
printf $output "----------------------------------";

And the output I get when I upload a word doc is
Contents are Contents are Contents are Contents are ----------------------------------

I am creating a web page to accept candidates resumes. Our recuriting application can parse word docs if I pass the binary contents of the file.  Thanks for your help.
0
 
LVL 10

Accepted Solution

by:
rj2 earned 120 total points
ID: 10807843
You have not the same code as I posted.
Use print for this, not printf.
And what is $output?
Try to remove it so code becomes as below

$filename = $query->param('uploaded_file');
binmode(STDOUT);
while (my $bytesread = read($filename, my $buffer, 16384)) {
     print $buffer;
}
0
How to improve team productivity

Quip adds documents, spreadsheets, and tasklists to your Slack experience
- Elevate ideas to Quip docs
- Share Quip docs in Slack
- Get notified of changes to your docs
- Available on iOS/Android/Desktop/Web
- Online/Offline

 

Author Comment

by:makam_75
ID: 10808215
But I want to print to a file, not on the web page.
I had $output = "data.txt";

Anyway I did try
print "Contents are $buffer";
I got "Contents are Contents are Contents are Contents are " printed on my web page.
Thanks.
0
 

Author Comment

by:makam_75
ID: 10808622
BTW, my form information is

$html_method = "POST";
$action="submit3.cgi";
$encoding="multipart/form-data";
print $query->start_multipart_form($method,$action,$encoding);

0
 
LVL 10

Assisted Solution

by:rj2
rj2 earned 120 total points
ID: 10808887
Ok, try like this then.
Please post your entire script if you still have problems.

my( $buffer,$bytesread);
my $filename = $query->param('uploaded_file');
if(open(OUTFILE, ">$output")) {
      binmode(OUTPUT);
      while ($bytesread = read($filename, $buffer, 16384)) {
           print OUTFILE $buffer;
      }
      close(OUTFILE);
}
0
 

Author Comment

by:makam_75
ID: 10826682
Thank you, I was out sick yesterday, so could not respond to your message.

On doing more research, what I found is:
My $output was mail, meaning I was making the server email the data. like this.
   $output = "MAIL";
   open (MAIL, "| /usr/lib/sendmail -oi -t");
   print $output $buffer;

When I changed the $output to a.txt, it works just fine. a.txt does have all the binary info from the word doc in the way i expected.  like this.
open (OUTFILE,">a.txt");
print OUTFILE $buffer;

So looks like when the info get transferred to sendmail, it somehow get changed?
(BTW I am viewing the email through a vi editor, so it is not an email client issue. )

Our recruiting application expects email with the binary content of word doc. so looks like i am stuck.
0
 

Author Comment

by:makam_75
ID: 10828394
So rj2, you have given the solution for my intial problem, so all the points belong to you.
Can someone tell me how to Accept rj2's answer? I cant find that button anymore .. :(
0

Featured Post

Top 6 Sources for Identifying Threat Actor TTPs

Understanding your enemy is essential. These six sources will help you identify the most popular threat actor tactics, techniques, and procedures (TTPs).

Join & Write a Comment

I have been pestered over the years to produce and distribute regular data extracts, and often the request have explicitly requested the data be emailed as an Excel attachement; specifically Excel, as it appears: CSV files confuse (no Red or Green h…
In the distant past (last year) I hacked together a little toy that would allow a couple of Manager types to query, preview, and extract data from a number of MongoDB instances, to their tool of choice: Excel (http://dilbert.com/strips/comic/2007-08…
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…
Illustrator's Shape Builder tool will let you combine shapes visually and interactively. This video shows the Mac version, but the tool works the same way in Windows. To follow along with this video, you can draw your own shapes or download the file…

743 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

12 Experts available now in Live!

Get 1:1 Help Now