Solved

Decoding a Base64 barcode in Java?

Posted on 2004-03-23
1
708 Views
Last Modified: 2008-02-01
Our vendor is supposedly going to send us the contents of a barcode in Base64.  We had before agreed upon Hexadecimal, but alas, things have changed.  These are 2-D barcodes to be precise, PDF417 and Data Matrix.  I am unfortunately unsure of exactly how Base64 works and how I coud go about determining the conversion of Base64 to Hexadecimal so that I can use our existing hexadecimal conversion code to parse the fields out of the barcode and store the human-readable fields in our database.

Has anyone used a Base 64 conversion utility or have links to sites where I can read about it and perhaps see some coding  examples?  I am not even sure if I can take the whole Base 64 string or perhaps a specific quadrant and convert it to Hex or Binary or whatever and then read from there, so i am at the beginning of my research somewhat. Please Help!
0
Comment
Question by:atlvandyguy
1 Comment
 
LVL 27

Accepted Solution

by:
BigRat earned 500 total points
ID: 10658430
In Base64 the characters A-Z,a-z,0-9,+,/ represent 6 bit values (0-63) (A=0,/=63). The input string (8-bit values) is taken 6-bits at a time and converted to one of the above characters. If the output string of characters is not a multiple of 6, equal signs (=) are used to pad out. Normally used for e-mail messages, the strings are often then sliced into max 78 characters and separated by CR/LFs to make "lines". These "lines" are then quite often prefixed by tabs or spaces to make the "text" look nice.

On converting back, tabs,spaces and CR/LF characters should be ignored. Each character is checked to be A-Z,a-z,0-9,+ or / (or =) and converted into a six-bit value. These values are "appended" to make the original string.


FUNCTION fromBase64(VAR Data   : data_rep_type;
                    VAR Result : data_rep_type) : Boolean;
LABEL 1,2,3,4;
VAR
   ch    : Char;
   i,c   : Integer;
   l     : 0..maxint;
   t     : Boolean;
   Tmp   : Byte;
   val   : Byte;
   which : 0..3;
  BEGIN
   fromBase64:=false;
   l:=StringLength(Data);
   NewString(Result,(l*3) DIV 4);
   i:=1; which:=0; c:=0; t:=false;
   WHILE i<=l DO
      BEGIN
      ch:=IndexString(Data,i);
      IF ch IN [%0D%,%0A%,%09%,' '] THEN GOTO 2;
      IF ch='=' THEN
         BEGIN
         c:=c+1;
         IF c>2 THEN
            BEGIN
            error('too many = at end of base64 string');
            GOTO 1;
            END;
         Val:=0;
         t:=true;
         GOTO 4;
         END;
      IF t THEN
         BEGIN
         error('illegal character at end of base64 string');
         GOTO 1;
         END;
      IF ch IN ['A'..'Z'] THEN
         val:=Ord(ch)-Ord('A')
      ELSE IF ch IN ['a'..'z'] THEN
         val:=Ord(ch)-Ord('a')+26
      ELSE IF ch IN ['0'..'9'] THEN
         val:=Ord(ch)-Ord('0')+52
      ELSE IF ch='+' THEN
         val:=62
      ELSE IF ch='/' THEN
         val:=63
      ELSE
         BEGIN
         error('illegal character in base64 string');
         GOTO 1;
         END;
4:    CASE which OF
      0 : (* output empty, move in all 6 bits *)
         Tmp:=Val*4;
      1 : (* Output has 6 bits, needs two more bits *)
         BEGIN
         CharAppend(Result,Chr(Tmp+(Val DIV 16)));
         Tmp:=(Val MOD 16)*16;
         END;
      2 : (* output has 4 bits, needs 4 more *)
         BEGIN
         CharAppend(Result,Chr(Tmp+(Val DIV 4)));
         Tmp:=(Val MOD 4)*64;
         END;
      3 : (* Output has 2 bits, needs 6 more *)
         CharAppend(Result,Chr(Tmp+Val));
      END;
      (* step on case *)
      which:=(which+1) MOD 4;
2:    i:=i+1;
      END;
3: IF which<>0 THEN
      BEGIN
      error('illegal termination of base64 string');
      GOTO 1;
      END;
   (* then remove any padded bytes  *)
   IF c>0 THEN
      BEGIN
      l:=StringLength(Result)-c;
      SliceString(Result,1,l);
      END;
   fromBase64:=true;
1:END;

The above is an old Pascal routine. The type data_rep_type is a variant type which supports infinitely long strings. In fact the length is determined by the input length *8 DIV 6.

In base64 each character (except CR and LF and white space) represents six bits of the output string. Since not every input string can be converted to an exact number of output characters (eg: two bytes makes 16 bits = 2*6 bits + 4 bits left) equal signs (=) are used to fill to the end. My routine checks all these cases, and errors appropiately.
0

Featured Post

Optimizing Cloud Backup for Low Bandwidth

With cloud storage prices going down a growing number of SMBs start to use it for backup storage. Unfortunately, business data volume rarely fits the average Internet speed. This article provides an overview of main Internet speed challenges and reveals backup best practices.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

This is about my first experience with programming Arduino.
If you’re thinking to yourself “That description sounds a lot like two people doing the work that one could accomplish,” you’re not alone.
With the power of JIRA, there's an unlimited number of ways you can customize it, use it and benefit from it. With that in mind, there's bound to be things that I wasn't able to cover in this course. With this summary we'll look at some places to go…
In this seventh video of the Xpdf series, we discuss and demonstrate the PDFfonts utility, which lists all the fonts used in a PDF file. It does this via a command line interface, making it suitable for use in programs, scripts, batch files — any pl…

809 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question