How to find a character in a file (which line and which column) which causes sun.io.MalformedException thrown

I have a piece of code inside try block like the following:

BufferedReader reader = new BufferedReader (new InputStreamReader(new FileInputStream(pFile), "UTF-8"));

text = reader.reader.readLine();  line 1

while(text != null)
{
      text = reader.readLine();
}


//notes: text variable has been defined and pFile is passed from function call.

When I run the piece of code, it always throws sun.io.MalformedException at line 1.

It is very possible that it exists invalid character in pFile.  Is there any way to change the piece code to find which line and which column the invalid character is located so that I can just correct it in pFile.

I did the following changes to find which line but didn't work (this change will not find which column the invalid character is located). It always print linenumber is 1. It sounds that the whole file has only one line after creating reader object using UTF-8

int linenumber = 1;
text = reader.readerLine();

while(text != null)
{
      linenumber++;
      text = reader.readLine();
}

Thanks for the help
cplusplus030999Asked:
Who is Participating?
 
ADSLMarkConnect With a Mentor Commented:
Ok, i understand.

I made my own utf-8 file with some japanese/chinese characters. I dragged it into a hex editor and i saw 0x0D 0x0A (which is carriage return and new line) in there, so i copied the code above and tested if i received multiple lines or just one and i received more lines. So i guess the error (which i obvious cannot reproduce since i do not have the file) seems to be on line 1.

I think the file is not accessible for outsiders?

Mark
0
 
enachemcCommented:
since MalformedException does not exist,  you are probably reffering to MalformedURLException. This would be caused by the value of pFile variable. Which is the real value of the variable.
0
 
cplusplus030999Author Commented:
sun.io.MalformedException is a famous exception. Like I said, the exception is thrown from line 1. pFile is File object without any problem. I have tried other good file and it works
0
Cloud Class® Course: CompTIA Cloud+

The CompTIA Cloud+ Basic training course will teach you about cloud concepts and models, data storage, networking, and network infrastructure.

 
ADSLMarkCommented:
It would be nice to see the value of pFile and also the precise error message.

Mark
0
 
cplusplus030999Author Commented:
pFile = new File("C\:\\temp\\real_file.html");

C: is c drive, temp is folder and real_file.html is file name. They will exist.

sun.io.MalformedException is just like NullPointerException. If you want to get precise error message, you will be disppointed since it is null.

0
 
ADSLMarkCommented:
C\: ?? shouldn't that be C:\

Mark
0
 
ADSLMarkCommented:
I mean:

pFile = new File("C:\\temp\\real_file.html");

: does not need escaping does it?

Mark

0
 
ADSLMarkCommented:
Well i could have posted this all in one post but you should use the pathSeperator from the File class:

http://java.sun.com/j2se/1.4.2/docs/api/java/io/File.html

static String pathSeparator
          The system-dependent path-separator character, represented as a string for convenience.
static char pathSeparatorChar
          The system-dependent path-separator character.
static String separator
          The system-dependent default name-separator character, represented as a string for convenience.
static char separatorChar
          The system-dependent default name-separator character.

Mark
0
 
cplusplus030999Author Commented:
ADSLMark,


pFile = new File("C:\\temp\\real_file.html");
and pFile = new File("C\:\\temp\\real_file.html") both work. Like I said, the file name is not an issue. I have the folder name in property file. It works for other files. Please don't take too much time on this. pFile is absolutely valid File object. Thanks.
0
 
cplusplus030999Author Commented:
Since BufferedStream allows UTF-8 transferred but why readLine() can not reacognize that. Obviously readLine is only reading one line. If BufferedStream is UTF-8, it should know that
0
 
CEHJConnect With a Mentor Commented:
Use a http://java.sun.com/j2se/1.5.0/docs/api/java/nio/charset/CharsetDecoder.html

It will allow you to trap encoding exceptions in detail
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.