Link to home
Start Free TrialLog in
Avatar of kacor
kacorFlag for Hungary

asked on

Edit html using excel macro

Hi experts,

I have to edit urgent html text files with macro. I tried to import html file into excel but the text file is broken into parts according cell size. The view is the same but I have to add the cells content to have the original line.
How to solve this job?  Any ideas appreciated.

wbr kacor
Avatar of dlmille
dlmille
Flag of United States of America image

I would read the text file using old fashioned methods.

Here's an example that will read your input into Excel, more "properly".

Option Explicit
Sub readHTMLTxtFile()
Dim wkb As Workbook
Dim wks As Worksheet
Dim wksOut As Worksheet
Dim dialogfile As FileDialog
Dim fName As String
Dim wholeLine As String
Dim i As Long

    Set wkb = ThisWorkbook
    Set wks = wkb.Worksheets("Control Panel")
    
    On Error Resume Next
    Set wksOut = wkb.Worksheets("Output")
    If Err.Number <> 0 Then
        Set wksOut = wkb.Worksheets.Add(after:=wkb.Worksheets("Control Panel"))
        wksOut.Name = "Output"
    End If
    On Error GoTo 0
    
    wksOut.Cells.Clear 'clear old output, if exists
    
    'prompt for txt file to read
    
    Set dialogfile = Application.FileDialog(msoFileDialogFilePicker)
    With dialogfile
        .Filters.Clear
        .Filters.Add "Text Files (*.txt)", "*.txt", 1 'change extension to .html if that's your extension
        .AllowMultiSelect = False
        .InitialView = msoFileDialogViewDetails
        .InitialFileName = ThisWorkbook.Path & "\"
        .Title = "Select TEXT File for HTML import"
        .Show
    End With
    If dialogfile.SelectedItems.Count > 0 Then
        fName = dialogfile.SelectedItems(1)
    Else
        fName = ""
    End If
    
    If fName <> "" Then 'read the file in
        Open fName For Input As #1
        Do While Not EOF(1) 'check for end of file
            Line Input #1, wholeLine
            'do something with it
            wksOut.Range("A1").Offset(i, 0).Value = wholeLine
            i = i + 1
        Loop
        wksOut.Columns("A").AutoFit
        Close #1
    End If

End Sub

Open in new window


See attached demonstration workbook.

Cheers,

Dave
readHTMLTxt-r1.xls
ASKER CERTIFIED SOLUTION
Avatar of dlmille
dlmille
Flag of United States of America image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of kacor

ASKER

Hi Dave,
thanks for the quick answer. Now I'm on the way to return home, and I'll test your suggestion asap.
I've not tested but I see a problem: they can occur some to long lines even up to 1 k or more. Would be this problem?

wbr kacor
In Excel 2003, a cell will hold 32767 characters, display 1024, but all is editable in the formula bar.

You asked to have the ability to read it all in without breaking lines.  Its now up to you to determine how you want to use the data in Excel.

Dave
Avatar of kacor

ASKER

thanks Dave!

kacor
You might try using Notepad++ also as an alternative.  I believe you can google and download for free.

Cheers,

Dave
Avatar of kacor

ASKER

Hi Dave,
your macro is excellent. The only needed change was - as you wrote - in line 30 the .txt extension changed to .htm

thanks again for this support!!

wbr Janos
You're welcome.  The second post has both the HTML and TXT options.

Cheers,

Dave