Edit html using excel macro

Hi experts,

I have to edit urgent html text files with macro. I tried to import html file into excel but the text file is broken into parts according cell size. The view is the same but I have to add the cells content to have the original line.
How to solve this job?  Any ideas appreciated.

wbr kacor
LVL 10
kacorretiredAsked:
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

x
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

dlmilleCommented:
I would read the text file using old fashioned methods.

Here's an example that will read your input into Excel, more "properly".

Option Explicit
Sub readHTMLTxtFile()
Dim wkb As Workbook
Dim wks As Worksheet
Dim wksOut As Worksheet
Dim dialogfile As FileDialog
Dim fName As String
Dim wholeLine As String
Dim i As Long

    Set wkb = ThisWorkbook
    Set wks = wkb.Worksheets("Control Panel")
    
    On Error Resume Next
    Set wksOut = wkb.Worksheets("Output")
    If Err.Number <> 0 Then
        Set wksOut = wkb.Worksheets.Add(after:=wkb.Worksheets("Control Panel"))
        wksOut.Name = "Output"
    End If
    On Error GoTo 0
    
    wksOut.Cells.Clear 'clear old output, if exists
    
    'prompt for txt file to read
    
    Set dialogfile = Application.FileDialog(msoFileDialogFilePicker)
    With dialogfile
        .Filters.Clear
        .Filters.Add "Text Files (*.txt)", "*.txt", 1 'change extension to .html if that's your extension
        .AllowMultiSelect = False
        .InitialView = msoFileDialogViewDetails
        .InitialFileName = ThisWorkbook.Path & "\"
        .Title = "Select TEXT File for HTML import"
        .Show
    End With
    If dialogfile.SelectedItems.Count > 0 Then
        fName = dialogfile.SelectedItems(1)
    Else
        fName = ""
    End If
    
    If fName <> "" Then 'read the file in
        Open fName For Input As #1
        Do While Not EOF(1) 'check for end of file
            Line Input #1, wholeLine
            'do something with it
            wksOut.Range("A1").Offset(i, 0).Value = wholeLine
            i = i + 1
        Loop
        wksOut.Columns("A").AutoFit
        Close #1
    End If

End Sub

Open in new window


See attached demonstration workbook.

Cheers,

Dave
readHTMLTxt-r1.xls
0
dlmilleCommented:
Modified to allow you to read HTML files or TXT files (note the filter in the dialog allows you to change from HTML to TXT.  I thought you'd get use out of it with just TXT files as well, and you can add more filters for this.  The dataset goes into A column of the output sheet, as its read in.
Option Explicit
Sub readHTMLTxtFile()
Dim wkb As Workbook
Dim wks As Worksheet
Dim wksOut As Worksheet
Dim dialogfile As FileDialog
Dim fName As String
Dim wholeLine As String
Dim i As Long

    Set wkb = ThisWorkbook
    Set wks = wkb.Worksheets("Control Panel")
    
    On Error Resume Next
    Set wksOut = wkb.Worksheets("Output")
    If Err.Number <> 0 Then
        Set wksOut = wkb.Worksheets.Add(after:=wkb.Worksheets("Control Panel"))
        wksOut.Name = "Output"
    End If
    On Error GoTo 0
    
    wksOut.Cells.Clear 'clear old output, if exists
    
    'prompt for txt file to read
    
    Set dialogfile = Application.FileDialog(msoFileDialogFilePicker)
    With dialogfile
        .Filters.Clear
        .Filters.Add "HTML Files (*.html)", "*.html", 1
        .Filters.Add "Text Files (*.txt)", "*.txt", 2 'change to .html if your text file has that extension
        .AllowMultiSelect = False
        .InitialView = msoFileDialogViewDetails
        .InitialFileName = ThisWorkbook.Path & "\"
        .Title = "Select TEXT File for HTML import"
        .Show
    End With
    If dialogfile.SelectedItems.Count > 0 Then
        fName = dialogfile.SelectedItems(1)
    Else
        fName = ""
    End If
    
    If fName <> "" Then 'read the file in
        Open fName For Input As #1
        Do While Not EOF(1) 'check for end of file
            Line Input #1, wholeLine
            'do something with it
            wksOut.Range("A1").Offset(i, 0).Value = wholeLine
            i = i + 1
        Loop
        wksOut.Columns("A").AutoFit
        Close #1
    End If

End Sub

Open in new window


See attached demonstration workbook.

Dave
readHTMLTxt-r2.xls
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
kacorretiredAuthor Commented:
Hi Dave,
thanks for the quick answer. Now I'm on the way to return home, and I'll test your suggestion asap.
I've not tested but I see a problem: they can occur some to long lines even up to 1 k or more. Would be this problem?

wbr kacor
0
Big Business Goals? Which KPIs Will Help You

The most successful MSPs rely on metrics – known as key performance indicators (KPIs) – for making informed decisions that help their businesses thrive, rather than just survive. This eBook provides an overview of the most important KPIs used by top MSPs.

dlmilleCommented:
In Excel 2003, a cell will hold 32767 characters, display 1024, but all is editable in the formula bar.

You asked to have the ability to read it all in without breaking lines.  Its now up to you to determine how you want to use the data in Excel.

Dave
0
kacorretiredAuthor Commented:
thanks Dave!

kacor
0
dlmilleCommented:
You might try using Notepad++ also as an alternative.  I believe you can google and download for free.

Cheers,

Dave
0
kacorretiredAuthor Commented:
Hi Dave,
your macro is excellent. The only needed change was - as you wrote - in line 30 the .txt extension changed to .htm

thanks again for this support!!

wbr Janos
0
dlmilleCommented:
You're welcome.  The second post has both the HTML and TXT options.

Cheers,

Dave
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Microsoft Excel

From novice to tech pro — start learning today.