Want to win a PS4? Go Premium and enter to win our High-Tech Treats giveaway. Enter to Win


trim the first 2 characters from each line in a file

Posted on 2010-11-10
Medium Priority
Last Modified: 2012-05-10
I have 4 types of files I receive which are outputs from a source I have no control over. Each file has in common that there are a consistent 2-4 characters in front of each line (in the 1st column for XLS docs). I want a scripted way to remove these. Can you suggest a method?
Thank you!
Question by:johndarby
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 4
  • 3
  • 2
  • +2
LVL 24

Accepted Solution

Tracy earned 1000 total points
ID: 34106244
Assuming headers, this will remove the first 2 characters from every cell in column A.

Option Explicit

Sub Remove2Chars()

    Dim i As Long
    Dim lastRow As Long
    lastRow = Range("A" & Rows.Count).End(xlUp).Row
    For i = 2 To lastRow
        Cells(i, 1).Value = Right(Cells(i, 1).Value, Len(Cells(i, 1).Value) - 2)
    Next i

End Sub

Open in new window

LVL 24

Expert Comment

ID: 34106265
To use a formula approach, you could put this in B2 and drag down to remove the first two characters in A2 and down:


Also, you say between 2-4 characters.  What is the logic behind removing 2 or 4?

Assisted Solution

fredniel earned 1000 total points
ID: 34106281
Can you post an example... or be more specific.

to figure out if there will be some delimitators or something.
How can we differentiate each type of file?

if your are trying just to remove the first column:


Problems using Powershell and Active Directory?

Managing Active Directory does not always have to be complicated.  If you are spending more time trying instead of doing, then it's time to look at something else. For nearly 20 years, AD admins around the world have used one tool for day-to-day AD management: Hyena. Discover why


Author Comment

ID: 34106495
Thanks folks! I don't have an example for you atm, but the extra characters seem to be random ASCII which precede the contents of each line in the docs. Usually it is 2 characters, sometimes 4 and occasionally it will be 5-9 characters, including one or more spaces, between the "garbage" characters.

The docs come from text dumps (they call them reports) from an MVS app called Universe, where one of the conversions that happens is EBCDIC-->ASCII. It's a black box to me.

Expert Comment

ID: 34106551
i think that we really need you to prepare an example...
there must be something to figure out how many characters to delete...

How can we identify those "garbage" characters...

there is something missing in the formulation of this problem.


Author Comment

ID: 34106885
I can get you an example...I just need to strip customer-specific data, first since this has payroll info in it. :)

Expert Comment

ID: 34106927
Agree with that. We are waiting for you then.
LVL 13

Expert Comment

ID: 34110730
Here is a code to select an Excel file and replace string bits from the 1st column. Note, multi-select Excel file is possible as well as the string bits (for cleaning) and Column Index (data column) is configurable.

To run the code in the attached file, press CNTRL + SHIFT + M
Option Explicit
Private Const sJunk As String = "#$%!"
Dim sJunkFormula As String

Public Sub CleanFileData()
Dim oOut, oWB As Workbook, nCtr As Integer
On Error Resume Next
oOut = Application.GetOpenFilename("Excel Files, *.xl*;*.xls;*.xlt", 2, "Select the files to clean", , True)
If Not IsArray(oOut) Then GoTo ErrCancel
sJunkFormula = ""
For nCtr = 1 To UBound(oOut)
    Set oWB = Application.Workbooks.Open(oOut(nCtr))
    CleanRange oWB.Sheets(1).Range("A:A")
    Set oWB = Nothing
Exit Sub
MsgBox "No File was selected.", vbCritical
End Sub

Public Sub CleanRange(oRange As Range)
Dim nCtr As Integer, oCell As Range
Dim m_sJunkFormula As String

If sJunkFormula = "" Then
    sJunkFormula = """~|~"""
    For nCtr = 1 To Len(sJunk)
        sJunkFormula = "SUBSTITUTE(" & sJunkFormula & ",""" & Mid(sJunk, nCtr, 1) & ""","""")"
End If
Set oRange = Application.Intersect(oRange, oRange.Worksheet.UsedRange)
oRange.NumberFormat = "General"
'oCell.Formula = "=" & Replace(sJunkFormula, "~|~", Replace(oRange.Cells(1).Address, "$", ""))
For Each oCell In oRange.Cells
    m_sJunkFormula = "=" & Replace(sJunkFormula, "~|~", oCell.Value)
    oCell.Value = Application.Evaluate(m_sJunkFormula)
End Sub

Open in new window

LVL 35

Expert Comment

ID: 34111930
If you can open these files in Excel and all you want to do is remove the first 2 characters of each line you could use Data>Text to columns... with fixed width.
You would only need one break line after the characters you don't need and you can chose not to import that column on the 3rd step.
If it isn't going to be the first 2 characters each time then you could create code that does the text to columns for however many characters it is.
Here's the code for 2 characters.
    Selection.TextToColumns Destination:=Range("A1"), DataType:=xlFixedWidth, _
        FieldInfo:=Array(Array(0, 9), Array(2, 1)), TrailingMinusNumbers:=True
The Array... part breaks down to this.
Array(0,9) - 1st column, do not import
Array(2,1) -2nd column, import starting at character 2
Note the character position starts at 0.
So for 3 characters you would change Array(2,1) to Array(3,1), for 4 Array(4,1)... and so on.
This would work if you can open the file in Excel and determine the no of characters to ignore.

Author Comment

ID: 34112970
Guys, I am having a hard time getting you a sample file. The problem lies with how much data (SSN, names, addresses, policy number...) is sensitive. I think I just have to be thnkful for the help you've given already. Thank you so much!

Author Closing Comment

ID: 34112985
Thank you again!

Featured Post

Industry Leaders: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Access developers frequently have requirements to interact with Excel (import from or output to) in their applications.  You might be able to accomplish this with the TransferSpreadsheet and OutputTo methods, but in this series of articles I will di…
Auditing domain password hashes is a commonly overlooked but critical requirement to ensuring secure passwords practices are followed. Methods exist to extract hashes directly for a live domain however this article describes a process to extract u…
This Micro Tutorial will demonstrate how to use longer labels with horizontal bar charts instead of the vertical column chart.
Excel styles will make formatting consistent and let you apply and change formatting faster. In this tutorial, you'll learn how to use Excel's built-in styles, how to modify styles, and how to create your own. You'll also learn how to use your custo…

636 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question