Searching & Renaming PDF Files

Searching & Renaming PDF Files

Bill you have provided some code in the past about searching and renaming PDF files...

Option Explicit

' Define files and folders to work with (NO TRAILING BACKSLASH)
Const cBaseDir = "b:\ee\ee29100342\files\[[DATE]]"
Const cControlFile = "b:\ee\ee29100342\list.csv"
Const cDelim = ","

' Text file I/O constants
Const ForReading = 1
Const ForWriting = 2
Const ForAppending = 8

' Declare global variables
Dim objFSO, strBaseDir, strControlFile, strDate
Dim arrControl, strControl, arrFields, i
Dim strOldName, strNewName, strOldPath, strNewPath

' Create file system object
Set objFSO = WScript.CreateObject("Scripting.FileSystemObject")

' Get date in YYYYMMDD format
strDate = Year(Now) & Right("0" & Month(Now), 2) & Right("0" & Day(Now), 2)

' Resolve file paths
strBaseDir = objFSO.GetAbsolutePathname(Replace(cBaseDir, "[[DATE]]", strDate, 1, -1, vbTextCompare))
strControlFile = objFSO.GetAbsolutePathname(cControlFile)

' Make sure base folder exists
If Not objFSO.FolderExists(strBaseDir) Then
    Wscript.Echo Now & " ERROR: Base folder does not exist - " & strBaseDir
    Wscript.Quit
End If

' Make sure control file exists
If Not objFSO.FileExists(strControlFile) Then
    Wscript.Echo Now & " ERROR: List file does not exist - " & strControlFile
    Wscript.Quit
End If

' Load csv file into array
With objFSO.OpenTextFile(strControlFile, ForReading)
    arrControl = Split(.ReadAll, vbCrLf)
    .Close
End With

' Loop over each control record and process as needed (skip header row)
For i = 1 To UBound(arrControl)

    ' Split line at delimiter, gather fields
    arrFields = CSVParse(arrControl(i))
    If UBound(arrFields) = 1 Then
        strOldName = arrFields(0) & ".pdf"
        strNewName = arrFields(0) & "_" & CleanFilename(arrFields(1)) & ".pdf"
    
        ' Build full paths for old and new file names
        strOldPath = strBaseDir & "\" & strOldName
        strNewPath = strBaseDir & "\" & strNewName
    
        ' If old file exists, and new name is not used, rename.  Otherwise display warning.
        If objFSO.FileExists(strOldPath) Then
            If Not objFSO.FileExists(strNewPath) Then
                objFSO.MoveFile strOldPath, strNewPath
            Else
                Wscript.Echo Now & " WARNING: New file already exists - " & strNewName
            End If
        Else
            Wscript.Echo Now & " WARNING: Old file does not exist - " & strOldName
        End If
        End If
Next

' Function to parse comma delimited line and return array of field values.
Function CSVParse(ByVal strLine)
    Dim arrFields
    Dim blnIgnore
    Dim intFieldCount
    Dim intCursor
    Dim intStart
    Dim strChar
    Dim strValue

    Const QUOTE = """"
    Const QUOTE2 = """"""

    ' Check for empty string and return empty array.
    If (Len(Trim(strLine)) = 0) then
        CSVParse = Array()
        Exit Function
    End If

    ' Initialize.
    blnIgnore = False
    intFieldCount = 0
    intStart = 1
    arrFields = Array()

    ' Add "," to delimit the last field.
    strLine = strLine & ","

    ' Walk the string.
    For intCursor = 1 To Len(strLine)
        ' Get a character.
        strChar = Mid(strLine, intCursor, 1)
        Select Case strChar
            Case QUOTE
                ' Toggle the ignore flag.
                blnIgnore = Not blnIgnore
            Case ","
                If Not blnIgnore Then
                    ' Add element to the array.
                    ReDim Preserve arrFields(intFieldCount)
                    ' Makes sure the "field" has a non-zero length.
                    If (intCursor - intStart > 0) Then
                        ' Extract the field value.
                        strValue = Mid(strLine, intStart, _
                            intCursor - intStart)
                        ' If it's a quoted string, use Mid to
                        ' remove outer quotes and replace inner
                        ' doubled quotes with single.
                        If (Left(strValue, 1) = QUOTE) Then
                            arrFields(intFieldCount) = _
                                Replace(Mid(strValue, 2, _
                                Len(strValue) - 2), QUOTE2, QUOTE)
                        Else
                            arrFields(intFieldCount) = strValue
                        End If
                    Else
                        ' An empty field is an empty array element.
                        arrFields(intFieldCount) = Empty
                    End If
                    ' increment for next field.
                    intFieldCount = intFieldCount + 1
                    intStart = intCursor + 1
                End If
        End Select
    Next
    ' Return the array.
    CSVParse = arrFields
End Function

Function CleanFilename(strName)
    Dim arrIllegal, strIllegal, i
    
    arrIllegal = Array("<",">",":","""","/","\","|","?","*")
    CleanFilename = Trim(strName)
    For Each strIllegal In arrIllegal
        CleanFilename = Replace(CleanFilename, strIllegal, "_")
    Next
    For i = 0 To 31
        CleanFilename = Replace(CleanFilename, Chr(i), "_")
    Next
End Function

Open in new window


I want to use the same bit of code again in an other scenario - This time the control file has slighted changed - see attached

This time i want to add the site details and date/time to the file name - removing any characters
Nick CollinsAsked:
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

x
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

Bill PrewIT / Software Engineering ConsultantCommented:
I didn't see a control file attached?


»bp
Nick CollinsAuthor Commented:
Bill PrewIT / Software Engineering ConsultantCommented:
Based on the sample control file you provided, please provide examples of old and new file names for each line in the CSV control file.


»bp
Python 3 Fundamentals

This course will teach participants about installing and configuring Python, syntax, importing, statements, types, strings, booleans, files, lists, tuples, comprehensions, functions, and classes.

Nick CollinsAuthor Commented:
Old File Name

SC_000001_1245367.pdf

New File Name

SC_000001_1234567_Persimmon Homes_Phase 1 Kings Copse_16th October 2018.pdf
Bill PrewIT / Software Engineering ConsultantCommented:
Old File Name: SC_000001_1245367.pdf
New File Name: SC_000001_1234567_Persimmon Homes_Phase 1 Kings Copse_16th October 2018.pdf

I don't see any of those values in the sample CSV you posted earlier.  I would like a sample of the control file, and the corresponding renaming that should occur.  Just trying to save time and errors by being very clear on how the values in the CSV are used in the renaming, including spaces, underscores, special characters etc.  So show some comprehensive examples that touch on those please.

"Completed On:","Client / Site / Job Number","Internal Site Visit Number:"
"15 October 2018 02:06 PM","Site One Ltd | Bristol | Job Number: 1893","SV_000002_1067414"
"15 October 2018 02:21 PM","Mi Test Ltd | Farm Bristol | Job Number: 1871","SV_000003_1067414"

Open in new window



»bp
Nick CollinsAuthor Commented:
The file will be named "SV_000002_1067414.pdf"

Control file shows the following:

"15 October 2018 02:06 PM","Site One Ltd | Bristol | Job Number: 1893","SV_000002_1067414"

The file will be renamed as follows"SV_000002_1067414_Site One Ltd_Bristol_15_October_2018.pdf"

In the renaming process you ignore the Job Number:1893
Bill PrewIT / Software Engineering ConsultantCommented:
Give this a try, seems to do what you want in testing here...

Option Explicit

' Define files and folders to work with (NO TRAILING BACKSLASH)
Const cBaseDir = "b:\ee\ee29122035\files\[[DATE]]"
Const cControlFile = "b:\ee\ee29122035\list.csv"
Const cDelim = ","

' Text file I/O constants
Const ForReading = 1
Const ForWriting = 2
Const ForAppending = 8

' Declare global variables
Dim objFSO, strBaseDir, strControlFile, strDate
Dim arrControl, strControl, arrFields, i, arrTemp
Dim strOldName, strNewName, strOldPath, strNewPath

' Create file system object
Set objFSO = WScript.CreateObject("Scripting.FileSystemObject")

' Get date in YYYYMMDD format
strDate = Year(Now) & Right("0" & Month(Now), 2) & Right("0" & Day(Now), 2)

' Resolve file paths
strBaseDir = objFSO.GetAbsolutePathname(Replace(cBaseDir, "[[DATE]]", strDate, 1, -1, vbTextCompare))
strControlFile = objFSO.GetAbsolutePathname(cControlFile)

' Make sure base folder exists
If Not objFSO.FolderExists(strBaseDir) Then
    Wscript.Echo Now & " ERROR: Base folder does not exist - " & strBaseDir
    Wscript.Quit
End If

' Make sure control file exists
If Not objFSO.FileExists(strControlFile) Then
    Wscript.Echo Now & " ERROR: List file does not exist - " & strControlFile
    Wscript.Quit
End If

' Load csv file into array
With objFSO.OpenTextFile(strControlFile, ForReading)
    arrControl = Split(.ReadAll, vbCrLf)
    .Close
End With

' Loop over each control record and process as needed (skip header row)
For i = 1 To UBound(arrControl)

    ' Split line at delimiter, gather fields
    arrFields = CSVParse(arrControl(i))
    If UBound(arrFields) = 2 Then
        ' Get old file name based on control file info
        strOldName = arrFields(2) & ".pdf"

        ' Build new name from control file pieces
        arrTemp = Split(arrFields(1), "|")
        strNewName = arrFields(2) & "_" & Replace(Trim(arrTemp(0)), " ", "_") & "_" & Replace(Trim(arrTemp(1)), " ", "_")
        arrTemp = Split(arrFields(0), " ")
        strNewName = strNewName & "_" & arrTemp(0) & "_" & arrTemp(1) & "_" & arrTemp(2) & ".pdf"
        strNewName = CleanFilename(strNewName)
    
        ' Build full paths for old and new file names
        strOldPath = strBaseDir & "\" & strOldName
        strNewPath = strBaseDir & "\" & strNewName
    
        ' If old file exists, and new name is not used, rename.  Otherwise display warning.
        If objFSO.FileExists(strOldPath) Then
            If Not objFSO.FileExists(strNewPath) Then
                objFSO.MoveFile strOldPath, strNewPath
            Else
                Wscript.Echo Now & " WARNING: New file already exists - " & strNewName
            End If
        Else
            Wscript.Echo Now & " WARNING: Old file does not exist - " & strOldName
        End If
    End If
Next

' Function to parse comma delimited line and return array of field values.
Function CSVParse(ByVal strLine)
    Dim arrFields
    Dim blnIgnore
    Dim intFieldCount
    Dim intCursor
    Dim intStart
    Dim strChar
    Dim strValue

    Const QUOTE = """"
    Const QUOTE2 = """"""

    ' Check for empty string and return empty array.
    If (Len(Trim(strLine)) = 0) then
        CSVParse = Array()
        Exit Function
    End If

    ' Initialize.
    blnIgnore = False
    intFieldCount = 0
    intStart = 1
    arrFields = Array()

    ' Add "," to delimit the last field.
    strLine = strLine & ","

    ' Walk the string.
    For intCursor = 1 To Len(strLine)
        ' Get a character.
        strChar = Mid(strLine, intCursor, 1)
        Select Case strChar
            Case QUOTE
                ' Toggle the ignore flag.
                blnIgnore = Not blnIgnore
            Case ","
                If Not blnIgnore Then
                    ' Add element to the array.
                    ReDim Preserve arrFields(intFieldCount)
                    ' Makes sure the "field" has a non-zero length.
                    If (intCursor - intStart > 0) Then
                        ' Extract the field value.
                        strValue = Mid(strLine, intStart, _
                            intCursor - intStart)
                        ' If it's a quoted string, use Mid to
                        ' remove outer quotes and replace inner
                        ' doubled quotes with single.
                        If (Left(strValue, 1) = QUOTE) Then
                            arrFields(intFieldCount) = _
                                Replace(Mid(strValue, 2, _
                                Len(strValue) - 2), QUOTE2, QUOTE)
                        Else
                            arrFields(intFieldCount) = strValue
                        End If
                    Else
                        ' An empty field is an empty array element.
                        arrFields(intFieldCount) = Empty
                    End If
                    ' increment for next field.
                    intFieldCount = intFieldCount + 1
                    intStart = intCursor + 1
                End If
        End Select
    Next
    ' Return the array.
    CSVParse = arrFields
End Function

Function CleanFilename(strName)
    Dim arrIllegal, strIllegal, i
    
    arrIllegal = Array("<",">",":","""","/","\","|","?","*")
    CleanFilename = Trim(strName)
    For Each strIllegal In arrIllegal
        CleanFilename = Replace(CleanFilename, strIllegal, "_")
    Next
    For i = 0 To 31
        CleanFilename = Replace(CleanFilename, Chr(i), "_")
    Next
End Function

Open in new window


»bp

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
VB Script

From novice to tech pro — start learning today.