Link to home
Create AccountLog in
Avatar of hankknight
hankknightFlag for Canada

asked on

ASP.NET/VB/REGEX: Leading Stuff Zapped

This code does not work if there are special characters like colons or hyphens before the matching phone number.

Imports System.Web
Partial Class test
    Inherits System.Web.UI.Page
    Protected Function REGEX(text As String) As String
        Dim input As String = ""

input = 
"This works 423-555-1212" & vbCrlf &
"This works (423) 555-1212" & vbCrlf &
"This works 4235551212" & vbCrlf &
"" & vbCrlf &
"This looses the hyphen - 4235551212" & vbCrlf &
"This losses the space before the phone number: 423-555-1212" & vbCrlf &
"This losses the space before the phone number: (423) 555-1212" & vbCrlf &
"This losses the space before the phone number: 4235551212" & vbCrlf &
"(This losses the space before the phone number) 423-555-1212" & vbCrlf &
"This losses the space before the phone number; 423-555-1212"

' Phone Numbers
input = RegularExpressions.Regex.Replace(input, "(\s?)(?!<\d[^a-z\d\n]{0,2})1?[^a-z\n\d,]*(\d)[^a-z\n\d,]*(\d)[^a-z\n\d,]*(\d)[^a-z\n\d,]*(\d)[^a-z\n\d,]*(\d)[^a-z\n\d,]*(\d)[^a-z\n\d,]*(\d)[^a-z\n\d,]*(\d)[^a-z\n\d,]*(\d)[^a-z\n\d,]*(\d)(?![a-z\d\n]{0,2}\d)", "$1$2$3$4.$5$6$7.$8$9$10$11")

Return input

    End Function
    Protected Sub _Index_Load(ByVal sender As Object, ByVal e As System.EventArgs) Handles me.load
        out.Text = REGEX(Request.Form("c"))
    End Sub
 
End Class

Open in new window

Avatar of kaufmed
kaufmed
Flag of United States of America image

What is supposed to happen, or what should the data look like after the function is run?
Avatar of hankknight

ASKER

This:
This works 423-555-1212
This works (423) 555-1212
This works 4235551212
This looses the hyphen - 4235551212
This losses the space before the phone number: 423-555-1212
This losses the space before the phone number: (423) 555-1212
This losses the space before the phone number: 4235551212
(This losses the space before the phone number) 423-555-1212
This losses the space before the phone number; 423-555-1212

Open in new window

Becomes this:
This works 423.555.1212
This works 423.555.1212
This works 423.555.1212
This looses the hyphen 423.555.1212
This losses the space before the phone number423.555.1212
This losses the space before the phone number423.555.1212
This losses the space before the phone number423.555.1212
(This losses the space before the phone number423.555.1212
This losses the space before the phone number423.555.1212

Open in new window

But it should become this:
This works 423.555.1212
This works 423.555.1212
This works 423.555.1212
This looses the hyphen - 423.555.1212
This losses the space before the phone number: 423.555.1212
This losses the space before the phone number: 423.555.1212
This losses the space before the phone number: 423.555.1212
(This losses the space before the phone number) 423.555.1212
This losses the space before the phone number; 423.555.1212

Open in new window

Notice how certain chararacters including spaces, hyphen, and colons are lost if there is a special character before the phone number.
You pattern seems overly-complicated. Can we try with this instead:

...

' Phone Numbers
input = RegularExpressions.Regex.Replace(input, "\b(?:(?<=\d{3})-(?=\d{4})|(?<=\d{3})-(?=\d{3}))\b", ".")  ' Replace delimited phone numbers
input = RegularExpressions.Regex.Replace(input, "\b(\d{3})(\d{3})(\d{4})\b", "$1.$2.$3")  ' Replace non-delimited phone numbers

...

Open in new window

Your pattern does not work with this:
Phone Number: (423) 555-1212

Open in new window

It should become:
Phone Number: 423.555.1212

Open in new window

ASKER CERTIFIED SOLUTION
Avatar of kaufmed
kaufmed
Flag of United States of America image

Link to home
membership
Create an account to see this answer
Signing up is free. No credit card required.
Create Account