return only Digits 0-9 from a String
Note: you've only solved half the problem here.
For US phone numbers entered "in the wild", you may have:
- Phone numbers with or without the "1" prefix
- Phone numbers with or without the area code
- Phone numbers with extension numbers (if you blindly remove all non-digits, you'll miss the "x" or "Ext." or whatever also on the line).
- Possibly, numbers encoded with mnemonic letters (800-BUY-THIS or whatever)
You'll need to add some smarts to your code to conform the resulting list of digits to a single standard that you actually search against in your database.
Some simple things you could do to fix this:
Before the RegEx removal of non-digits, see if there's an "x" in the string. If there is, chop everything off after it (will handle most versions of writing an extension number).
For any number with 10+ digits beginning with a "1", chop off the 1. It's not part of the area code, US area codes start in the 2xx range.
For any number still exceeding 10 digits, assume the remainder is an extension of some sort, and chop it off.
Do your database search using an "ends-with" pattern search (SELECT * FROM mytable WHERE phonenumber LIKE 'blah%'). This will handle sitations (although with the possibility of error) where the area code is not provided, but your database has the number with the area code.
In .NET, you could extract just the digits from the string. Using Linq like this:
string justNumbers = new String(text.Where(Char.IsDigit).ToArray());
Don't forget to include using System.Linq
As an alternative to the main .Net
solution, adapted from a similar question's answer:
string justNumbers = string.Concat(text.Where(char.IsDigit));
I don't know if VBScript has some kind of a "regular expression replace" function, but if it does, then you could do something like this pseudocode:
reg_replace(/\D+/g, '', your_string)
I don't know VBScript so I can't give you the exact code but this would remove anything that is not a number.
EDIT: Make sure to have the global flag (the "g" at the end of the regexp), otherwise it will only match the first non-number in your string.