Solved

tokenizing strings in sql

Posted on 2006-06-15
3
438 Views
Last Modified: 2008-03-10
Hi,

I have data in an 'Address' column which needs to be migrated to new table whose
columns are now 'Address1', 'Address2', 'Address3' and 'Address4'.

could anyone help me write an sql query that can cut up a string into four component strings.
The cut token would be a new line character with any whitespace character(including other newline) to the left or right.

ie
Address =

"Jon smith
Melbourne University
Australia"

would be cut up into
address1="Jon Smith"
address2="Melbourne Unversity"
address3="australia"
address4=<null>

ie
Address =
"Henry Smith

Economics Faculty


Melbourne University
Victoria
Australia.
Earth."

would be cup into
Address1="Henry Smith"
address2="Economics Faculty"
Address3="Melbourne University"
Address4=
"Victoria
Australia.
Earth"

Any help on this would be much appreciated
0
Comment
Question by:santuon
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
3 Comments
 
LVL 2

Accepted Solution

by:
ewahner earned 500 total points
ID: 16910449
First you want to replace the data with something a little easier to parse like this and then run it through this function:

CREATE FUNCTION dbo.fnArray ( @Str varchar(1000), @Delim varchar(1) = ' ' )
returns  @tmpTable table ( arrValue varchar(25))
as
begin
   declare @pos integer
   declare @lastpos integer
   declare @arrdata varchar(1000)
   declare @data varchar(25)
   
   set @arrdata = replace(replace(replace(replace(upper(@Str),@Delim,'|'),'-','|'),'/','|'),'\','|')
   set @arrdata = @arrdata + '|'
   set @lastpos = 1
   set @pos = 0
   set @pos = charindex('|', @arrdata)
   while @pos <= len(@arrdata) and @pos <> 0
   begin
      set @data = substring(@arrdata, @lastpos, (@pos - @lastpos))
      if rtrim(ltrim(@data)) > ''
      begin
         if not exists( select top 1 arrValue from @tmpTable where arrValue = @data )
         begin  
            insert into @tmpTable ( arrValue ) values ( @data )
         end
      end
      set @lastpos = @pos + 1
      set @pos = charindex('|', @arrdata, @lastpos)
   end
   return
end

declare @str varchar(255)
set @str = 'Henry Smith
Economics Faculty
Melbourne University
Victoria
Australia.
Earth'

set @str = replace(replace(@str,char(13),'|'),char(10),'')
select * from dbo.fnArray(@str,'|')
0
 
LVL 6

Expert Comment

by:ksbhat
ID: 16910622
0
 
LVL 2

Expert Comment

by:ewahner
ID: 17030199
Mine is more efficient as it uses the charindex to move to the next delimiter, whereas the devx one loops thru each character...very inefficient.  Also you can take out the if and just do the insert:

--        if not exists( select top 1 arrValue from @tmpTable where arrValue = @data )
--        begin  
            insert into @tmpTable ( arrValue ) values ( @data )
--        end

If you don't care about non-unique values.
0

Featured Post

Complete VMware vSphere® ESX(i) & Hyper-V Backup

Capture your entire system, including the host, with patented disk imaging integrated with VMware VADP / Microsoft VSS and RCT. RTOs is as low as 15 seconds with Acronis Active Restore™. You can enjoy unlimited P2V/V2V migrations from any source (even from a different hypervisor)

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

International Data Corporation (IDC) prognosticates that before the current the year gets over disbursing on IT framework products to be sent in cloud environs will be $37.1B.
Ever needed a SQL 2008 Database replicated/mirrored/log shipped on another server but you can't take the downtime inflicted by initial snapshot or disconnect while T-logs are restored or mirror applied? You can use SQL Server Initialize from Backup…
Via a live example, show how to shrink a transaction log file down to a reasonable size.
Viewers will learn how to use the UPDATE and DELETE statements to change or remove existing data from their tables. Make a table: Update a specific column given a specific row using the UPDATE statement: Remove a set of values using the DELETE s…

739 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question