Solved

Merge two CSV Files Powershell

Posted on 2010-09-17
3
1,208 Views
Last Modified: 2012-05-10
Hi All,

I have the below script but am having some issues, i have two csv files

prods.csv:
partcode, manuf, desc, cost, rrp

stock.csv:
partcode, qty

i need to merge the qty column with the corolating partcode, the file stock.csv has more items in it than the prods.csv and neither files have a header in them so i need to match the partcodes and then on a match put the qty in the prods.csv or a new csv.

my current script only checks to make sure that the line number is the same which isnt good enough as the stock.csv has more lines in it.


Remove-Item E:\MergedData.txt

$prods = Get-Content "E:\prods.csv"

Write-Host "Ateam Product file read"

$data = Get-Content "E:\Stock.csv"

Write-Host "Both files read"

Clear-Host

[int] $i = 0
foreach($prod in $prods)
{
# split the current line from the genes.txt
# file into an array of 2 items using
# the tab character as the delimiter
$oneLineProds = $prod.Split(",")

# assign each of the 2 pieces of gene info
# to a variable

# $geneNumber would be "gene1" in first line
$prodNumber = $oneLineProds[0]

# $geneName would be "kinse_a" in first line
$prodName = $oneLineProds[1]
$prodDesc = $oneLineProds[2]
$prodCost = $oneLineProds[3]
$prodRRP = $oneLineProds[4]

# $data[$i] retrieves the correct line number from
# the data.txt file.
# split the current line from the data.txt
# file into an array of 2 elements using
# the tab character as the delimiter
$oneLineData = $data[$i].Split(",")

# $geneDataValue would be "+8.6" in first line
$prodStock = $oneLineData[1]

# build a variable that will look like this for the
# first line in both files:
# "gene1,kinse_a,+8.6"
$lineToWrite = "$prodNumber, $prodName, $prodDesc, $prodCost, $prodRRP, $prodStock"

# now write that variable out to a file
$lineToWrite | out-file E:\MergedData.txt -append

# increment $i so that we always retrieve the
# correct line number from the data.txt file
$i++

# for performance testing - comment out for production
if( ($i % 100) -eq 0 )
{
$i
}
}

Open in new window

0
Comment
Question by:sophieb1984
  • 2
3 Comments
 
LVL 17

Accepted Solution

by:
Learnctx earned 500 total points
ID: 33706897
This will take the 2 CSV files look for a match between the prods.csv and the stock.csv for the partcode. If a match is found they're put into a merged csv. If there is no match it is discarded. It inserts a header into the CSV files to make them easier to work with but that can be taken away and you can just work with the array as you have been.

Unfortunately I am short of time today so cannot provide a cleaner script.
$prodheader = "partcode,manuf,desc,cost,rrp"
$stockheader = "partcode,qty"
$combinedheader = "$($prodheader),qty"

$arrProd = @()
$arrProd += $prodheader
gc "E\prods.csv" | % { $arrProd+=$_ }
Set-Content "E:\prods.csv" $arrProd
$arrProd = $null

$arrStock = @()
$arrStock += $stockheader
gc "E:\stock.csv" | % { $arrStock+=$_}
Set-Content "E:\stock.csv" $arrStock
$arrStock = $null

$arrProd = Import-Csv "E:\prods.csv"
$arrStock = Import-Csv "E:\stock.csv"

$hashProd = @{}
$arrProd | % {
	$hashProd.Add($_.partcode,$("$($_.manuf),$($_.desc),$($_.cost),$($_.rrp)"))
}

$hashStock = @{}
$arrStock | % {
	$hashStock.Add($_.partcode,$_.qty)
}

$hashStock.Keys | % {
	if ($hashProd.ContainsKey($_)){
		$hashProd.Item($_) += ","+$hashStock.Item($_)
	}
}

New-Item -ItemType File -Path "E:\combined.csv"
Add-content "E:\cojavascript:void(0);mbined.csv" $combinedheader
Add-Content "E:\combined.csv" @($hashProd.Keys | % { "$($_),$($hashprod.item($_))" })

Open in new window

0
 
LVL 1

Author Comment

by:sophieb1984
ID: 33715103
The script works fine, but when one of the products cant find a stock line for it the column isn't created as you will know this is not good for CSV as it cant tell where the column ends / finishes I am getting the following result:

partcode,manuf,desc,cost,rrp
partcode,manuf,desc,cost,rrp,qty
partcode,manuf,desc,cost,rrp,qty
partcode,manuf,desc,cost,rrp
partcode,manuf,desc,cost,rrp,qty
partcode,manuf,desc,cost,rrp


i need it to do this:

partcode,manuf,desc,cost,rrp,
partcode,manuf,desc,cost,rrp,qty
partcode,manuf,desc,cost,rrp,qty
partcode,manuf,desc,cost,rrp,
partcode,manuf,desc,cost,rrp,qty
partcode,manuf,desc,cost,rrp,


simply add a comma to the columns that dont have a qty of stock in order to make it look like the
0
 
LVL 1

Author Comment

by:sophieb1984
ID: 33723575
Hi Sorry to be a pain i have got the script to work now but if the script is run more than once on a file then it will add the header again and again when the script is run, is it possible to add an if statement that will not add the header if it already exists.
Remove-Item "e:\csv\csv_suppliers\ateam\MergedData.csv"
$prodheader = "partcode,manuf,desc,cost,std"
$stockheader = "partcode,qty"
$combinedheader = "$($prodheader),qty"

$arrProd = @()
$arrProd += $prodheader
gc "e:\csv\csv_suppliers\ateam\Pricelist for Fms.csv" | % { $arrProd+=$_ }

Set-Content "e:\csv\csv_suppliers\ateam\Pricelist for Fms.csv" $arrProd
$arrProd = $null

$arrStock = @()
$arrStock += $stockheader
gc "e:\csv\csv_suppliers\ateam\stock feed.csv" | % { $arrStock+=$_}
Set-Content "e:\csv\csv_suppliers\ateam\stock feed.csv" $arrStock
$arrStock = $null

$arrProd = Import-Csv "e:\csv\csv_suppliers\ateam\Pricelist for fms.csv"
$arrStock = Import-Csv "e:\csv\csv_suppliers\ateam\stock feed.csv"

$hashProd = @{}
$arrProd | % {
	$hashProd.Add($_.partcode,$("$($_.manuf),$($_.desc),$($_.cost),$($_.std),"))
}

$hashStock = @{}
$arrStock | % {
	$hashStock.Add($_.partcode,$("$($_.qty)"))
}

$hashStock.Keys | % {
	if ($hashProd.ContainsKey($_)){
		$hashProd.Item($_) +=""+$hashStock.Item($_)
	}
}

New-Item -ItemType File -Path "e:\csv\csv_suppliers\ateam\MergedData.csv"
Add-content "e:\csv\csv_suppliers\ateam\MergedData.csv" $combinedheader
Add-Content "e:\csv\csv_suppliers\ateam\MergedData.csv" @($hashProd.Keys | % { "$($_),$($hashprod.item($_))" })

Open in new window

0

Featured Post

Control application downtime with dependency maps

Visualize the interdependencies between application components better with Applications Manager's automated application discovery and dependency mapping feature. Resolve performance issues faster by quickly isolating problematic components.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

In this article we will get to know that how can we recover deleted data if it happens accidently. We really can recover deleted rows if we know the time when data is deleted by using the transaction log.
This article will help you understand what HashTables are and how to use them in PowerShell.
Along with being a a promotional video for my three-day Annielytics Dashboard Seminor, this Micro Tutorial is an intro to Google Analytics API data.
Sending a Secure fax is easy with eFax Corporate (http://www.enterprise.efax.com). First, just open a new email message. In the To field, type your recipient's fax number @efaxsend.com. You can even send a secure international fax — just include t…

910 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

21 Experts available now in Live!

Get 1:1 Help Now