Solved

XML Formatting

Posted on 2008-10-09
11
381 Views
Last Modified: 2013-11-18
Is there any way to add line breaks to an XML file after each element using XSLT or C#?

The problem is I have multiple files (several hundred thousand) that were all exported into XML as a single line. For the input program to accept them, they have to include line breaks to be more properly formatted. Any ideas on how to accomplish this?


Before
 

<document xmlns="http://www.ademero.com/XmlSchemas/ContentCentral/XmlCaptureDescriptorV1.1" xmlns:addData="http://www.abbyy.com/FlexiCapture/Schemas/Export/AdditionalFormData.xsd" xmlns:form="http://www.abbyy.com/FlexiCapture/Schemas/Export/FormData.xsd"><electronicFile>Images\AIRCO-W8371_1.pdf</electronicFile><destination><catalog documentType="default">Accounts Payable</catalog></destination><fields><field name="Invoice">W8371</field><field name="Vendor">AIRCO INC</field><field name="Date">12/18/98</field></fields></document>
 
 

AFTER
 

<document xmlns="http://www.ademero.com/XmlSchemas/ContentCentral/XmlCaptureDescriptorV1.1" xmlns:addData="http://www.abbyy.com/FlexiCapture/Schemas/Export/AdditionalFormData.xsd" xmlns:form="http://www.abbyy.com/FlexiCapture/Schemas/Export/FormData.xsd">

  <electronicFile>Images\AIRCO-W8371_1.pdf</electronicFile>

  <destination>

    <catalog documentType="default">Accounts Payable</catalog>

  </destination>

  <fields>

    <field name="Invoice">W8371</field>

    <field name="Vendor">AIRCO INC</field>

    <field name="Date">12/18/98</field>

  </fields>

</document>

Open in new window

0
Comment
Question by:developernetwork
11 Comments
 
LVL 18

Expert Comment

by:zc2
Comment Utility
Try to use an identity transformation, it should output with line breaks, especially if you use <xsl:output> with the indent="yes" attribute
<?xml version="1.0"?>

<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">

	<xsl:output method="xml" indent="yes" encoding="iso-8859-1"/>
 

	<xsl:template match="@*|node()">

	  <xsl:copy>

	    <xsl:apply-templates select="@*|node()"/>

	  </xsl:copy>

	</xsl:template>	

</xsl:stylesheet>

Open in new window

0
 

Author Comment

by:developernetwork
Comment Utility
I'm a little confused.

Here, let me show you the XSLT stylesheet we're already using.
I have the indent="yes" in there and it fails to do that.
<?xml version="1.0" encoding="utf-8"?>

<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">

	<xsl:output method="xml" indent="yes" media-type="text/xml" omit-xml-declaration="no" version="1.0"/>

	<xsl:template match="/" xmlns:addData="http://www.abbyy.com/FlexiCapture/Schemas/Export/AdditionalFormData.xsd"

	xmlns:form="http://www.abbyy.com/FlexiCapture/Schemas/Export/FormData.xsd">

		<document xmlns="http://www.ademero.com/XmlSchemas/ContentCentral/XmlCaptureDescriptorV1.1">

			<electronicFile>

				<xsl:value-of select="form:Documents/*/@addData:ImagePath" />

			</electronicFile>

			<destination>

				<catalog documentType="default">Accounts Payable</catalog>

			</destination>

			<fields>

				<field name="Invoice">

					<xsl:value-of select="form:Documents/*/_Document/_Invoice" />

				</field>

				<field name="Vendor">

					<xsl:value-of select="form:Documents/*/_Document/_Vendor" />

				</field>

				<field name="Date">

					<xsl:value-of select="form:Documents/*/_Document/_Date" />

				</field>

			</fields>

		</document>

	</xsl:template>

</xsl:stylesheet>

Open in new window

0
 
LVL 18

Expert Comment

by:zc2
Comment Utility
May I ask you what the xsl processor are you using?
I tried on MSXML v3 and v6, it does obey the indent attribute.
0
 

Author Comment

by:developernetwork
Comment Utility
I'm using a simple Xsl.Transform command in Microsoft C# version 3.5
0
 

Author Comment

by:developernetwork
Comment Utility
I'm not familiar with how to use MSXML v6, I've just written a little program in C# that does the transform.
0
Enabling OSINT in Activity Based Intelligence

Activity based intelligence (ABI) requires access to all available sources of data. Recorded Future allows analysts to observe structured data on the open, deep, and dark web.

 

Author Comment

by:developernetwork
Comment Utility
Did you make changes to the XSLT that I posted to make it do that? I'm trying again and I'm just not able to get it to format the document. I'm extremely new to working with XML, and they've put me on a very short deadline, so I haven't had the chance to read a lot of what I would have if I had been granted more time on this project.
0
 
LVL 18

Expert Comment

by:zc2
Comment Utility
You don't need a complex xslt to just reformat the xml with the same structure it already has.
All you need is a copy xslt which I have provided you in the first comment.

I tried to use the xsl from a C# program, and you right it does ignore the indent="yes" attribute.

Try to use the copy xsl with the following command line utility, it uses MSXML v3 as far as I know.
http://www.microsoft.com/downloads/details.aspx?FamilyID=2fb55371-c94e-4373-b0e9-db4816552e41&DisplayLang=en
0
 

Author Comment

by:developernetwork
Comment Utility
Well, this isn't the original document. The XSLT transforms it from a different document.

The reason I use the C# tool is because I have to execute this on a couple hundred thousand documents.
0
 
LVL 10

Assisted Solution

by:Hans Langer
Hans Langer earned 200 total points
Comment Utility
Just load the XML on a XML object and then save it or call his outerxml property. Both method ident the data.

xml.xmlDocument oXML = new xml.xmldocument;
oXML.loadxml(sXML)


oXML.outterXML

or

oXML.save("c:\myfile.xml")
0
 
LVL 23

Accepted Solution

by:
Tiggerito earned 300 total points
Comment Utility
This might help...
xmlNode = the xml data
 

StringWriter stringWriter = new StringWriter();

XmlTextWriter xmlWriter = new XmlTextWriter(stringWriter);

xmlWriter.Formatting = Formatting.Indented;

xmlWriter.Indentation = 4;

xmlWriter.IndentChar = ' ';

xmlWriter.QuoteChar = '"';
 

xmlNode.WriteTo(xmlWriter);
 

return stringWriter.ToString();

Open in new window

0
 

Author Comment

by:developernetwork
Comment Utility
Thanks, using all of this I was able to design a working solution. I appreciate it!
0

Featured Post

How your wiki can always stay up-to-date

Quip doubles as a “living” wiki and a project management tool that evolves with your organization. As you finish projects in Quip, the work remains, easily accessible to all team members, new and old.
- Increase transparency
- Onboard new hires faster
- Access from mobile/offline

Join & Write a Comment

Browsers only know CSS so your awesome SASS code needs to be translated into normal CSS. Here I'll try to explain what you should aim for in order to take full advantage of SASS.
Exception Handling is in the core of any application that is able to dignify its name. In this article, I'll guide you through the process of writing a DRY (Don't Repeat Yourself) Exception Handling mechanism, using Aspect Oriented Programming.
Viewers will learn about basic arrays, how to declare them, and how to use them. Introduction and definition: Declare an array and cover the syntax of declaring them: Initialize every index in the created array: Example/Features of a basic arr…
The viewer will receive an overview of the basics of CSS showing inline styles. In the head tags set up your style tags: (CODE) Reference the nav tag and set your properties.: (CODE) Set the reference for the UL element and styles for it to ensu…

763 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

10 Experts available now in Live!

Get 1:1 Help Now