I have the following structure from a legacy system. It is NOT XML. Basically it has a start and end structure and it varies from the name: Group1, Group2, Grouo3, etc.
Inside each of those structures there is a finite set of properties, none of them have a closing piece. For instance <DESC> can have many lines but NO closing matching </DESC>. The contents of DESC, as the other fields, end when another field starts.
Hence, my question is: What is the best way to parse it using Regex since each structure starts with a different name?
Attached a sample file with that.
<title>This is the first of the documents related to the site
<desc>This documents describes the land and surroundings
of the location
<title>This is the second of the documents related to the beach site
<desc>The beach house as it sits and its architectural characteristics
according to the author.