I have a text file that contains the following line of text. Everything has a starting tag. I am looking for a way to get the values from the date, year and agency. One problem I have encountered is that some of the values will have html tags as well. I have not had much experince with regular expressions. Any help is appreciated.
"<PRESOL> <DATE>0622 <YEAR>99 <AGENCY>General Services Administration <OFFICE>Public Buildings Service (PBS) <LOCATION>Spokane Customer Services Center (10PM3) <ZIP>99201-1075 <CLASSCOD>Z <OFFADD>General Services Administration, Public Buildings Service (PBS), Spokane Customer Services Center (10PM3), 920 West Riverside Avenue, Room 120, U. S. Courthouse, Spokane, WA 99201-1075 <SUBJECT>EXTERIOR PAINTING, FB/USPO, SPOKANE, WASHINGTON <SOLNBR>10PM3XX990138 <RESPDATE>081199 <CONTACT>Cheryl O'Donnell, Contract Specialist, Phone (509) 353-2457, Fax (509) 353-2359, Email email@example.com - Eva Hutchison, Procurement Technician, Phone (509) 353-2457, Fax (509) 353-2359, Email firstname.lastname@example.org <DESC>Contractor shall furnish all labor, materials and equipment to paint all previously painted workwork and exterior metal on the FB/USPO, 904 West Riverside Avenue, Spokane, Washington. Building is five  stories. Repair/replace missing, loose, cracked or defective caulking and glazing compound from glass, frames and trim of exterior windows. All old paint contains lead. Sic Code 1721. All responsible sources may submit a quotation which, if timely received, may be considered by the Government. This procurement is set aside for small business concerns. Price range $100,000 - $250,000. Please fax requests for solicitations to 509-353-2359. <LINK> <URL>http://www.fbo.gov/spg/GSA/PBS/10PM3/10PM3XX990138/listing.html
<DESC>Link to FedBizOpps document. <EMAIL> <ADDRESS>cheryl.odonnell@g
sa.gov <DESC>Cheryl O'Donnell </PRESOL>"