Convert ASCII file with several rows of data that are inconsistent to a column format in Excel

I have an ASCII file that was generated in Cobol.  Trying to convert the rows of data into column format in Excel 2013.  The problem is that there are different number of rows for each record.  Some records may be 15 lines while another may have 20.
dman19691Asked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

5teveoCommented:
Can you submit a sample file with some records to review?

Are there any patterns? Record row tags / headings?

You will need to append multi-row records into 1 row for excel which will require a macro - in my opinion.
0
dman19691Author Commented:
5Teveo,

I will have to check in see if the data is confidential.  But I was able to put every record that started with an 01 and changed it to a unique identifier of XX.  So that the start of a new record what be identified with XX.  I don't know if that makes sense?
0
5teveoCommented:
Just make a pattern similar to what you need so enable a better sample script... that's my thought
0
Introducing Cloud Class® training courses

Tech changes fast. You can learn faster. That’s why we’re bringing professional training courses to Experts Exchange. With a subscription, you can access all the Cloud Class® courses to expand your education, prep for certifications, and get top-notch instructions.

dman19691Author Commented:
Here is a sample of the data.  The first two positions in the lines of data represent a record.  For example 01, 02, 03 up to 15 represent one record.   I will attach a sample of the data.
sample.txt
0
5teveoCommented:
Upon quick review... it appears a data definition may be needed...
I can guess row types 01-?? thru next 01 are all associated with 1 record. I can also see multiple record rows concatenation for a text type description but it appears some other array type work is occurring within record.
Do you have a data definition describing how data is defined in variable length record in cobol? that's may be what you need to get better answer/script?

A vb script to join records will not help you unless script knows how to handle each type.

make sense?
0
dman19691Author Commented:
0
aikimarkCommented:
since you have very different layouts for these various record types, please do a manual parsing of the data and post that workbook.
0
5teveoCommented:
yup - that's it....

Well that's a lot analysis for me right now for free (or only points). We can pull records all records into 1 row but you would still have a mess unless you walk thru column definitions by record type from Data definition for better excel alignment.
0
aikimarkCommented:
This is not a trivial question.  Look at some of the work researchers in this field had to do:
http://assets.conferencespot.org/fileserver/file/42340/filename/2vdsq5.pdf


Given the size of the task, I might suggest looking for pre-parsed versions of this data, such as
http://archive.today/bomAY
http://www.capcog.org/documents/airquality/reports/2013/Task_3.1-2012_and_2018_Emissions_Modeling_for_CAPCOG_Region_and_Milam_Counties_2013-12-02.pdf

==============
I suggest you look for some software or professional help on this problem.
* Sourceforge projects:
    http://cobol-dde.sourceforge.net/
    http://sourceforge.net/projects/record-editor/
    http://record-editor.sourceforge.net/Cobol.html

* github (python) project: https://github.com/bpeterso2000/pycobol

* It wouldn't surprise me if there weren't some other open source applications that can interpret file content, based on COBOL layout.

* A (pricey) software solution example:  http://www.cobolproducts.com/datafile/pricing.html
* MicroFocus -- a COBOL developer tool
* If you already have Oracle in place: http://www.oracle.com/technetwork/testcontent/flat-file-cobol-copybook-sources-100800.html

* Contact environmental or open data groups to help you.
* Look at the Excel, VB, or COBOL top experts to see if they have enabled their Hire Me button.
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
dman19691Author Commented:
Thanks 5teveo and aikimark.  I figured this would not be easy since the data is in Cobol and has several lines, but you pointed me in the right direction.  Thanks again.
0
aikimarkCommented:
The better formats for this data would be:
* mark-up (XML, JSON, YAML)
* relational database with separate tables for each kind of record (01-15)
* NoSQL database (most likely JSON documents)

If you want to save this in Excel, you should have separate tabs for each kind of record (01-15)

Note: When parsing the data, the primary key of the 01 (or 02) record must be added to the 03-15 records as a foreign key, allowing you to join the tables for your queries.
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Microsoft Excel

From novice to tech pro — start learning today.

Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.