Solved

Convert ASCII file with several rows of data that are inconsistent to a column format in Excel

Posted on 2014-07-28
11
376 Views
Last Modified: 2014-07-29
I have an ASCII file that was generated in Cobol.  Trying to convert the rows of data into column format in Excel 2013.  The problem is that there are different number of rows for each record.  Some records may be 15 lines while another may have 20.
0
Comment
Question by:dman19691
  • 4
  • 4
  • 3
11 Comments
 
LVL 8

Expert Comment

by:5teveo
ID: 40224350
Can you submit a sample file with some records to review?

Are there any patterns? Record row tags / headings?

You will need to append multi-row records into 1 row for excel which will require a macro - in my opinion.
0
 

Author Comment

by:dman19691
ID: 40224438
5Teveo,

I will have to check in see if the data is confidential.  But I was able to put every record that started with an 01 and changed it to a unique identifier of XX.  So that the start of a new record what be identified with XX.  I don't know if that makes sense?
0
 
LVL 8

Expert Comment

by:5teveo
ID: 40224634
Just make a pattern similar to what you need so enable a better sample script... that's my thought
0
 

Author Comment

by:dman19691
ID: 40224845
Here is a sample of the data.  The first two positions in the lines of data represent a record.  For example 01, 02, 03 up to 15 represent one record.   I will attach a sample of the data.
sample.txt
0
 
LVL 8

Expert Comment

by:5teveo
ID: 40224904
Upon quick review... it appears a data definition may be needed...
I can guess row types 01-?? thru next 01 are all associated with 1 record. I can also see multiple record rows concatenation for a text type description but it appears some other array type work is occurring within record.
Do you have a data definition describing how data is defined in variable length record in cobol? that's may be what you need to get better answer/script?

A vb script to join records will not help you unless script knows how to handle each type.

make sense?
0
Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

 

Author Comment

by:dman19691
ID: 40225024
0
 
LVL 45

Expert Comment

by:aikimark
ID: 40225029
since you have very different layouts for these various record types, please do a manual parsing of the data and post that workbook.
0
 
LVL 8

Assisted Solution

by:5teveo
5teveo earned 250 total points
ID: 40225063
yup - that's it....

Well that's a lot analysis for me right now for free (or only points). We can pull records all records into 1 row but you would still have a mess unless you walk thru column definitions by record type from Data definition for better excel alignment.
0
 
LVL 45

Accepted Solution

by:
aikimark earned 250 total points
ID: 40225280
This is not a trivial question.  Look at some of the work researchers in this field had to do:
http://assets.conferencespot.org/fileserver/file/42340/filename/2vdsq5.pdf


Given the size of the task, I might suggest looking for pre-parsed versions of this data, such as
http://archive.today/bomAY
http://www.capcog.org/documents/airquality/reports/2013/Task_3.1-2012_and_2018_Emissions_Modeling_for_CAPCOG_Region_and_Milam_Counties_2013-12-02.pdf

==============
I suggest you look for some software or professional help on this problem.
* Sourceforge projects:
    http://cobol-dde.sourceforge.net/
    http://sourceforge.net/projects/record-editor/
    http://record-editor.sourceforge.net/Cobol.html

* github (python) project: https://github.com/bpeterso2000/pycobol

* It wouldn't surprise me if there weren't some other open source applications that can interpret file content, based on COBOL layout.

* A (pricey) software solution example:  http://www.cobolproducts.com/datafile/pricing.html
* MicroFocus -- a COBOL developer tool
* If you already have Oracle in place: http://www.oracle.com/technetwork/testcontent/flat-file-cobol-copybook-sources-100800.html

* Contact environmental or open data groups to help you.
* Look at the Excel, VB, or COBOL top experts to see if they have enabled their Hire Me button.
0
 

Author Closing Comment

by:dman19691
ID: 40227418
Thanks 5teveo and aikimark.  I figured this would not be easy since the data is in Cobol and has several lines, but you pointed me in the right direction.  Thanks again.
0
 
LVL 45

Expert Comment

by:aikimark
ID: 40227618
The better formats for this data would be:
* mark-up (XML, JSON, YAML)
* relational database with separate tables for each kind of record (01-15)
* NoSQL database (most likely JSON documents)

If you want to save this in Excel, you should have separate tabs for each kind of record (01-15)

Note: When parsing the data, the primary key of the 01 (or 02) record must be added to the 03-15 records as a foreign key, allowing you to join the tables for your queries.
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

A little background as to how I came to I design this code: Around 5 years ago I designed an add-in that formatted Excel files to a corporate standard, applying different cell colours and font type depending on whether the cells contained inputs,…
How to quickly and accurately populate Word documents with Excel data, charts and images (including Automated Bookmark generation) David Miller (dlmille) Synopsis In this article you’ll learn how to use ExcelToWord! to copy data,charts, shapes …
This Micro Tutorial will demonstrate on a Mac how to change the sort order for chart legend values and decrpyt the intimidating chart menu.
Excel styles will make formatting consistent and let you apply and change formatting faster. In this tutorial, you'll learn how to use Excel's built-in styles, how to modify styles, and how to create your own. You'll also learn how to use your custo…

920 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

15 Experts available now in Live!

Get 1:1 Help Now