Avatar of enthuguy
enthuguy
Flag for Australia asked on

Python to produce nice tabular output

HI,

I'm trying to modify below python script o produce nice tabular output. (Right now its not in a readable format)

Thanks in advance

Script source:
https://github.com/hjacobs/aws-cost-and-usage-report/blob/master/aws-cost-and-usage-report.py

Current output
./aws-cost-and-usage-report.py
TimePeriod	LinkedAccount	Service                                 	Amount	Unit	Estimated
2019-11-08 	 21212121212121	AWS CloudTrail 	 	 	                       0.153943 	 USD 	 False
2019-11-08 	 21212121212121	AWS Config 	 	 	                          9.213 	 USD 	 False
2019-11-08 	 21212121212121	AWS Direct Connect 	 	 	                   0.2797877163 	 USD 	 False
2019-11-08 	 21212121212121	AWS Key Management Service 	 	 	                   1.4141780112 	 USD 	 False
2019-11-08 	 21212121212121	AWS Lambda 	 	 	                   0.0804225759 	 USD 	 False
2019-11-08 	 21212121212121	Amazon DynamoDB 	 	 	                   0.3836161225 	 USD 	 False
2019-11-08 	 21212121212121	Amazon EC2 Container Registry (ECR) 	 	 	                   0.0783308328 	 USD 	 False
2019-11-08 	 21212121212121	Amazon EC2 Container Service 	 	 	                              0 	 USD 	 False
2019-11-08 	 21212121212121	EC2 - Other 	 	 	                   6.8639388761 	 USD 	 False
2019-11-08 	 21212121212121	Amazon Elastic Compute Cloud - Compute 	 	 	                  73.1890902202 	 USD 	 False
2019-11-08 	 21212121212121	Amazon Elastic File System 	 	 	                   2.2110942898 	 USD 	 False
2019-11-08 	 21212121212121	Amazon Elastic Load Balancing 	 	 	                   4.8388505022 	 USD 	 False
2019-11-08 	 21212121212121	Amazon GuardDuty 	 	 	                    0.761623977 	 USD 	 False
2019-11-08 	 21212121212121	Amazon Relational Database Service 	 	 	                  21.2797291955 	 USD 	 False

Open in new window

* aws-cliShell ScriptingPythonAWS

Avatar of undefined
Last Comment
enthuguy

8/22/2022 - Mon
noci

How about using format...  (https://www.programiz.com/python-programming/methods/string/format )
Then use for heading & data:

...
template="{10} {14} {40} {16.10} {6} {10}"
...
print(template.format("TimePeriod", "LinkedAccount", "Service", "Amount", "Unit", "Estimated"))
...
   print(template.format(result_by_time['TimePeriod']['Start'], group['Keys']['LINKED_ACCOUNT'],  group['Keys']['SERVICE'], amount, unit,  result_by_time['Estimated']))
...

Open in new window

You still need the subfields for the keys to be corrected i guessed them from the source.

The format might need a bit of adjustment. I guessed the widths from the above example.
enthuguy

ASKER
Thanks a lot Noci, will update you
enthuguy

ASKER
Sorry, this is what I updated. since i'm not familiar with python


#!/usr/bin/env python3

import argparse
import boto3
import datetime

parser = argparse.ArgumentParser()
parser.add_argument('--days', type=int, default=30)
args = parser.parse_args()


now = datetime.datetime.utcnow()
start = (now - datetime.timedelta(days=args.days)).strftime('%Y-%m-%d')
end = now.strftime('%Y-%m-%d')
template="{10} {14} {40} {16.10} {6} {10}"

cd = boto3.client('ce', 'ap-southeast-2')

results = []

token = None
while True:
    if token:
        kwargs = {'NextPageToken': token}
    else:
        kwargs = {}
    data = cd.get_cost_and_usage(TimePeriod={'Start': start, 'End':  end}, Granularity='DAILY', Metrics=['UnblendedCost'], GroupBy=[{'Type': 'DIMENSION', 'Key': 'LINKED_ACCOUNT'}, {'Type': 'DIMENSION', 'Key': 'SERVICE'}], **kwargs)
    results += data['ResultsByTime']
    token = data.get('NextPageToken')
    if not token:
        break

# print('\t'.join(['TimePeriod', 'LinkedAccount', 'Service'.ljust(40, ' '), 'Amount', 'Unit', 'Estimated']))
print(template.format("TimePeriod", "LinkedAccount", "Service", "Amount", "Unit", "Estimated")

for result_by_time in results:
    for group in result_by_time['Groups']:
        amount = group['Metrics']['UnblendedCost']['Amount']
        unit = group['Metrics']['UnblendedCost']['Unit']
        # comp = join(group['Keys'])
        # rep_start_time = result_by_time['TimePeriod']['Start']
        # print(rep_start_time.ljust(10, ' '), '\t', '\t' .join(group['Keys']), '\t', '\t', '\t', amount.rjust(30, ' '), '\t', unit, '\t', result_by_time['Estimated'])
        print(template.format(result_by_time['TimePeriod']['Start'], group['Keys'],  amount, '\t', unit, '\t', result_by_time['Estimated'])

Open in new window



Error

./aws-cost-and-usage-report.py --days=7
Traceback (most recent call last):
  File "./aws-cost-and-usage-report.py", line 34, in <module>
    print(template.format("TimePeriod", "LinkedAccount", "Service", "Amount", "Unit", "Estimated"))
IndexError: tuple index out of range

Open in new window

Your help has saved me hundreds of hours of internet surfing.
fblack61
noci

there are missing ) with the print statements.  And i removed the '\t' from the 2nd string.
(I copied a wrong example earlier...)
(add : in the formats type field for float, and difference between header & data template)
#!/usr/bin/env python3

import argparse
import boto3
import datetime

parser = argparse.ArgumentParser()
parser.add_argument('--days', type=int, default=30)
args = parser.parse_args()


now = datetime.datetime.utcnow()
start = (now - datetime.timedelta(days=args.days)).strftime('%Y-%m-%d')
end = now.strftime('%Y-%m-%d')
dtemplate="{:10} {:14} {:40} {:16.10f} {:6} {:10}"
htemplate="{:10} {:14} {:40} {:16} {:6} {:10}"

cd = boto3.client('ce', 'ap-southeast-2')

results = []

token = None
while True:
    if token:
        kwargs = {'NextPageToken': token}
    else:
        kwargs = {}
    data = cd.get_cost_and_usage(TimePeriod={'Start': start, 'End':  end}, Granularity='DAILY', Metrics=['UnblendedCost'], GroupBy=[{'Type': 'DIMENSION', 'Key': 'LINKED_ACCOUNT'}, {'Type': 'DIMENSION', 'Key': 'SERVICE'}], **kwargs)
    results += data['ResultsByTime']
    token = data.get('NextPageToken')
    if not token:
        break

# print('\t'.join(['TimePeriod', 'LinkedAccount', 'Service'.ljust(40, ' '), 'Amount', 'Unit', 'Estimated']))
print(htemplate.format("TimePeriod", "LinkedAccount", "Service", "Amount", "Unit", "Estimated"))

for result_by_time in results:
    for group in result_by_time['Groups']:
        amount = group['Metrics']['UnblendedCost']['Amount']
        unit = group['Metrics']['UnblendedCost']['Unit']
        # comp = join(group['Keys'])
        # rep_start_time = result_by_time['TimePeriod']['Start']
        # print(rep_start_time.ljust(10, ' '), '\t', '\t' .join(group['Keys']), '\t', '\t', '\t', amount.rjust(30, ' '), '\t', unit, '\t', result_by_time['Estimated'])
        print(dtemplate.format(result_by_time['TimePeriod']['Start'], group['Keys']['LINKED_ACCOUNT'],  group['Keys']['SERVICE'], amount, unit,  result_by_time['Estimated']))

Open in new window

Juan Carlos

Suhas .

Hi,
You can using one of the modules:

https://pypi.org/project/tabulate/

or

https://pypi.org/project/tabulate/
ex: print(tabulate(table, headers, tablefmt="pipe"))

or

https://pypi.org/project/beautifultable/
⚡ FREE TRIAL OFFER
Try out a week of full access for free.
Find out why thousands trust the EE community with their toughest problems.
enthuguy

ASKER
Hi noci, sorry again.

kindly help pls

./aws-cost-and-usage-report.py
TimePeriod LinkedAccount  Service                                  Amount           Unit   Estimated
Traceback (most recent call last):
  File "./aws-cost-and-usage-report.py", line 44, in <module>
    print(dtemplate.format(result_by_time['TimePeriod']['Start'], group['Keys']['LINKED_ACCOUNT'],  group['Keys']['SERVICE'], amount, unit,  result_by_time['Estimated']))
TypeError: list indices must be integers or slices, not str
enthuguy

ASKER
Thanks Juan and Suhas,
I tried these as a standalone scripts it works fine.

My challenge is how to incorporate this to my script :)
noci

The group['Keys]['LINKED_ACCOUNT] & group['Keys']['SERVICE'] probably are not strings. This might need to be changed to:

 group['Keys][0] & group['Keys'][1]

print(dtemplate.format(result_by_time['TimePeriod']['Start'], group['Keys'][0],  group['Keys'][1], amount, unit,  result_by_time['Estimated']))

Open in new window


I have no access to AWS so i cannot verify the code.
Experts Exchange is like having an extremely knowledgeable team sitting and waiting for your call. Couldn't do my job half as well as I do without it!
James Murphy
enthuguy

ASKER
its ok, I can understand....thx for your help

may be we are getting close :)


./aws-cost-and-usage-report.py
TimePeriod LinkedAccount  Service                                  Amount           Unit   Estimated
Traceback (most recent call last):
  File "./aws-cost-and-usage-report.py", line 45, in <module>
    print(dtemplate.format(result_by_time['TimePeriod']['Start'], group['Keys'][0],  group['Keys'][1], amount, unit,  result_by_time['Estimated']))
ValueError: Unknown format code 'f' for object of type 'str'

Open in new window

ASKER CERTIFIED SOLUTION
noci

THIS SOLUTION ONLY AVAILABLE TO MEMBERS.
View this solution by signing up for a free trial.
Members can start a 7-Day free trial and enjoy unlimited access to the platform.
See Pricing Options
Start Free Trial
GET A PERSONALIZED SOLUTION
Ask your own question & get feedback from real experts
Find out why thousands trust the EE community with their toughest problems.
enthuguy

ASKER
Thanks a lot Noci, you are simply great!

Even though you did not have the aws env...you helped me. :)