Solved

FREE tool for compressing source files

Posted on 2008-11-01
10
191 Views
Last Modified: 2013-11-19
Hi,
I need a free tool that I can use to remove line-breaks and whitespaces from my source files ( XHTML, JavaScript and CSS) so I get  files smaller in size.
I see that many sites use this technique.

 Is there such a tool or program or do I have to do it on my own?
Thanks!
0
Comment
Question by:mine_23
  • 5
  • 5
10 Comments
 
LVL 13

Expert Comment

by:brundo
ID: 22860700
Hi,

I've remember, this was called compression, and I think I've seen such files in Joomla (or was it Moodle?). Now you can search without me, too :-))

Try with:

0
 
LVL 1

Author Comment

by:mine_23
ID: 22862209
Cool links. Thanks!
But what XHTML files ?
0
 
LVL 13

Expert Comment

by:brundo
ID: 22865297
New day, new ideas :-))

Are you maybe on some Unix? (in your question you wrote you can write your own program... maybe you are...)

If yes, try with following from the command line:

tr -s '\r\n' '\t' <input_file.input_file_type | sed 's/\t/ /g' | sed 's/  //g' >output_file.input_and_output_file_type.css

(all the content should come in one line, change only filenames)

It replaces first all the line breaks (in file uploaded from Windows, with line feed and new line at the end of the line), then replaces all the tabs and, finally, all the double blanks. This works for all file types.

If you like this approach, the long line can be saved in a file and executed as a script.
0
 
LVL 1

Author Comment

by:mine_23
ID: 22872013
Unfortunately, I am not using Unix.
I am using Windows XP.

Note that XHTML is a 100% valid XML  document. so this might help!
0
 
LVL 13

Expert Comment

by:brundo
ID: 22881489
I am now in RegExp mood (whatever it means ;-)) ).

This is a page with JavaScript, where you have to paste your code and obtain how it is getting smaller. It "eats" all the line breaks, tabs, and double blanks.

<html>
 

<head>

<meta http-equiv="Content-Language" content="hr">

<meta http-equiv="Content-Type" content="text/html; charset=utf-8">

<title>Eat them!</title>
 

<script type="text/javascript">

function eatThem (){

	var str = document.getElementById("original").value;

    document.getElementById("before").value = str.length;

	str = str.replace(/[\n\r\t]/g, " ");
 

	str = str.replace(/\ \ /g, " "); 

    document.getElementById("original").value = str;

    document.getElementById("after").value = str.length;

	while (str.search(/\ \ /) != -1) {

		str = str.replace(/\ \ /g, " "); 

	    document.getElementById("original").value = str;

	    document.getElementById("after").value = str.length;

    }

}

</script>
 

</head>
 

<body>
 

<form method="post" action="">

	<p><textarea rows="15" id="original" cols="88"></textarea></p>

	<!-- p><textarea rows="15" id="chowen" cols="88"></textarea></p -->

	<p>Before: <input type="text" id="before" size="20">, after: <input type="text" id="after" size="20"> characters</p>

	<p><input type="button" value="Eat them!" onclick="eatThem();"></p>

</form>
 

</body>
 

</html>

Open in new window

0
What Should I Do With This Threat Intelligence?

Are you wondering if you actually need threat intelligence? The answer is yes. We explain the basics for creating useful threat intelligence.

 
LVL 1

Author Comment

by:mine_23
ID: 22891133
OK.
But this approach is poor:

1) It doesn't remove comments.
2) It removes whitespaces, tabs and double blanks  from  attribute values (between double quotation marks).

I found a similar tool here http://www.peterbe.com/plog/blogitem-040406-1/compressor but it suffers from the the same problem.
0
 
LVL 13

Accepted Solution

by:
brundo earned 100 total points
ID: 22893567
1) comments are removed by the script in the attached version (although you didn't mention comments in your first question).

2) it removes only double blanks, and tabs from attribute values (and I can't remember any situation where it would be necessary to have double blanks or tabs in attribute values)

<html>
 

<head>

<meta http-equiv="Content-Type" content="text/html; charset=utf-8">

<title>Eat them!</title>
 

<script type="text/javascript">

function eatThem (){

	var str = document.getElementById("original").value;

    document.getElementById("before").value = str.length;

	str = str.replace(/[\n\r\t]/g, " ");

	str = str.replace(/<!--[^-]*?-->/g, " ");

	while (str.search(/\ \ /) != -1) {

		str = str.replace(/\ \ /g, " "); 

    }

	document.getElementById("original").value = str;

    document.getElementById("after").value = str.length;

}

</script>
 

</head>
 

<body>
 

<form method="post" action="">

	<p><textarea rows="15" id="original" cols="88"></textarea></p>

	<p>Before: <input type="text" id="before" size="20">, after: <input type="text" id="after" size="20"> characters</p>

	<p><input type="button" value="Eat them!" name="B1" onclick="eatThem();"></p>

</form>
 

</body>
 

</html>

Open in new window

0
 
LVL 1

Author Comment

by:mine_23
ID: 22900181
Yes, I did not mention comments. You are right!
Well, Sometimes, I use double blanks in ASP.NET pages but it is not that important.
Many thanks!
0
 
LVL 1

Author Closing Comment

by:mine_23
ID: 31512396
Thank you for the code.
I am sorry about asking for removing comments late!
0
 
LVL 13

Expert Comment

by:brundo
ID: 22900433
Thanks. Now you can play around with this code... join lines 11 and 12, then make 13 and 14 more effective :-))

You can start with http://www.regular-expressions.info/quickstart.html, but with one eye on the table with JavaScript dialect of regular expressions at http://www.javascriptkit.com/javatutors/redev2.shtml. It is interesting, especially when you know that many editors have regular expressions in their find&replace functions.
0

Featured Post

Why You Should Analyze Threat Actor TTPs

After years of analyzing threat actor behavior, it’s become clear that at any given time there are specific tactics, techniques, and procedures (TTPs) that are particularly prevalent. By analyzing and understanding these TTPs, you can dramatically enhance your security program.

Join & Write a Comment

Read about why website design really matters in today's demanding market.
Building a website can seem like a daunting task to the uninitiated but it really only requires knowledge of two basic languages: HTML and CSS.
The viewer will learn how to count occurrences of each item in an array.
The viewer will learn how to create and use a small PHP class to apply a watermark to an image. This video shows the viewer the setup for the PHP watermark as well as important coding language. Continue to Part 2 to learn the core code used in creat…

758 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

18 Experts available now in Live!

Get 1:1 Help Now