Solved

I need a shell script to find html files, identify a word in the file and replace dummy text with that word

Posted on 2007-04-04
16
798 Views
Last Modified: 2012-05-05
I have several paralell directories, each with a file named mes.html
In each of these files is one of several standard words, like pure, planet or factor.  After each standard word is a number, like 12 or 08.
Based on which word is in the mes.html file, I need to replace the word thumb.jpg with a the standard word in the file, like pure 08.jpg.  Even better would be no space in the replacement text.

0
Comment
Question by:kailee
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 6
  • 5
  • 2
  • +3
16 Comments
 
LVL 14

Expert Comment

by:ygoutham
ID: 18855789
some more clarity is required.  

you have mes.html in too many directories.  this mes.html has words pure08, planet03, factor04 etc happening inside them.

you want pure08 to be changed to pure08.jpg and the following

pure08  ->  pure08.jpg
planet03 -> planet03.gif

and so on...

is this right??

if so, yes it is doable
0
 

Author Comment

by:kailee
ID: 18856016
not exactly.
mes.html is a standard file in each directory.
in mes.html, are words like 'pure' followed by a number, like '08'
so you might find pure 08, planet 03 or factor 04.
also, in each mes.html is the word 'thumb'.
depending on which word (not number) is found, thumb needs to be replaced by that word and its number.

So, I think case - select is needed as part of the script; one for each word.

0
 
LVL 84

Accepted Solution

by:
ozo earned 250 total points
ID: 18856097
find . -name mes.html | xargs perl -i -pe '($w,$n)=($1,$2) if /\b(pure|planet|factor)\s*(\d+); s/(thumb)/$w$n/g if $w'
0
Migrating Your Company's PCs

To keep pace with competitors, businesses must keep employees productive, and that means providing them with the latest technology. This document provides the tips and tricks you need to help you migrate an outdated PC fleet to new desktops, laptops, and tablets.

 
LVL 6

Assisted Solution

by:_iskywalker_
_iskywalker_ earned 250 total points
ID: 18856493
a less cryptografic:

list="pure planet factor"
list1=`find . -name mes.html`

for i in $list; do
for j in $list1; do
#check if the work is in this file, hope this is not by a break line separeted
k=`grep $i $j|head -n 1`

pass=0

for m in $k; do
if [ $m == $i ] ; then
pass=1;
l=$m;
fi;
if [ $pass -eq 1 ]; then
n=$m;
break;
done;
sed -e 's/thumb/$l $n/g' $j

done;
done;
0
 

Author Comment

by:kailee
ID: 18861324
can I use something other than a space in the list?  There are cases where some of the standard words are actually two words, like 'flower power'.
0
 
LVL 84

Expert Comment

by:ozo
ID: 18861338
in that case, would you want to replace 'thumb' with 'flower power'?
0
 

Author Comment

by:kailee
ID: 18861649
welll.....
I was going to do a search and replace after this process, to keep this one simple.
so, actually, each standard search word (or combo) is replaced by a pair of characters.
So, flower power 12 will become fp12.  mutual ecology 05 will beccome me05.

0
 
LVL 84

Expert Comment

by:ozo
ID: 18861682
then would pure 08 become p08?
0
 

Author Comment

by:kailee
ID: 18862153
yes.  However, I could make it easier by adding a dummy word after the single word ones.  like pure dummy 08, world dummy 13, etc.
0
 
LVL 84

Expert Comment

by:ozo
ID: 18862429
can there be more than one standard word(s) in the file?
would they always appear before the corresponding thumbs?
0
 
LVL 6

Expert Comment

by:_iskywalker_
ID: 18862984
you could reaplace power by "" and the replace flower by flower power.
0
 

Author Comment

by:kailee
ID: 18864778
Ozo.  No, there is only one entry per file.  Shal I set them to be one word or two words?
0
 
LVL 84

Expert Comment

by:ozo
ID: 18961934
perl -i -pe '($w,$x,$n)=($1,$2,$3) if /\b(?=(.).*?\b(\w))(?:flower power|word dummy)\s*(\d+)/; s/(thumb)/$w$x$n/g if $w'
0
 

Author Comment

by:kailee
ID: 19139096
Sorry, I was out of town for awhile.  I'm back at looking at this and will try these solutions
0
 
LVL 16

Expert Comment

by:Hanno P.S.
ID: 21170286
No comment has been added to this question in more than 21 days, so it is now classified as abandoned.

I will leave the following recommendation for this question in the Cleanup Zone:
SPLIT POINTS - between _iskywalker_ {http:#18856493} and ozo{http:#18856097}

Any objections should be posted here in the next 4 days. After that time, the question will be closed.
JustUNIX, Experts Exchange Cleanup Volunteer
0
 
LVL 1

Expert Comment

by:Computer101
ID: 21198277
Forced accept.

Computer101
EE Admin
0

Featured Post

Use Case: Protecting a Hybrid Cloud Infrastructure

Microsoft Azure is rapidly becoming the norm in dynamic IT environments. This document describes the challenges that organizations face when protecting data in a hybrid cloud IT environment and presents a use case to demonstrate how Acronis Backup protects all data.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

SSH (Secure Shell) - Tips and Tricks As you all know SSH(Secure Shell) is a network protocol, which we use to access/transfer files securely between two networked devices. SSH was actually designed as a replacement for insecure protocols that sen…
Join Greg Farro and Ethan Banks from Packet Pushers (http://packetpushers.net/podcast/podcasts/pq-show-93-smart-network-monitoring-paessler-sponsored/) and Greg Ross from Paessler (https://www.paessler.com/prtg) for a discussion about smart network …
Connecting to an Amazon Linux EC2 Instance from Windows Using PuTTY.
How to Install VMware Tools in Red Hat Enterprise Linux 6.4 (RHEL 6.4) Step-by-Step Tutorial

735 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question