Solved

Finding Files using shell/perl

Posted on 2012-03-26
10
790 Views
Last Modified: 2012-11-19
Hi,

 can some one please let me know how to find .text files by recursively reading folders and sub folders from a path.

Issue here is , in that folders I have two type of txt files. For example (`7249184.txt',  '7419841_0001.txt'), but I need only to find .txt  

Thanks,
0
Comment
Question by:new_perl_user
  • 4
  • 3
  • 2
  • +1
10 Comments
 
LVL 48

Expert Comment

by:Tintin
ID: 37768266
Issue here is , in that folders I have two type of txt files. For example (`7249184.txt',  '7419841_0001.txt'), but I need only to find .txt  

That sentence doesn't make sense.  Both your examples are .txt files.

To find .txt files, do

find /some/path -type f -name "*.txt"

Open in new window

0
 
LVL 13

Expert Comment

by:Carl Bohman
ID: 37768308
Find all .txt files in the current directory:
perl -MFile::Find -e 'find(sub{/\.txt$/ && print $_,"\n";}, @ARGV)' .

Open in new window

Modifying the regex allows you to search for any other files by name.  In your case, I think this regex may do what you need:
perl -MFile::Find -e 'find(sub{/^[^_]+\.txt$/ && print $_,"\n";}, @ARGV)' .

Open in new window

0
 

Author Comment

by:new_perl_user
ID: 37768357
Hi Bounsy,

I tried to run the perl -MFile::Find -e 'find(sub{/^[^_]+\.txt$/ && print $_,"\n";}, @ARGV)'  from command line and it throwed out an error.


invalid top directory at /usr/lib/perl5/5.8.8/File/Find.pm line 592.
0
PRTG Network Monitor: Intuitive Network Monitoring

Network Monitoring is essential to ensure that computer systems and network devices are running. Use PRTG to monitor LANs, servers, websites, applications and devices, bandwidth, virtual environments, remote systems, IoT, and many more. PRTG is easy to set up & use.

 
LVL 13

Expert Comment

by:Carl Bohman
ID: 37768444
The command included a period at the end.  This was to tell it to search starting from the current directory.  If you want to be more explicit, just list the directories you want to search.
perl -MFile::Find -e 'find(sub{/^[^_]+\.txt$/ && print "$File::Find::name\n";}, @ARGV)' /dir1 /dir2 /sub/dir3

Open in new window

Also note that I fixed the print statement in the above command, since the original version didn't show the path to the file, just the file name.
0
 

Author Comment

by:new_perl_user
ID: 37768593
Hi,

Thank you so much it worked. If possible can you please help me to extend the above command.

After finding the file  can we move that file to a location "/usr/HOME/DATA".
0
 
LVL 13

Accepted Solution

by:
Carl Bohman earned 500 total points
ID: 37768845
Untested, but something like this should work:
perl -MFile::Find -MFile::Copy -e 'find(sub{/^[^_]+\.txt$/ && move($File::Find::name, "/usr/HOME/DATA");}, @ARGV)' /dir1 /dir2 /sub/dir3

Open in new window

Note that you need to make sure that the directory /usr/HOME/DATA exists before running this command or you won't get the results you're looking for.
0
 
LVL 48

Expert Comment

by:Tintin
ID: 37768958
or easier to do

find .  -type f -name "*.txt" | xargs -i mv {} /usr/HOME/DATA

Open in new window

0
 
LVL 13

Expert Comment

by:Carl Bohman
ID: 37771768
@Tintin: That does work for the simple case of all .txt files, but not for the case that new_perl_user asked for which is for only some .txt files.  You would need to make the -name option more complicated or add an addiitonal grep command (likely using a regex) in order to only get the files that new_perl_user was interested in.  My solution is obviously more complicated (not necessarily a good thing), but has the advantage of being able to accept any arbitrarily-complex regex for the file name.  In general, I definitely agree that simple is better and prefer simple solutions when they are capable of handling the requirements.
0
 
LVL 48

Expert Comment

by:Tintin
ID: 37774145
Ah, so you successfully managed to interpret that when new_perl_user said

For example (`7249184.txt',  '7419841_0001.txt'), but I need only to find .txt  

they really meant:

I want to match numeric .txt files only, ie: no underscores.

In that case, a regex is the way to go.

With GNU find, you can do:

find . -type f -regex ".*/[0-9]+.txt"

Open in new window

0
 

Expert Comment

by:ashsysad
ID: 38614248
Good one.  I just came to know that we can use RegEx with find command.

Thanks
0

Featured Post

Problems using Powershell and Active Directory?

Managing Active Directory does not always have to be complicated.  If you are spending more time trying instead of doing, then it's time to look at something else. For nearly 20 years, AD admins around the world have used one tool for day-to-day AD management: Hyena. Discover why

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

On Microsoft Windows, if  when you click or type the name of a .pl file, you get an error "is not recognized as an internal or external command, operable program or batch file", then this means you do not have the .pl file extension associated with …
Over the years I've spent many an hour playing on hardened, DMZ'd servers, with only a sub-set of the usual GNU toy's to keep me company; frequently I've needed to save and send log or data extracts from these server back to my PC, or to others, and…
Learn several ways to interact with files and get file information from the bash shell. ls lists the contents of a directory: Using the -a flag displays hidden files: Using the -l flag formats the output in a long list: The file command gives us mor…
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…

810 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question