Separate based on file names


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Separate based on file names
# 1  
Old 02-04-2008
Separate based on file names

Hello experts,
This might prove to be a stupid question to some of you, but I have tried to tackle it in different ways. Being new to shell scripting, I am requesting your help in coming up with an elegant solution. I am using Korn shell.

We have a directory with file names with the pattern:
abc_123
abc_456
xyz_789

I have to compare them against a prefix file to check whether the names behind the underscore are valid and consolidate them in different directories.

How do I do that?
1. Put them in different files and use diff/comm to compare. This requires multiple passes - since the consolidation is more than what I can highlight here.
2. Do something like:
FileList=$(ls -r $SOURCE_DIR) # list & sort
PrefixList=$(cat $PREFIX_FILE | sort -r)
for sFile in $sFileList
do
iDelimit=$(expr index $sFile _)
(( iDelimit -= 1 ))
sFileCmp=$(expr substr $sFile 1 $iDelimit)
for sPrefix in $sPrefixList
do
echo $sFileCmp compared with $sPrefix
if [ $sFileCmp -gt sPrefix ] # will not work, symbolic
then
continue
elif [ $sFileCmp -eq sPrefix ]
then
blPrefixValid=1
else
blPrefixValid=0
break
fi
done

3. Get valid prefixes and run through the directory each time to process files


Any help appreciated.


Thanks.
# 2  
Old 02-04-2008
Quote:
Originally Posted by prashk15
Hello experts,
This might prove to be a stupid question to some of you, but I have tried to tackle it in different ways. Being new to shell scripting, I am requesting your help in coming up with an elegant solution. I am using Korn shell.

We have a directory with file names with the pattern:
abc_123
abc_456
xyz_789

I have to compare them against a prefix file to check whether the names behind the underscore are valid and consolidate them in different directories.

How do I do that?
1. Put them in different files and use diff/comm to compare. This requires multiple passes - since the consolidation is more than what I can highlight here.
2. Do something like:
FileList=$(ls -r $SOURCE_DIR) # list & sort
PrefixList=$(cat $PREFIX_FILE | sort -r)
for sFile in $sFileList
do
iDelimit=$(expr index $sFile _)
(( iDelimit -= 1 ))
sFileCmp=$(expr substr $sFile 1 $iDelimit)
for sPrefix in $sPrefixList
do
echo $sFileCmp compared with $sPrefix
if [ $sFileCmp -gt sPrefix ] # will not work, symbolic
then
continue
elif [ $sFileCmp -eq sPrefix ]
then
blPrefixValid=1
else
blPrefixValid=0
break
fi
done

3. Get valid prefixes and run through the directory each time to process files


Any help appreciated.


Thanks.
What is the criteria to separate the files?

Thanks
Nagarajan G
# 3  
Old 02-04-2008
Thanks for your reply.

The prefix is the criteria. If the file has valid prefix (that can be found in the prefix list) - that goes in one directory, if it is a duplicate it goes in the other and so on.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Sort a text file based on names in square brackets

Hi all, I have a text file similar to this: Text More text Etc Stuff That Is Needed Etc Etc This contains over 70 entries and each entry has several lines of text below the name in square brackets. (5 Replies)
Discussion started by: Scally
5 Replies

2. Programming

C++ separate code based on the few changes

Hi, I am working in Visual studio 2008 in which i have written the code in c++,qml,qt.Its a simulator application. I would like to create a simulator with certain changes. In order to do that i have modified the code with few changes based on the preprocessor condition. #define... (4 Replies)
Discussion started by: SA_Palani
4 Replies

3. UNIX for Beginners Questions & Answers

Consternation of multiple file names based on naming pattern

Hi, I have the following reports that get generated every 1 hour and this is my requirement: 1. 5 reports get generated every hour with the names "Report.Dddmmyy.Thhmiss.CTLR" "Report.Dddmmyy.Thhmiss.ACCD" "Report.Dddmmyy.Thhmiss.BCCD" "Report.Dddmmyy.Thhmiss.CCCD"... (1 Reply)
Discussion started by: Jesshelle David
1 Replies

4. UNIX for Dummies Questions & Answers

Allocating names to folders based on a file

Hi everyone, I have a problem and I would be gratful if you can help. I have set of folders with files in them. e.g. data1, data2, data3 and I have a json file with info ... looking like this I want to rename my files to replace the data with their gender to some processing and back to... (8 Replies)
Discussion started by: A-V
8 Replies

5. Shell Programming and Scripting

Sort html based on .jar, .war file names and still keep text within three groups.

Output from zipdiff GNU EAR comparison tool produces output in html divided into three sections "Added, Removed, Changed". I want the output to be sorted by jar or war file. <html> <body> <table> <tr> <td class="diffs" colspan="2">Added </td> </tr> <tr><td> <ul>... (5 Replies)
Discussion started by: kchinnam
5 Replies

6. Shell Programming and Scripting

How to separate based on delimiter?

Hi, Variable=MKT1,MKT2,MKT3 and so on i am trying to seperate MKT1,MKT2,MKT3 and store each in a variable. the values in variable1 may vary. I am using bash (8 Replies)
Discussion started by: arghadeep adity
8 Replies

7. UNIX for Dummies Questions & Answers

using sed delete a line from csv file based on specific data in two separate fields

Hello, :wall: I have a 12 column csv file. I wish to delete the entire line if column 7 = hello and column 12 = goodbye. I have tried everything that I can find in all of my ref books. I know this does not work /^*,*,*,*,*,*,"hello",*,*,*,*,"goodbye"/d Any ideas? Thanks Please... (2 Replies)
Discussion started by: Chris Eagleson
2 Replies

8. Shell Programming and Scripting

Sorting file based on names

Hi I have some files in directory and the names of files are like jnhld_15233_2010-11-23 jnhld_15233_2007-10-01 jnhld_15233_2001-05-04 jnhld_15233_2011-11-11 jnhld_15233_2005-06-07 jnhld_15233_2000-04-01 ..etc How can i sort these files based on the date in the file name so that ... (4 Replies)
Discussion started by: morbid_angel
4 Replies

9. Shell Programming and Scripting

How to split a data file into separate files with the file names depending upon a column's value?

Hi, I have a data file xyz.dat similar to the one given below, 2345|98|809||x|969|0 2345|98|809||y|0|537 2345|97|809||x|544|0 2345|97|809||y|0|651 9685|98|809||x|321|0 9685|98|809||y|0|357 9685|98|709||x|687|0 9685|98|709||y|0|234 2315|98|809||x|564|0 2315|98|809||y|0|537... (2 Replies)
Discussion started by: nithins007
2 Replies

10. UNIX for Dummies Questions & Answers

File names based off of wildcards

Hello all- First post, so just to forewarn you: I know enough about Perl and the Terminal to get myself into trouble, not quite enough to always get out. I'd like to know if it is possible to, from the command prompt, use a wild-card to declare the names of files for input and then use the... (1 Reply)
Discussion started by: tcquad
1 Replies
Login or Register to Ask a Question