How to delete all lines with less then 32 characters from a textfile?

02-26-2010

Registered User

61, 0

Join Date: Feb 2010

Last Activity: 26 October 2015, 5:05 PM EDT

Location: The bash shell

Posts: 61

Thanks Given: 8

Thanked 0 Times in 0 Posts

Quote:

Originally Posted by alister

Did you test it with filenames containing IFS characters? Assuming a default IFS value, your option handling, sed invocations, and cat statement will all barf if a filename contains whitespace.

Code:

for opt in $*

should be

Code:

for opt in "$@"

All instances of $3 need to be double-quoted.

Alister

Okay, I'll change that then.

EDIT: I fixed the script. It now looks like this:

Code:

#!/bin/bash
# deletes lines of a certain range length from a file
# Writing the result to the file is optional.

write="no";
for opt in "$@"
do
	case "$opt" in
		-w ) write="yes";
		     shift;;
		-* ) shift;;
		*  );;
	esac
done
least=$1;
great=$2;
shift;
shift;
filname="$*";
if [ $write == "yes" ]
then
	sed -e "/^.\{$least,$great\}$/d" "$filname" > tempfile.txt;
	cat tempfile.txt > "$filname";
	rm tempfile.txt;
else
	sed -e "/^.\{$least,$great\}$/d" "$filname";
fi
unset write

All I did was put quotation marks around the filename. I was also able to fix another script that I wrote a while back which was having the same problem.

Actually I've found I don't have to use "$@". It works whether I use that or $*. The quotation marks were the only problem.

Last edited by Ultrix; 02-26-2010 at 01:10 PM..

Ultrix

View Public Profile for Ultrix

Find all posts by Ultrix

02-27-2010

Registered User

3,231, 978

Join Date: Dec 2009

Last Activity: 11 June 2014, 8:40 PM EDT

Posts: 3,231

Thanks Given: 179

Thanked 978 Times in 791 Posts

Quote:

Originally Posted by Ultrix

Actually I've found I don't have to use "$@". It works whether I use that or $*. The quotation marks were the only problem.

If you aren't using "$@" in that situation, then your script has a bug. There is no doubt about it.

Using $@ without quotes or $* with or without quotes will not expand to each individual command line argument (positional parameter in sh man page lingo). If unquoted, $@ and $* behave identically; they will expand to a list of words and then (this is the problem) each word (a positional parameter at this point) will be split according to the current setting of IFS (after splitting, the words may no longer correspond to the positional parameters). If you quote $*, you end up with one word containing all your positional parameters, regardless of how many parameters there are.

In that for loop, "$@" is the only correct option. If you don't believe me, try $* or $@ with a file name containing a space (assuming default IFS value) followed by one of your program's valid options, such as "infile -w". Even if the -w option isn't passed, such a filename will trigger it because "infile -w" will be split into two words, "infile" and "-w". That would be a bug. I realize that's a contrived and unlikely filename, but the point is that the option handling is behaving erroneously.

If you don't see it, read the sh man page carefully, with particular emphasis on the special parameters $@ and $*, quoting, and word splitting.

Here's some exemplary code:

Code:

$ cat o.sh 
#!/bin/bash

printf '==================================================\n'
printf '$@: INCORRECT: Word splitting after $@ expansion yields 3 words.\n'
for opt in $*
do
        case "$opt" in
                *  )echo "$opt";;
        esac
done

printf '==================================================\n'
printf '$*: INCORRECT: Word splitting after $* expansion yields 3 words.\n'
for opt in $*
do
        case "$opt" in
                *  )echo "$opt";;
        esac
done

printf '==================================================\n'
printf '"$*": INCORRECT: Always expands to one word, regarless of $# positional parameter count.\n'
for opt in "$*"
do
        case "$opt" in
                *  )echo "$opt";;
        esac
done

printf '==================================================\n'
printf '"$@": CORRECT: Expands to one word per positional parameter without subsequent word splitting.\n'
for opt in "$@"
do
        case "$opt" in
                *  )echo "$opt";;
        esac
done

# Let's call the script with two positional parameters.
# Only "$@" will expand to the two correct words, while the others result in 1 or 3.

$ ./o.sh -v 'input -w'
==================================================
$@: INCORRECT: Word splitting after $@ expansion yields 3 words.
-v
input
-w
==================================================
$*: INCORRECT: Word splitting after $* expansion yields 3 words.
-v
input
-w
==================================================
"$*": INCORRECT: Always expands to one word, regarless of $# positional parameter count.
-v input -w
==================================================
"$@": CORRECT: Expands to one word per positional parameter without subsequent word splitting.
-v
input -w

I hope this helped.

Regards,
Alister

Last edited by alister; 02-27-2010 at 10:27 AM..

alister

View Public Profile for alister

Find all posts by alister

Shell Programming and Scripting

How to delete all lines with less then 32 characters from a textfile?

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Replicating certain lines in a textfile

Discussion started by: sandy90

2. Shell Programming and Scripting

How to separate sorte different characters from one textfile and copy them in a new textfile?

Discussion started by: schwatter

3. Shell Programming and Scripting

Cut lines from and to in a textfile

Discussion started by: suryanarayana

4. Shell Programming and Scripting

Sed/awk to delete single lines that aren't touching other lines

Discussion started by: slimjbe

5. Shell Programming and Scripting

search and replace, when found, delete multiple lines, add new set of lines?

Discussion started by: DeuceLee

6. UNIX for Advanced & Expert Users

In a huge file, Delete duplicate lines leaving unique lines

Discussion started by: krishnix

7. Shell Programming and Scripting

Find a string in textfile, erase $num lines after that string

Discussion started by: ilcsfe

8. UNIX for Dummies Questions & Answers

How get only required lines & delete the rest of the lines in file

Discussion started by: reva

9. Shell Programming and Scripting

Detect lines beginning with double-byte characters (Japanese) and delete

Discussion started by: ubbeauty

10. Shell Programming and Scripting

How to delete lines in a file that have duplicates or derive the lines that aper once

Discussion started by: necroman08