Find records with specific characters in 2 nd field Post: 303022352

Sponsored Content

Top Forums UNIX for Beginners Questions & Answers Find records with specific characters in 2 nd field Post 303022352 by ashwin3086 on Thursday 30th of August 2018 01:16:37 PM

08-30-2018

Registered User

Find records with specific characters in 2 nd field

Hi ,
I have a requirement to read a file ( 5 fields , ~ delimited) and find the records which contain anything other than Alphabets, Numbers , comma ,space and dot . ie a-z and A-Z and 0-9 and . and " " and , in 2nd field. Once I do that i would want the result to have field1|<flag>

flag can be Y or N .

N - If 2nd field doesnt have anything other above mentioned characters.
Else Y .

I am able to achieve this using below code by reading line by line . Please note second field is "address".

Code:

#!/bin/ksh
 rm -f ca_sc_flag.txt 
while read rec
do

cust_id=`echo $rec | cut -d'~' -f1`
addr=`echo $rec | cut -d'~' -f2`


addr_rem=`echo ${addr}|tr -d 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789,. '`

if [ -z "${addr_rem}" ]; then
  
    sc='N'
    echo "$cust_id|$sc" >> ca_sc_flag.txt   

else
         sc='Y'
     echo "$cust_id|$sc" >> ca_sc_flag.txt   

fi
done < ca.txt

The issue is it is very ineffective and takes almost 30 mins for 100K records. Can I improve it by using better logic. May be by avoiding reading line by line.

Moderator's Comments:

Please use CODE tags as required by forum rules!

Last edited by RudiC; 08-30-2018 at 06:40 PM.. Reason: Changed HTML to CODE tags.

ashwin3086

View Public Profile for ashwin3086

Find all posts by ashwin3086

10 More Discussions You Might Find Interesting

1. HP-UX

extract field of characters after a specific pattern - using UNIX shell script

Hello, Below is my input file's content ( in HP-UX platform ): ABCD120672-B21 1 ABCD142257-002 1 ABCD142257-003 1 ABCD142257-006 1 From the above, I just want to get the field of 13 characters that comes after 'ABCD' i.e '120672-B21'... . Could...

2. Shell Programming and Scripting

count characters in specific records

I have a text file which represents a http flow like this: HTTP/1.1 200 OK Date: Fri, 23 Jan 2009 17:16:24 GMT Server: Apache Last-Modified: Fri, 23 Jan 2009 17:08:03 GMT Accept-Ranges: bytes Cache-Control: max-age=540 Expires: Fri, 23 Jan 2009 17:21:31 GMT Vary: Accept-Encoding ...

3. Shell Programming and Scripting

[Solved] Find Specific records from file and add totals into variables

Hi Eveyone, I am working on one shell script to find the specific records from data file and add the totals into variables and print them. you can find the sample data file below for more clarification. Sample Data File: PXSTYL00__20090803USA CHCART00__20090803IND...

4. UNIX for Dummies Questions & Answers

Grep specific records from a file of records that are separated by an empty line

Hi everyone. I am a newbie to Linux stuff. I have this kind of problem which couldn't solve alone. I have a text file with records separated by empty lines like this: ID: 20 Name: X Age: 19 ID: 21 Name: Z ID: 22 Email: xxx@yahoo.com Name: Y Age: 19 I want to grep records that...

5. Shell Programming and Scripting

[Solved] Counting specific characters within each field

Hello, I have a file like following: ALB_13554 1 1 1 ALB_13554 1 2 1 ALB_18544 2 0 2 ALB_18544 1 0 1 This is a sample of my file, my real file has 441845 number of fields. What I want to do is to calculate the number of 1 and 2 in each column using AWK, so, the output file looks like...

6. Shell Programming and Scripting

Can't figure out how to find specific characters in specific columns

I am trying to find a specific set of characters in a long file. I only want to find the characters in column 265 for 4 bytes. Is there a search for that? I tried cut but couldn't get it to work. Ex. I want to find '9999' in column 265 for 4 bytes. If it is in there, I want it to print...

7. Shell Programming and Scripting

Find and replace with 0 for characters in a specific position

Need command for position based replace: I need a command to replace with 0 for characters in the positions 11 to 20 to all the lines starts with 6 in a file. For example the file ABC.txt has: abcdefghijklmnopqrstuvwxyz 6abcdefghijklmnopqrstuvwxyz abcdefghijklmnopqrstuvwxyz...

8. UNIX for Dummies Questions & Answers

How to replace and remove few junk characters from a specific field?

I would like to remove all characters starting with "%" and ending with ")" in the 4th field - please help!! 1412007819.864 /device/services/heartbeatxx 204 0.547%!i(int=0) 0.434 0.112 1412007819.866 /device/services/heartbeatxx 204 0.547%!i(int=1) 0.423 0.123...

9. Shell Programming and Scripting

Find the difference in specific field

Hi All, Seeking for your assistance to get the difference of field1 and field2 and output all the records if there's a difference. please see below scenario. file1.txt 250|UPTREND FASHION DESIGN,CORP.|2016-04-04 09:36:13.991257 74|MAINSTREAM BUSINESS INC.|2016-04-04 09:36:13.991257...

10. Shell Programming and Scripting

Calling specific characters from a find variable

I'm trying to do something like this: find . -name blablabla -exec ln -s ./"{:53:14} blablabla" \; The idea is find blablabla and create a symbolic link to it using part of it's path and then it's name, "blablabla." I just don't know if I can call characters out of a find variable. ...

LEARN ABOUT NETBSD

cut

CUT(1)							    BSD General Commands Manual 						    CUT(1)

NAME

     cut -- select portions of each line of a file

SYNOPSIS

     cut -b list [-n] [file ...]
     cut -c list [file ...]
     cut -f list [-d delim] [-s] [file ...]

DESCRIPTION

     The cut utility selects portions of each line (as specified by list) from each file and writes them to the standard output.  If the file
     argument is a single dash ('-') or no file arguments were specified, lines are read from the standard input.  The items specified by list can
     be in terms of column position or in terms of fields delimited by a special character.  Column numbering starts from 1.

     list is a comma or whitespace separated set of increasing numbers and/or number ranges.  Number ranges consist of a number, a dash (-), and a
     second number and select the fields or columns from the first number to the second, inclusive.  Numbers or number ranges may be preceded by a
     dash, which selects all fields or columns from 1 to the first number.  Numbers or number ranges may be followed by a dash, which selects all
     fields or columns from the last number to the end of the line.  Numbers and number ranges may be repeated, overlapping, and in any order.	It
     is not an error to select fields or columns not present in the input line.

     The options are as follows:

     -b list	 The list specifies byte positions.

     -c list	 The list specifies character positions.

     -d string	 Use the first character of string as the field delimiter character.  The default is the <TAB> character.

     -f list	 The list specifies fields, separated by the field delimiter character.  The selected fields are output, separated by the field
		 delimiter character.

     -n 	 Do not split multi-byte characters.

     -s 	 Suppresses lines with no field delimiter characters.  Unless specified, lines with no delimiters are passed through unmodified.

EXIT STATUS

     cut exits 0 on success, 1 if an error occurred.

SEE ALSO

     paste(1)

STANDARDS

     The cut utility conforms to IEEE Std 1003.2-1992 (``POSIX.2'').

BSD
								 December 21, 2008							       BSD

10 More Discussions You Might Find Interesting

1. HP-UX

extract field of characters after a specific pattern - using UNIX shell script

Discussion started by: jansat

2. Shell Programming and Scripting

count characters in specific records

Discussion started by: littleboyblu

3. Shell Programming and Scripting

[Solved] Find Specific records from file and add totals into variables

Discussion started by: veeru

4. UNIX for Dummies Questions & Answers

Grep specific records from a file of records that are separated by an empty line

Discussion started by: Atrisa

5. Shell Programming and Scripting

[Solved] Counting specific characters within each field

Discussion started by: Homa

6. Shell Programming and Scripting

Can't figure out how to find specific characters in specific columns

Discussion started by: Drenhead

7. Shell Programming and Scripting

Find and replace with 0 for characters in a specific position

Discussion started by: thangabalu