Sponsored Content
Top Forums UNIX for Beginners Questions & Answers awk regex woth text in each line Post 303046352 by RudiC on Friday 1st of May 2020 03:25:52 PM
Old 05-01-2020
Try
Code:
awk '
                                                {printf "%s\t", $0
                                                }
                                                
/^00-0000-[0-9]+-[Ff]?e?[Mm]ale-BBB_xxx/        {printf "%s\n", "Good"
                                                 next
                                                }
/^[0-9]+-[Tt][Ee][Ss][Tt]/                      {printf "%s\n", "Test"
                                                 next
                                                }
                                                {printf "%s\n", "Bad"
                                                }
' file
00-0000-1234-Male-BBB_xxx.txt        Good
11-1111_xxx.txt        Bad
00-0000-12345-female-BBB_xxx.txt        Good
000000-Test_xxx.txt        Test

EDIT: Or, even shorter,
Code:
awk '
        {V = "Bad"
         if (/^00-0000-[0-9]+-[Ff]?e?[Mm]ale-BBB_xxx/)  V = "Good"
         else if (/^[0-9]+-[Tt][Ee][Ss][Tt]/)           V = "Test"
         printf "%s\t%s\n", $0, V
        }
 ' file

EDIT: Or, even shorter, if you like "conditional assignments / expressions", and one-liners:

Code:
awk '{print $0 "\t" ((/^00-0000-[0-9]+-[Ff]?e?[Mm]ale-BBB_xxx/) ? "Good" : (/^[0-9]+-[Tt][Ee][Ss][Tt]/) ? "Test" : "Bad")}' file


Last edited by RudiC; 05-02-2020 at 02:30 AM..
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

mail command woth Postfix

Hi, maybe someone could help me. This is the thing. I usually use the command "mail" on my scripts to send me info about the linux box, something like: df -k | mail -s "disk usage" lalala@lalala.com. This works fine in all the linuxes that sendmail is installed, but there is one with Postfix... (1 Reply)
Discussion started by: piltrafa
1 Replies

2. Shell Programming and Scripting

sed: delete regex line and next line if blank

Hi, I want to write a sed script which from batiato: batiato/giubbe: pip_b.2.txt pip_b.3.txt pip_b.3mmm.txt bennato: bennato/peterpan: 123.txt consoli: pip_a.12.txt daniele: (2 Replies)
Discussion started by: one71
2 Replies

3. UNIX for Dummies Questions & Answers

awk - display from line number to regex

Hi. Is there a way in awk to show all lines between a line number and the next line containing a particular regex? We can do these, of course: awk '/regex1/,/regex2/' filename awk 'FNR > X && FNR < Y' filename But can they be combined? Thanks. (3 Replies)
Discussion started by: treesloth
3 Replies

4. Shell Programming and Scripting

Selecting a part of the text (regex pattern, awk, sed)

Hello, let's start by giving you guys a few examples of the text: "READ /TEXT123/ABC123" "READ /TEXT123/ABC123/" "READ TEXT123/ABC123" "READ TEXT123/ABC123/" "READ TEXT123/TEXT456/ABC123" "READ /TEXT123/TEXT456/ABC123" "READ /TEXT123/TEXT456/ABC123/" TEXT and ABC can be and I... (5 Replies)
Discussion started by: TehOne
5 Replies

5. Shell Programming and Scripting

awk regex- include text

Hi I am trying to filter some data using awk. I have a statement- awk 'BEGIN { FS = "\n" ; RS = "" } { if ( $6 = "City: " ) { print "City: Unknown" } else { print $6 } }'` The $6 values are City: London City: Madrid City: City: Tokyo This expression seems to catch all the lines... (4 Replies)
Discussion started by: jamie_123
4 Replies

6. UNIX for Dummies Questions & Answers

How to specify beginning-of-line/end-of-line characters inside a regex range

How can I specify special meaning characters like ^ or $ inside a regex range. e.g Suppose I want to search for a string that either starts with '|' character or begins with start-of-line character. I tried the following but it does not work: sed 's/\(\)/<do something here>/g' file1 ... (3 Replies)
Discussion started by: jawsnnn
3 Replies

7. UNIX for Dummies Questions & Answers

read regex from ID file, print regex and line below from source file

I have a file of protein sequences with headers (my source file). Based on a list of IDs (which are included in some of the headers), I'd like to print out only the specified sequences, with only the ID as header. In other words, I'd like to search source.txt for the terms in IDs.txt, and print... (3 Replies)
Discussion started by: pathunkathunk
3 Replies

8. Shell Programming and Scripting

Using regex's from file1, print line and line after matches in file2

Good day, I have a list of regular expressions in file1. For each match in file2, print the containing line and the line after. file1: file2: Output: I can match a regex and print the line and line after awk '{lines = $0} /Macrosiphum_rosae/ {print lines ; print lines } ' ... (1 Reply)
Discussion started by: pathunkathunk
1 Replies

9. Shell Programming and Scripting

(n)awk: print regex search output lines in one line

Hello. I have been looking high and low for the solution for this. I seems there should be a simple answer, but alas. I have a big xml file, and I need to extract certain information from specific items. The information I need can be found between a specific set of tags. let's call them... (2 Replies)
Discussion started by: Tobias-Reiper
2 Replies

10. Shell Programming and Scripting

awk to add text to each line of matching id

The awk below executes as expected if the id in $4 (like in f) is unique. However most of my data is like f1 where the same id can appear multiple times. I think that is the reason why the awk is not working as expected. I added a comment on the line that I can not change without causing the script... (6 Replies)
Discussion started by: cmccabe
6 Replies
LICENSECHECK(1)                                                                                                                    LICENSECHECK(1)

NAME
licensecheck - simple license checker for source files SYNOPSIS
licensecheck --help|--version licensecheck [--no-conf] [--verbose] [--copyright] [-l|--lines=N] [-i|--ignore=regex] [-c|--check=regex] [-m|--machine] [-r|--recursive] list of files and directories to check DESCRIPTION
licensecheck attempts to determine the license that applies to each file passed to it, by searching the start of the file for text belonging to various licenses. If any of the arguments passed are directories, licensecheck will add the files contained within to the list of files to process. OPTIONS
--verbose, --no-verbose Specify whether to output the text being processed from each file before the corresponding license information. Default is to be quiet. -l=N, --lines=N Specify the number of lines of each file's header which should be parsed for license information. (Default is 60). -i=regex, --ignore=regex When processing the list of files and directories, the regular expression specified by this option will be used to indicate those which should not be considered (e.g. backup files, VCS metadata). -r, --recursive Specify that the contents of directories should be added recursively. -c=regex, --check=regex Specify a pattern against which filenames will be matched in order to decide which files to check the license of. The default includes common source files. --copyright Also display copyright text found within the file -m, --machine Display the information in a machine readable way, i.e. in the form <file><tab><license>[<tab><copyright>] so that it can be easily sorted and/or filtered, e.g. with the awk and sort commands. Note that using the --verbose option will kill the readability. --no-conf, --noconf Do not read any configuration files. This can only be used as the first option given on the command-line. CONFIGURATION VARIABLES
The two configuration files /etc/devscripts.conf and ~/.devscripts are sourced by a shell in that order to set configuration variables. Command line options can be used to override configuration file settings. Environment variable settings are ignored for this purpose. The currently recognised variables are: LICENSECHECK_VERBOSE If this is set to yes, then it is the same as the --verbose command line parameter being used. The default is no. LICENSECHECK_PARSELINES If this is set to a positive number then the specified number of lines at the start of each file will be read whilst attempting to determine the license(s) in use. This is equivalent to the --lines command line option. LICENSE
This code is copyright by Adam D. Barratt <adam@adam-barratt.org.uk>, all rights reserved; based on a script of the same name from the KDE SDK, which is copyright by <dfaure@kde.org>. This program comes with ABSOLUTELY NO WARRANTY. You are free to redistribute this code under the terms of the GNU General Public License, version 2 or later. AUTHOR
Adam D. Barratt <adam@adam-barratt.org.uk> Debian Utilities 2013-12-23 LICENSECHECK(1)
All times are GMT -4. The time now is 04:58 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy