Sponsored Content
Top Forums Shell Programming and Scripting Handle special characters in awk -F Post 302942299 by cjcox on Monday 27th of April 2015 10:31:55 AM
Old 04-27-2015
For maximum portability assume FS to be something simple. However, since the world has gone Linux (gawk)... realize that FS can be a single character or if not, then it's a regex. So... you want:

Code:
echo '/a/b/c+/d' | awk -F'/c[+]/d$' '{print $1}'

Which returns:

Code:
/a/b

 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk/sed with special characters

i have this script that searches for a pattern. However it fails if the pattern includes some special characters. So far, it fails with the following strings: 1. -Cr 2. $Mj 3. H'412 would a sed or awk be more effective? i don't want the users to put the (\) during the search (they... (5 Replies)
Discussion started by: apalex
5 Replies

2. Shell Programming and Scripting

Handling special characters using awk

Hi all, How do I extract a value without special characters? I need to extract the value of %Used from below and if its greater than 80, need to send a notification. I am doing this right now..Its giving 17%..Is there a way to extract the value and assign it to a variable in one step? df |grep... (3 Replies)
Discussion started by: sam_78_nyc
3 Replies

3. Shell Programming and Scripting

awk search pattern with special characters passed from CL

I'm very new to awk and sed and I've been struggling with this for a while. I'm trying to search a file for a string with special characters and this string is a command line argument to a simple script. ./myscript "searchpattern" file #!/bin/sh awk "/$1/" $2 > dupelistfilter.txt sed... (6 Replies)
Discussion started by: cue
6 Replies

4. Shell Programming and Scripting

awk print $1 escape all special characters

I'm using awk '{print $1}' and it works most of the time to print the contents of a mysql query loop, but occationally I get a field with some special character in it, is there a way to tell awk to ignore all special characters between my FS? I have >186K records, so building a list of ALL special... (6 Replies)
Discussion started by: unclecameron
6 Replies

5. Shell Programming and Scripting

awk loop: display special characters

Hi everybody; I have a code and this fetches data from first.txt,modify it and outputs it to second.txt file. l awk 'NR>1 {print "l ./gcsw "$1" lt all lset Data="$2" Value "$3}' /home/gcsw/first.txt > /home/gcsw/second.txt this outputs as: l ./gcsw 123 lt all lset Data=456 Value 789 ... (1 Reply)
Discussion started by: gc_sw
1 Replies

6. Shell Programming and Scripting

Sed or awk : pattern selection based on special characters

Hello All, I am here again scratching my head on pattern selection with special characters. I have a large file having around 200 entries and i have to select a single line based on a pattern. I am able to do that: Code: cat mytest.txt | awk -F: '/myregex/ { print $2}' ... (6 Replies)
Discussion started by: usha rao
6 Replies

7. UNIX for Dummies Questions & Answers

awk for removing special characters and extra commas

Hi, I have a .csv file which as empty lines with comma and some special characters in 3rd column as below. Source data 1,2,3,4,%#,6 ,,,,,, 1,2,3,4,5,6 Target Data 1,2,3,4,5,6I need to remove blank lines and special charcters I am trying to get this using the below awk awk -F","... (2 Replies)
Discussion started by: shruthidwh
2 Replies

8. Shell Programming and Scripting

awk match shell variable that contains special characters?

How to match a shell variable that contains parenthesis (and other special characters like "!") file.txt contains: Charles Dickens Matthew Lewis (writer) name="Matthew Lewis (writer)"; awk -v na="$name" ' $0 ~ na' file.txt Ideally this would match $name in file.txt (in this... (3 Replies)
Discussion started by: Mid Ocean
3 Replies

9. Shell Programming and Scripting

awk conditions failing (special characters?)

This is really frustrating because I can't figure it out. I'm running a health check script. One of the items I'm checking is the amount of memory on a server. I use the free command, which outputs something like this (excerpt) Mem: 100 100 100 100 Swap: 100 100 100 100 In my debugging... (5 Replies)
Discussion started by: JustaDude
5 Replies

10. Shell Programming and Scripting

Awk: split column if special characters

Hi, I've data like these: Gene1,Gene2 snp1 Gene3 snp2 Gene4 snp3 I'd like to split line if comma and then print remaining information for the respective gene. My code: awk '{ if($1 ~ /,/){ n = split($0, t, ",") (7 Replies)
Discussion started by: genome
7 Replies
multibyte(3C)															     multibyte(3C)

NAME
mblen(), mbtowc(), mbstowcs(), wctomb(), wcstombs() - multibyte characters and strings conversions SYNOPSIS
DESCRIPTION
A multibyte character is composed of one or more bytes that represent a "whole" character in a character encoding. A wide character (type of is composed of a fixed number of bytes whose code value can represent any character in a character encoding. Determine the number of bytes in the multibyte character pointed to by s. Equivalent to: If s is a null pointer, mblen returns a nonzero or zero value, depending on whether the multibyte character encodings do or do not have state-dependent encodings, respectively. Since no character encodings currently supported by HP-UX are state-depen- dent, zero is always returned in this case. However, for maximum portability to other systems, application programs should not depend on this. If s is not a null pointer, mblen returns the number of bytes in the multibyte character if the next n or fewer bytes form a valid multibyte character, or return -1 if they do not form a valid multibyte character. If s points to the null character, mblen returns 0. Determine the number of bytes in the multibyte character pointed to by s, determine the code for the value of type corresponding to that multibyte character, then store the code in the object pointed to by pwc. The value of the code corresponding to the null character is zero. At most n characters are examined, starting at the character pointed to by s. If s is a null pointer, returns a non-zero or zero value, depending on whether the multibyte character encodings do or do not have state-dependent encodings, respectively. Since no character encodings currently supported by HP-UX are state-dependent, zero is always returned in this case. However, for maximum portability to other systems, application programs should not depend on this. If s is not a null pointer, returns the number of bytes in the converted multibyte character if the next n or fewer bytes form a valid multibyte character, or -1 if they do not form a valid multibyte character. If s points to the null character, returns 0. The value returned is never greater than n or the value of the macro. Determine the number of bytes needed to represent the multibyte character corresponding to the code whose value is wchar and store the multibyte character representation in the array object pointed to by s. At most characters are stored. If s is a null pointer, returns a nonzero or zero value, depending on whether the multibyte character encodings do or do not have state-dependent encodings, respectively. Since no character encodings currently supported by HP-UX are state-dependent, zero is always returned in this case. However, for maximum portability to other systems, application programs should not depend on this. If s is not a null pointer, returns the number of bytes in the multibyte character corresponding to the value of wchar, or -1 if the value of wchar does not correspond to a valid multibyte character. The value returned is never greater than the value of the macro. Convert a sequence of multibyte characters from the array pointed to by s into a sequence of corresponding codes and store these codes into the array pointed to by pwcs, stopping after either n codes or a code with value zero (a converted null character) is stored. Each multibyte character is converted as if by a call to No more than n elements are modified in the array pointed to by pwcs. If an invalid multibyte character is encountered, returns (size_t)-1. Otherwise, returns the number of array elements modi- fied, not including a terminating zero code, if any. The array is not null- or zero-terminated if the value returned is n. If pwcs is a null pointer, returns the number of elements required for the wide-character-code array. Convert a sequence of codes corresponding to multibyte characters from the array pointed to by pwcs into a sequence of multibyte characters and store them into the array pointed to by s, stopping if a multibyte character exceeds the limit of n total bytes or if a null character is stored. Each code is converted as if by a call to No more than n bytes are modified in the array pointed to by s. If a code is encountered that does not correspond to a valid multibyte character, returns (size_t)-1. Otherwise, returns the number of bytes modified, not including a terminating null character, if any. The array is not null- or zero-terminated if the value returned is n. If s is a null pointer, returns the number of bytes required for the character array. EXTERNAL INFLUENCES
Locale The category determines the behavior of the multibyte character and string functions. ERRORS
and may fail and is set if the following condition is encountered: [EILSEQ] An invalid multibyte sequence or wide character code was found. WARNINGS
With the exception of ASCII characters, the code values of wide characters (type of are specific to the effective locale specified by the environment variable. These values may not be compatible with values obtained by specifying other locales that are supported now, or which may be supported in the future. It is recommended that wide character constants and wide string literals (see the not be used, and that wide character code values not be stored in files or devices because future standards may dictate changes in the code value assignments of the wide characters. However, wide character constants and wide string literals corresponding to the characters of the ASCII code set can be safely used since their values are guaranteed to be the same as their ASCII code set values. AUTHOR
The multibyte functions in this entry were developed by OSF and HP. SEE ALSO
setlocale(3C), wctype(3C), thread_safety(5), glossary(9). STANDARDS CONFORMANCE
multibyte(3C)
All times are GMT -4. The time now is 03:44 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy