Sponsored Content
Top Forums Shell Programming and Scripting Regex issue with \s in character class. Post 303042635 by jim mcnamara on Wednesday 1st of January 2020 11:07:58 PM
Old 01-02-2020
It is a shorthand character class that mostly expands to [ \f\t\n\v] -- it is shorthand for a character class, like \d and others

Explained here:.

Regexp Tutorial - Shorthand Character Classes has this for \s:

Quote:
\s stands for “whitespace character”¯. Again, which characters this actually includes, depends on the regex flavor. In all flavors discussed in this tutorial, it includes [ \t\r\n\f]. That is: \s matches a space, a tab, a line break, or a form feed. Most flavors also include the vertical tab, with Perl (prior to version 5.18) and PCRE (prior to version 8.34) being notable exceptions. In flavors that support Unicode, \s normally includes all characters from the Unicode “separator”¯ category. Java and PCRE are exceptions once again. But JavaScript does match all Unicode whitespace with \s.
This User Gave Thanks to jim mcnamara For This Post:
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk and POSIX character class

can anyone tell me why this doesn't work? I've been trying to play with character classes and I seem to be missing something here..! echo "./comparecdna.summary" | awk '/^compare+]summary$/' # returns nothing echo "./compare_cdna.summary" | awk '/^compare_+]summary$/' # returns nothing echo... (5 Replies)
Discussion started by: anthalamus
5 Replies

2. Shell Programming and Scripting

regex to find font class

So, I need to find the instances of a certain font and remove it....so far in my testing I am using the find command with regex to find a font I want to pull out. However, I seem to be slightly stuck, and I am sure the beard stroking Unix geniuses here can help me. My example code: find... (7 Replies)
Discussion started by: tlarkin
7 Replies

3. Shell Programming and Scripting

perl regex issue

Hi, I find it really strange while writing a simple regex to match and print the matched string, dibyajyo@fwtest:~ #perl -e '$x = "root@rashmi>"; print "matched string:$1\n" if ($x =~ /(root@rashmi)/);' matched string:root dibyajyo@fwtest:~ #perl -e '$x = "root@rashmi>"; print... (1 Reply)
Discussion started by: rrd1986
1 Replies

4. Shell Programming and Scripting

Regex escape special character in AWK if statement

I am having issues escaping special characters in my AWK script as follows: for id in `cat file` do grep $id in file2 | awk '\ BEGIN {var=""} \ { if ( /stringwith+'|'+'50'chars/ ) { echo "do this" } else if ( /anotherString/ ) { echo "do that" } else { ... (4 Replies)
Discussion started by: purebc
4 Replies

5. Shell Programming and Scripting

Regex:search/replace but not for escaped character

Hi Input: - -- --- ---- aa-bb-cc aa--bb--cc aa---bb---cc aa----bb----cc Output: . - -. -- aa.bb.cc (7 Replies)
Discussion started by: chitech
7 Replies

6. UNIX for Advanced & Expert Users

Get pointer for existing device class (struct class) in Linux kernel module

Hi all! I am trying to register a device in an existing device class, but I am having trouble getting the pointer to an existing class. I can create a class in a module, get the pointer to it and then use it to register the device with: *cl = class_create(THIS_MODULE, className);... (0 Replies)
Discussion started by: hdaniel@ualg.pt
0 Replies

7. Shell Programming and Scripting

Regex space character

Hi, I have following regex condition, however it does not work with different logs having same visible string.I believe it is because of some difference with space character, is it possible to make it work everywhere. Can someone suggest a better string? /BIND dn=" uid=/ Thanks. (8 Replies)
Discussion started by: susankoperna1
8 Replies

8. Programming

Size of Derived class, upon virtual base class inheritance

I have the two class definition as follows. class A { public: int a; }; class B : virtual public A{ }; The size of class A is shown as 4, and size of class B is shown as 16. Why is this effect ?. (2 Replies)
Discussion started by: techmonk
2 Replies

9. Shell Programming and Scripting

Match string against character class in bash

Hello, I want to check whether string has only numeric characters. The following code doesn't work for me #!/usr/local/bin/bash if ]]; then echo "true" else echo "False" fi # ./yyy '346' False # ./yyy 'aaa' False I'm searching for solution using character classes, not regex.... (5 Replies)
Discussion started by: urello
5 Replies

10. Programming

C++ : Base class member function not accessible from derived class

Hello All, I am a learner in C++. I was testing my inheritance knowledge with following piece of code. #include <iostream> using namespace std; class base { public : void display() { cout << "In base display()" << endl; } void display(int k) {... (2 Replies)
Discussion started by: anand.shah
2 Replies
wctype(3)						     Library Functions Manual							 wctype(3)

NAME
wctype - Gets a handle to classify wide characters LIBRARY
Standard C Library (libc) SYNOPSIS
#include <wctype.h> wctype_t wctype( const char *property); The following prototype for wctype() does not conform to current industry standards and is supported only for backward compatibility: #include <wchar.h> wctype_t wctype( char *property); STANDARDS
Interfaces documented on this reference page conform to industry standards as follows: wctype(): XSH5.0 Refer to the standards(5) reference page for more information about industry standards and associated tags. PARAMETERS
Points to a string that identifies a character class, or property. DESCRIPTION
The wctype() function obtains a handle for valid property names for wide characters, as defined in the current locale. The handle can then be used as the wc_prop parameter in a subsequent call to the iswctype() function to check whether a wide character has this property. (See the iswctype(3) reference page.) Values returned by the wctype() function are valid until the setlocale() function modifies the value of the LC_CTYPE environment variable. The property parameter can have a value equal to one of the basic character classes or to any other character class that is defined for a locale. The characters included in each class are specified in the locale definition file (see the localedef(1) and locale(4) reference pages for more information). The basic character classes are as follows: A combination of the classes alpha and digit. Alphabetic (letter) characters. This class automatically includes characters in the upper and lower classes. Blank char- acters, such as the space or tab character. Control characters. This class does not include characters in the alpha or print class. Numeric digit characters. This class includes the following characters: 0 1 2 3 4 5 6 7 8 9 Graphic characters for printing. This class does not include characters in the space or cntrl class but includes all characters in the alpha, digit, and punct classes. Lowercase characters. This class does not include characters in the cntrl, digit, punct, or space class. At a minimum, the lower class includes the 26 lowercase US-ASCII characters. These are: a b c d e f g h i j k l m n o p q r s t u v w x y z Print characters. This class includes characters in the graph and space classes but does not include characters in the cntrl class. Punctuation characters. This class does not include characters in the space, alpha, digit, or cntrl class. Space characters. At a minimum, this class includes the space, form-feed, newline, carriage-return, tab, and vertical-tab characters. Uppercase characters. At a minimum, this class includes the 26 uppercase US ASCII characters. These are: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Hexadecimal characters. This class includes the following characters: 0 1 2 3 4 5 6 7 8 9 A B C D E F a b c d e f For the list of supplemental properties that can be defined in Unicode locales, see the locale(4) reference page. On Tru64 UNIX systems, Unicode character properties are defined only in locales whose names end in .UTF-8. RETURN VALUES
On successful completion, the wctype() function returns a value of type wctype_t, which is a handle for valid property names in the current locale. If the property parameter specifies a character class that is not valid for the current locale, the function returns the value 0 cast to wctype_t. RELATED INFORMATION
Commands: localedef(1) Files: locale(4) Functions: ctype(3), iswctype(3), setlocale(3), towupper(3), wctrans(3) delim off wctype(3)
All times are GMT -4. The time now is 03:57 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy