01-02-2020
It is a shorthand character class that
mostly expands to
[ \f\t\n\v] -- it is shorthand for a character class, like \d and others
Explained here:.
Regexp Tutorial - Shorthand Character Classes has this for
\s:
Quote:
\s stands for “whitespace character”¯. Again, which characters this actually includes, depends on the regex flavor. In all flavors discussed in this tutorial, it includes [ \t\r\n\f]. That is: \s matches a space, a tab, a line break, or a form feed. Most flavors also include the vertical tab, with Perl (prior to version 5.18) and PCRE (prior to version 8.34) being notable exceptions. In flavors that support Unicode, \s normally includes all characters from the Unicode “separator”¯ category. Java and PCRE are exceptions once again. But JavaScript does match all Unicode whitespace with \s.
This User Gave Thanks to jim mcnamara For This Post:
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
can anyone tell me why this doesn't work? I've been trying to play with character classes and I seem to be missing something here..!
echo "./comparecdna.summary" | awk '/^compare+]summary$/' # returns nothing
echo "./compare_cdna.summary" | awk '/^compare_+]summary$/' # returns nothing
echo... (5 Replies)
Discussion started by: anthalamus
5 Replies
2. Shell Programming and Scripting
So, I need to find the instances of a certain font and remove it....so far in my testing I am using the find command with regex to find a font I want to pull out. However, I seem to be slightly stuck, and I am sure the beard stroking Unix geniuses here can help me.
My example code:
find... (7 Replies)
Discussion started by: tlarkin
7 Replies
3. Shell Programming and Scripting
Hi,
I find it really strange while writing a simple regex to match and print the matched string,
dibyajyo@fwtest:~ #perl -e '$x = "root@rashmi>"; print "matched string:$1\n" if ($x =~ /(root@rashmi)/);'
matched string:root
dibyajyo@fwtest:~ #perl -e '$x = "root@rashmi>"; print... (1 Reply)
Discussion started by: rrd1986
1 Replies
4. Shell Programming and Scripting
I am having issues escaping special characters in my AWK script as follows:
for id in `cat file`
do
grep $id in file2 | awk '\
BEGIN {var=""} \
{ if ( /stringwith+'|'+'50'chars/ ) {
echo "do this"
} else if ( /anotherString/ ) {
echo "do that"
} else {
... (4 Replies)
Discussion started by: purebc
4 Replies
5. Shell Programming and Scripting
Hi
Input:
-
--
---
----
aa-bb-cc
aa--bb--cc
aa---bb---cc
aa----bb----cc
Output:
.
-
-.
--
aa.bb.cc (7 Replies)
Discussion started by: chitech
7 Replies
6. UNIX for Advanced & Expert Users
Hi all!
I am trying to register a device in an existing device class, but I am
having trouble getting the pointer to an existing class.
I can create a class in a module, get the pointer to it and then use
it to register the device with:
*cl = class_create(THIS_MODULE, className);... (0 Replies)
Discussion started by: hdaniel@ualg.pt
0 Replies
7. Shell Programming and Scripting
Hi,
I have following regex condition, however it does not work with different logs having same visible string.I believe it is because of some difference with space character, is it possible to make it work everywhere.
Can someone suggest a better string?
/BIND dn=" uid=/
Thanks. (8 Replies)
Discussion started by: susankoperna1
8 Replies
8. Programming
I have the two class definition as follows.
class A { public: int a; };
class B : virtual public A{ };
The size of class A is shown as 4, and size of class B is shown as 16. Why is this effect ?. (2 Replies)
Discussion started by: techmonk
2 Replies
9. Shell Programming and Scripting
Hello,
I want to check whether string has only numeric characters. The following code doesn't work for me
#!/usr/local/bin/bash
if ]]; then
echo "true"
else
echo "False"
fi
# ./yyy '346'
False
# ./yyy 'aaa'
False
I'm searching for solution using character classes, not regex.... (5 Replies)
Discussion started by: urello
5 Replies
10. Programming
Hello All,
I am a learner in C++. I was testing my inheritance knowledge with following piece of code.
#include <iostream>
using namespace std;
class base
{
public :
void display()
{
cout << "In base display()" << endl;
}
void display(int k)
{... (2 Replies)
Discussion started by: anand.shah
2 Replies
wctype(3) Library Functions Manual wctype(3)
NAME
wctype - Gets a handle to classify wide characters
LIBRARY
Standard C Library (libc)
SYNOPSIS
#include <wctype.h>
wctype_t wctype( const char *property);
The following prototype for wctype() does not conform to current industry standards and is supported only for backward compatibility:
#include <wchar.h>
wctype_t wctype(
char *property);
STANDARDS
Interfaces documented on this reference page conform to industry standards as follows:
wctype(): XSH5.0
Refer to the standards(5) reference page for more information about industry standards and associated tags.
PARAMETERS
Points to a string that identifies a character class, or property.
DESCRIPTION
The wctype() function obtains a handle for valid property names for wide characters, as defined in the current locale. The handle can then
be used as the wc_prop parameter in a subsequent call to the iswctype() function to check whether a wide character has this property. (See
the iswctype(3) reference page.) Values returned by the wctype() function are valid until the setlocale() function modifies the value of
the LC_CTYPE environment variable. The property parameter can have a value equal to one of the basic character classes or to any other
character class that is defined for a locale. The characters included in each class are specified in the locale definition file (see the
localedef(1) and locale(4) reference pages for more information). The basic character classes are as follows: A combination of the classes
alpha and digit. Alphabetic (letter) characters. This class automatically includes characters in the upper and lower classes. Blank char-
acters, such as the space or tab character. Control characters. This class does not include characters in the alpha or print class.
Numeric digit characters. This class includes the following characters:
0 1 2 3 4 5 6 7 8 9 Graphic characters for printing. This class does not include characters in the space or cntrl class but includes
all characters in the alpha, digit, and punct classes. Lowercase characters. This class does not include characters in the cntrl,
digit, punct, or space class. At a minimum, the lower class includes the 26 lowercase US-ASCII characters. These are:
a b c d e f g h i j k l m n o p q r s t u v w x y z Print characters. This class includes characters in the graph and space classes
but does not include characters in the cntrl class. Punctuation characters. This class does not include characters in the space,
alpha, digit, or cntrl class. Space characters. At a minimum, this class includes the space, form-feed, newline, carriage-return,
tab, and vertical-tab characters. Uppercase characters. At a minimum, this class includes the 26 uppercase US ASCII characters.
These are:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Hexadecimal characters. This class includes the following characters:
0 1 2 3 4 5 6 7 8 9 A B C D E F a b c d e f
For the list of supplemental properties that can be defined in Unicode locales, see the locale(4) reference page. On Tru64 UNIX systems,
Unicode character properties are defined only in locales whose names end in .UTF-8.
RETURN VALUES
On successful completion, the wctype() function returns a value of type wctype_t, which is a handle for valid property names in the current
locale. If the property parameter specifies a character class that is not valid for the current locale, the function returns the value 0
cast to wctype_t.
RELATED INFORMATION
Commands: localedef(1)
Files: locale(4)
Functions: ctype(3), iswctype(3), setlocale(3), towupper(3), wctrans(3) delim off
wctype(3)