I have been having an encoding problem that I need to solve.
I have an 4-column tab-separated file: I need to remove all of the lines that contain the string 'vis-à-vis'
In this way, if my file contains 4 lines that contain 'vis-à-vis' they will all be filterd.
How can I do this with a one liner grep?
---------- Post updated at 01:18 PM ---------- Previous update was at 01:09 PM ----------
or I need something that removes all non-ascii characters..
i know it's out there, but I cannot remember how to check if a given ascii character string contains all digits or not ... any ideas?
ie...function("123") --> OK
function("NOT_A_NUMBER") --> returns error
thanks!! (2 Replies)
Hi All,
In the HP Unix that i'm using when i initialise a string as Stalled="'30¬G'"
Stalled=$Stalled" '30¬C'", it is taking the character ¬ as a comma. I need to grep for 30¬G 30¬C in a file and take its count. But since this character ¬ is not being understood, the count returns a zero.
The... (2 Replies)
Hello,
Is there any UNIX utility/command/executable that will convert mutlibyte characters to standard single byte ASCII characters in a given file?
and
Is there any UNIX utility/command/executable that will recognize multibyte characters in a given file name?
The typical multibyte... (8 Replies)
Hi gurus,
I have a file in unix with ascii values. I need to convert all the ascii values in the file to ascii characters. File contains nearly 20000 records with ascii values. (10 Replies)
I am having a file(1234.txt) downloaded from windows server (in Ascii format).However when i ftp this file to Unix server and try to work with it..i am unable to do anything.When i try to open the file using vi editor the file opens in the following format ...
@
@
@
@
@
@
@
@... (4 Replies)
Here is my problem.
I have a list of phone numbers that I want to use only the last 4 digits as PINs for something I am working on. I have all the numbers in a file but now I want to be removed all items EXCEPT the last 4 digits.
I have seen sed commands and some grep commands but I am... (10 Replies)
Hi,
I have many text files which contain some non-ASCII characters. I attach the screenshots of one of the files for people to have a look at. The issue is even after issuing the non-ASCII removal commands one of the characters does not go away. The character that goes away is the black one with a... (2 Replies)
I have the following type of 2 column file:
motility -
role -
supplementation -
age b
ancestry b
purity b
recommendation b
serenity b
unease b
carving f
expansion f
I would like to print only certain sections of the file depending on the value of the second column.
For instance,... (6 Replies)
Hi,
I'm writing a BBS telnet program. I'm having issues with it not displaying lower ASCII characters. For example, instead of displaying the "smiley face" character (Ctrl-B), it displays ^B. Is this because i'm using Ncurses? If so, is there any way around this?
Thanks. (3 Replies)
Discussion started by: ignatius
3 Replies
LEARN ABOUT OSX
strvisx
VIS(3) BSD Library Functions Manual VIS(3)NAME
vis -- visually encode characters
LIBRARY
Standard C Library (libc, -lc)
SYNOPSIS
#include <vis.h>
char *
vis(char *dst, int c, int flag, int nextc);
int
strvis(char *dst, const char *src, int flag);
int
strvisx(char *dst, const char *src, size_t len, int flag);
DESCRIPTION
The vis() function copies into dst a string which represents the character c. If c needs no encoding, it is copied in unaltered. The string
is null terminated, and a pointer to the end of the string is returned. The maximum length of any encoding is four characters (not including
the trailing NUL); thus, when encoding a set of characters into a buffer, the size of the buffer should be four times the number of charac-
ters encoded, plus one for the trailing NUL. The flag argument is used for altering the default range of characters considered for encoding
and for altering the visual representation. The additional character, nextc, is only used when selecting the VIS_CSTYLE encoding format
(explained below).
The strvis() and strvisx() functions copy into dst a visual representation of the string src. The strvis() function encodes characters from
src up to the first NUL. The strvisx() function encodes exactly len characters from src (this is useful for encoding a block of data that
may contain NUL's). Both forms NUL terminate dst. The size of dst must be four times the number of characters encoded from src (plus one
for the NUL). Both forms return the number of characters in dst (not including the trailing NUL).
The encoding is a unique, invertible representation composed entirely of graphic characters; it can be decoded back into the original form
using the unvis(3) or strunvis(3) functions.
There are two parameters that can be controlled: the range of characters that are encoded, and the type of representation used. By default,
all non-graphic characters except space, tab, and newline are encoded. (See isgraph(3).) The following flags alter this:
VIS_GLOB Also encode magic characters ('*', '?', '[' and '#') recognized by glob(3).
VIS_SP Also encode space.
VIS_TAB Also encode tab.
VIS_NL Also encode newline.
VIS_WHITE Synonym for VIS_SP | VIS_TAB | VIS_NL.
VIS_SAFE Only encode "unsafe" characters. Unsafe means control characters which may cause common terminals to perform unexpected func-
tions. Currently this form allows space, tab, newline, backspace, bell, and return - in addition to all graphic characters -
unencoded.
There are four forms of encoding. Most forms use the backslash character '' to introduce a special sequence; two backslashes are used to
represent a real backslash. These are the visual formats:
(default) Use an 'M' to represent meta characters (characters with the 8th bit set), and use caret '^' to represent control characters
see (iscntrl(3)). The following formats are used:
^C Represents the control character 'C'. Spans characters '