I have been having an encoding problem that I need to solve.
I have an 4-column tab-separated file: I need to remove all of the lines that contain the string 'vis-à-vis'
In this way, if my file contains 4 lines that contain 'vis-à-vis' they will all be filterd.
How can I do this with a one liner grep?
---------- Post updated at 01:18 PM ---------- Previous update was at 01:09 PM ----------
or I need something that removes all non-ascii characters..
i know it's out there, but I cannot remember how to check if a given ascii character string contains all digits or not ... any ideas?
ie...function("123") --> OK
function("NOT_A_NUMBER") --> returns error
thanks!! (2 Replies)
Hi All,
In the HP Unix that i'm using when i initialise a string as Stalled="'30¬G'"
Stalled=$Stalled" '30¬C'", it is taking the character ¬ as a comma. I need to grep for 30¬G 30¬C in a file and take its count. But since this character ¬ is not being understood, the count returns a zero.
The... (2 Replies)
Hello,
Is there any UNIX utility/command/executable that will convert mutlibyte characters to standard single byte ASCII characters in a given file?
and
Is there any UNIX utility/command/executable that will recognize multibyte characters in a given file name?
The typical multibyte... (8 Replies)
Hi gurus,
I have a file in unix with ascii values. I need to convert all the ascii values in the file to ascii characters. File contains nearly 20000 records with ascii values. (10 Replies)
I am having a file(1234.txt) downloaded from windows server (in Ascii format).However when i ftp this file to Unix server and try to work with it..i am unable to do anything.When i try to open the file using vi editor the file opens in the following format ...
@
@
@
@
@
@
@
@... (4 Replies)
Here is my problem.
I have a list of phone numbers that I want to use only the last 4 digits as PINs for something I am working on. I have all the numbers in a file but now I want to be removed all items EXCEPT the last 4 digits.
I have seen sed commands and some grep commands but I am... (10 Replies)
Hi,
I have many text files which contain some non-ASCII characters. I attach the screenshots of one of the files for people to have a look at. The issue is even after issuing the non-ASCII removal commands one of the characters does not go away. The character that goes away is the black one with a... (2 Replies)
I have the following type of 2 column file:
motility -
role -
supplementation -
age b
ancestry b
purity b
recommendation b
serenity b
unease b
carving f
expansion f
I would like to print only certain sections of the file depending on the value of the second column.
For instance,... (6 Replies)
Hi,
I'm writing a BBS telnet program. I'm having issues with it not displaying lower ASCII characters. For example, instead of displaying the "smiley face" character (Ctrl-B), it displays ^B. Is this because i'm using Ncurses? If so, is there any way around this?
Thanks. (3 Replies)
Discussion started by: ignatius
3 Replies
LEARN ABOUT HPUX
inv
vis(1) General Commands Manual vis(1)NAME
vis, inv - make unprintable and non-ASCII characters in a file visible or invisible
SYNOPSIS
file ...
file ...
DESCRIPTION
reads characters from each file in sequence and writes them to the standard output, converting those that are not printable or not ASCII
into a visible form. inv performs the inverse function, reading printable characters from each file, returning them to non-printable or
non-ASCII form, if appropriate, then writing them to standard output;
Non-printable ASCII characters are represented using C-like escape conventions:
backslash
backspace
escape
form-feed
new-line
carriage return
space
horizontal tab
vertical tab
the character whose
ASCII code is the 3-digit octal number n.
the character whose
ASCII code is the 2-digit hexadecimal number n.
Non-ASCII single- or multi-byte characters are examined one byte at a time. For each byte, if it can be displayed as an ASCII character,
it is treated as if it is an ASCII character; Otherwise, it is represented in the following conventions:
the 8-bit character whose
code value is the 3-digit octal number n.
the 8-bit character whose
code value is the 2-digit hexadecimal number n.
Space, horizontal-tab, and new-line characters can be treated as printable (and therefore passed unaltered to the output) or non-printable
depending on the options selected. Backslash, although printable, is expanded by vis, to a pair of backslashes so that when they are
passed back through inv, they convert back to a single backslash.
If no input file is given, or if the argument is encountered, and inv read from the standard input.
Options
and recognize the following options:
Treat new-line, space, and horizontal tab as non-printable characters.
expands them visibly as and rather than passing them directly to the output. discards these characters, expecting only the
printable expansions. New-line characters are inserted by every 16 bytes so that the output will be in a form that is
usable by most editors.
Make and silent about non-existent files, identical input and output, and write errors. Normally, no input file can be the same
as the output file unless it is a special file.
Treat horizontal-tab and space characters as non-printable
in the same manner that treats them.
Cause output to be unbuffered (byte-by-byte);
normally, output is buffered.
Cause output to be in hexadecimal form rather than the default octal form. Either form is accepted to as input.
EXTERNAL INFLUENCES
Environment Variables
determines the language in which messages are displayed.
International Code Set Support
Single- and multi-byte character code sets are supported.
WARNINGS
Redirecting output to an input file destroys the original data. Therefore, command forms such as
should be avoided unless the source file can be safely discarded.
AUTHOR
was developed by HP.
SEE ALSO cat(1), echo(1), od(1).
vis(1)