Thanks for the reply Robin; you're on the same page as me.
Not being one to sit back and expect it to be written for me I've had a go with that pointer you gave me but I'm getting some strange results from it.
I thought I'd start simple and built up. I was half expecting (excuse the pun) that the below would output half the length of the string and store it in $half then using that combined with an awk sub string I would be able to just out put the first half of the string. The idea being I could store that in a variable do the same for the second half by getting the awk substr to start at $half for $half and I could compare the two. If they match then output if they don't then bin them.
That doesn't give me the output I was expecting. $TEMP_1 is a file name which contains the values one per line as per my previous post.
Hi,
I have 3 lines in a text file that is similar to this (as a result of a diff between 2 files):
35,36d34
< DATA.EVENT.EVENT_ID.s = "3661208"
< DATA.EVENT.EVENT_ID.s = "3661208"
I am trying to get it down to just this:
DATA.EVENT.EVENT_ID.s = "3661208"
How can I do this?... (11 Replies)
Hi,
I have two strings like this in an array:
For example:
@a=("Brain aging is associated with a progressive imbalance between intracellular concentration of Reactive Oxygen Species","Brain aging is associated with a progressive imbalance between intracellular concentration of Reactive... (9 Replies)
My input contains a single word lines.
From each line
data.txt
prjtestBlaBlatestBlaBla
prjthisBlaBlathisBlaBla
prjthatBlaBladpthatBlaBla
prjgoodBlaBladpgoodBlaBla
prjgood1BlaBla123dpgood1BlaBla123
Desired output -->
data_out.txt
prjtestBlaBla
prjthisBlaBla... (8 Replies)
Hi,
do you have awk or sed sommand taht will delete duplicate lines like.
sample:
server1-log1-14
server1-log2-14
superserver-time-2
superserver-log-2
output:
server-log1-14
superserver-time-2
thansk (2 Replies)
Hi Perl users,
I have another problem with text processing in Perl. I have a file below:
Linux Unix Linux Windows SUN
MACOS SUN SUN HP-AUX
I want the result below:
Unix Windows SUN
MACOS HP-AUX
so the duplicate string will be removed and also the keyword of the string on... (2 Replies)
I have a script that builds a database ~30 million lines, ~3.7 GB .cvs file. After multiple optimzations It takes about 62 min to bring in and parse all the files and used to take 10 min to remove duplicates until I was requested to add another column. I am using the highly optimized awk code:
awk... (34 Replies)
Hi,
I have a file which is an extract of jil codes of all autosys jobs in our server.
Sample jil code:
**************************
permission:gx,wx
date_conditions:yes
days_of_week:all
start_times:"05:00"
condition: notrunning(appDev#box#ProductLoad)... (1 Reply)
Hi all
I have a grep written to pull out values; below (in the code snip-it) is an example of the output.
What I'm struggling to do, and looking for assistance on, is identifying the lines that have duplicate strings.
For example 74859915K74859915K in the below is 74859915K repeated twice but... (3 Replies)
Hi All,
I have a requirement where I have to get the duplicate string count and uniq error message. Below is my file:
Rejected - Error on table TableA, column ColA.
Error String 1.
Rejected - Error on table TableA, column ColB.
Error String 2.
Rejected - Error on table TableA, column... (6 Replies)
Discussion started by: Deekhari
6 Replies
LEARN ABOUT OPENSOLARIS
col
col(1) User Commands col(1)NAME
col - reverse line-feeds filter
SYNOPSIS
col [-bfpx]
DESCRIPTION
The col utility reads from the standard input and writes to the standard output. It performs the line overlays implied by reverse line-
feeds, and by forward and reverse half-line-feeds. Unless -x is used, all blank characters in the input will be converted to tab characters
wherever possible. col is particularly useful for filtering multi-column output made with the .rt command of nroff(1) and output resulting
from use of the tbl(1) preprocessor.
The ASCII control characters SO and SI are assumed by col to start and end text in an alternative character set. The character set to which
each input character belongs is remembered, and on output SI and SO characters are generated as appropriate to ensure that each character
is written in the correct character set.
On input, the only control characters accepted are space, backspace, tab, carriage-return and newline characters, SI, SO, VT, reverse line-
feed, forward half-line-feed and reverse half-line-feed. The VT character is an alternative form of full reverse line-feed, included for
compatibility with some earlier programs of this type. The only other characters to be copied to the output are those that are printable.
The ASCII codes for the control functions and line-motion sequences mentioned above are as given in the table below. ESC stands for the
ASCII escape character, with the octal code 033; ESC- means a sequence of two characters, ESC followed by the character x.
reverse line-feed ESC-7
reverse half-line-feed ESC-8
forward half-line-feed ESC-9
vertical-tab (VT) 013
start-of-text (SO) 016
end-of-text (SI) 017
OPTIONS -b Assume that the output device in use is not capable of backspacing. In this case, if two or more characters are to appear in the
same place, only the last one read will be output.
-f Although col accepts half-line motions in its input, it normally does not emit them on output. Instead, text that would appear
between lines is moved to the next lower full-line boundary. This treatment can be suppressed by the -f (fine) option; in this case,
the output from col may contain forward half-line-feeds (ESC-9), but will still never contain either kind of reverse line motion.
-p Normally, col will ignore any escape sequences unknown to it that are found in its input; the -p option may be used to cause col to
output these sequences as regular characters, subject to overprinting from reverse line motions. The use of this option is highly
discouraged unless the user is fully aware of the textual position of the escape sequences.
-x Prevent col from converting blank characters to tab characters on output wherever possible. Tab stops are considered to be at each
column position n such that n modulo 8 equals 1.
ENVIRONMENT VARIABLES
See environ(5) for descriptions of the following environment variables that affect the execution of col: LC_CTYPE, LC_MESSAGES, and
NLSPATH.
EXIT STATUS
The following error values are returned:
0 Successful completion.
>0 An error occurred.
ATTRIBUTES
See attributes(5) for descriptions of the following attributes:
+-----------------------------+-----------------------------+
| ATTRIBUTE TYPE | ATTRIBUTE VALUE |
+-----------------------------+-----------------------------+
|Availability |SUNWesu |
|CSI |enabled |
+-----------------------------+-----------------------------+
SEE ALSO nroff(1), tbl(1), ascii(5), attributes(5), environ(5)NOTES
The input format accepted by col matches the output produced by nroff with either the -T37 or -Tlp options. Use -T37 (and the -f option of
col) if the ultimate disposition of the output of col will be a device that can interpret half-line motions, and -Tlp otherwise.
col cannot back up more than 128 lines or handle more than 800 characters per line.
Local vertical motions that would result in backing up over the first line of the document are ignored. As a result, the first line must
not have any superscripts.
SunOS 5.11 1 Feb 1995 col(1)