04-11-2011
Can you post some lines of the inputfile, the command you executed and the output you got?
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hi,
Let me explain the problem clearly:
Let the entries in my file be:
lion,tiger,bear
apple,mango,orange,apple,grape
unix,windows,solaris,windows,linux
red,blue,green,yellow
orange,maroon,pink,violet,orange,pink
Can we detect the lines in which one of the words(separated by field... (8 Replies)
Discussion started by: srinivasan_85
8 Replies
2. Shell Programming and Scripting
Hi,
How to identify duplicate columns in a row?
Input data: may have 30 columns
9211480750 LK 120070417 920091030
9211480893 AZ 120070607
9205323621 O7 120090914 120090914 1420090914 2020090914 2020090914
9211479568 AZ 120070327 320090730
9211479571 MM 120070326
9211480892 MM 120070324... (3 Replies)
Discussion started by: suresh3566
3 Replies
3. UNIX for Dummies Questions & Answers
Hi all,
I have a tab-delimited file and want to remove identical lines, i.e. all of line 1,2,4 because the columns are the same as the columns in other lines. Any input is appreciated.
abc gi4597 9997 cgcgtgcg $%^&*()()*
abc gi4597 9997 cgcgtgcg $%^&*()()*
ttt ... (1 Reply)
Discussion started by: dr_sabz
1 Replies
4. Shell Programming and Scripting
Dear All,
I need to find the difference between two adjacent columns. The file is having 'i' columns and i need to find the difference between two adjacent columns (like $1 difference $2; $2 difference $3; .... and $(i-1) difference $i). I have used the following coding
awk '{ for (i=1; i<NF;... (7 Replies)
Discussion started by: Fredrick
7 Replies
5. Shell Programming and Scripting
hi friends,
my input
chr1 exon 35204 35266 gene_id "GOLGB1"; transcript_id "GOLGB1";
chr1 exon 42357 42473 gene_id "GOLGB1"; transcript_id "GOLGB1";
chr1 exon 45261 45404 gene_id "GOLGB1"; transcript_id "GOLGB1";
chr1 exon 50701 50778 gene_id "GOLGB1"; transcript_id "GOLGB1";... (2 Replies)
Discussion started by: jacobs.smith
2 Replies
6. Shell Programming and Scripting
Hello experts,
I have a requirement where I have to implement two checks on a csv file:
1. Check to see if the value in first column is duplicate, if any value is duplicate script should exit.
2. Check to verify if the value at second column is between "yes" or "no", if it is anything else... (4 Replies)
Discussion started by: avikaljain
4 Replies
7. Shell Programming and Scripting
Hi,
I have a file with 1M records
ABC 200 400 2.4 5.6
ABC 410 299 12 1.5
XYZ 4 5 6 7
MNO 22 40 30 70
MNO 47 55 80 150
What I want is for all the rows it should take the max value where there are duplicates
output
ABC 410 400 12 5.6
XYZ 4 5 6 7
MNO 47 55 80 150
How can i... (6 Replies)
Discussion started by: Diya123
6 Replies
8. Shell Programming and Scripting
I have this structure:
col1 col2 col3 col4 col5
27 xxx 38 aaa ttt
2 xxx 38 aaa yyy
1 xxx 38 aaa yyy
I need to collapse duplicate lines ignoring column 1 and add values of duplicate lines (col1) so it will look like this:
col1 col2 col3 col4 col5
27 xxx 38 aaa ttt ... (3 Replies)
Discussion started by: coppuca
3 Replies
9. Shell Programming and Scripting
I have a 13gb file. It has the following columns:
The 3rd column is basically correlation values. I want to delete those rows which are repeated between the columns:
A B 0.04
B C 0.56
B B 1
A A 1
C D 1
C C 1
Desired Output: (preferably in a .csv format
A,B,0.04
B,C,0.56
C,D,1... (3 Replies)
Discussion started by: Sanchari
3 Replies
10. Shell Programming and Scripting
Input
1,ABCD,no
2,system,yes
3,ABCD,yes
4,XYZ,no
5,XYZ,yes
6,pc,noCode used to find duplicate with regard to 2nd column
awk 'NR == 1 {p=$2; next} p == $2 { print "Line" NR "$2 is duplicated"} {p=$2}' FS="," ./input.csv
Now is there a wise way to de-duplicate the entire line (remove... (4 Replies)
Discussion started by: deadyetagain
4 Replies
LEARN ABOUT DEBIAN
vilistextum
VILISTEXTUM(1) General Commands Manual VILISTEXTUM(1)
NAME
vilistextum - html to ascii converter
SYNOPSIS
vilistextum [OPTIONS] [inputfile |-] [outputfile | -]
DESCRIPTION
vilistextum is a html to ascii converter specifically programmed to get the best out of incorrect html.
OPTIONS
inputfile,- resp. outputfile,-
replace inputfile with '-' for reading from standard input, likewise outputfile with '-' for writing to standard output.
-a, --no-alt
don't output anything for IMG tags even if they have an ALT attribute. Implies --no-image.
-c, --convert-tags
some tags will be converted to special characters.
-e, --errorlevel NUMBER
increase level of verbosity for error messages (0: No error messages).
-i, --defimage STRING
IMG tags without alt attribute are output as [STRING].
-l, --links
numbers the links in the document and creates footnotes of each link at the end of the file.
-k, --links-inline
print the links directly after the html tag.
-m, --dont-convert-characters
don't convert the entities from windows1252 (€-Ÿ and their proper entity names)
-n, --no-image
don't output [Image] for IMG tags that have no ALT attribute.
-p, --palm
output text more suitable for reading on a PDA.
-r, --remove-empty-alt
if there is an empty ALT attribute in a IMG tag (eg <IMG href="..." alt="">), don't output '[]'.
-s, --shrink-lines [NUMBER]
if there are more than NUMBER empty lines, output only NUMBER. Default: 1.
-t, --no-title
don't output title.
-w, --width NUMBER
maximum line width.
-h, --help
display this help and exit
-v, --version
output version information and exit
MULTIBYTE OPTIONS (Only available if compiled with multibyte support)
-u, --output-utf-8
instead of the character set of the html document, everything will be output as utf-8.
-x, --translit
use the //TRANSLIT feature of libiconv. Consult the iconv manual for details.
-y, --charset CHARSET
if the HTML document doesn't provide a character set in the meta tags, use CHARSET.
LIMITATIONS
The rendering of tables is not very good.
The handling of OL is incomplete. The program treats it as UL and more than 10 nested lists confuse it.
Text is never justified.
REPORTING BUGS
Please report bugs to <bhaak@gmx.net>.
AUTHOR
Vilistextum was written by Patric Mueller <bhaak@gmx.net> and may be freely distributed under the terms of the GNU General Public License
Version 2. There is ABSOLUTELY NO WARRANTY for this program.
SEE ALSO
iconv(3), lynx(1), links(1), w3m(1)
22 OCT 2006 VILISTEXTUM(1)