I have an issue with the below script
It is taking too long to get a string from in1.text, search for the string in in2.txt and create a new file out1.txt.
Is there any alternative way we can achieve the same faster?
Thanks in advance
Moderator's Comments:
Please use CODE tags rather than ICODE tags when displaying code with more than 1 line or with a single line that might be split due to line length.
Last edited by Don Cragun; 08-15-2014 at 09:22 PM..
Reason: Changed ICODE tags to CODE tags.
As I know:
FNR: The ordinal number of the current record in the current file.
NR: The ordinal number of the current record from the start of input.
I don't understand really differency between NR and FNR. Who can explain it for me? And give me an example.
Thanks (1 Reply)
Hi Guys,
I have two files:
f1:
A B C D E F G H
f2:
A X Y Z
f1 has 48000 lines, and f2 has 68. I have been matching f1 $3 to f2 $1, and getting f3:
A A B C D E F G
I would like f3 too look like this:
A X Y Z A B C D E F G (2 Replies)
cat file1
1 a b c
2 d e f
3 a f r
cat file2
a c e
output should be
1
3
means:
if field 1 of file2 matches filed 2 of file1 then print field 1 of file1
I know that it can be done using awk NR=FNR.
But not able to acheive it.
Thanks in advance. (9 Replies)
awk -F'' 'FNR==NR {a=$2; next} {$1=a} 1' $useralias ${entries} >> ${entries}_2
Hi,
Is there anyway to alter this command so that if it does not find a match it will just leave the line alone instead of replacing what it doesn't find with a blank space? (4 Replies)
This has been asked and answered hundreds of times, but I can't understand the syntax of awk's NR==FNR trick for merging files and printing the correct columns.
Here's my File 1
1 rs8179466 224176 A ADD 1037 1.066 0.1421 0.8065 1.408 0.4468 ... (3 Replies)
Hi
i have file1:
conn=232257 client=16218.19488.218.86:51237 protocol=LDAP
file2:
conn=232257 dn="uid=apple,ou=xxxx,ou=usfgfhfers,dc=example,dc=com"
conn=232370 dn="uid=ball,ou=yyyyyy,ou=usfhfhfhers,dc=example,dc=com"
In the output file it should match first column from above both files... (2 Replies)
Example:
$ cat file1
2
3$ cat file2
1
2
3
4
5
6The following awk script works like a charm, NR==FNR is true for file1, the remainder runs for file2:
awk '
NR==FNR {A; next}
($1 in A)
' file1 file2
2
3Now have an empty file1:
>file1and run the awk script again.
The result is empty... (8 Replies)
Hi everybody!
need some awk-support. i want a line-selective printout of a file.
wat i normally will do with ...
awk ' FNR==8' sample.txt
But now i need the data from line 8, 10 and the following data from line13 to 250 wich is not end of the file. I tried allready to combine it but without... (2 Replies)
To merge mutiple *.tab files as:
file1.tab
rs1 A A
rs2 A A
rs3 C C
rs4 C Cfile2.ind
rs1 T T
rs2 T T
rs3 G G
rs4 G Gand file3.tab
rs1 B B
rs2 B B
rs3 L L
rs4 L LOutput:
file1.tab file2.tab file3.tab
AA TT BB
AA TT BB
CC GG LL
CC GG ... (4 Replies)
Dear All,
I have below two files with me:
file 1:
A|B
E|F
C|D
file 2:
A|X|Y
R|T|I
C|V|N
I want to compare 1st column of each file and than print both columns of file 1 and column 2 and 3 of file 2
Sample required output in regards to above files is below:
A|B|X|Y
C|D|V|N (5 Replies)
Discussion started by: Nebula
5 Replies
LEARN ABOUT SUSE
html2pdbtxt
html2pdbtxt(1) General Commands Manual html2pdbtxt(1)NAME
html2pdbtxt - HTML to Doc Text converter for Palm Pilots
SYNOPSIS
html2pdbtxt [ -bchars ] [ -ttitle ] [ -uURL ] file.html [ file.txt ]
html2pdbtxt -v
DESCRIPTION
html2pdbtxt converts HTML to text suitable for conversion to a Doc(4) file via txt2pdbdoc(1). If no text filename is given, the generated
text is sent to standard output.
HTML Tags
The following HTML tags (and corresponding ending tags) are recognized: ADDRESS, A NAME, BLOCKQUOTE, BR, CENTER, DIV, DL, DT, H1, H2, H3,
H4, H5, H6, OL, OPTION, PRE, P, SELECT, SCRIPT, STYLE, TABLE, TITLE, UL. In all cases, the most ``reasonable'' thing is done given the
constraints of the Doc(4) format which is essentially plain text. ALT attributes (typically found in IMG tags) have their text extracted
and placed between brackets [like this]. All other HTML tags are stripped.
Character Entities
Both HTML character and numeric (decimal and hexadecimal) entity references are converted to their byte value according to the ISO 8859-1
(Latin 1) character set so they appear properly on the Pilot. For example, ``résumé'' becomes ``resume'' with accented letter
'e's.
Document Title
Unless specified with the -t option, the HTML file is scanned for <TITLE> ... </TITLE> tags and, if found, the title is extracted and put
on line 1 of the generated file.
Bookmarks
Bookmarks are placed into the generated file wherever <A NAME="..."> tags are found in the HTML file.
OPTIONS -bchars Specify the character sequence that is to serve as the bookmark indicator. The default is (*). (See the CAVEATS.)
-ttitle Specify the title of the document that is to appear on line 1 of the generated file overriding any title found inside the HTML
file between <TITLE> ... </TITLE> tags.
-uurl Specify the URL the HTML file supposedly came from and put it on the line after the title, if any, in the generated file.
-v Print the version number to standard output and exit.
EXAMPLE
To convert an HTML file to Doc:
html2pdbtxt -u http://www.wonderland.org/ alice.html alice.txt
txt2pdbdoc "`head -1 alice.txt`" alice.txt alice.pdb
CAVEATS
1. Some Doc readers have a ``feature'' whereby, during the scan for bookmarks phase, they recognize the bookmark sequence of characters
anywhere in the text and not just at the beginning of a line.
2. Some Doc readers do not allow the bookmark sequence to contain the > character since they interpret that as the sequence delimiter,
e.g., <->> will be interpreted as the sequence being merely -.
3. Ordered lists (via the OL tag) are treated as unordered lists (like the UL tag) because it would greatly complicate the code since it
would have to be parsed rather than simple substitutions being performed.
SEE ALSO pdbtxt2html(1), txt2pdbdoc(1), doc(4), pdb(4)
International Standards Organization. ``ISO 8859-1: Information Processing -- 8-bit single-byte coded graphic character sets -- Part 1:
Latin alphabet No. 1.'' 1987.
World Wide Web Consortium. ``Character entity references in HTML 4.0.'' HTML 4.0 Specification, http://www.w3.org/
AUTHOR
Paul J. Lucas <pauljlucas@mac.com>
html2pdbtxt January 21, 2005 html2pdbtxt(1)