Hi,
I have one huge record and know that each record in the file is 550 bytes long. How do I parse out individual records from the single huge record.
Thanks, (4 Replies)
Hi Folks,
I need to compare two very huge file ( i.e the files would contain a minimum of 70k records each) using awk or sed. The comparison needs to be done with respect to a 'key'. For example :
File1
**********
1234|TONY|Y75634|20/07/2008
1235|TINA|XCVB56|30/07/2009... (13 Replies)
Dear friends,
I receive the following files into a FTP location on a daily basis
-rw-r----- 1 guest ftp1 5021 Aug 19 09:03 CHECK_TEST_Extracts_20080818210000.zip
-rw-r----- 1 guest ftp1 2437 Aug 20 05:15 CHECK_TEST_Extracts_20080819210000.zip
-rw-r----- 1 guest ... (2 Replies)
Anyone can help for filter the uniq record for below example? Thank you very much
Input file
20090503011111|test|abc
20090503011112|tet1|abc|def
20090503011112|test1|bcd|def
20090503011131|abc|abc
20090503011131|bbc|bcd
20090503011152|bcd|abc
20090503011151|abc|abc... (8 Replies)
Hi folks,
Below is the content of a file 'tmp.dat', and I want to keep the uniq record (key by first column). However, the uniq record should be the last record.
302293022|2|744124889|744124889
302293022|3|744124889|744124889
302293022|4|744124889|744124889
302293022|5|744124889|744124889... (4 Replies)
I was given a data file that I need to split into multiple lines/records based on a key word. The problem is that it is 2.5GB or bigger and everything I try in perl or sed causes a Segmentation fault. Can someone give me some other ideas.
The data is of the form:... (5 Replies)
Hi,
I have the following command in place
nawk -F, '!a++' file > file.uniq
It has been working perfectly as per requirements, by removing duplicates by taking into consideration only first 3 fields. Recently it has started giving below error:
bash-3.2$ nawk -F, '!a++'... (17 Replies)
Hi,
I have a Huge 7 GB file which has around 1 million records, i want to split this file into 4 files to contain around 250k messages each.
Please help me as Split command cannot work here as it might miss tags..
Format of the file is as below
<!--###### ###### START-->... (6 Replies)
input.csv:
Field1,Field2,Field3,Field4,Field4
abc ,123 ,xyz ,000 ,pqr
mno ,123 ,dfr ,111 ,bbb
output:
Field2,Field4
123 ,000
123 ,111
how to fetch the values of Field4 where Field2='123'
I don't want to fetch the values based on column position. Instead want to... (10 Replies)
I was wondering if anyone could explain to me how to split a variable length EBCDIC file into seperate files based on the record key. I have the COBOL layout, and so I need to split the file into 13 different EBCDIC files so that I can run each one through a C++ converter I have, and get the... (11 Replies)
Discussion started by: hanshot1stx
11 Replies
LEARN ABOUT DEBIAN
fullquottel
FULLQUOTTEL(1) User Commands FULLQUOTTEL(1)NAME
fullquottel - tool for detecting full quotes in mails or postings
SYNOPSIS
fullquottel [OPTION...] < infile > outfile
DESCRIPTION
fullquottel - tool for detecting full quotes in mails or postings
-t, --scorethreshold=NUMBER
Threshold for final score that discriminates between Goodquottel and Fullquottel (default: 1.5)
--quotechars=CHARS
Chars used for quoting (default: >|#)
-d, --debug
Debug option (prints single test results; default: off)
--ownsigtest=1|0
Turn on/off ownSigTest (default: on)
--ownsigtestscore=NUMBER
Score for ownSigTest (default: 1.0)
-f, --ownsigtestfile=SIGFILE
Signature file to test against (default: ~/.signature)
--mstest=1|0
Turn on/off msTest (Microsoft attribution lines; default: on)
--mstestscore=NUMBER
Score for msTest (default: 1.0)
--mstestthreshold=NUMBER
Threshold for msTest (at or below which line a Microsoft attribution line ist found; default: 2)
--quotedtoalltest=1|0
Turn on/off quotedToAllTest (ratio quoted lines to all; default: on)
--quotedtoalltestscore=NUMBER
Score for quotedToAllTest (default: 1.0)
--quotedtoalltestfactor=NUMBER
Result = score + ratio * FACTOR (default: 0)
--quotedtoalltestthreshold=NUMBER
Ratio threshold for activating quotedToAllTest (default: 0.5)
--bottomquotedtoalltest=1|0
Turn on/off bottomQuotedToAllTest (ratio quoted lines at bottom to all; default: on)
--bottomquotedtoalltestscore=NUMBER
Score for bottomQuotedToAllTest (default: 1.0)
--bottomquotedtoalltestfactor=NUMBER
Result = score + ratio * FACTOR (default: 0)
--bottomquotedtoalltestthreshold=NUMBER
Ratio threshold for activating bottomQuotedToAllTest (default: 0.5)
--bottomquotedtoquotedtest=1|0
Turn on/off bottomQuotedToQuotedTest (ratio quoted lines at bottom to all quoted lines; default: on)
--bottomquotedtoquotedtestscore=NUMBER
Score for bottomQuotedToQuotedTest (default: 1.0)
--bottomquotedtoquotedtestfactor=NUMBER
Result = score + ratio * FACTOR (default: 0)
--bottomquotedtoquotedtestthreshold=NUMBER
Ratio threshold for activating bottomQuotedToQuotedTest (default: 0.5)
--singlebottomquotetest=1|0
Turn on/off singleBottomQuoteTest (only one quote block, and at the bottom; default: on)
--singlebottomquotetestscore=NUMBER
Score for singleBottomQuoteTest (default: 1.0)
HINT: All long options (without leading --) can be used in ~/.fullquottelrc
-?, --help
Give this help list
--usage
Give a short usage message
-V, --version
Print program version
Mandatory or optional arguments to long options are also mandatory or optional for any corresponding short options.
REPORTING BUGS
Report bugs to <toast@toastfreeware.priv.at>.
SEE ALSO
The full documentation for fullquottel is maintained in HTML created by doxygen and can be found in
/usr/share/doc/fullquottel/html/
fullquottel 0.1.2 April 2007 FULLQUOTTEL(1)