10 More Discussions You Might Find Interesting
1. UNIX for Beginners Questions & Answers
Hi All,
I have records in unix file like below. In this file, we have empty fields from 4th Column to 22nd Column. I have some 200000 records in a file. I want to extract records only which have empty fields from 4th field to 22nd filed. This file is comma separated file. what is the unix... (2 Replies)
Discussion started by: rakeshp
2 Replies
2. Shell Programming and Scripting
Hi All!!
I have a large file containing millions of records. My purpose is to extract 8 characters immediately from the given file.
222222222|ZRF|2008.pdf|2008|01/29/2009|001|B|C|C
222222222|ZRF|2009.pdf|2009|01/29/2010|001|B|C|C
222222222|ZRF|2010.pdf|2010|01/29/2011|001|B|C|C... (5 Replies)
Discussion started by: pavand
5 Replies
3. UNIX for Dummies Questions & Answers
Dear all,
I want to extract around 300 columns from a very large file with almost 2million columns. There are no headers, but I can find out which column numbers I want. I know I can extract with the function 'cut -f2' for example just the second column but how do I do this for such a large... (1 Reply)
Discussion started by: fndijk
1 Replies
4. Shell Programming and Scripting
Hi i have a php script that works 100% however i don't want this to run on php because of server limits etc. Ideally if i could convert this simple php script to a shell script i can set it up to run on a cron. My mac server has curl on it. So i am assuming i should be using this to download the... (3 Replies)
Discussion started by: timgolding
3 Replies
5. Shell Programming and Scripting
Hello all, newbie here. I've searched the forum and found many "how to split a text file" topics but none that are what I'm looking for.
I have a large text file (~15 MB) in size. It contains a variable number of "paragraphs" (for lack of a better word) that are each of variable length. A... (3 Replies)
Discussion started by: lupin..the..3rd
3 Replies
6. Shell Programming and Scripting
Dear gurus
I have several files with the following format filenameCCYYMMDD , that is the last 8 characters will be the date in CCYYMMDD format. eg FILENAME20110523 .
Could anyone please put me through on how to extract only the last 8 characters from the files.
I am thinking of using awk,sed... (2 Replies)
Discussion started by: erinlomo
2 Replies
7. Shell Programming and Scripting
trying to extract the numbers in this file name:
fname="ebcdic.f0633.cmp_ebcdic.f0633.bin"
fnametmp=${fname#*(V|v|F|f)}
parse=${fnametmp%%(ENC|enc|CMP|cmp|BIN|bin)}}
echo FLRECL=$parse
result is FLRECL=0633.cmp_ebcdic.f0633
expected result FLRECL=0633
my guru is on holiday and i need... (5 Replies)
Discussion started by: mambo2523
5 Replies
8. Shell Programming and Scripting
Hi All,
I am trying to extract data from a large text file , I want to extract lines which contains a five digit number followed by a hyphen , like
12345- , i tried with egrep ,eg : egrep "+" text.txt
but which returns all the lines which contains any number of digits followed by hyhen ,... (19 Replies)
Discussion started by: shijujoe
19 Replies
9. Shell Programming and Scripting
Hello,
I have got one file with more than 120+ million records(35 GB in size). I have to extract some relevant data from file based on some parameter and generate other output file.
What will be the besat and fastest way to extract the ne file.
sample file format :--... (2 Replies)
Discussion started by: learner16s
2 Replies
10. Shell Programming and Scripting
Hello Gurus,
We are facing some performance issue in UNIX. If someone had faced such kind of issue in past please provide your suggestions on this .
Problem Definition:
/Few of load processes of our Finance Application are facing issue in UNIX when they uses a shell script having below... (19 Replies)
Discussion started by: KRAMA
19 Replies
HTML::Quoted(3pm) User Contributed Perl Documentation HTML::Quoted(3pm)
NAME
HTML::Quoted - extract structure of quoted HTML mail message
SYNOPSIS
use HTML::Quoted;
my $html = '...';
my $struct = HTML::Quoted->extract( $html );
DESCRIPTION
Parses and extracts quotation structure out of a HTML message. Purpose and returned structures are very similar to Text::Quoted.
SUPPORTED FORMATS
Variouse MUAs use quite different approaches for quoting in mails.
Some use blockquote tag and it's quite easy to parse.
Some wrap text into p tags and add '>' in the beginning of the paragraphs.
Things gettign messier when it's an HTML reply on plain text mail thread.
If you found format that is not supported then file a bug report via rt.cpan.org with as short as possible example. Test file is even
better. Test file with patch is the best. Not obviouse patches without tests suck.
METHODS
extract
my $struct = HTML::Quoted->extract( $html );
Takes a string with HTML and returns array reference. Each element in the array either array or hash. For example:
[
{ 'raw' => 'Hi,' },
{ 'raw' => '<div><br><div>On date X wrote:<br>' },
[
{ 'raw' => '<blockquote>' },
{ 'raw' => 'Hello,' },
{ 'raw' => '<div>How are you?</div>' },
{ 'raw' => '</blockquote>' }
],
...
]
Hashes represent a part of the html. The following keys are meaningful at the moment:
o raw - raw HTML
o quoter_raw, quoter - raw and decoded (entities are converted) quoter if block is prefixed with quoting characters
AUTHOR
Ruslan.Zakirov <ruz@bestpractical.com>
LICENSE
Under the same terms as perl itself.
perl v5.10.1 2011-01-09 HTML::Quoted(3pm)