10-14-2008
Not really.
Running again a plain sort on a tera-byte problem wont scale up properly and that is not needed as well.
These type of problems for which computational complexity increases with more number of records to be processed can be handled by the map-reduce problem. This should probably be done by splitting the files into 'n' chunks and collaborating each of the processed chunks.
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi,
I have a file like this:
Some_String_Here 123 123 123 321 321 321 3432 3221 557 886 321 321
I would like to find only the unique values in the files and get the following output:
Some_String_Here 123 321 3432 3221 557 886
I am trying to get this done using awk. Can someone please... (5 Replies)
Discussion started by: Legend986
5 Replies
2. UNIX Desktop Questions & Answers
After giving
grep -A4 "feature 1," <file name>
I have extracted the following text
feature 1,
subfeat 2,
type 1,
subtype 5,
dump '30352f30312f323030392031313a33303a3337'H -- "05/01/2009 11:30:37" --
--
... (1 Reply)
Discussion started by: shivi707
1 Replies
3. UNIX for Dummies Questions & Answers
Hello all,
I have a file with following sample data
2009-08-26 05:32:01.65 spid5 Process ID 86:214 owns resources that are blocking processes on Scheduler 0.
2009-08-26 05:32:01.65 spid5 Process ID 86:214 owns resources that are blocking processes on Scheduler 0.
2009-08-26... (5 Replies)
Discussion started by: simonsimon
5 Replies
4. Shell Programming and Scripting
Hi
I have the following info in a file -
<Cell id="25D"/>
<Cell id="26A"/>
<Cell id="26B"/>
<Cell id="26C"/>
<Cell id="27A"/>
<Cell id="27B"/>
<Cell id="27C"/>
<Cell id="28A"/>
I would like to know how would you go about counting all... (4 Replies)
Discussion started by: Prega
4 Replies
5. Shell Programming and Scripting
I need to take the second column of a .csv file and count the number of instances of each unique value in that same second column. I'd like the output to be value,count sorted by most instances. Thanks for any guidance!
Data example:
317476,317756,0
816063,318861,0
313123,319091,0... (4 Replies)
Discussion started by: batcho
4 Replies
6. Shell Programming and Scripting
Hello,
I need some sort of way to extract every date contained in a file, and count how many of those dates there are.
Here are the specifics:
The date format I'm looking for is mm/dd/yyyy
I only need to look after line 45 in the file (that's where the data begins)
The columns of... (2 Replies)
Discussion started by: ronan1219
2 Replies
7. Linux
cat sample.csv
ID,Name,no
1,AAA,1
2,BBB,1
3,AAA,1
4,BBB,1
cut -d',' -f2 sample.csv | sort | uniq
this gives only the 2nd column values
Name
AAA
BBB
How to I get all the columns of CSV along with this? (1 Reply)
Discussion started by: sanvel
1 Replies
8. Shell Programming and Scripting
Hi Folks,
I have the below feed file named abc1.txt in which you can see there is a title and below is the respective values in the rows and it is completely pipe delimited file ,.
... (4 Replies)
Discussion started by: punpun66
4 Replies
9. Shell Programming and Scripting
Hello Everyone!
I have updated the first post so that my intentions are easier to understand, and also attached sample files (post #18).
I have over 500 text files in a directory. Over 1 GB of data. The data in those files is organised in lines:
My intention is to return one line per... (23 Replies)
Discussion started by: clippertm
23 Replies
10. Shell Programming and Scripting
Hi,
I have a huge unsorted text file. We wanted to identify the unique field values in a line and consider those fields as a primary key for a table in upstream system.
Basically, the process or script should fetch the values from each line that are unique compared to the rest of the lines in... (13 Replies)
Discussion started by: manikandan23
13 Replies
LEARN ABOUT DEBIAN
arithmetic
ARITHMETIC(6) BSD Games Manual ARITHMETIC(6)
NAME
arithmetic -- quiz on simple arithmetic
SYNOPSIS
arithmetic [-o +-x/] [-r range]
DESCRIPTION
arithmetic asks you to solve problems in simple arithmetic. Each question must be answered correctly before going on to the next. After
every 20 problems, it prints the score so far and the time taken. You can quit at any time by typing the interrupt or end-of-file character.
The options are as follows:
-o By default, arithmetic asks questions on addition of numbers from 0 to 10, and corresponding subtraction. By supplying one or more
of the characters +-x/, you can ask for problems in addition, subtraction, multiplication, and division, respectively. If you give
one of these characters more than once, that kind of problem will be asked correspondingly more often.
-r If a range is supplied, arithmetic selects the numbers in its problems in the following way. For addition and multiplication, the
numbers to be added or multiplied are between 0 and range, inclusive. For subtraction and division, both the required result and the
number to divide by or subtract will be between 0 and range. (Of course, arithmetic will not ask you to divide by 0.) The default
range is 10.
When you get a problem wrong, arithmetic will remember the numbers involved, and will tend to select those numbers more often than others, in
problems of the same sort. Eventually it will forgive and forget.
arithmetic cannot be persuaded to tell you the right answer. You must work it out for yourself.
DIAGNOSTICS
``What?'' if you get a question wrong. ``Right!'' if you get it right. ``Please type a number.'' if arithmetic doesn't understand what you
typed.
SEE ALSO
bc(1), dc(1)
BSD
May 31, 1993 BSD