06-08-2005
once again what is a record?
A single line with multiple fields?
A block of lines, one field for line?
Some combo of the above?
A sample file could be helpful......
9 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
To start I have a table that has ticketholders. Each ticket holder has a unique number and each ticket holder is associated to a so called household number. You can have multiple guests w/i a household.
I would like to create 3 flags (form a, for a household that has 1-4 gst) form b 5-8 gsts... (3 Replies)
Discussion started by: sbr262
3 Replies
2. UNIX for Dummies Questions & Answers
Hi,
this might be a basic question...
why is that wc -c counts 1 more per line than what is there.
for example,
> cat dum1.txt
123
12
> wc -c dum1.txt
7 dum1.txt
Thanks,
Sameer. (4 Replies)
Discussion started by: ensameer
4 Replies
3. Shell Programming and Scripting
How do get the counts by excluding header and tailer.
wc -l customer_data*.0826
31 customer_data_1.0826
57 customer_data_2.0826
456 customer_data_3.0826
668 customer_data_4.0826
789 customer_data_5.0826
2344 customer_data_6.0826
13457 customer_data_7.0826... (6 Replies)
Discussion started by: zooby
6 Replies
4. Shell Programming and Scripting
I have a file with multiple entries and I have calculated the percentages. Now I want to know how many of my entries are there between 1-10% 11-20% and so on..
chr1_14401_14450 0.211954217888936
chr1_14451_14500 1.90758796100042
chr1_14501_14550 4.02713013988978
chr1_14551_14600 ... (3 Replies)
Discussion started by: Diya123
3 Replies
5. Shell Programming and Scripting
Hi,
I have a file with 2500 entries. There are many duplicates,triplicates symbols in my file in the first column and the second column has categories(high/medium/low) . I want to have count for the occurances of each category for each unique symbol
ABC high
ABC high
ABC medium
ABC ... (2 Replies)
Discussion started by: Diya123
2 Replies
6. Shell Programming and Scripting
Hi,
I have a file with 4 million rows. Each row has certain number ranging between 1 to 30733090.
What I want is to count the rows between each 1000 intervals.
1-1000 4000
1001-2000 2469
...
...
...
...
last 1000 interval
Thanks, (7 Replies)
Discussion started by: Diya123
7 Replies
7. UNIX for Dummies Questions & Answers
This is very easy , but I`m struggling .. please help modify my script,
I want to count the number of h and n , from the second column group by the first. The second column is binary, can only have h and n.
a h
a h
a n
a n
a h
b h
b h
b h
b h
b h
c n
c h
c h
c h
c h (2 Replies)
Discussion started by: jianp83
2 Replies
8. Shell Programming and Scripting
I can not figure out why there are 56,548 unique entries in test.bed. However, perl and awk see only 56,543 and that # is what my analysis see's as well. What happened to the 5 missing? Thank you :).
The file is attached as well.
cmccabe@DTV-A5211QLM:~/Desktop/NGS/bed/bedtools$wc -l... (2 Replies)
Discussion started by: cmccabe
2 Replies
9. Shell Programming and Scripting
Hi Team,
Am getting the below output but need the count of records to be displayed in same line but currently count alone moves to next line. Please let me know how we can still keep the count in the same line.
######code #####
while read YEAR; do
for i in TEST_*PGYR${YEAR}_${DT}.csv; do... (3 Replies)
Discussion started by: weknowd
3 Replies
JOIN(1) General Commands Manual JOIN(1)
NAME
join - relational database operator
SYNOPSIS
join [ options ] file1 file2
DESCRIPTION
Join forms, on the standard output, a join of the two relations specified by the lines of file1 and file2. If file1 is `-', the standard
input is used.
File1 and file2 must be sorted in increasing ASCII collating sequence on the fields on which they are to be joined, normally the first in
each line.
There is one line in the output for each pair of lines in file1 and file2 that have identical join fields. The output line normally con-
sists of the common field, then the rest of the line from file1, then the rest of the line from file2.
Fields are normally separated by blank, tab or newline. In this case, multiple separators count as one, and leading separators are dis-
carded.
These options are recognized:
-an In addition to the normal output, produce a line for each unpairable line in file n, where n is 1 or 2.
-e s Replace empty output fields by string s.
-jn m Join on the mth field of file n. If n is missing, use the mth field in each file.
-o list
Each output line comprises the fields specified in list, each element of which has the form n.m, where n is a file number and m is a
field number.
-tc Use character c as a separator (tab character). Every appearance of c in a line is significant.
SEE ALSO
sort(1), comm(1), awk(1)
BUGS
With default field separation, the collating sequence is that of sort -b; with -t, the sequence is that of a plain sort.
The conventions of join, sort, comm, uniq, look and awk(1) are wildly incongruous.
7th Edition April 29, 1985 JOIN(1)