awk solution to duplicate lines based on column Post: 302862731

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

duplicate row based on single column

I am a newbie to shell scripting .. I have a .csv file. It has 1000 some rows and about 7 columns... but before I insert this data to a table I have to parse it and clean it ..basing on the value of the first column..which a string of phone number type... example below.. column 1 ...

2. Shell Programming and Scripting

AWK Duplicate lines multiple times based on a calculated value

Hi, I'm trying to create an XML sitemap of our dynamic ecommerce sites SEO Friendly URLs and am trying to create the initial page listing. I have a CSV file that looks like the following and need duplicate the lines based on a value which needs calculating. ...

3. Shell Programming and Scripting

awk print non matching lines based on column

My item was not answered on previous thread as code given did not work I wanted to print records from file2 where comparing column 1 and 16 for both files find rows where column 16 in file 1 does not match column 16 in file 2 Here was CODE give to issue ~/unix.com$ cat f1...

4. Shell Programming and Scripting

Perl: filtering lines based on duplicate values in a column

Hi I have a file like this. I need to eliminate lines with first column having the same value 10 times. 13 18 1 + chromosome 1, 122638287 AGAGTATGGTCGCGGTTG 13 18 1 + chromosome 1, 128904080 AGAGTATGGTCGCGGTTG 13 18 1 - chromosome 14, 13627938 CAACCGCGACCATACTCT 13 18 1 + chromosome 1,...

5. UNIX for Dummies Questions & Answers

awk to sum column field from duplicate row/lines

Hello, I am new to Linux environment , I working on Linux script which should send auto email based on the specific condition from log file. Below is the sample log file Name m/c usage abc xxx 10 abc xxx 20 abc xxx 5 xyz ...

6. Shell Programming and Scripting

awk to sum a column based on duplicate strings in another column and show split totals

Hi, I have a similar input format- A_1 2 B_0 4 A_1 1 B_2 5 A_4 1 and looking to print in this output format with headers. can you suggest in awk?awk because i am doing some pattern matching from parent file to print column 1 of my input using awk already.Thanks! letter number_of_letters...

7. Shell Programming and Scripting

Remove duplicate rows based on one column

Dear members, I need to filter a file based on the 8th column (that is id), and does not mather the other columns, because I want just one id (1 line of each id) and remove the duplicates lines based on this id (8th column), and does not matter wich duplicate will be removed. example of my file...

8. Shell Programming and Scripting

Removing duplicate lines on first column based with pipe delimiter

Hi, I have tried to remove dublicate lines based on first column with pipe delimiter . but i ma not able to get some uniqu lines Command : sort -t'|' -nuk1 file.txt Input : 38376KZ|09/25/15|1.057 38376KZ|09/25/15|1.057 02006YB|09/25/15|0.859 12593PS|09/25/15|2.803...

9. Shell Programming and Scripting

Solution for replacement of 4th column with 3rd column in a file using awk/sed preserving delimters

input "A","B","C,D","E","F" "S","T","U,V","W","X" "AA","BB","CC,DD","EEEE","FFF" required output: "A","B","C,D","C,D","F" "S", T","U,V","U,V","X" "AA","BB","CC,DD","CC,DD","FFF" tried using awk but double quotes not preserving for every field. any help to solve this is much...

10. Shell Programming and Scripting

awk to select lines with maximum value of each record based on column value

Hello, I want to get the maximum value of each record separated by empty line based on the 3rd column of each row within each record? Input: A1 chr5D 634 7 82 707 A2 chr5D 637 6 82 713 A3 chr5D 637 5 82 713 A4 chr5D 626 1 82 704...

LEARN ABOUT V7

fitcircle

FITCIRCLE(l)															      FITCIRCLE(l)

NAME

       fitcircle - find mean position and pole of best-fit great [or small] circle to points on a sphere.

SYNOPSIS

       fitcircle [ xyfile ] -Lnorm [ -H[nrec] ] [ -S ] [ -V ] [ -: ] [ -bi[s][n] ]

DESCRIPTION

       fitcircle  reads  lon,lat  [or  lat,lon]  values from the first two columns on standard input [or xyfile]. These are converted to cartesian
       three-vectors on the unit sphere. Then two locations are found: the mean of the input positions, and the pole to  the  great  circle  which
       best  fits  the input positions. The user may choose one or both of two possible solutions to this problem. The first is called -L1 and the
       second is called -L2. When the data are closely grouped along a great circle both solutions are similar. If the data have large dispersion,
       the pole to the great circle will be less well determined than the mean. Compare both solutions as a qualitative check.
       The  -L1 solution is so called because it approximates the minimization of the sum of absolute values of cosines of angular distances. This
       solution finds the mean position as the Fisher average of the data, and the pole position as  the  Fisher  average  of  the  cross-products
       between	the mean and the data. Averaging cross-products gives weight to points in proportion to their distance from the mean, analogous to
       the "leverage" of distant points in linear regression in the plane.
       The -L2 solution is so called because it approximates the minimization of the sum of squares of cosines of angular distances. It creates  a
       3  by 3 matrix of sums of squares of components of the data vectors. The eigenvectors of this matrix give the mean and pole locations. This
       method may be more subject to roundoff errors when there are thousands of data. The pole is given by the eigenvector corresponding  to  the
       smallest eigenvalue; it is the least-well represented factor in the data and is not easily estimated by either method.

       -L     Specify the desired norm as 1 or 2, or use -L or -L3 to see both solutions.

OPTIONS

       xyfile ASCII  [or  binary, see -b] file containing lon,lat [lat,lon] values in the first 2 columns. If no file is specified, fitcircle will
	      read from standard input.

       -H     Input file(s) has Header record(s). Number of header records can be changed by editing your .gmtdefaults file. If used, GMT  default
	      is 1 header record.

       -S     Attempt to fit a small circle instead of a great circle. The pole will be constrained to lie on the great circle connecting the pole
	      of the best-fit great circle and the mean location of the data.

       -V     Selects verbose mode, which will send progress reports to stderr [Default runs "silently"].

       -:     Toggles between (longitude,latitude) and (latitude,longitude) input/output. [Default  is	(longitude,latitude)].	 Applies  to  geo-
	      graphic coordinates only.

       -bi    Selects  binary input. Append s for single precision [Default is double].  Append n for the number of columns in the binary file(s).
	      [Default is 2 input columns].

EXAMPLES

       Suppose you have lon,lat,grav data along a twisty ship track in the file ship.xyg. You want to project this data onto a	great  circle  and
       resample it in distance, in order to filter it or check its spectrum.  Try:

       fitcircle ship.xyg -L2

       project ship.xyg -Cox/oy -Tpx/py -S -pz | sample1d -S-100 -I1 > output.pg

       Here,  ox/oy is the lon/lat of the mean from fitcircle, and px/py is the lon/lat of the pole. The file output.pg has distance, gravity data
       sampled every 1 km along the great circle which best fits ship.xyg

SEE ALSO

       gmt(1gmt), project(1gmt), sample1d(1gmt)

								    1 Jan 2004							      FITCIRCLE(l)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

duplicate row based on single column

Discussion started by: mitr

2. Shell Programming and Scripting

AWK Duplicate lines multiple times based on a calculated value

Discussion started by: jamesfx

3. Shell Programming and Scripting

awk print non matching lines based on column

Discussion started by: sigh2010

4. Shell Programming and Scripting

Perl: filtering lines based on duplicate values in a column

Discussion started by: polsum