04-24-2013
Sort csv file by duplicated column value
hello, I have a large file (about 1gb) that is in a file similar to the following:
Quote:
"Timmy","??","Age 26","1","0"
"Jack","??","Age 21","1","0"
"Troy","??","Age 21","1","0"
"Kim","?","Age 26","1","0"
"Mark","???","Age 24","1","0"
"John","??","Age 27","1","0"
I want to make it so that I can put all the duplicates where column 3 (delimited by the commas) are shown on top. Meaning all people with the same age are listed at the top.
Quote:
"Timmy","??","Age 26","1","0"
"Kim","?","Age 26","1","0"
"Jack","??","Age 21","1","0"
"Troy","??","Age 21","1","0"
"Mark","???","Age 24","1","0"
"John","??","Age 27","1","0"
The command I used was
sort -t, +2 input.csv > output.csv
I assumed that-t would make ',' be the delimiter and "+2" would look at the second column. What am I doing wrong?
9 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi
Just wondering whether or not I can remove duplicated lines without sort
For example, I use the command who, which shows users who are logging on. In some cases, it shows duplicated lines of users who are logging on more than one terminal.
Normally, I would do
who | cut -d" " -f1 |... (6 Replies)
Discussion started by: lalelle
6 Replies
2. UNIX for Dummies Questions & Answers
Hello all,
I've got a strange behaviour of sort and uniq commands: they do not recognise apparently duplicated lines in a file (already sorted). The lines are identical by eye, but they must differ in smth, because when they are put in two files, those have slightly different size.
What can make... (8 Replies)
Discussion started by: roussine
8 Replies
3. Shell Programming and Scripting
Hello people,
I am having problem to sort, sed and zero padding of column in csv file.
7th column only.
Input of csv file:
1,2,3,4,5,6,4/1/2010 12:00 AM,8
1,2,3,4,5,6,3/11/2010 9:39 AM,8
1,2,3,4,5,6,5/12/2011 3:43 PM,8
1,2,3,4,5,6,12/20/2009 7:23 PM,8
Output:... (5 Replies)
Discussion started by: sean1357
5 Replies
4. Shell Programming and Scripting
Dear all,
How can I remove duplicated column in a text file?
Input:
LG10_PM_map_19_LEnd 1000560 G AA AA AA AA AA GG
LG10_PM_map_19_LEnd 1005621 G GG GG GG AA AA GG
LG10_PM_map_19_LEnd 1011214 A AA AA AA AA GG GG
LG10_PM_map_19_LEnd 1011673 T TT TT TT TT CC CC... (1 Reply)
Discussion started by: huiyee1
1 Replies
5. Shell Programming and Scripting
Hi, I am newbie in shell script.
I need your help to solve my problem.
Firstly, I have 2 files of csv and i want to compare of the contents then the output will be written in a new csv file.
File1:
SourceFile,DateTimeOriginal
/home/intannf/foto/IMG_0713.JPG,2015:02:17 11:14:07... (8 Replies)
Discussion started by: refrain
8 Replies
6. Shell Programming and Scripting
Hi,
I have the following output from an Oracle SQL statement and I want to remove duplicated column values.
I know it is possible using Oracle analytical/statistical functions but unfortunately I don't know how to use any of those.
So now, I've gone to PLAN B using awk/sed maybe or any... (5 Replies)
Discussion started by: newbie_01
5 Replies
7. Shell Programming and Scripting
Please help me to get required output for both scenario 1 and scenario 2 and need separate code for both scenario 1 and scenario 2
Scenario 1
i need to do below changes only when column1 is CR and column3 has duplicates rows/values. This inputfile can contain 100 of this duplicated rows of... (1 Reply)
Discussion started by: as7951
1 Replies
8. UNIX for Beginners Questions & Answers
I have to sort the 4th column of an excel/csv file. I tried the following command
sort -u --field-separator=, --numeric-sort -k 2 -n dinesh.csv > test.csv
But, it's not working. Moreover, I have to do the same for more than 30 excel/csv file. So please help me to do the same. (6 Replies)
Discussion started by: dineshkumarsrk
6 Replies
9. UNIX for Beginners Questions & Answers
I have a csv file as shown below,
xop_thy 80 avr_njk 50 str_nyu 60
avr_irt 70 str_nhj 60 avr_ngt 50
str_tgt 80 xop_nmg 50 xop_nth 40
cyv_gty 40 cop_thl 40 vir_tyk 80
vir_plo 20 vir_thk 40 ijk_yuc 70
cop_thy 70 ijk_yuc 80 irt_hgt 80
I need to align/sort the csv file based... (7 Replies)
Discussion started by: dineshkumarsrk
7 Replies
LES(8) Maintenance Commands LES(8)
NAME
les, bus - ATM LAN Emulation service demons
SYNOPSIS
les [-d module] [-m module] [-f configuration_file]
bus [-d module] [-m module] [-f configuration_file]
DESCRIPTION
LE Service consists of three components: LAN Emulation Configuration Server (lecs(8)), LAN Emulation Server (les) and Broadcast and Unknown
Server (bus).
Les performs the control coordination function for the emulated LAN. LE clients register MAC addresses and/or route descriptors they rep-
resent to les, and later query it when they want to resolve MAC addresses/route descriptors into ATM addresses. Other LE control messages
which are to be distributed to every client in ELAN are also sent to les. Les forwards these messages using Control Distribute VCC which it
has set up to every client in ELAN.
Bus handles data sent by clients to broadcast and multicast MAC addresses and some of the data directed to unicast addresses. LE Client has
a possibility to send data directed to some unicast address to the bus before target's ATM address has been resolved and the Data Direct
VCC has been established.
Configuration file example for les and bus:
[main]
memdebug=True
debug=True
[load]
#memdebug=True
#debug=True
[conn]
debug=True
#S1, LE Server's ATM address
#S1=:47:00:23:00:00:00:03:00:00:01:00:02:01:00:20:ea:00:05:aa:00
S1=:47:00:23:00:00:00:03:03:00:01:00:02:01:00:20:ea:00:0a:e9:01
#S2, LAN Type
S2="802.3"
#S3, Maximum Frame Size
S3=1516
#S4, Join Timeout, s
S4=15
#S5, Maximum Frame Age, s
S5=6
#S6, BUS Atm address
S6=:47:00:23:00:00:00:03:03:00:01:00:02:01:00:20:ea:00:0a:e9:02 #viulu
#S6=0,0,170
#ELANNAME="asdf"
The configuration file contains each modules name in brackets followed by variable definitions for that module. The definitions are of form
variable=value, where value can be either an integer, a truth value (True/False), a string enclosed in double quotes ("string") or an ATM
address in hexadecimal format. Variables that can be set are the debug/memdebug for each module and variables S1-S6 as defined in LE speci-
fication.
S1=Address of the LES. This address is used in ATM
signalling.
S2=Type of the emulated LAN. Valid values is "802.3".
S3=Maximum frame size. Valid value is 1516.
S4=Join Timeout. Time in seconds which LES waits for
LE_JOIN_REQUEST before tearing down a connection.
S5=Maximum frame Age. Currently not used.
S6=Address of the BUS. This address is used in ATM signalling.
ELANNAME= Name of the emulated LAN
SIGHUP causes restart of the server. All resources are released and server is started. SIGUSR1 causes the server to dump its internal
state. SIGUSR2 shuts down the server (hopefully) gracefully.
OPTIONS
-d module
Set debugging messages on for a module. "All" sets debugging on for all modules.
-m module
Set memory debugging messages on for a module. "All" sets debugging on for all modules.
-f configuration_file
Use the specified configuration file instead of .lanevars.
FILES
.lanevars configuration file
BUGS
Servers don't establish point-to-multipoint connections to LE clients as the specification states, which means that some LE clients won't
work with these servers.
Supports only IEEE 802.3 / Ethernet type of ELANs.
This manual page is confusing.
AUTHOR
Marko Kiiskila, TUT <carnil@cs.tut.fi>
SEE ALSO
lecs(8), atmsigd(8), zeppelin(8)
Linux Sep 11, 1996 LES(8)