11-01-2009
Deleting a column in multiple files that are comma separated
Hi,
I have a directory that contains say 100 files named sequencially like input_1.25_50_C1.txt
input_1.25_50_C2.txt
input_1.25_50_C3.txt
input_1.25_50_C4.txt
..
..
..
input_1.25_50_C100.txt
an example of the content in each of the file is:
"NAME" "MEM.SHIP"
"cgd1_10" "cgd1_10" 0.938962779873022
"cgd1_1860" "cgd1_1860" 0.728179501334915
"cgd1_2500" "cgd1_2500" 0.751709755780061
"cgd1_2560" "cgd1_2560" 0.8086690400501
"cgd1_2640" "cgd1_2640" 0.750586302165714
"cgd1_2800" "cgd1_2800" 0.954020921349781
"cgd1_290" "cgd1_290" 0.757355995574881
"cgd1_2950" "cgd1_2950" 0.558344549451538
"cgd1_3010" "cgd1_3010" 0.667420188529345
"cgd1_50" "cgd1_50" 0.545172222953814
"cgd2_1000" "cgd2_1000" 0.944926355001402
"cgd2_1690" "cgd2_1690" 0.861817882469458
"cgd3_1820" "cgd3_1820" 0.817150133730118
"cgd3_2150" "cgd3_2150" 0.553855831143878
"cgd4_2220" "cgd4_2220" 0.577452218116135
I want to delete the second & third column of each file, make the file tab delimited and replace the column heading "NAME" in the first column as [Gene]
Is there a way in awk or shell programming to do this at a directory level; without openning each file and editing it.
Please let me know.
Lucky Ali
Last edited by Lucky Ali; 11-01-2009 at 03:55 PM..
10 More Discussions You Might Find Interesting
1. UNIX Desktop Questions & Answers
I would like to copy 2 parts of a csv file from Unix to an XL sheet. However to save time I do not want to format the column ever time I cut and paste into XL(Text2Column). I've used awk -F, '{Print $1, $2....}'. Is there a script or code that can automatically format the csv for XL columns?
... (3 Replies)
Discussion started by: ravzter
3 Replies
2. Shell Programming and Scripting
Dear All,
I have requirement like this:
I have 2 camma seperated files
file1:
1,aaa,bbb,ccc,
2,bbb,ccc,ddd,
3,ccc,ddd,eee,
file2:
1,aaa,bbb,ccc,
2,bbb,ddd,ddd,
3,ccc,ddd,eee,
my requirement is I should get message in the out put like:
There is a difference in 3 rd filed in... (2 Replies)
Discussion started by: mymoto
2 Replies
3. Homework & Coursework Questions
Use and complete the template provided. The entire template must be completed. If you don't, your post may be deleted!
1. The problem statement, all variables and given/known data:
Hi All,
I am new to unix, my requirement is like need to find the files like DATA_FUNCTION* and put those... (1 Reply)
Discussion started by: madsongtel
1 Replies
4. Shell Programming and Scripting
I am in the process of creating a BASH shell scripts for a project at work. So the scenario is as such:
I have a file with each line entry separated by ':'
... (3 Replies)
Discussion started by: metallica1973
3 Replies
5. Shell Programming and Scripting
Hello there,
I have a comma separated csv , and all the text field is wrapped by double quote. Issue is some text field contain comma as well inside double quote. so it is difficult to process.
Input in the csv file is ,
1,234,"abc,12,gh","GH234TY",34
I need output like below,... (8 Replies)
Discussion started by: Uttam Maji
8 Replies
6. UNIX for Dummies Questions & Answers
Hi,
I have a large number of files which are written as csv (comma-separated values).
Does anyone know of simple sed/awk command do achieve this?
Thanks!
---------- Post updated at 10:59 AM ---------- Previous update was at 10:54 AM ----------
Guess I asked this too soon. Found the... (0 Replies)
Discussion started by: lost.identity
0 Replies
7. Shell Programming and Scripting
Hi,
I want to change a file file1.txt:
1234
3456
2345
6789
3456
2333
4444
As, file2.txt in Linux:
'1234','3456','2345','6789','3456','2333','4444'
Could someone please help me. (Single liner sed, awk will be welcome!) (7 Replies)
Discussion started by: wiweq05
7 Replies
8. Shell Programming and Scripting
Hi Everyone,
On my Linux box I have a text file having block of few lines and this block lines separated by one blank line. I would like to format and print these lines in such a way that this entire block of lines will come as single comma separated line & again next block of lines in next... (7 Replies)
Discussion started by: gr8_usk
7 Replies
9. Shell Programming and Scripting
Hi,
I have a list of tables in a file.txt
C_CLAIM
C_HLD
C_PROVIDER
I want the output to be
'C_CLAIM','C_HLD','C_PROVIDER'
Currently I'm usin awk and getting output which is almost correct but still has minor defects
awk -vORS="','" '{ print $1 }' file.txt
The output of... (4 Replies)
Discussion started by: wahi80
4 Replies
10. Shell Programming and Scripting
Hi Experts,
Please support
I have below data in file in comma seperated, but 4th column is containing comma in between numbers, bcz of which when i tried to parse the file the column 6th value(5049641141) is being removed from the file and value(222.82) in column 5 becoming value of column6.
... (3 Replies)
Discussion started by: as7951
3 Replies
LEARN ABOUT CENTOS
gensprep
gensprep(8) ICU 50.1.2 Manual gensprep(8)
NAME
gensprep - compile StringPrep data from files filtered by filterRFC3454.pl
SYNOPSIS
gensprep [ -h, -?, --help ] [ -v, --verbose ] [ -c, --copyright ] [ -s, --sourcedir source ] [ -d, --destdir destination ]
DESCRIPTION
gensprep reads filtered RFC 3454 files and compiles their information into a binary form. The resulting file, <name>.icu, can then be read
directly by ICU, or used by pkgdata(8) for incorporation into a larger archive or library.
The files read by gensprep are described in the FILES section.
OPTIONS
-h, -?, --help
Print help about usage and exit.
-v, --verbose
Display extra informative messages during execution.
-c, --copyright
Include a copyright notice into the binary data.
-s, --sourcedir source
Set the source directory to source. The default source directory is specified by the environment variable ICU_DATA.
-d, --destdir destination
Set the destination directory to destination. The default destination directory is specified by the environment variable ICU_DATA.
ENVIRONMENT
ICU_DATA Specifies the directory containing ICU data. Defaults to /usr/share/icu/50.1.2/. Some tools in ICU depend on the presence of the
trailing slash. It is thus important to make sure that it is present if ICU_DATA is set.
FILES
The following files are read by gensprep and are looked for in the source /misc for rfc3454_*.txt files and in source /unidata for Normal-
izationCorrections.txt.
rfc3453_A_1.txt Contains the list of unassigned codepoints in Unicode version 3.2.0....
rfc3454_B_1.txt Contains the list of code points that are commonly mapped to nothing....
rfc3454_B_2.txt Contains the list of mappings for casefolding of code points when Normalization form NFKC is specified....
rfc3454_C_X.txt Contains the list of code points that are prohibited for IDNA.
NormalizationCorrections.txt
Contains the list of code points whose normalization has changed since Unicode Version 3.2.0.
VERSION
50.1.2
COPYRIGHT
Copyright (C) 2000-2002 IBM, Inc. and others.
SEE ALSO
pkgdata(8)
ICU MANPAGE
18 March 2003 gensprep(8)