Deleting a column in multiple files that are comma separated Post: 302367148

Sponsored Content

Top Forums Shell Programming and Scripting Deleting a column in multiple files that are comma separated Post 302367148 by Lucky Ali on Sunday 1st of November 2009 02:13:02 PM

11-01-2009

Registered User

Deleting a column in multiple files that are comma separated

Hi,

I have a directory that contains say 100 files named sequencially like input_1.25_50_C1.txt
input_1.25_50_C2.txt
input_1.25_50_C3.txt
input_1.25_50_C4.txt
..
..
..
input_1.25_50_C100.txt

an example of the content in each of the file is:

"NAME" "MEM.SHIP"
"cgd1_10" "cgd1_10" 0.938962779873022
"cgd1_1860" "cgd1_1860" 0.728179501334915
"cgd1_2500" "cgd1_2500" 0.751709755780061
"cgd1_2560" "cgd1_2560" 0.8086690400501
"cgd1_2640" "cgd1_2640" 0.750586302165714
"cgd1_2800" "cgd1_2800" 0.954020921349781
"cgd1_290" "cgd1_290" 0.757355995574881
"cgd1_2950" "cgd1_2950" 0.558344549451538
"cgd1_3010" "cgd1_3010" 0.667420188529345
"cgd1_50" "cgd1_50" 0.545172222953814
"cgd2_1000" "cgd2_1000" 0.944926355001402
"cgd2_1690" "cgd2_1690" 0.861817882469458
"cgd3_1820" "cgd3_1820" 0.817150133730118
"cgd3_2150" "cgd3_2150" 0.553855831143878
"cgd4_2220" "cgd4_2220" 0.577452218116135

I want to delete the second & third column of each file, make the file tab delimited and replace the column heading "NAME" in the first column as [Gene]

Is there a way in awk or shell programming to do this at a directory level; without openning each file and editing it.

Please let me know.

Lucky Ali

Last edited by Lucky Ali; 11-01-2009 at 03:55 PM..

Lucky Ali

View Public Profile for Lucky Ali

Find all posts by Lucky Ali

10 More Discussions You Might Find Interesting

1. UNIX Desktop Questions & Answers

Unix Comma Separated to Excel Column

I would like to copy 2 parts of a csv file from Unix to an XL sheet. However to save time I do not want to format the column ever time I cut and paste into XL(Text2Column). I've used awk -F, '{Print $1, $2....}'. Is there a script or code that can automatically format the csv for XL columns? ...

2. Shell Programming and Scripting

Finding difference in two comma separated files in UINX

Dear All, I have requirement like this: I have 2 camma seperated files file1: 1,aaa,bbb,ccc, 2,bbb,ccc,ddd, 3,ccc,ddd,eee, file2: 1,aaa,bbb,ccc, 2,bbb,ddd,ddd, 3,ccc,ddd,eee, my requirement is I should get message in the out put like: There is a difference in 3 rd filed in...

3. Homework & Coursework Questions

Find the files and make them comma separated files

Use and complete the template provided. The entire template must be completed. If you don't, your post may be deleted! 1. The problem statement, all variables and given/known data: Hi All, I am new to unix, my requirement is like need to find the files like DATA_FUNCTION* and put those...

4. Shell Programming and Scripting

Assigning Multiple Comma Separated IP's To A Bash Array

I am in the process of creating a BASH shell scripts for a project at work. So the scenario is as such: I have a file with each line entry separated by ':' ...

5. Shell Programming and Scripting

Need Help - comma inside double quote in comma separated csv,

Hello there, I have a comma separated csv , and all the text field is wrapped by double quote. Issue is some text field contain comma as well inside double quote. so it is difficult to process. Input in the csv file is , 1,234,"abc,12,gh","GH234TY",34 I need output like below,...

6. UNIX for Dummies Questions & Answers

[solved] Comma separated values to space separated

Hi, I have a large number of files which are written as csv (comma-separated values). Does anyone know of simple sed/awk command do achieve this? Thanks! ---------- Post updated at 10:59 AM ---------- Previous update was at 10:54 AM ---------- Guess I asked this too soon. Found the...

7. Shell Programming and Scripting

Make multiple lines into single quoted comma separated Linux

Hi, I want to change a file file1.txt: 1234 3456 2345 6789 3456 2333 4444 As, file2.txt in Linux: '1234','3456','2345','6789','3456','2333','4444' Could someone please help me. (Single liner sed, awk will be welcome!)

8. Shell Programming and Scripting

Combining multiple block of lines in one comma separated line

Hi Everyone, On my Linux box I have a text file having block of few lines and this block lines separated by one blank line. I would like to format and print these lines in such a way that this entire block of lines will come as single comma separated line & again next block of lines in next...

9. Shell Programming and Scripting

Convert column to quote and comma separated row

Hi, I have a list of tables in a file.txt C_CLAIM C_HLD C_PROVIDER I want the output to be 'C_CLAIM','C_HLD','C_PROVIDER' Currently I'm usin awk and getting output which is almost correct but still has minor defects awk -vORS="','" '{ print $1 }' file.txt The output of...

10. Shell Programming and Scripting

awk to parse comma separated field and removing comma in between number and double quotes

Hi Experts, Please support I have below data in file in comma seperated, but 4th column is containing comma in between numbers, bcz of which when i tried to parse the file the column 6th value(5049641141) is being removed from the file and value(222.82) in column 5 becoming value of column6. ...

LEARN ABOUT CENTOS

gensprep

gensprep(8)							 ICU 50.1.2 Manual						       gensprep(8)

NAME

       gensprep - compile StringPrep data from files filtered by filterRFC3454.pl

SYNOPSIS

       gensprep [ -h, -?, --help ] [ -v, --verbose ] [ -c, --copyright ] [ -s, --sourcedir source ] [ -d, --destdir destination ]

DESCRIPTION

       gensprep reads filtered RFC 3454 files and compiles their information into a binary form.  The resulting file, <name>.icu, can then be read
       directly by ICU, or used by pkgdata(8) for incorporation into a larger archive or library.

       The files read by gensprep are described in the FILES section.

OPTIONS

       -h, -?, --help
	      Print help about usage and exit.

       -v, --verbose
	      Display extra informative messages during execution.

       -c, --copyright
	      Include a copyright notice into the binary data.

       -s, --sourcedir source
	      Set the source directory to source.  The default source directory is specified by the environment variable ICU_DATA.

       -d, --destdir destination
	      Set the destination directory to destination.  The default destination directory is specified by the environment variable ICU_DATA.

ENVIRONMENT

       ICU_DATA  Specifies the directory containing ICU data. Defaults to /usr/share/icu/50.1.2/.  Some tools in ICU depend on the presence of the
		 trailing slash. It is thus important to make sure that it is present if ICU_DATA is set.

FILES

       The  following files are read by gensprep and are looked for in the source /misc for rfc3454_*.txt files and in source /unidata for Normal-
       izationCorrections.txt.

       rfc3453_A_1.txt	   Contains the list of unassigned codepoints in Unicode version 3.2.0....

       rfc3454_B_1.txt	   Contains the list of code points that are commonly mapped to nothing....

       rfc3454_B_2.txt	   Contains the list of mappings for casefolding of  code points when Normalization form NFKC is specified....

       rfc3454_C_X.txt	   Contains the list of code points that are prohibited for IDNA.

       NormalizationCorrections.txt
			   Contains the list of code points whose normalization has changed since Unicode Version 3.2.0.

VERSION

       50.1.2

COPYRIGHT

       Copyright (C) 2000-2002 IBM, Inc. and others.

SEE ALSO

       pkgdata(8)

ICU MANPAGE
							   18 March 2003						       gensprep(8)

10 More Discussions You Might Find Interesting

1. UNIX Desktop Questions & Answers

Unix Comma Separated to Excel Column

Discussion started by: ravzter

2. Shell Programming and Scripting

Finding difference in two comma separated files in UINX

Discussion started by: mymoto

3. Homework & Coursework Questions

Find the files and make them comma separated files

Discussion started by: madsongtel

4. Shell Programming and Scripting

Assigning Multiple Comma Separated IP's To A Bash Array

Discussion started by: metallica1973