Sponsored Content
Top Forums UNIX for Beginners Questions & Answers UNIX - 2 tab delimited files, conditional column extraction Post 303014977 by Scrutinizer on Sunday 25th of March 2018 02:38:47 AM
Old 03-25-2018
Hi, see if this works:
Code:
awk -F'\t' '
  NR==FNR {
    if(!($1 in L))
      L[$1]=$2
    R[$1]=$3
    next
  }
  {
    print ($2>=L[$1] && $2<R[$1])?$4:"NA"
  }
' file1 file2

This User Gave Thanks to Scrutinizer For This Post:
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Delete first column in tab-delimited text-file

I have a large text-file with tab-delimited genetic data that looks like: KSC112 KSC234 0 0 1 1 A G C T I simply wan to delete the first column, but since the file has 600 000 columns, it is not possible with awk (seems to be limited at 32k columns). Does anyone have an idea how to do this? (2 Replies)
Discussion started by: andmal
2 Replies

2. UNIX for Dummies Questions & Answers

Add a new column to a tab delimited text file

I want to add a new column to a tab delimited text file. It will be the first column and it will just be 1's. How do I go about doing that? Thanks! (1 Reply)
Discussion started by: evelibertine
1 Replies

3. Shell Programming and Scripting

Using sed on 1st column of tab delimited file

Hi all, I'm new to Unix and work primarily in bioinformatics. I am in need of a script which will allow me to replace "1" with "chr1" in only the first column of a file which looks like such: 1 10327 rs112750067 T C . PASS ASP;RSPOS=10327;... (4 Replies)
Discussion started by: Hkins552
4 Replies

4. UNIX for Dummies Questions & Answers

Using awk to log transform a column in a tab-delimited text file?

How do I use awk to log transform the fifth column of a tab-delimited text file? Thanks! (1 Reply)
Discussion started by: evelibertine
1 Replies

5. Shell Programming and Scripting

Extract second column tab delimited file

I have a file which looks like this: 73450 articles and news developmental psychology 2006-03-30 16:22:40 1 http://www.usnews.com 73450 articles and news developmental psychology 2006-03-30 16:22:40 2 http://www.apa.org 73450 articles and news developmental psychology 2006-03-30... (1 Reply)
Discussion started by: shoaibjameel123
1 Replies

6. UNIX for Dummies Questions & Answers

add (append) a column in a tab delimited file

I have a file having the following entries: test1 test2 test3 11 22 33 22 44 66 99 99 44 --- I want to add a column so that the above file becomes: test1 test2 test3 notest 11 22 33 * 22 44 66 * 99 99 44 * --- Thanks (6 Replies)
Discussion started by: mary271
6 Replies

7. Shell Programming and Scripting

Convert a 3 column tab delimited file to a matrix

Hi all, I have a 3 columns input file like this: CPLX9PC-4943 CPLX9PC-4943 1 CPLX9PC-4943 CpxID123 0 CPLX9PC-4943 CpxID126 0 CPLX9PC-4943 CPLX9PC-5763 0.5 CPLX9PC-4943 CpxID13 0 CPLX9PC-4943 CPLX9PC-6163 0 CPLX9PC-4943 CPLX9PC-6164 0.04... (7 Replies)
Discussion started by: AshwaniSharma09
7 Replies

8. Shell Programming and Scripting

Delete an entire column from a tab delimited file

Hi, Can anyone please tell me about how we can delete an entire column from a tab delimited file? Mu input_file.txt looks like this: And I want the output as: I used the below code nawk -v d="1" 'BEGIN{FS=OFS="\t"}{$d=""}{print}' input_file.txtBut in the output, the first column is... (5 Replies)
Discussion started by: sampoorna
5 Replies

9. UNIX for Dummies Questions & Answers

awk - Extract 4 lines in Column to Rows Tab Delimited between tags

I have tried the following to no avail. xargs -n8 < test.txt awk '{if(NR%6!=0){p=""}else{p="\n"};printf $0" "p}' Mod_Alm_log.txt > test.txt I have tried different variations of the above, the problem is mixes lines together. And it includes the tags "%a and %A" I need them to be all tab... (16 Replies)
Discussion started by: mytouchsr
16 Replies

10. UNIX for Beginners Questions & Answers

Replace a column in tab delimited file with column in other tab delimited file,based on match

Hello Everyone.. I want to replace the retail col from FileI with cstp1 col from FileP if the strpno matches in both files FileP.txt ... (2 Replies)
Discussion started by: YogeshG
2 Replies
TE(1p)							User Contributed Perl Documentation						    TE(1p)

NAME
te (table editor) - front-end that simplifies editing tab-delimited text tables SYNOPSIS
te file1 [ file2 ... ] DESCRIPTION
This program makes it easier to edit tab-delimited ASCII tables, such as are used with Interchange (see icdevgroup.org), and can be exported from many popular spreadsheet and database applications. It converts tab-delimited ASCII files that have one record per line into temporary files with one field per line, each line beginning with the field name. It then sends each file to your favorite text editor. After you exit your editor, it checks to see if you changed anything in the file, and if so, it converts the data back to the tab-delimited format with one record per line, and replaces the original file. The first line of each input file must contain the field names, tab-delimited, that apply for that file. Editing is pretty straightforward when you see it in action. The rules are: o Empty lines are ignored. o Comment lines (beginning with "#") are ignored at the beginning of the file, and terminate a record in the middle. o The fields in the first record are used in all subsequent records, and the order in which they are specified are the order in which the columns will be written. o To delete a column, delete its line in the first record. o To change which order the columns are in, re-order the first record. o To add a column, add it to the first record (and as many subsequent records as you wish). Note that if you're using the "extended" option (-e), you can't add new columns, because they can't be distinguished from the extended fields that go into the serialized hash. You'd need to do that in a separate pass. o If you delete a column, you do not need to delete it from every record; any instances of that field in records after the first will be ignored. o Any space left after the field name and colon (like "fieldname:") will be included as part of the field. Any tabs you put in the field data itself will be converted to spaces (as they would corrupt the table otherwise). o A record can be deleted by removing all its fields. A new record can be added by inserting a new block of all fields at a record boundary. o If any errors are encountered, such as non-existent field names or lines that don't follow the prescribed format, processing aborts immediately and the original file is left untouched. You can edit several files in succession by naming each on the command line. The editor will be called for each one independently. If you start editing many files and decide you want to stop, add a line "#DONE" anywhere in the temporary file and save it. The current file will be processed and saved, but the rest will be skipped. As is customary with many Unix applications, you can set the environment variables VISUAL or EDITOR to point to your favorite text editor. If neither of those is set, my favorite editor, vi(1) is used. Options will also be read from environment variable TE_OPTIONS if it is set. AUTHOR
Jon Jensen <jon@endpoint.com> COPYRIGHT
Copyright (C) 2002-2008 Jon Jensen and others Copyright (C) 2001-2002 Red Hat, Inc. This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License at http://www.fsf.org/copyleft/gpl.html for more details. VERSION
2008-04-12 02:33:40 CHANGELOG
2001-04-26. Initial release. 2001-05-04. Make temporary file in working directory instead of using POSIX::tmpnam. Keep ownership and permissions of original file. Fixed bug that caused fields with number "0" to be output as empty strings. 2001-05-06. Fix problem with how temporary file names were generated. 2001-07-31. Check for VISUAL environment variable, and handle editor options there or in EDITOR if given. Take advantage of Digest::MD5's native file reading instead of doing it ourselves. 2001-10-03. Added gvim support by forcing foreground option -f. 2002-07-23. Remove line endings whether CR, LF, or CRLF, instead of using running Perl's platform-specific chomp. If preserving a trailing solitary CR in the last field of a line is important, you'll want to change this behavior. 2002-08-30. Add option -s for starting value support (really only vi). te -s os28004 <file> Jumps to first occurrence of "os28004" in <file>. Option -i ignores case in the search. (By Mike Heins.) 2002-09-02. Add option -f to handle files without field names. 2002-09-03. Add option -n to number rows in comments. Allow setting of persistent options in environment variable TE_OPTIONS. 2004-06-07. Fixed bug that misinterpreted file as having no data rows when last line of file was empty. 2005-08-29. Added ability to delete, re-order, or add columns by placing them in the first record. 2005-11-15. Added support for extended columns containing Perl serialized hashes with the -e option. 2008-04-11. Added option -o to write output to a file and exit, never invoking an editor. By Greg Sabino Mullane. perl v5.14.2 2011-03-09 TE(1p)
All times are GMT -4. The time now is 09:08 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy