Command line / script option to filter a data set by values of one column


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers Command line / script option to filter a data set by values of one column
Prev   Next
# 1  
Old 07-04-2012
Command line / script option to filter a data set by values of one column

Hi all!

I have a data set in this tab separated format : Label, Value1, Value2

An instance is "data.txt" :
0 1 1
-1 2 3
0 2 2

I would like to parse this data set and generate two files, one that has only data with the label 0 and the other with label -1, so my outputs should be, for example :

data0.txt
0 1 1
0 2 2

and data-1.txt
-1 2 3

These are large datasets, and I do not know in advance how many labels there are. Assuming the labels are l1...ln, I would like the outputs stored in data_<label>.txt where <label> is one of l1...ln


Can someone here suggest a quick way to script / command-line this?

Thanks in advance!
 
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Filter file to remove duplicate values in first column

Hello, I have a script that is generating a tab delimited output file. num Name PCA_A1 PCA_A2 PCA_A3 0 compound_00 -3.5054 -1.1207 -2.4372 1 compound_01 -2.2641 0.4287 -1.6120 3 compound_03 -1.3053 1.8495 ... (3 Replies)
Discussion started by: LMHmedchem
3 Replies

2. Shell Programming and Scripting

UNIX command -Filter rows in fixed width file based on column values

Hi All, I am trying to select the rows in a fixed width file based on values in the columns. I want to select only the rows if column position 3-4 has the value AB I am using cut command to get the column values. Is it possible to check if cut -c3-4 = AB is true then select only that... (2 Replies)
Discussion started by: ashok.k
2 Replies

3. Shell Programming and Scripting

awk to filter out lines containing unique values in a specified column

Hi, I have multiple files that each contain four columns of strings: File1: Code: 123 abc gfh 273 456 ddff jfh 837 789 ghi u4u 395 File2: Code: 123 abc dd fu 456 def 457 nd 891 384 djh 783 I want to compare the strings in Column 1 of File 1 with each other file and Print in... (3 Replies)
Discussion started by: owwow14
3 Replies

4. Linux

Filter a .CSV file based on the 5th column values

I have a .CSV file with the below format: "column 1","column 2","column 3","column 4","column 5","column 6","column 7","column 8","column 9","column 10 "12310","42324564756","a simple string with a , comma","string with or, without commas","string 1","USD","12","70%","08/01/2013",""... (2 Replies)
Discussion started by: dhruuv369
2 Replies

5. Shell Programming and Scripting

Script for extracting data from csv file based on column values.

Hi all, I am new to shell script.I need your help to write a shell script. I need to write a shell script to extract data from a .csv file where columns are ',' separated. The file has 5 columns having values say column 1,column 2.....column 5 as below along with their valuesm.... (3 Replies)
Discussion started by: Vivekit82
3 Replies

6. Shell Programming and Scripting

awk : Filter a set of data to parse header line and last field of multiple same match.

Hi Experts, I have a data with multiple entry , I want to filter PKG= & the last column "00060110" or "00088150" in the output file: ############################################################################################### PKG= P8SDB :: VGS = vgP8SOra vgP8SDB1 vgP8S001... (5 Replies)
Discussion started by: rveri
5 Replies

7. Programming

Reading command line arguments and setting up values if option not provided

I have a C++ program. I read command line arguments, but if the value is not supplied, I default or make a calculation. Let's say I set it to a default value. I can code this in several ways. Here I show three ways. What would be the best way for maintaining this code? The program will get very... (2 Replies)
Discussion started by: kristinu
2 Replies

8. Shell Programming and Scripting

perl script command line option driven script

could someone show me a sample command line option driven script? i want to see an easy way to write one and how i can execute it using command line options such as typing in read.pl -i <id> -c <cmds> -s <start> -e <end> would read out all the commands run by ID . from start time to... (7 Replies)
Discussion started by: kpddong
7 Replies

9. UNIX for Dummies Questions & Answers

What is a menu or command line option driven script?

i'm confused what this means. i was asked to design a menu or command line option driven script that reads out of a DB and displays info such as read_data.pl -u <user> -e <event> which would print commands run by <user>with the <event> in the db. any suggestions? i've been using... (2 Replies)
Discussion started by: kpddong
2 Replies

10. Shell Programming and Scripting

How to change values in certain column only in every line (any script)

Let say in a file I have lines of data like this : 13;2073;461496;15075341;3;001f7d3a;2042063674; 13;2074;461446;15080241;6;001ed33a;2042020154; 13;2075;461401;15085270;6;001f593b;2042054459; 13;2076;461381;15087160;6;001f7483;2042061443; 13;2077;461419;15083419;6;001eca1a;2042017818; I... (3 Replies)
Discussion started by: luna_soleil
3 Replies
Login or Register to Ask a Question
ARD-PARSE-BOARDS(1)					      General Commands Manual					       ARD-PARSE-BOARDS(1)

NAME
ard-parse-boards - Read data from the Arduino boards.txt file USAGE
Dump all the data in the file: $ ard-parse-boards --dump See which boards we know about: $ ard-parse-boards --boards Look for a particular board... $ ard-parse-boards --find uno multiple terms are implicitly ANDed: $ ard-parse-boards --find duemil 328 Dump all the data for a particular board: $ ard-parse-boards atmega328 Extract a particular field: $ ard-parse-boards atmega328 build.f_cpu DESCRIPTION
The Arduino software package ships with a boards.txt file which tells the Arduino IDE details about particular hardware. So when the user says he's got a shiny new Arduino Uno, boards.txt knows that it has a 16MHz ATmega328 on it. It would be nice to access these data from the command line too. In normal operation you simply specify the tag given to the board in the boards.txt file, and optionally a field name. This program then extracts the data to STDOUT. Most boards have names which are quite unwieldy, so we always refer to a board by a tag, not its name. Strictly the tag is the bit before the first dot in the boards.txt key. You can see a list of board tags and names with the "--boards" option. OPTIONS
--boards_txt=[file] Specify the full path to the boards.txt file. The following options all disable the normal 'lookup' operation. --dump Dump the complete database in YAML format. --boards Print a list of the tag and name of every board in the file. --find [query] <query> ... Find matching data. Strictly, return a list of values which match all of the query terms, treating each term as a case-insensitive regexp. For example: --find 328 List data containing 328 (anywhere in the value). --find due List data containing 'due' (e.g. duemilanove). --find 328 due List data containing both 328 and due. BUGS AND LIMITATIONS
There are no known bugs in this application. Please report problems to the author. Patches are welcome. AUTHOR
Martin Oldfield, ex-atelier@mjo.tc Thanks to Mark Sproul who suggested doing something like this to me ages ago. LICENSE AND COPYRIGHT
Copyright (c) 2011, Martin Oldfield. All rights reserved. This file is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MER- CHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. MAY 2012 ARD-PARSE-BOARDS(1)