Remove duplicates in a dataframe (table) keeping all the different cells of just one of the columns Post: 303032091

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Deleting table cells in a script

I'd like to use sed or awk to do this but I'm weak on both along with RE. Looking for a way with sed or awk to count for the 7th table data within a table row and if the condition is met to delete "<td>and everything in between </td>". Since the table header start on a specific line each time, that...

2. Shell Programming and Scripting

Remove duplicates based on the two key columns

Hi All, I needs to fetch unique records based on a keycolumn(ie., first column1) and also I needs to get the records which are having max value on column2 in sorted manner... and duplicates have to store in another output file. Input : Input.txt 1234,0,x 1234,1,y 5678,10,z 9999,10,k...

3. Shell Programming and Scripting

Search based on 1,2,4,5 columns and remove duplicates in the same file.

Hi, I am unable to search the duplicates in a file based on the 1st,2nd,4th,5th columns in a file and also remove the duplicates in the same file. Source filename: Filename.csv "1","ccc","information","5000","temp","concept","new" "1","ddd","information","6000","temp","concept","new"...

4. UNIX for Dummies Questions & Answers

Two files; if cells match then copy over other columns

My current issue is dealing with two space delimited files. The first file has column 1 as the sample ID's, then columns 2 - n as the observations. The second file has column 1 as the sample ID's, column 2 as the mother ID's, column 3 as the father ID's, column 4 as the gender, and column 5...

5. UNIX Desktop Questions & Answers

Using grep to remove cells instead of lines

I would like to use grep to remove certain strings from a text file but I can't use the grep -v option because it removes the whole line that includes the string whereas I just want to remove the string. How do I go about doing that? My input file: Magmas CEU rs12542019 CPNE1 RBM12 CEU...

6. Shell Programming and Scripting

CSV with commas in field values, remove duplicates, cut columns

Hi Description of input file I have: ------------------------- 1) CSV with double quotes for string fields. 2) Some string fields have Comma as part of field value. 3) Have Duplicate lines 4) Have 200 columns/fields 5) File size is more than 10GB Description of output file I need:...

7. Shell Programming and Scripting

Remove Duplicates on multiple Key Columns and get the Latest Record from Date/Time Column

Hi Experts , we have a CDC file where we need to get the latest record of the Key columns Key Columns will be CDC_FLAG and SRC_PMTN_I and fetch the latest record from the CDC_PRCS_TS Can we do it with a single awk command. Please help....

8. Shell Programming and Scripting

Remove duplicates by keeping the order intact

Hello friends, I have a file with duplicate lines. I could eliminate duplicate lines by running sort <file> |uniq >uniq_file and it works fine BUT it changes the order of the entries as it we did "sort". I need to remove duplicates and also need to keep the order/sequence of entries. I...

9. UNIX for Beginners Questions & Answers

Merge cells in all rows of a HTML table dynamically.

Hello All, I have visited many pages in Unix.com and could find out one solution for merging the HTML cells in the 1st row. (Unable to post the complete URL as I should not as per website rules). But, however I try, I couldn't achieve this merging to happen for all other rows of HTML...

10. UNIX for Beginners Questions & Answers

Sort and remove duplicates in directory based on first 5 columns:

I have /tmp dir with filename as: 010020001_S-FOR-Sort-SYEXC_20160229_2212101.marker 010020001_S-FOR-Sort-SYEXC_20160229_2212102.marker 010020001-S-XOR-Sort-SYEXC_20160229_2212104.marker 010020001-S-XOR-Sort-SYEXC_20160229_2212105.marker 010020001_S-ZOR-Sort-SYEXC_20160229_2212106.marker...

LEARN ABOUT PLAN9

grep

GREP(1) 						      General Commands Manual							   GREP(1)

NAME

       grep - search a file for a pattern

SYNOPSIS

       grep [ option ...  ] pattern [ file ...	]

DESCRIPTION

       Grep  searches  the input files (standard input default) for lines (with newlines excluded) that match the pattern, a regular expression as
       defined in regexp(6).  Normally, each line matching the pattern is `selected', and each selected line is copied	to  the  standard  output.
       The options are

       -c     Print only a count of matching lines.
       -h     Do not print file name tags (headers) with output lines.
       -i     Ignore alphabetic case distinctions.  The implementation folds into lower case all letters in the pattern and input before interpre-
	      tation.  Matched lines are printed in their original form.
       -l     (ell) Print the names of files with selected lines; don't print the lines.
       -L     Print the names of files with no selected lines; the converse of -l.
       -n     Mark each printed line with its line number counted in its file.
       -s     Produce no output, but return status.
       -v     Reverse: print lines that do not match the pattern.

       Output lines are tagged by file name when there is more than one input file.  (To force this tagging, include  /dev/null  as  a	file  name
       argument.)

       Care should be taken when using the shell metacharacters $*[^|()= and newline in pattern; it is safest to enclose the entire expression in
       single quotes '...'.

SOURCE

       /sys/src/cmd/grep.c

SEE ALSO

       ed(1), awk(1), sed(1), sam(1), regexp(6)

DIAGNOSTICS

       Exit status is null if any lines are selected, or non-null when no lines are selected or an error occurs.

																	   GREP(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Deleting table cells in a script

Discussion started by: phpfreak

2. Shell Programming and Scripting

Remove duplicates based on the two key columns

Discussion started by: kmsekhar

3. Shell Programming and Scripting

Search based on 1,2,4,5 columns and remove duplicates in the same file.

Discussion started by: onesuri

4. UNIX for Dummies Questions & Answers

Two files; if cells match then copy over other columns

Discussion started by: Renyulb28