04-03-2009
Compare 200,000 of rows in two text files
Friends,
I have two very large plain text files with pipe delimited as below.
Both files are not sorted.
Both files have 200,000 of rows.
FName|LName|Address|HPhNumber
Is perl or shell script feasible for this task?
Thanks,
Prashant
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
This may be the 3rd time I'm posting this question. I'm so new here that I'm not even sure how to post!
I'm trying to compare two files but can't do a line by line comparison so comm and diff are out. I've been told that I would need to use the awk programing language. I've looked up what I... (14 Replies)
Discussion started by: jimmyflip
14 Replies
2. UNIX for Dummies Questions & Answers
Dear UNIX-Community,
can help me doing 2 things in Debian 5.0?
1.) Create 100 folders
Format: ./0/0, ./0/1, ./0/2, ...,./0/9, ./1/0/, ..., ./9/9
2.) Move over 200.000 files into the subdirectories according to their last digits.
12398123.dat -> ./3/2/12398123.dat
48161.dat ->... (2 Replies)
Discussion started by: diadas
2 Replies
3. Shell Programming and Scripting
Hi Team,
Could you please help me on below one ..
etrademail1.txt
etDefaultLogin=pdayanan
mail=poojaaragam.dayanand@exchange.etr.comx
employeeNumber=31567
etDefaultLogin=sudrupa
mail=sudrupa.ayanand@exchange.etr.comx
employeeNumber=318967
etDefaultLogin=gurathi (1 Reply)
Discussion started by: nivas_k2006
1 Replies
4. Shell Programming and Scripting
- I have two files (File 1 and File 2) and the contents of the files are mentioned below.
- I am trying to compare the values of Column1 of File1 with Column1 of File2. If a match is found, print the corresponding value from Column2 of File1 in Column5 of File2.
- I tried to modify and use... (10 Replies)
Discussion started by: Santoshbn
10 Replies
5. UNIX for Dummies Questions & Answers
Hello guys,
I have file1 and file2, two text files containing various lines.
I'm trying to find a way to compare file1 and file2:
If the first 7 characters of a line in file2 match the first 7 characters of a line in file1, then do not do anything.
Print out the lines of file1 (in file3,... (3 Replies)
Discussion started by: bobylapointe
3 Replies
6. Shell Programming and Scripting
i am working on a shell script and need help in the comparing part of it. for e.g. there two text files like this:
file1.txt
name1
name2
name3
file1 has to be comared with file2
defaultfile.txt
name1
name2
name3
name4
and during comparision with defaultfile.txt
if... (2 Replies)
Discussion started by: draghun9
2 Replies
7. Shell Programming and Scripting
Hi,
I have two text files, compare column one in both the files and if it matches then the output should contain the id in column one, the number and the description.
Both the files are sorted. Is there a one liner to get this done, kindly help. Thank you
File 1:
NC_000964 92.33 ... (2 Replies)
Discussion started by: pulikoti
2 Replies
8. UNIX for Dummies Questions & Answers
Hi Gurus,
I have 100 tab-delimited text files each with 21 columns. I want to extract only 2nd and 5th column from each text file. However, the values in both 2bd and 5th column contain duplicate values but the combination of these values in a row are not duplicate. I want to extract only those... (3 Replies)
Discussion started by: Unilearn
3 Replies
9. UNIX for Dummies Questions & Answers
Hi,
I have a process which creates almost 200K files. Each file ranging from 1kb to 5kb. This is almost 2GB of data in all files.
I have a requirement where the business needs only 100MB worth of files.
Is there a way to get files around 100MB (doesn't have to be exactly 100MB) from all the... (2 Replies)
Discussion started by: grep_me
2 Replies
10. Shell Programming and Scripting
Hi all, I'm pretty much a newbie to UNIX. I would appreciate any help with UNIX coding on comparing two large csv files (greater than 10 GB in size), and output a file with matching columns.
I want to compare file1 and file2 by 'id' and 'chain' columns, then extract exact matching rows'... (5 Replies)
Discussion started by: bkane3
5 Replies
LEARN ABOUT MINIX
dh_compress
DH_COMPRESS(1) Debhelper DH_COMPRESS(1)
NAME
dh_compress - compress files and fix symlinks in package build directories
SYNOPSIS
dh_compress [debhelperoptions] [-Xitem] [-A] [file...]
DESCRIPTION
dh_compress is a debhelper program that is responsible for compressing the files in package build directories, and makes sure that any
symlinks that pointed to the files before they were compressed are updated to point to the new files.
By default, dh_compress compresses files that Debian policy mandates should be compressed, namely all files in usr/share/info,
usr/share/man, files in usr/share/doc that are larger than 4k in size, (except the copyright file, .html and other web files, image files,
and files that appear to be already compressed based on their extensions), and all changelog files. Plus PCF fonts underneath
usr/share/fonts/X11/
FILES
debian/package.compress
These files are deprecated.
If this file exists, the default files are not compressed. Instead, the file is ran as a shell script, and all filenames that the shell
script outputs will be compressed. The shell script will be run from inside the package build directory. Note though that using -X is a
much better idea in general; you should only use a debian/package.compress file if you really need to.
OPTIONS
-Xitem, --exclude=item
Exclude files that contain item anywhere in their filename from being compressed. For example, -X.tiff will exclude TIFF files from
compression. You may use this option multiple times to build up a list of things to exclude.
-A, --all
Compress all files specified by command line parameters in ALL packages acted on.
file ...
Add these files to the list of files to compress.
CONFORMS TO
Debian policy, version 3.0
SEE ALSO
debhelper(7)
This program is a part of debhelper.
AUTHOR
Joey Hess <joeyh@debian.org>
11.1.6ubuntu2 2018-05-10 DH_COMPRESS(1)