Generate separate files with similar and dissimilar contents
Hello experts,
I have 2 files 1.txt (10,000 lines of text) and 2.txt (7500 lines of text).
Both files have similar as well as dissimilar entries.
Is there a way(s) where i can perform the following operations :
1. Generate a file which will have all similar lines.
2. Generate a file which will have all dissimilar lines.
On my part, I performed the following command in order to, generate a file which will have all dissimilar lines :
Example of file 1.txt
Example of file 2.txt
Could you please help with both these queries.
Thank you.
Regards,
Haider
Last edited by Scrutinizer; 07-22-2016 at 05:48 AM..
Reason: adding example; [mod] icode tags changed to code tags. Added code tags for data samples
Hi guyz I have been using this script to find similar keys in 2 files and merge the keys along with their values. Therefore it prints similar keys by leaving dissimilar. Any one knows how to print only Dissimilar leaving Similar.
Help would be appreciated.
The script I'm using for similar... (4 Replies)
I got many pair files, which only have small difference, such as more space, or more empty line, and some unreadable characters.
If list by commend "diff", I can see many many difference.
So I'd like to write a script to compare the pair files, if 95% contents are same, I will think they are... (2 Replies)
Hi,
I have a file like this
cpsSystemNotifyTrap='2010/12/14 11:05:31 CST' Manufacturer=IBM ReportingMTMS=n/a ProbNm=26 LparName=n/a FailingEnclosureMTMS=7946-IQL*99G4874 SRC=B3031107 EventText=Problem reported by customer. CallHome=true Calendar
I want to have a output like this... (6 Replies)
I am familiar with scripting, but I am trying to see if there is an easy way to append files from similar files into one file. For example, if there is file1_20121201, file1_20121202, file1_20121203,
file2_20121201, file2_20121202, file2_20121203
I want to be able to combine all the data from... (3 Replies)
Hey guys,
Sorry for the basic question but I have a lot of files that I want to separate into groups based on filenames which I can then cat together. Eg I have:
(a_b_c.txt)
WB34_2_SLA8.txt
WB34_1_SLA8.txt
WB34_1_DB10.txt
WB34_2_DB10.txt
WB34_1_SLA8.txt
WB34_2_SLA8.txt
77_1_SLA8.txt... (1 Reply)
Hello all,
I have a server that is running AIX, running a tool that converts various printstreams (AFP/Metadata) to PDF. This is done using a rexx script and an off the shelf utility.
Each report (there's around 125) uses a certain script file, it's basically a text file.
I am trying... (5 Replies)
Hi,
I have a file1 like this:
ABAT
ABCA1
ABCC1
ABCC5
ABCC8
ABCE1
ABHD2
ABL1
CAMTA1
ACBD3
ACCN1
And I have a second file like this:
chr19 46118590 46119564 MACS_peak_1499 3100.00 chr19 46122009 46148405 CYP2B7P1 -2445
chr1 7430312 7430990... (7 Replies)
Dear Gents,
Please I need your help... I need small script :) to do the following.
I have a thousand of files in a folder produced daily.
I need first to merge all files called. txt (0009.txt, 0010.txt, 0011.txt) and and to output a resume of all information on 2 separate files in csv... (14 Replies)
Dear all,
I have a simple question. I have a file like below (separated by tab):
col1 col2 col3 col4 col5 col6 col7
21 66745 rs1234 21 rs5678 23334 0.89
21 66745 rs2334 21 rs9978 23334 0.89
21 66745 ... (4 Replies)
Hi,
I need to compare the /etc/passwd files from 2 servers, and extract the users that are similar in these two files. I sorted the 2 files based on the user IDs (UID) (3rd column). I first sorted the files using the username (1st column), however when I use comm to compare the files there is no... (1 Reply)
Discussion started by: anaigini45
1 Replies
LEARN ABOUT DEBIAN
pescetti
PESCETTI(1) General Commands Manual PESCETTI(1)NAME
pescetti -- Pseudo-Duplimate Generator
SYNOPSIS
pescetti
DESCRIPTION
This manual page documents briefly the pescetti command.
OPTIONS
Here are a list of the available options and what they do. You must specify exactly one from --demo, --generate or --load.
--help Prints the help text
--demo Demonstration mode. Generates one hand with permutations and the tutorial for how to use them.
--generate=N
Generate N random boards
--load=boards.txt
Load boards+analysis from boards.txt
--load-dds=boards.dds
Load boards from boards.dds in dds format
--load-analysis=tricks.txt
Load analysis from tricks.txt
--permutations=permutations.txt
Generate the permutations and save them to the given file
--curtains=curtains.txt
Save curtain cards to file curtains.txt
--save=boards.txt
Save the boards+analysis to boards.txt
--save-dds=boards.dds
Save the boards to boards.dds in dds format
--save-analysis=tricks.txt
Save the analysis to tricks.txt
--format=html|txt|pdf
Set the output mode to the given format
--title=title
Set the title for the output
--output=hands.txt
Print the hands to hands.txt, rather than to standard output
--stats Generate statistics about the set of boards; included in the hands output
--analyze Run the dds analyzer on the boards and print the resulting numberof tricks (warning SLOW)
--criteria=
A list of criteria to apply to each generated hand to generate specific hand types. The list should be space separated and each
item may be suffixed with a colon and a (fractional) probability value which can be used to weight the criteria.
E.g. --criteria="weaknt:0.8 strongnt:0.5"
Valid criteria are: unbalanced weaknt strongnt twont strongtwo weaktwo three twoclubs 4441 singlesuit twosuits partscore game
slam game-invite slam-invite jumpshift jumpfit splinter bacon weird
--probability=factor
Generate hands matching the criteria with only the given probability. Factor is in the range 0 to 1. On each attempt to generate
a board it is rejected if it doesn't match the criteria with the given probability. A factor of about 0.8 gives roughly half
matching boards
AUTHOR
This manual page was written by Matthew Johnson <debian@matthew.ath.cx>. Permission is granted to copy, distribute and/or modify this docu-
ment under the terms of the GNU General Public License, Version 2 as published by the Free Software Foundation.
On Debian systems, the complete text of the GNU General Public License can be found in /usr/share/common-licenses/GPL.
PESCETTI(1)