Hi, I'm sorry I'm no coder so I came here, counting on your free time and good will to beg for spoonfeeding some good code. I'll try to be quick and concise!
Got file with 50k lines like this:
Problem is that somewhere (anywhere) in file may appear a similar line (but usually not exactly the same), which needs to be recognized as duplicate and deleted!
My example - of what could be found and should be recognized (and deleted) as duplicate:
So I guess algorithm should basically do this:
1. from each line read only letters [a-z], [A-Z] and numbers [0-9] and disregard any possible spacing or special characters or punctuation
2. compare with every other line (in same manner a-Z, 0-9) and if same arrangement of letters and numbers is found (ignoring spacing, case, special chars...) delete one of the lines (doesn't matter which one)
Ok here's what I'm trying to do. I need to get a listing of all the mountpoints on a system into a file, which is easy enough, just using something like "mount | awk '{print $1}'"
However, on a couple of systems, they have some mount points looking like this:
/stage
/stand
/usr
/MFPIS... (2 Replies)
OK, I have read several things on how to do this, but can't make it work. I am writing this to a vi file then calling it as an awk script.
So I need to search a file for duplicate lines, delete duplicate lines, then write the result to another file, say /home/accountant/files/docs/nodup
... (2 Replies)
Hi please help me how to remove duplicate lines in any file.
I have a file having huge number of lines.
i want to remove selected lines in it.
And also if there exists duplicate lines, I want to delete the rest & just keep one of them.
Please help me with any unix commands or even fortran... (7 Replies)
Hey all, a relative bash/script newbie trying solve a problem.
I've got a text file with lots of lines that I've been able to clean up and format with awk/sed/cut, but now I'd like to remove the lines with duplicate usernames based on time stamp. Here's what the data looks like
2007-11-03... (3 Replies)
hi :)
I need to delete partial duplicate lines
I have this in a file
sihp8027,/opt/cf20,1980182
sihp8027,/opt/oracle/10gRelIIcd,155200016
sihp8027,/opt/oracle/10gRelIIcd,155200176
sihp8027,/var/opt/ERP,10376312
and need to leave it like this:
sihp8027,/opt/cf20,1980182... (2 Replies)
Hi All,
I have a very huge file (4GB) which has duplicate lines. I want to delete duplicate lines leaving unique lines. Sort, uniq, awk '!x++' are not working as its running out of buffer space.
I dont know if this works : I want to read each line of the File in a For Loop, and want to... (16 Replies)
The question is not as simple as the title... I have a file, it looks like this
<string name="string1">RZ-LED</string>
<string name="string2">2.0</string>
<string name="string2">Version 2.0</string>
<string name="string3">BP</string>
I would like to check for duplicate entries of... (11 Replies)
Dear folks
I have a map file of around 54K lines and some of the values in the second column have the same value and I want to find them and delete all of the same values. I looked over duplicate commands but my case is not to keep one of the duplicate values. I want to remove all of the same... (4 Replies)
Hi
I need to delete duplicate like pattern lines from a text file containing 2 duplicates only (one being subset of the other) using sed or awk preferably.
Input:
FM:Chicago:Development
FM:Chicago:Development:Score
SR:Cary:Testing:Testcases
PM:Newyork:Scripting
PM:Newyork:Scripting:Audit... (6 Replies)
Discussion started by: tech_frk
6 Replies
LEARN ABOUT DEBIAN
locale::language
Locale::Language(3perl) Perl Programmers Reference Guide Locale::Language(3perl)NAME
Locale::Language - standard codes for language identification
SYNOPSIS
use Locale::Language;
$lang = code2language('en'); # $lang gets 'English'
$code = language2code('French'); # $code gets 'fr'
@codes = all_language_codes();
@names = all_language_names();
DESCRIPTION
The "Locale::Language" module provides access to standard codes used for identifying languages, such as those as defined in ISO 639.
Most of the routines take an optional additional argument which specifies the code set to use. If not specified, the default ISO 639 two-
letter codes will be used.
SUPPORTED CODE SETS
There are several different code sets you can use for identifying languages. The ones currently supported are:
alpha-2
This is the set of two-letter (lowercase) codes from ISO 639, such as 'he' for Hebrew.
This code set is identified with the symbol "LOCALE_LANG_ALPHA_2".
This is the default code set.
alpha-3
This is the set of three-letter (lowercase) bibliographic codes from ISO 639, such as 'heb' for Hebrew.
This code set is identified with the symbol "LOCALE_LANG_ALPHA_3".
term
This is the set of three-letter (lowercase) terminologic codes from ISO 639.
This code set is identified with the symbol "LOCALE_LANG_TERM".
ROUTINES
code2language ( CODE [,CODESET] )
language2code ( NAME [,CODESET] )
language_code2code ( CODE ,CODESET ,CODESET2 )
all_language_codes ( [CODESET] )
all_language_names ( [CODESET] )
Locale::Language::rename_language ( CODE ,NEW_NAME [,CODESET] )
Locale::Language::add_language ( CODE ,NAME [,CODESET] )
Locale::Language::delete_language ( CODE [,CODESET] )
Locale::Language::add_language_alias ( NAME ,NEW_NAME )
Locale::Language::delete_language_alias ( NAME )
Locale::Language::rename_language_code ( CODE ,NEW_CODE [,CODESET] )
Locale::Language::add_language_code_alias ( CODE ,NEW_CODE [,CODESET] )
Locale::Language::delete_language_code_alias ( CODE [,CODESET] )
These routines are all documented in the Locale::Codes man page.
SEE ALSO
Locale::Codes
Locale::Constants
http://www.loc.gov/standards/iso639-2/
Source of the ISO 639 codes.
AUTHOR
See Locale::Codes for full author history.
Currently maintained by Sullivan Beck (sbeck@cpan.org).
COPYRIGHT
Copyright (c) 1997-2001 Canon Research Centre Europe (CRE).
Copyright (c) 2001-2010 Neil Bowers
Copyright (c) 2010-2011 Sullivan Beck
This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
perl v5.14.2 2011-09-26 Locale::Language(3perl)