Hello All,
I have two VERY large .csv files that I want to compare values based on substrings. If the lines are unique, then print the line.
For example, if I run a
I get results similar to
I want to compare the ids (string between "_" and ",") and if it's unique, then print the line so my output would be like the following:
Output:
I was thinking I could sue the cut command and delimit on the first "_" but didn't know how to compare all the values up until you reach the first comma.
Any suggestions?
Last edited by Scott; 09-13-2013 at 12:39 PM..
Reason: Code tags for input and output too
I have a datafile that is formatted as fixed.
I know that each line should contain 880 characters.
I want to separate the file into 2 files, one that has lines with 880 characters and the other file with everything else.
Is this possible ? (9 Replies)
Hi All,
I have a very huge file (4GB) which has duplicate lines. I want to delete duplicate lines leaving unique lines. Sort, uniq, awk '!x++' are not working as its running out of buffer space.
I dont know if this works : I want to read each line of the File in a For Loop, and want to... (16 Replies)
hi
my problem is little complicated one. i have 2 files which appear like this
file 1
abbsss:aa:22:34:as akl abc 1234
mkilll:as:ss:23:qs asc abc 0987
mlopii:cd:wq:24:as asd abc 7866
file2
lkoaa:as:24:32:sa alk abc 3245
lkmo:as:34:43:qs qsa abc 0987
kloia:ds:45:56:sa acq abc 7805
i... (5 Replies)
Hello,
I need some sort of way to extract every date contained in a file, and count how many of those dates there are.
Here are the specifics:
The date format I'm looking for is mm/dd/yyyy
I only need to look after line 45 in the file (that's where the data begins)
The columns of... (2 Replies)
Hello guys. I really hope someone will help me with this one..
So, I have to write this script who:
- creates a file home/student/vmdisk of 10 mb
- formats that file to ext3
- mounts that partition to /mnt/partition
- creates a file /mnt/partition/data. In this file, there will... (1 Reply)
Hello to all, happy new year 2013!
May somebody could help me, is about a very similar problem to the problem I've posted here where the member rdrtx1 and bipinajith helped me a lot.
https://www.unix.com/shell-programming-scripting/211147-map-values-blocks-single-line-2.html
It is very... (3 Replies)
file1:
file2:
I need to find matches for any lines in file1 that appear in file2. Desired output is '>' plus the file1 term, followed by the line after the match in file2 (so the title is a little misleading):
This is honestly beyond what I can do without spending the whole night on it, so I'm... (2 Replies)
I would like to print unique lines without sort or unique. Unfortunately the server I am working on does not have sort or unique. I have not been able to contact the administrator of the server to ask him to add it for several weeks. (7 Replies)
For some reason I am having difficulty performing what should be a fairly easy task. I would like to print lines of a file that have a unique value in the first field. For example, I have a large data-set with the following excerpt:
PS003,001 MZMWR/ L-DWD// *
PS003,001... (4 Replies)
Discussion started by: jvoot
4 Replies
LEARN ABOUT DEBIAN
locale::language
Locale::Language(3perl) Perl Programmers Reference Guide Locale::Language(3perl)NAME
Locale::Language - standard codes for language identification
SYNOPSIS
use Locale::Language;
$lang = code2language('en'); # $lang gets 'English'
$code = language2code('French'); # $code gets 'fr'
@codes = all_language_codes();
@names = all_language_names();
DESCRIPTION
The "Locale::Language" module provides access to standard codes used for identifying languages, such as those as defined in ISO 639.
Most of the routines take an optional additional argument which specifies the code set to use. If not specified, the default ISO 639 two-
letter codes will be used.
SUPPORTED CODE SETS
There are several different code sets you can use for identifying languages. The ones currently supported are:
alpha-2
This is the set of two-letter (lowercase) codes from ISO 639, such as 'he' for Hebrew.
This code set is identified with the symbol "LOCALE_LANG_ALPHA_2".
This is the default code set.
alpha-3
This is the set of three-letter (lowercase) bibliographic codes from ISO 639, such as 'heb' for Hebrew.
This code set is identified with the symbol "LOCALE_LANG_ALPHA_3".
term
This is the set of three-letter (lowercase) terminologic codes from ISO 639.
This code set is identified with the symbol "LOCALE_LANG_TERM".
ROUTINES
code2language ( CODE [,CODESET] )
language2code ( NAME [,CODESET] )
language_code2code ( CODE ,CODESET ,CODESET2 )
all_language_codes ( [CODESET] )
all_language_names ( [CODESET] )
Locale::Language::rename_language ( CODE ,NEW_NAME [,CODESET] )
Locale::Language::add_language ( CODE ,NAME [,CODESET] )
Locale::Language::delete_language ( CODE [,CODESET] )
Locale::Language::add_language_alias ( NAME ,NEW_NAME )
Locale::Language::delete_language_alias ( NAME )
Locale::Language::rename_language_code ( CODE ,NEW_CODE [,CODESET] )
Locale::Language::add_language_code_alias ( CODE ,NEW_CODE [,CODESET] )
Locale::Language::delete_language_code_alias ( CODE [,CODESET] )
These routines are all documented in the Locale::Codes man page.
SEE ALSO
Locale::Codes
Locale::Constants
http://www.loc.gov/standards/iso639-2/
Source of the ISO 639 codes.
AUTHOR
See Locale::Codes for full author history.
Currently maintained by Sullivan Beck (sbeck@cpan.org).
COPYRIGHT
Copyright (c) 1997-2001 Canon Research Centre Europe (CRE).
Copyright (c) 2001-2010 Neil Bowers
Copyright (c) 2010-2011 Sullivan Beck
This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
perl v5.14.2 2011-09-26 Locale::Language(3perl)