parsing data from a big file using keys from another smaller file Post: 302511411

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Parsing the data in a file

Hi, I have file (FILE.tmp) having contents, FILE.tmp ======== filename=menudata records=0000000000037 ldbname=pinsys timestamp=2005/05/14-18:32:33 I want to parse it bring a new file which will look like, filename records ldbname timestamp...

2. Shell Programming and Scripting

Big data file - sed/grep/awk?

Morning guys. Another day another question. :rolleyes: I am knocking up a script to pull some data from a file. The problem is the file is very big (up to 1 gig in size), so this solution: for results in `grep "^\ ... works, but takes ages (we're talking minutes) to run. The data is held...

3. Shell Programming and Scripting

perl help to split big verilog file into smaller ones for each module

Hi I have a big verilog file with multiple modules. Each module begin with the code word 'module <module-name>(ports,...)' and end with the 'endmodule' keyword. Could you please suggest the best way to split each of these modules into multiple files? Thank you for the help. Example of...

4. Shell Programming and Scripting

How to cut some data from big file

How to cut data from big file my file around 30 gb I tried "head -50022172 filename > newfile.txt ,and tail -5454283 newfile.txt. It's slowy. afer that I tried sed -n '46467831,50022172p' filename > newfile.txt ,also slow Please recommend me , faster command to cut some data from...

5. Shell Programming and Scripting

Helping in parsing subset of text from a big results file

Hi All, I need some help to effectively parse out a subset of results from a big results file. Below is an example of the text file. Each block that I need to parse starts with "reading sequence file 10.codon" (next block starts with another number) and ends with **p-Value(s)**. I have given...

6. Shell Programming and Scripting

Sort a big data file

Hello, I have a big data file (160 MB) full of records with pipe(|) delimited those fields. I`m sorting the file on the first field. I'm trying to sort with "sort" command and it brings me 6 minutes. I have tried with some transformation methods in perl but it results "Out of memory". I was...

7. Shell Programming and Scripting

Segment a big file into smaller ones

Greeting to all. I have big text file that I would like to segment into many smaller files. Each file should be maximum 10 000 lines. The file is called time.txt. after the execution of the file I would like to have. time_01.txt, time_02, txt, ...,time_n.txt Can anybody help. Br.

8. Shell Programming and Scripting

parsing characters and number from a big file with brackets

I have a big file with many brackets () in it from which I need to parse number characters and numbers. Below is an example of my file 14 (((A__0:0.02,B__1:0.3)0:0.04,C__0:0.025)2:0.01),(D__0:0.00978,E__2:0.01031)1:0.00362; 15...

9. Shell Programming and Scripting

Parsing data using keys from one file

I have 2 text files where I need to parse data from file 2 using the data from file 1. Below are my sample files File 1 (tab delimited) 257 350 670 845 725 1025 767 820 ... .... .... file 2 (tab delimited) 220..450 TA AB650 ABCED 520..850 GA AB720 ABCDE 700..1100 TC AB820 ABCDE...

10. Shell Programming and Scripting

Extract data according to keys from filename mentioned in file

Hello experts, I want to join a file with files whosE names are mentioned in one of the columns of the same file. File 1 t1,a,b,file number 1 t1,a,c,file number 1 t2,c,d,file number 2 t2,c,e,file number 2 t2,c,f,file number 2 t2,c,g,file number 2 t3,e,f,file number 3 file number 1...

LEARN ABOUT DEBIAN

search::xapian::enquire

Xapian::Enquire(3pm)					User Contributed Perl Documentation				      Xapian::Enquire(3pm)

NAME

       Search::Xapian::Enquire - Make queries against a database

DESCRIPTION

       This class provides an interface to the information retrieval system for the purpose of searching.

METHODS

       new
       set_query
	   takes either a ready made Search::Xapian::Query or a scalar containing a query, which in that case will be passed to
	   Search::Xapian::Query's constructor, together with any other passed arguments.

       set_query_object <query>
       get_query
       matches <start> <size> [<check_at_least>]
	   Takes the start element, and maximum number of elements (and optionally the minimum number of matches to check), and returns an array
	   tied to Search::Xapian::MSet::Tied.

       get_matching_terms_begin
	   Returns a Search::Xapian::TermIterator, pointing to the start of the stream.

       get_matching_terms_end
	   Returns a Search::Xapian::TermIterator, pointing to the end of the stream.

       set_collapse_key <collapse_key>
       set_docid_order <order>
	   Set the direction in which documents are ordered by document id in the returned MSet.

	   This order only has an effect on documents which would otherwise have equal rank.  For a weighted probabilistic match with no sort
	   value, this means documents with equal weight.  For a boolean match, with no sort value, this means all documents.  And if a sort value
	   is used, this means documents with equal sort value (and also equal weight if ordering on relevance after the sort).

	   order can be ENQ_ASCENDING (the default, docids sort in ascending order), ENQ_DESCENDING (docds sort in descending order), or
	   ENQ_DONT_CARE (docids sort in whatever order is most efficient for the backend.)

	   Note: If you add documents in strict date order, then a boolean search - i.e. set_weighting_scheme(Search::Xapian::BoolWeight->new()) -
	   with set_docid_order(ENQ_DESCENDING) is a very efficient way to perform "sort by date, newest first".

       set_cutoff <percent_cutoff> [<weight_cutoff>]
       set_sort_by_relevance
	   Set the sorting to be by relevance only.  This is the default.

       set_sort_by_value <sort_key> [<ascending>]
	   Set the sorting to be by value only.

	   sort_key - value number to reorder on.  Sorting is with a string compare.  If ascending is true (the default) higher is better; if
	   ascending is false, lower is better.

	   ascending - If true, document values which sort higher by string compare are better.  If false, the sort order is reversed.	(default
	   true)

       set_sort_by_value_then_relevance <sort_key> [<ascending>]
	   Set the sorting to be by value, then by relevance for documents with the same value.

	   sort_key - value number to reorder on.  Sorting is with a string compare.  If ascending is true (the default) higher is better; if
	   ascending is false, lower is better.

	   ascending - If true, document values which sort higher by string compare are better.  If false, the sort order is reversed.	(default
	   true)

       set_sort_by_relevance_then_value <sort_key> [<ascending>]
	   Set the sorting to be by relevance then value.

	   Note that with the default BM25 weighting scheme parameters, non-identical documents will rarely have the same weight, so this setting
	   will give very similar results to set_sort_by_relevance().  It becomes more useful with particular BM25 parameter settings (e.g.
	   BM25Weight(1,0,1,0,0)) or custom weighting schemes.

	   sort_key - value number to reorder on.  Sorting is with a string compare.  If ascending is true (the default) higher is better; if
	   ascending is false, lower is better.

	   ascending - If true, document values which sort higher by string compare are better.  If false, the sort order is reversed.	(default
	   true)

       set_sort_by_key <sorter> [<ascending>]
	   Set the sorting to be by key only.

	   sorter - the functor to use to build the key.

	   ascending - If true, keys which sort higher by string compare are better.  If false, the sort order is reversed.  (default true)

       set_sort_by_key_then_relevance <sorter> [<ascending>]
	   Set the sorting to be by key, then by relevance for documents with the same key.

	   sorter - the functor to use to build the key.

	   ascending - If true, keys which sort higher by string compare are better.  If false, the sort order is reversed.  (default true)

       set_sort_by_relevance_then_key <sorter> [<ascending>]
	   Set the sorting to be by relevance then key.

	   sorter - the functor to use to build the key.

	   ascending - If true, keys which sort higher by string compare are better.  If false, the sort order is reversed.  (default true)

       get_mset
	   Get match set.

       get_eset <maxitems> <rset> [<decider>]
	   Get set of query expansion terms.

       get_description
	   Return a description of this object.

SEE ALSO

       Search::Xapian::Query, Search::Xapian::Database

perl v5.14.2							    2012-05-09						      Xapian::Enquire(3pm)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Parsing the data in a file

Discussion started by: Omkumar

2. Shell Programming and Scripting

Big data file - sed/grep/awk?

Discussion started by: dlam

3. Shell Programming and Scripting

perl help to split big verilog file into smaller ones for each module

Discussion started by: return_user

4. Shell Programming and Scripting

How to cut some data from big file

Discussion started by: almanto

5. Shell Programming and Scripting

Helping in parsing subset of text from a big results file

Discussion started by: Lucky Ali

6. Shell Programming and Scripting

Sort a big data file

Discussion started by: rubber08

7. Shell Programming and Scripting

Segment a big file into smaller ones

Discussion started by: flash80

8. Shell Programming and Scripting

parsing characters and number from a big file with brackets

Discussion started by: Lucky Ali

9. Shell Programming and Scripting

Parsing data using keys from one file

Discussion started by: Lucky Ali

10. Shell Programming and Scripting

Extract data according to keys from filename mentioned in file

Discussion started by: ritakadm

LEARN ABOUT DEBIAN

search::xapian::enquire