Sponsored Content
Full Discussion: Extract pattern from text
Top Forums Shell Programming and Scripting Extract pattern from text Post 302969252 by Aia on Sunday 20th of March 2016 02:11:16 PM
Old 03-20-2016
Got Perl?
Code:
perl -0ne 'while(/(D\s8{4}\s4[43])\s(\w+)/g){print "$1 $2\n"}' stinkefisch.input

Code:
D 8888 43 BBBBBBBBBBBBBB
D 8888 44 AAAAA4FFBBBBBB
D 8888 44 CCCCCCCCCCCCCC
D 8888 43 EEEEEEEEEEEEEE
D 8888 43 FFFFFFFFFFFFFFFFF

 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Extract pattern from text line

Gents, from these sample lines: ZUCR.MI ZUCCHI SPA RISP NC 2,5000 6 ott 0,0000 ZV.MI ZIGNAGO VETRO 3,6475 16:36 Up 0,0075 is it possible to get this: ZUCR.MI 2,5000 ZV.MI 3,6475 i.e. the first field, a separator and the first decimal number? (in Europe we... (9 Replies)
Discussion started by: vampirodolce
9 Replies

2. Shell Programming and Scripting

Extract pattern from text line

Hi, the text line looks like this: "test1" " " "test2" "test3" "test4" "10" "test 10 12" "00:05:58" "filename.bin" "3.3MB" "/dir/name" "18459" what's the best way to select any of it? So I can for example get only the time or size and so on. I was trying awk -F""" '{print $N}' but... (3 Replies)
Discussion started by: TehOne
3 Replies

3. Programming

c program to extract text between two delimiters from some text file

needa c program to extract text between two delimiters from some text file. and then storing them in to diffrent variables ? text file like 0: abc.txt ========= aaaaaa|11111111|sssssssssss|333333|ddddddddd|34343454564|asass aaaaaa|11111111|sssssssssss|333333|ddddddddd|34343454564|asass... (7 Replies)
Discussion started by: kukretiabhi13
7 Replies

4. Shell Programming and Scripting

Extract pattern from text line

The text line has the following formats: what.ever.bla.bla.C01G06.BLA.BLA2 what.ever.bla.bla.C11G33.BLA.BLA2 what.ever.bla.bla.01x03.BLA.BLA2 what.ever.bla.bla.03x05.BLA.BLA2 what.ever.bla.bla.Part01.BLA.BLA2 and other similar ones, I need a way to select the "what.ever.bla.bla" part out... (4 Replies)
Discussion started by: TehOne
4 Replies

5. Shell Programming and Scripting

sed: Find start of pattern and extract text to end of line, including the pattern

This is my first post, please be nice. I have tried to google and read different tutorials. The task at hand is: Input file input.txt (example) abc123defhij-E-1234jslo 456ujs-W-abXjklp From this file the task is to grep the -E- and -W- strings that are unique and write a new file... (5 Replies)
Discussion started by: TestTomas
5 Replies

6. Shell Programming and Scripting

extract unique pattern from large text file

Hi All, I am trying to extract data from a large text file , I want to extract lines which contains a five digit number followed by a hyphen , like 12345- , i tried with egrep ,eg : egrep "+" text.txt but which returns all the lines which contains any number of digits followed by hyhen ,... (19 Replies)
Discussion started by: shijujoe
19 Replies

7. Shell Programming and Scripting

Search for a pattern,extract value(s) from next line, extract lines having those extracted value(s)

I have hundreds of files to process. In each file I need to look for a pattern then extract value(s) from next line and then search for value(s) selected from point (2) in the same file at a specific position. HEADER ELECTRON TRANSPORT 18-MAR-98 1A7V TITLE CYTOCHROME... (7 Replies)
Discussion started by: AshwaniSharma09
7 Replies

8. Shell Programming and Scripting

Extract specific line in an html file starting and ending with specific pattern to a text file

Hi This is my first post and I'm just a beginner. So please be nice to me. I have a couple of html files where a pattern beginning with "http://www.site.com" and ending with "/resource.dat" is present on every 241st line. How do I extract this to a new text file? I have tried sed -n 241,241p... (13 Replies)
Discussion started by: dejavo
13 Replies

9. Shell Programming and Scripting

Extract all the sentences from a text file that matches a pattern list

Hi I have a big text file. I want to extract all the sentences that matches at least 70% (seventy percent) of the words from each sentence based on a word list called A. Say the format of the text file is as given below: This is the first sentence which consists of fifteen words... (4 Replies)
Discussion started by: my_Perl
4 Replies

10. Shell Programming and Scripting

Extract whole word preceding a specific character pattern with first occurence of the pattern

Hello. Here is a file contents : declare -Ax NEW_FORCE_IGNORE_ARRAY=(="§" ="§" ="§" ="§" ="§" .................. ="§"Here is a pattern =I want to extract 'NEW_FORCE_IGNORE_ARRAY' which is the whole word before the first occurrence of pattern '=' Is there a better solution than mine :... (3 Replies)
Discussion started by: jcdole
3 Replies
STAG-IR(1p)						User Contributed Perl Documentation					       STAG-IR(1p)

NAME
stag-ir.pl - information retrieval using a simple relational index SYNOPSIS
stag-ir.pl -r person -k social_security_no -d Pg:mydb myrecords.xml stag-ir.pl -d Pg:mydb -q 999-9999-9999 -q 888-8888-8888 DESCRIPTION
Indexes stag nodes (XML Elements) in a simple relational db structure - keyed by ID with an XML Blob as a value Imagine you have a very large file of data, in a stag compatible format such as XML. You want to index all the elements of type person; each person can be uniquely identified by social_security_no, which is a direct subnode of person The first thing to do is to build the index file, which will be stored in the database mydb stag-ir.pl -r person -k social_security_no -d Pg:mydb myrecords.xml You can then use the index "person-idx" to retrieve person nodes by their social security number stag-ir.pl -d Pg:mydb -q 999-9999-9999 > some-person.xml You can export using different stag formats stag-ir.pl -d Pg:mydb -q 999-9999-9999 -w sxpr > some-person.xml You can retrieve multiple nodes (although these need to be rooted to make a valid file) stag-ir.pl -d Pg:mydb -q 999-9999-9999 -q 888-8888-8888 -top personset Or you can use a list of IDs from a file (newline delimited) stag-ir.pl -d Pg:mydb -qf my_ss_nmbrs.txt -top personset ARGUMENTS -d DB_NAME This database will be used for storing the stag nodes The name can be a logical name or DBI locator or DBStag shorthand - see DBIx::DBStag The database must already exist -clear Deletes all data from the relation type (specified with -r) before loading -insertonly Does not check if the ID in the file exists in the db - will always attempt an INSERT (and will fail if ID already exists) This is the fastest way to load data (only one SQL operation per node rather than two) but is only safe if there is no existing data (Default is clobber mode - existing data with same ID will be replaced) -newonly If there is already data in the specified relation in the db, and the XML being loaded specifies an ID that is already in the db, then this node will be ignored (Default is clobber mode - existing data with same ID will be replaced) -transaction_size A commit will be performed every n UPDATEs/COMMITs (and at the end) Default is autocommit note that if you are using -insertonly, and you are using transactions, and the input file contains an ID already in the database, then the transaction will fail because this script will try and insert a duplicate ID -r RELATION-NAME This is the name of the stag node (XML element) that will be stored in the index; for example, with the XML below you may want to use the node name person and the unique key id <person_set> <person> <id>...</id> </person> <person> <id>...</id> </person> ... </person_set> This flag should only be used when you want to store data -k UNIQUE-KEY This node will be used as the unique/primary key for the data This node should be nested directly below the node that is being stored in the index - if it is more that one below, specify a path This flag should only be used when you want to store data -u UNIQUE-KEY Synonym for -k -create If specified, this will create a table for the relation name specified below; you should use this the first time you index a relation -idtype TYPE (optional) This is the SQL datatype for the unique key; it defaults to VARCHAR(255) If you know that your id is an integer, you can specify INTEGER here If your id is always a 8-character field you can do this -idtype 'CHAR(8)' This option only makes sense when combined with the -c option -p PARSER This can be the name of a stag supported format (xml, sxpr, itext) - XML is assumed by default It can also be a module name - this module is used to parse the input file into a stag stream; see Data::Stag::BaseGenerator for details on writing your own parsers/event generators This flag should only be used when you want to store data -q QUERY-ID Fetches the relation/node with unique key value equal to query-id Multiple arguments can be passed by specifying -q multple times This flag should only be used when you want to query data -top NODE-NAME If this is specified in conjunction with -q or -qf then all the query result nodes will be nested inside a node with this name (ie this provides a root for the resulting document tree) -qf QUERY-FILE This is a file of newline-seperated IDs; this is useful for querying the index in batch -keys This will write a list of all primary keys in the index SEE ALSO
Data::Stag For more complex stag to database mapping, see DBIx::DBStag and the scripts stag-db.pl use file DBM indexes stag-storenode.pl is for storing fully normalised stag trees selectall_xml perl v5.12.4 2010-01-21 STAG-IR(1p)
All times are GMT -4. The time now is 07:42 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy