Skip the delimiter with in double quotes and count the number of delimiters during data extract
Hi All,
I'm stuck-up in finding a way to skip the delimiter which come within double quotes using awk or any other better option. can someone please help me out.
I know this is probably much simplier than I am making but I need some help please. I have a data file that contains a value on the first line between double quotes ("00043"). I need to assign the value between the first set quotes to a variable in my perl script for comparison analysis. Also,... (6 Replies)
I have a list of files with names as "FULL_abcd_xyz_timestamp.txt" and "FULL_xx_abcd_xyz_timestamp.txt". I am writing a script with a 'for loop' to take each file, strip the "FULL" and "timestamp" from the file name and do some actions on the contains of the file. So I need to know the number of... (4 Replies)
Hi,
I'm trying to insert double quotes right after the third delimiter in a file. Delimiter is ^Z.
For example:
Input:
Oct ^Z 1234 ^Z John ^Z Hello!"
Desired Output:
Oct ^Z 1234 ^Z John ^Z "Hello!"
Any ideas? (1 Reply)
Hi,
my requirement is to find the count of commas in a string excluding the ones in double quotes.
For example:
If the input string is
abc,xyz.com,lmhgdf,"abc, 401 street","tty,stt",45,23,45
The output should be 7 (7 Replies)
Hello everybody!
I am quit new here and hope you can help me.
Using an awk script I am trying to extract data from several files. The structure of the input files is as follows:
TimeStep parameter1 parameter2 parameter3 parameter4
e.g.
1 X Y Z L
1 D H Z I
1 H Y E W
2 D H G F
2 R... (2 Replies)
Need a little help.
I have just a simple string with a lot double quotes in it. I need to be able to parse through this string, and know how many double quotes I have, and where I am, so I can key off every 9th double quote. For example (coding is not complete):
#!/bin/bash
count=0... (3 Replies)
I would like to know how to replace a space delimiter with a ^_ (\037) delimiter and a double quote delimiter while maintaining the spaces inside the double quotes. The double quote delimiter is only used on text fields.
I'd prefer a one-liner, but could handle a function or script that accepts... (4 Replies)
i want extract where the 2nd column having "3" or "7".
Based on the forums tried like this but it is not working
awk -F"," '$2=3;$2=7 {print}' filename
Source
"1","2","3","4"
"1","3","3","4"
"1","7","3","4"
"1","8","3","4"
"1","2","3","4"
"1","2","3","4"
Output :
... (5 Replies)
What to know the way to count the number of delimiters in each record by ignoring the escape delimiters.
Sample Data:
12345678|ABN\|XYZ MED CHEM PTY. LTD.|C||100.00|22|AB"C\|Corp|"XYZ|CDEF"|
I'm using awk -F'|' '{ print NF-1 }' command to find the number of delimiters. this command... (8 Replies)
Hi All,
I have my data like below
"1","abc,db","hac,aron","4","5"
Now I need to extract 1,2,4th columns
Output should be like
"1",abc,db","4"
Am trying to use cut command but not able to get the results.
Thanks in advance. (4 Replies)
Discussion started by: weknowd
4 Replies
LEARN ABOUT DEBIAN
uri::find::delimited
URI::Find::Delimited(3pm) User Contributed Perl Documentation URI::Find::Delimited(3pm)NAME
URI::Find::Delimited - Find URIs which may be wrapped in enclosing delimiters.
DESCRIPTION
Works like URI::Find, but is prepared for URIs in your text to be wrapped in a pair of delimiters and optionally have a title. This will be
useful for processing text that already has some minimal markup in it, like bulletin board posts or wiki text.
SYNOPSIS
my $finder = URI::Find::Delimited->new;
my $text = "This is a [http://the.earth.li/ titled link].";
$finder->find($text);
print $text;
METHODS
new
my $finder = URI::Find::Delimited->new(
callback => &callback,
delimiter_re => [ '[', ']' ],
ignore_quoted => 1 # defaults to 0
);
All arguments are optional; defaults are provided (see below).
Creates a new URI::Find::Delimited object. This object works similarly to a URI::Find object, but as well as just looking for URIs it
is also aware of the concept of a wrapped, titled URI. These look something like
[http://foo.com/ the foo website]
where:
* "[" is the opening delimiter
* "]" is the closing delimiter
* "http://foo.com/" is the URI
* "the foo website" is the title
* the URI and title are separated by spaces and/or tabs
The URI::Find::Delimited object will extract each of these parts separately and pass them to your callback.
callback
"callback" is a function which is called on each URI found. It is passed five arguments: the opening delimiter (if found), the
closing delimiter (if found), the URI, the title (if found), and any whitespace found between the URI and title.
The return value of the callback will replace the original URI in the text.
If you do not supply your own callback, the object will create a default one which will put your URIs in 'a href' tags using the
URI for the target and the title for the link text. If no title is provided for a URI then the URI itself will be used as the
title. If the delimiters aren't balanced (eg if the opening one is present but no closing one is found) then the URI is treated as
not being wrapped.
Note: the default callback will not remove the delimiters from the text. It should be simple enough to write your own callback to
remove them, based on the one in the source, if that's what you want. In fact there's an example in this distribution, in
"t/delimited.t".
delimiter_re
The "delimiter_re" parameter is optional. If you do supply it then it should be a ref to an array containing two regexes. It
defaults to using single square brackets as the delimiters.
Don't use capturing groupings "( )" in your delimiters or things will break. Use non-capturing "(?: )" instead.
ignore_quoted
If the "ignore_quoted" parameter is supplied and set to a true value, then any URIs immediately preceded with a double-quote char-
acter will not be matched, ie your callback will not be executed for them and they'll be treated just as normal text.
This is kinda lame but it's in here because I need to be able to ignore things like
<img src="http://foo.com/bar.gif">
A better implementation may happen at some point.
SEE ALSO
URI::Find.
AUTHOR
Kake Pugh (kake@earth.li).
COPYRIGHT
Copyright (C) 2003 Kake Pugh. All Rights Reserved.
This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
CREDITS
Tim Bagot helped me stop faffing over the name, by pointing out that RFC 2396 Appendix E uses "delimited". Dave Hinton helped me fix the
regex to make it work for delimited URIs with no title. Nick Cleaton helped me make "ignore_quoted" work. Some of the code was taken from
URI::Find.
perl v5.8.8 2008-03-01 URI::Find::Delimited(3pm)