Using Linux Commands on selected text


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Using Linux Commands on selected text
# 1  
Old 10-31-2015
Using Linux Commands on selected text

Hi

I have a XML file as shown below:


Code:
<Text Text_ID="10155645315850165_10155645333075165" From="460350337463650" Created="2014-10-16T17:05:37+0000" use_count="536">This is the first text</Text>
<Text Text_ID="10155645315850165_10155645317025165" From="1626711840908498" Created="2014-10-16T17:01:02+0000" use_count="408">This is the second text</Text>
<Text Text_ID="10155645315850165_10155645320000165" From="1481727095388591" Created="2014-10-16T17:02:04+0000" use_count="1064">This is the third text
If counted 
GOT IT... ����</Text>
<Text Text_ID="10154713369385165_10154714450825165" From="464236763734179" Created="2015-10-30T23:34:47+0000" use_count="1">This is is just a sample text......</Text>
<Text Text_ID="10154713369385165_10154714444345165" From="642181809247720" Created="2015-10-30T23:31:48+0000" use_count="1">This is just another sample text.......</Text>
<Text Text_ID="10154713369385165_10154714441590165" From="320601854777120" Created="2015-10-30T23:31:08+0000" use_count="1">जय जय '</Text>
<Text Text_ID="10154713369385165_10154714441495165" From="882276951835278" Created="2015-10-30T23:31:07+0000" use_count="1">posts</Text>
<Text Text_ID="10154713369385165_10154714430150165" From="481434655355081" Created="2015-10-30T23:28:39+0000" use_count="1">नमज</Text>
<Text Text_ID="10154713369385165_10154714426085165" From="415855878601070" Created="2015-10-30T23:27:48+0000" use_count="1">This is the fourth text........</Text>
<Text Text_ID="10154713369385165_10154714408620165" From="712354448887766" Created="2015-10-30T23:24:10+0000" use_count="1">We love you</Text>
<Text Text_ID="10154713369385165_10154714406055165" From="10202898434142187" Created="2015-10-30T23:23:34+0000" use_count="1">Jor se Bharat Mata ki jai</Text>

I need help if I can use Linux command on the text part, i.e,

Code:
This is the first text
This is the second text
This is the third text
If counted 
GOT IT... ����
This is is just a sample text......
This is just another sample text.......
जय जय '
posts
नमज
This is the fourth text........
We love you
Jor se Bharat Mata ki jai

Is there some way that I can use direct unix command? For example, a unix command to filter text with four (4) or more words in the texts and OUTPUT file should appear as XML file as shown below:

Code:
<Text Text_ID="10155645315850165_10155645333075165" From="460350337463650" Created="2014-10-16T17:05:37+0000" use_count="536">This is the first text</Text>
<Text Text_ID="10155645315850165_10155645317025165" From="1626711840908498" Created="2014-10-16T17:01:02+0000" use_count="408">This is the second text</Text>
<Text Text_ID="10155645315850165_10155645320000165" From="1481727095388591" Created="2014-10-16T17:02:04+0000" use_count="1064">This is the third text
If counted 
GOT IT... ����</Text>
<Text Text_ID="10154713369385165_10154714450825165" From="464236763734179" Created="2015-10-30T23:34:47+0000" use_count="1">This is is just a sample text......</Text>
<Text Text_ID="10154713369385165_10154714444345165" From="642181809247720" Created="2015-10-30T23:31:48+0000" use_count="1">This is just another sample text.......</Text>
<Text Text_ID="10154713369385165_10154714426085165" From="415855878601070" Created="2015-10-30T23:27:48+0000" use_count="1">This is the fourth text........</Text>
<Text Text_ID="10154713369385165_10154714406055165" From="10202898434142187" Created="2015-10-30T23:23:34+0000" use_count="1">Jor se Bharat Mata ki jai</Text>

Thanks in advance.Smilie. I also need help on how to remove duplicate text from this XML file.

Last edited by my_Perl; 11-01-2015 at 01:31 AM.. Reason: Editing
# 2  
Old 10-31-2015
What operating system and shell are you using?

What have you tried?
This User Gave Thanks to Don Cragun For This Post:
# 3  
Old 10-31-2015
I use Ubuntu 14.04 LTS and bash shell. Smilie
# 4  
Old 10-31-2015
Hi my_Perl,

You can try:
Code:
awk 'BEGIN{RS=">\n"} NF>=8{print}' file

Regards

Last edited by cgkmal; 10-31-2015 at 08:36 PM.. Reason: Change NF>8 to NF>=8...
This User Gave Thanks to cgkmal For This Post:
# 5  
Old 10-31-2015
I am getting the following output


Code:
<Text Text_ID="10155645315850165_10155645333075165" From="460350337463650" Created="2014-10-16T17:05:37+0000" use_count="536">This is the first text</Text
<Text Text_ID="10155645315850165_10155645317025165" From="1626711840908498" Created="2014-10-16T17:01:02+0000" use_count="408">This is the second text</Text
<Text Text_ID="10155645315850165_10155645320000165" From="1481727095388591" Created="2014-10-16T17:02:04+0000" use_count="1064">This is the third text
If counted
GOT IT... 👍👍</Text
<Text Text_ID="10154713369385165_10154714450825165" From="464236763734179" Created="2015-10-30T23:34:47+0000" use_count="1">This is is just a sample text......</Text
<Text Text_ID="10154713369385165_10154714444345165" From="642181809247720" Created="2015-10-30T23:31:48+0000" use_count="1">This is just another sample text.......</Text
<Text Text_ID="10154713369385165_10154714406055165" From="10202898434142187" Created="2015-10-30T23:23:34+0000" use_count="1">Jor se Bharat Mata ki jai</Text


The closing of the </Text is missing, i.e., </Text>
# 6  
Old 10-31-2015
You're rigth. Please use instead.

Code:
 awk 'BEGIN{RS=">\n"} NF>=8{print $0">"}' file

Regards
This User Gave Thanks to cgkmal For This Post:
# 7  
Old 10-31-2015
Thanks. How do I remove duplicate text from the XML file ?
Login or Register to Ask a Question

Previous Thread | Next Thread

9 More Discussions You Might Find Interesting

1. What is on Your Mind?

JQuery to Add Code Tags to Selected Text

Hey. Someone find or write some jQuery code where we can select text with our mouse and then click or double click the highlighted / selected text and then it will wrap code tags around the highlighted text (in our editors). :) (0 Replies)
Discussion started by: Neo
0 Replies

2. Linux

How to run commands with pipe from text file?

Hello, I have standard loop while read -r info; do command $info done < info in info text file I have multiple commands each on line that I want to execute. When I used them in console they worked, but not with this loop. This is one of the commands in info file: grep... (4 Replies)
Discussion started by: adamlevine
4 Replies

3. Shell Programming and Scripting

Bash to select text and apply it to a selected file in bash

In the bash below I am asking the user for a panel and reading that into bed. Then asking the user for a file and reading that into file1.Is the grep in bold the correct way to apply the selected panel to the file? I am getting a syntax error. Thank you :) ... (4 Replies)
Discussion started by: cmccabe
4 Replies

4. Emergency UNIX and Linux Support

Executing several commands in a text file

I have a file that has about 3000 commands , listed one below the other. I would like to execute them all in one go. Is there a simpler way to do it - like a batch file processing, than executing one line at a time? (3 Replies)
Discussion started by: ggayathri
3 Replies

5. Web Development

How to copy a selected value of list box into a text box in html form?

hi, i have a list box , a text box and a button in a html form. list box displays some values, when a user selects a value from the list box and press the button. the selected value should be copied to the text box value. can any1 give me a html and javascript code to do this facility. ... (1 Reply)
Discussion started by: Little
1 Replies

6. UNIX for Dummies Questions & Answers

Hoe to copy selected strings from file into another text file

Hi Experts, I just want to copy some selected strings from a a file into a new .txt file . I am using below command to find the data now want to copy the search results into another .txt file please help me . find /Path -exec grep -w "filename1|filename1|filename1|" '{}' \;... (2 Replies)
Discussion started by: mumakhij
2 Replies

7. Shell Programming and Scripting

To display the selected part in text file of unix

0400903071220312 20120322 20:21 1TRANTELSTRAFLEXCAB22032012CMP201323930000812201108875802100A003485363 12122011AUS 182644 000C2 8122011 0000 000 1TRANTELSTRAFLEXCAB22032012CMP201323930000812201108875802100A003485363 12122011AUS ... (6 Replies)
Discussion started by: rammm
6 Replies

8. Shell Programming and Scripting

Commands to reorganize a text file

Hi! I am trying to create a script to reorder the contents of a text file. Below is the text file initially, followed by how I would like it reordered: File initially: --- Initial lines with text and/or numbers Initial lines with text and/or numbers Initial lines with text and/or numbers... (11 Replies)
Discussion started by: gwr
11 Replies

9. Shell Programming and Scripting

trying to print selected fields of selected lines by AWK

I am trying to print 1st, 2nd, 13th and 14th fields of a file of line numbers from 29 to 10029. I dont know how to put this in one code. Currently I am removing the selected lines by awk 'NR==29,NR==10029' File1 > File2 and then doing awk '{print $1, $2, $13, $14}' File2 > File3 Can... (3 Replies)
Discussion started by: ananyob
3 Replies
Login or Register to Ask a Question