Remove new line characters from a file


 
Thread Tools Search this Thread
Special Forums UNIX Desktop Questions & Answers Remove new line characters from a file
# 1  
Old 04-20-2012
Remove new line characters from a file

I tried using below command
Code:
tr -cd "[:print:]" < InputFile.xml > output.txt

============= This removes all the tabs/newline/extra spaces from a file
it successfully removed all the extra spaces,tabs and new line characters but then the complete file become one record. I want to retain one new line character which is actually the record delimiter.
Which is consistent. Every record always finishes with </Success></Employee>


Input

Code:
26,<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE EmployeeDetails SYSTEM  "EmpDetails.dtd">
<Employee>
 <Address>
  <Street>1234 64th Ave</Street>
   </Address>
</Employee>,<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE Success SYSTEM "Succ.dtd"><Employee><Success>inserted</Success></Employee>
47,<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE EmployeeDetails SYSTEM  "EmpDetails.dtd">
<Employee>
 <Address>
  <Street>PO BOX 56</Street>
   </Address>
</Employee>,<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE Success SYSTEM "Succ.dtd"><Employee><Success>updated</Success></Employee>

output

Code:
26|<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE EmployeeDetails SYSTEM  "EmpDetails.dtd"><Employee><Address><Street>1234 64th Ave</Street></Address></Employee>|<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE Success SYSTEM "Succ.dtd"><Employee><Success>inserted</Success></Employee>
47|<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE EmployeeDetails SYSTEM  "EmpDetails.dtd"><Employee><Address><Street>PO BOX 56</Street></Address></Employee>|<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE Success SYSTEM "Succ.dtd"><Employee><Success>updated</Success></Employee>


Last edited by joeyg; 04-20-2012 at 03:11 PM.. Reason: Please wrap data and scripts in CodeTags - easier to read
# 2  
Old 04-20-2012
Hi saini,

Try:
Code:
$ cat infile 
26,<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE EmployeeDetails SYSTEM  "EmpDetails.dtd">
<Employee>
 <Address>
  <Street>1234 64th Ave</Street>
   </Address>
</Employee>,<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE Success SYSTEM "Succ.dtd"><Employee><Success>inserted</Success></Employee>
47,<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE EmployeeDetails SYSTEM  "EmpDetails.dtd">
<Employee>
 <Address>
  <Street>PO BOX 56</Street>
   </Address>
</Employee>,<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE Success SYSTEM "Succ.dtd"><Employee><Success>updated</Success></Employee>
$ perl -pe 'tr/,/|/; chomp unless m|(?i)</Success></Employee>\s*\Z|' infile
26|<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE EmployeeDetails SYSTEM  "EmpDetails.dtd"><Employee> <Address>  <Street>1234 64th Ave</Street>   </Address></Employee>|<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE Success SYSTEM "Succ.dtd"><Employee><Success>inserted</Success></Employee>
47|<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE EmployeeDetails SYSTEM  "EmpDetails.dtd"><Employee> <Address>  <Street>PO BOX 56</Street>   </Address></Employee>|<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE Success SYSTEM "Succ.dtd"><Employee><Success>updated</Success></Employee>

Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

How to remove new line characters from data rows in a Pipe delimited file?

I have a file as below Emp1|FirstName|MiddleName|LastName|Address|Pincode|PhoneNumber 1234|FirstName1|MiddleName2|LastName3| Add1 || ADD2|123|000000000 2345|FirstName2|MiddleName3|LastName4| Add1 || ADD2| 234|000000000 OUTPUT : ... (1 Reply)
Discussion started by: styris
1 Replies

2. Shell Programming and Scripting

Remove first 2 characters and last two characters of each line

here's what im trying to do. i have a file containing lines similar to this: data.txt: 1hsRmRsbHRiSFZNTTA1dlEyMWFkbU5wUW5CSlIyeDFTVU5SYjJOSFRuWmpia0ZuWXpKV2FHTnRU 1lKUnpWMldrZFZaMG95V25oYQpSelEyWTBka2QyRklhSHBrUjA1b1kwUkJkd3BOVXpWM1lVaG5k... (5 Replies)
Discussion started by: SkySmart
5 Replies

3. Shell Programming and Scripting

Ksh: Read line parse characters into variable and remove the line if the date is older than 50 days

I have a test file with the following format, It contains the username_date when the user was locked from the database. $ cat lockedusers.txt TEST1_21062016 TEST2_02122015 TEST3_01032016 TEST4_01042016 I'm writing a ksh script and faced with this difficult scenario for my... (11 Replies)
Discussion started by: humble_learner
11 Replies

4. Shell Programming and Scripting

I want to remove 1st and last two characters of each line of the file

I want to remove 1st and last two characters of each line of the file Ex: file1 zzfile1ee @xfile2:y qfile3>> @ file4yy and redirect to the file called new Basically file will have any charcter including space, spical character... Please help.... (7 Replies)
Discussion started by: shell1509
7 Replies

5. Shell Programming and Scripting

Remove characters from line

I have a the following line 22.152.25.36 - K##### "GET /DGGKE/GetMail.do;jwebsphere=FADFFFGSFGSFGSDGFSDFGSDFGSDF HTTP/1.1" 200 44948 Need a cut command which should give me the below output 22.152.25.36 - K##### "GET /DGGKE/GetMail.do HTTP/1.1" 200 44948 Note: The value of jwebsphere can... (6 Replies)
Discussion started by: suindar1982
6 Replies

6. Shell Programming and Scripting

remove first few characters from each line

Hi, I have a file with lines like below. I need to remove first few characters from each line until a date format is found. 05/06/12 20:47:02 GUMGUY@98.192.174.74{42B42A72AC955F5926621273E3A15059.tomcat2}TP-Processor15 LogExchUsage: ERROR: 05/06/12 20:47:02... (8 Replies)
Discussion started by: ratheeshjulk
8 Replies

7. Shell Programming and Scripting

How to remove special characters from each line?

Hello, Is there a simpler way to remove special characters (color codes) from each lines in a log file? I use sed like in the example below but I think there should be a more simple way to achieve the same result: $ cat -vet file1 ^, , , , Maybe to convert the file somehow? ... (5 Replies)
Discussion started by: majormark
5 Replies

8. UNIX for Dummies Questions & Answers

remove characters from line

Hello, I have multiple lines in a file, each of which will have data that looks like this: xxxxxyyyyzzzz4abcdXYZXYZXYZ pqrstPQRST2cdPQRSTPQRST lmnopqr6abcdefgRST.3abc I want to be able to remove the number 4 + the following 4 characters (abcd) in the first line. For the second line,... (1 Reply)
Discussion started by: Gussifinknottle
1 Replies

9. Shell Programming and Scripting

sed to remove 1st two characters every line of text file

what is the sed command to remove the first two characters of every line of a text file? each line of the text file has the same amount of characters, and they are ALL NUMERIC. there are hundreds of lines though. for example, >cat file1.txt 10081551 10081599 10082234 10082259 20081134... (20 Replies)
Discussion started by: ajp7701
20 Replies

10. Shell Programming and Scripting

sed remove last 10 characters of a line start from 3rd line

hello experts, I need a sed command that remove last 10 characters of a line start from 3rd line. any suggestions? Thanks you (7 Replies)
Discussion started by: minifish
7 Replies
Login or Register to Ask a Question