Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Extracting information from text fields. Post 302120422 by spindoctor on Wednesday 6th of June 2007 11:08:43 AM
Old 06-06-2007
Hi, On the first point, this is the code I was playing with from the command line, based on your suggestion.
awk 'BEGIN { gsub(/\\/,"")}' 03152000.rtf

Did you put the gsub command in there just to get rid of the backslashes? Or is it related to the field extraction process?

Regarding the second point:
Here is the text, copied straight from the .txt file I'm trying to extract information from.

Author(s): Ashley Geddes, Provincial Affairs Writer

Here is the code (entered in the command line) I got from you and it's output.

awk 'Begin{FS=":"} ; /Author/{print $2 }' 03152000.rtf > Author.txt

Ashley

Here is the modified way I wrote the code and its result

awk 'Begin{FS=":"} ; /Author/{print $2,$3 }' 03152000.rtf > Author.txt

Ashley Geddes,

the same problem occurred with the field "Document Types" I had to change around the fields as you wrote them to get the result.

Obviously I'm not that concerned, because it seems to be working. However I am very curious because I'd like to know how the thing works. Thanks for your help!
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Extracting information from a template

I have a template that I usually use to generate stats on an hourly basis for a number of cell sites altogether. I would like to be able to write a script that would go to the template and extract the information for any single site at any time during the day. For example, let's say that my... (4 Replies)
Discussion started by: Ernst
4 Replies

2. Shell Programming and Scripting

extracting fields

Hi, i have a line with several fields (indefinite number of - count varies) separated by colon. Now, i need to pick each field (except the first one) and have it assigned to variable within a loop. In other words, in the first iteration of the loop, the variable must be assigned with 2nd... (2 Replies)
Discussion started by: prvnrk
2 Replies

3. Shell Programming and Scripting

Problems with extracting information

Hi all, <select name="comp" id="comp" style="width:130px;"> <?php $sqlcomp = mysql_query("SELECT * FROM comp"); while ($redcomp = mysql_fetch_array($sqlcomp)) { extract($redcomp); echo "<option value=\"$comp_id\">comp_name</option>"; } ?> ... (0 Replies)
Discussion started by: c0mrade
0 Replies

4. Shell Programming and Scripting

Extracting information from Config files /text processing

Hello All, This is my first post on this forums, which I consider one of the best of its kind. The reason for my post is that I want to export some information form Nagios configuration files to a DB. I know that there are other tools available to do this, like NDO, monarch, etc... But I want to... (3 Replies)
Discussion started by: oconmx
3 Replies

5. Shell Programming and Scripting

extracting information from multiple files

Hello there, I am trying to extract (string) information ( a list words) from 4 files and then put the results into 1 file. Currently I am doing this using grep -f list.txt file1 . and repeat the process for the other 3 files. The reasons i am doing that (a) I do know how to code (b) each file... (4 Replies)
Discussion started by: houkto
4 Replies

6. Shell Programming and Scripting

Extracting relevant information from syslogs.

I need to analyse some syslogs and I want to print out all the lines containing SSH connections to the inside interface of the firewall and ignore lines where the originating port is 22. So basically I want to print all matches after "to inside:" that contains /22 and ignore lines where /22 occur... (2 Replies)
Discussion started by: lewk
2 Replies

7. Shell Programming and Scripting

Problems extracting some information

Hi there! Well, I'm writing a script to obtain certain information about files. Specifically, I want to get the information about those files which last access were in the last 24 hours, so I'm doing something like this: find <directory_name> -atime -1 -printf '%f %a\n' I would also... (4 Replies)
Discussion started by: Skirmish
4 Replies

8. Shell Programming and Scripting

Extracting information using awk

I want to write a script that extracts a value from a line of text. I know it can be done using awk but I've never used awk before so I don't know how to do it. The text is: Mem: 100M Active, 2150K Cache, 500M Buf, 10G Free I want to extract the free memory value to use as a variable. In... (5 Replies)
Discussion started by: millsy5
5 Replies

9. Shell Programming and Scripting

[Solved] Extracting information from DDL's

Dear Experts, I need your help here. I have lot of teradata DDL's as follows, i want to extract field names , field attributes and NOT NULL information from DDL.Could you please help here. Sample DDL: CREATE MULTISET TABLE APS_CALL_IN_PICKUP_CANCELED ,NO FALLBACK , NO BEFORE... (2 Replies)
Discussion started by: srikanth38
2 Replies

10. Shell Programming and Scripting

Extracting information from XML to excel

Hi, I am trying to extract information from a XML file and write it to a excel sheet. I am not sure where to start from. Here is the content from my input XML file. <com.cloudbees.hudson.plugins.folder.properties.FolderProxyGroupContainer plugin="nectar-rbac@4.5"> <groups> ... (4 Replies)
Discussion started by: Sajjadmehdi
4 Replies
TEXTUTIL(1)						    BSD General Commands Manual 					       TEXTUTIL(1)

NAME
textutil -- text utility SYNOPSIS
textutil [command_option] [other_options] file ... DESCRIPTION
textutil can be used to manipulate text files of various formats, using the mechanisms provided by the Cocoa text system. The first argument indicates the operation to perform, one of: -help Show the usage information for the command and exit. This is the default command option if none is specified. -info Display information about the specified files. -convert fmt Convert the specified files to the indicated format and write each one back to the file system. -cat fmt Read the specified files, concatenate them, and write the result out as a single file in the indicated format. fmt is one of: txt, html, rtf, rtfd, doc, docx, wordml, odt, or webarchive There are some additional options for general use: -extension ext Specify an extension to be used for output files (by default, the extension will be determined from the format). -output path Specify the file name to be used for the first output file. -stdin Specify that input should be read from stdin rather than from files. -stdout Specify that the first output file should go to stdout. -encoding IANA_name | NSStringEncoding Specify the encoding to be used for plain text or HTML output files (by default, the output encoding will be UTF-8). NSStringEncoding refers to one of the numeric values recognized by NSString. IANA_name refers to an IANA character set name as understood by CFString. The operation will fail if the file cannot be converted to the specified encoding. -inputencoding IANA_name | NSStringEncoding Force all plain text input files to be interpreted using the specified encoding (by default, a file's encoding will be deter- mined from its BOM). The operation will fail if the file cannot be interpreted using the specified encoding. -format fmt Force all input files to be interpreted using the indicated format (by default, a file's format will be determined from its contents). -font font Specify the name of the font to be used for converting plain to rich text. -fontsize size Specify the size in points of the font to be used for converting plain to rich text. -- Specify that all further arguments are file names. There are some additional options for HTML and WebArchive files: -noload Do not load subsidiary resources. -nostore Do not write out subsidiary resources. -baseurl url Specify a base URL to be used for relative URLs. -timeout t Specify the time in seconds to wait for resources to load. -textsizemultiplier x Specify a numeric factor by which to multiply font sizes. -excludedelements (tag1, tag2, ...) Specify which HTML elements should not be used in generated HTML (the list should be a single argument, and so will usually need to be quoted in a shell context). -prefixspaces n Specify the number of spaces by which to indent nested elements in generated HTML (default is 2). There are some additional options for treating metadata: -strip Do not copy metadata from input files to output files. -title val Specify the title metadata attribute for output files. -author val Specify the author metadata attribute for output files. -subject val Specify the subject metadata attribute for output files. -keywords (val1, val2, ...) Specify the keywords metadata attribute for output files (the list should be a single argument, and so will usually need to be quoted in a shell context). -comment val Specify the comment metadata attribute for output files. -editor val Specify the editor metadata attribute for output files. -company val Specify the company metadata attribute for output files. -creationtime yyyy-mm-ddThh:mm:ssZ Specify the creation time metadata attribute for output files. -modificationtime yyyy-mm-ddThh:mm:ssZ Specify the modification time metadata attribute for output files. EXAMPLES
textutil -info foo.rtf displays information about foo.rtf. textutil -convert html foo.rtf converts foo.rtf into foo.html. textutil -convert rtf -font Times -fontsize 10 foo.txt converts foo.txt into foo.rtf, using Times 10 for the font. textutil -cat html -title "Several Files" -output index.html *.rtf loads all RTF files in the current directory, concatenates their contents, and writes the result out as index.html with the HTML title set to "Several Files". DIAGNOSTICS
The textutil command exits 0 on success, and 1 on failure. CAUTIONS
Some options may require a connection to the window server. HISTORY
The textutil command first appeared in Mac OS X 10.4. macOS September 9, 2004 macOS
All times are GMT -4. The time now is 06:50 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy