I have a file with 14million lines and I would like to extract all the unique lines from the file into another text file.
For example:
Contents of file1
happy
sad
smile
happy
funny
sad
I want to run a command against file one that only returns the unique lines (ie 1 line for happy... (3 Replies)
I'm attempting to write a script to identify users who have sudo access on a server. I only want to extract the ID's of the sudo users after a unique line of text. The list of sudo users goes to the EOF so I only need the script to start after the unique line of text. I already have a script to... (1 Reply)
Hello,
i've got this output text:
and i need it to look something like this:
which means that there won't be absolute path of each directory, just it's size and the last word after last '/' in each line, and i also don't need last line '1.7M /tmp'
Looks like there is a simple... (5 Replies)
Hello. I am sorry if this is a common question but through all my searching, I haven't found an answer which matches what I want to do.
I am looking for a sed command that will parse through a large text file and extract lines that start with specific words (which are repeated throughout the... (4 Replies)
I can't decide if I should use AWK or PERL after pouring over these forums for hours today I decided I'd post something and see if I couldn't get some advice.
I've got a text file full of hundreds of events in this format:
Record Number : 1
Records in Seq : ... (3 Replies)
Hi,
I am trying to extract lines from a text file given a text file containing line numbers to be extracted from the first file. How do I go about doing this? Thanks! (1 Reply)
Hello,
I have a file ff.txt that looks as follows
*ABNA.txt
356
24
36
112
*AC24.txt
457
458
321
2
ABNA.txt and AC24.txt are the files in the folder named foo1. Based on the numbers in the ff.txt file, I want to extract the lines from the corresponding files in the foo1 folder and... (2 Replies)
I would like to print unique lines without sort or unique. Unfortunately the server I am working on does not have sort or unique. I have not been able to contact the administrator of the server to ask him to add it for several weeks. (7 Replies)
Discussion started by: cokedude
7 Replies
LEARN ABOUT DEBIAN
odt2txt.odt2txt
ODT2TXT(1) User Commands ODT2TXT(1)NAME
odt2txt - a simple converter from OpenDocument Text to plain text
SYNOPSIS
odt2txt [OPTIONS] FILENAME
DESCRIPTION
odt2txt is a command-line tool which extracts the text out of OpenDocument Texts, as produced by OpenOffice.org, KOffice, StarOffice and
others.
odt2txt can also extract text from some file formats similar to OpenDocument Text, such as OpenOffice.org XML (*.sxw), which was used by
OpenOffice.org version 1.x and older StarOffice versions. To a lesser extend, odt2txt may be useful to extract content from OpenDocument
spreadsheets (*.ods) and OpenDocument presentations (*.odp).
The FILENAME argument is mandatory.
OPTIONS --width=WIDTH
Wrap text lines after WIDTH characters. The default value is 65, which means that any words which would extend beyond column 65 are
moved to a new line.
If WIDTH is set to -1 then no lines will be broken
--output=FILE
Write output to FILE and not to standard output.
--subst=SUBST
Select which non-ascii characters shall be replaced by ascii look-a-likes. Valid values for SUBST are all, some and none.
--subst=all Substitute all characters for which substitutions are known
--subst=some Substitute all characters which the output charset does not contain This is the default
--subst=none Substitute no characters
--encoding=X
Do not try to autodetect the terminal encoding, but convert the document to encoding X unconditionally To find out, which terminal
encoding will be used in automatic mode, use --encoding=show
--raw Print raw XML
--version
Show version and copyright information
COPYRIGHT
Copyright (C) 2006,2007 Dennis Stosberg <dennis@stosberg.net>
Uses parts of the kunzip library, Copyright 2005,2006 by Michael Kohn
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License, version 2 as
published by the Free Software Foundation
SEE ALSO
Homepage
http://stosberg.net/odt2txt/
odt2txt 0.4 2008-06-23 ODT2TXT(1)