1- I tried to turn text to one array and replace them something like this:
But this code doesn't keep the new line and echo all text in one line and my text is so big about 15 Gb and can't put it in one array.
2- Choose randomly from all occurrences of one word or token in the whole text.
simply can get number of repetition by something like this ( grep "a" | wc -l )
3- I can use python to do this but my text is huge and I want to use bash since it is faster than python. In python I use a set contains (a,b) and in replace function use a random function to choose (a or b) from that set.
4- I use Ubuntu 18.04, sed (GNU sed) 4.4, GNU Awk 4.1.4, API: 1.1 (GNU MPFR 4.0.1, GNU MP 6.1.2)
5- I simplify the problem, the main problem is that: I want to normalize a text corpus for training a tri-gram language model, in the language model, the sequence of words is important. I normalize the numbers to letters so for example, I convert all 30 to thirty but we use often half instead of thirty for reporting hour (e.g 8:30). I want to replace randomly things like this not whole of them.
on my desktop i am using the kde rotating desktop image option. this rotates images randomly every half hour. now, i would like to write an html file which will have an inline frame with some text, maybe system messages, or my friends live journal thati read alot, or unix.com! however, i dont want... (1 Reply)
I have a directory of files that look like filename 001.ext, filename 002.ext, etc. I'd like to rename the files with unique random numbered names, so that the original filenames are stripped and the files are given a new, random number name. I'm not super new to UNIX, but I don't often use it for... (2 Replies)
Hi there!
I am really enjoying working with sed. I am trying to come up with a sed command to replace some occurrences (not all) in the same line, for instance:
I have a command which the output will be:
200.300.400.5 0A 0B 0C 01 02 03
being that the last 6 strings are actually one... (7 Replies)
I have a text (text.txt) and I would like to replace only the first 2 occurrences of a word (but I might need to replace more):
For example, if text is this:
CAR sweet head
hat red yellow
CAR book brown
tiger CAR cow CAR
CAR milk
I would like to replace the word "CAR" with word... (12 Replies)
I want to create a cron job randomly once a day for my site's registration.
The responsible file for registrations is a config file and I need to change the contents
twice on day (on and off)
I know the way for random cron job for example
*/n * * * * /usr/local/bin/php... (6 Replies)
Hi,
I tried to adapt bartus's solution to my problem, without success. I want to replace all the occurences of this:
with:
, where something can contain an arbitrary number of balanced parens and brakets.
Any ideas ?
Best, (1 Reply)
Hi,
(First post, please be gental!)
I have a java app that I am running on unix (centos)
But it keeps dying randomly. The times seem random from anything between 3 hours and 3 days.
I have a cronjob running to restart it when ever it dies but I would rather this happened less often.
... (2 Replies)
Hello,
This is my code:
nb_lignes=`wc -l $1 | cut -d " " -f1`
for i in $(seq $nb_lignes)
do
m=`head $1 -n $i | tail -1`
//command
done
Please how can i change it to get Get 20% of lines in File randomly to apply "command" on each line ? 20% or 40% or 60 % (it's a parameter)
Thank you. (15 Replies)
Hey,
How can i create randomly create time N times.
Suppose i want to create data for a particualr date 5 times...
Mon Jan 19 11:42:50
Mon Jan 19 19:16:40
Mon Jan 19 12:12:33
Mon Jan 19 14:26:27
Mon Jan 19 12:29:53
Mon Jan 19 13:30:31
I want the script to create N times randome... (2 Replies)
Discussion started by: jaituteja
2 Replies
LEARN ABOUT HPUX
unifdef
unifdef(1) General Commands Manual unifdef(1)NAME
unifdef - remove preprocessor lines
SYNOPSIS
sym] sym] sym] sym]] ... [file]
DESCRIPTION
simulates some of the actions of in interpreting C language preprocessor command lines (see cpp(1)). For a valid preprocessor command line
contains as its first character a and one of the following keywords: or The character and its associated keyword must appear on the same
line, but they can be separated by spaces, tabs, and commented text. When appropriate, the portions of code surrounded by and including
the targeted preprocessor directives are removed, and the resultant text is written to the standard output.
Unlike does not insert included files, interpret macros, or strip comment lines. This means, among other things, that and macros occurring
within the input text are not interpreted.
Since is language-independent, it can be used for processing source files for languages other than the C language. For example, can be
used on FORTRAN language source files, provided the C language preprocessor commands are used.
Options
recognizes the following command-line options:
Complement the normal behavior by printing only the rejected lines.
Ignore text delimited by
sym. In other words, text that would otherwise be affected by some action is not touched when found within the context
of a preprocessor command using sym.
Ignore text delimited by
sym.
Replace rejected lines with blank lines
in the text written to the standard output.
Treat the input source as plain text.
C-language comment and quoting constructs are not recognized.
Define symbol
sym.
Cause symbol
sym to be undefined.
RETURN VALUE
The command returns the following exit values:
0 Output is an exact copy of the input.
1 Output is not an exact copy of the input.
2 The command fails. The failure might be due to a premature EOF or to an inappropriate or
EXAMPLES
Assume file contains the following:
The command sequence:
produces the following result in file
WARNINGS
Any symbol name defined in the file must be specified in the command line; otherwise, will ignore the line.
AUTHOR
was developed in the public domain.
SEE ALSO cpp(1).
unifdef(1)