09-06-2008
Easy unix/sed question that I could have done 10 years ago!
Hi all and greetings from Ireland!
I have not used much unix or awk/sed in years and have forgotten a lot.
Easy enough query tho.
I am cleansing/fixing 10,000 postal addresses using global replacements.
I have 2 pipe delimited files , one is basically a spell checker for geographical areas. The second file is actual addresses.
Sample file 1 - 100+ lines (basically a spell checker):
|Irlllland|Ireland|
|Dubblin|Dublin|
|Corrk|Cork|
etc..
Sample file 2 - 10,000+ lines (Addresses to be cleansed):
|10 Main Street Irlllland|
|11 High Road Irlllland|
|1 High Road, Corrk|
The output required is :
|10 Main Street Ireland|
|11 High Road Ireland|
|1 High Road, Cork|
I am very rusty but reckon I need a loop with a global substition in it.
I used to know unix, awk and sed reasonably well but have forgotten the basic syntax.
All helpers there?
8 More Discussions You Might Find Interesting
1. Cybersecurity
Hi,
Could anyone direct me to any sites that have any info on unix attcks or hacks in the last 5 years. This is needed for an assignment. All help would be greatly appreciated.
Thanks:) (6 Replies)
Discussion started by: suzant
6 Replies
2. UNIX for Dummies Questions & Answers
can anyone tell me what exactly the following UNIX notation code does cause I need to do the same in windows?
for x in webapps/sal/*.htm*
do
mv $x $x.bak
sed 's@bob@sal@g' $x.bak > $x
done
Thanks (1 Reply)
Discussion started by: lavaghman
1 Replies
3. UNIX for Dummies Questions & Answers
I am trying to check through all of a certain type of file in all main directories, and find the top 10 that are taking up the most space. How can I do that? I was thinking like du *.file | sort -n | head (1 Reply)
Discussion started by: wallacer
1 Replies
4. Shell Programming and Scripting
I have a file name in this format
ABC_WIRE_TRANS_YYYYMMDD_00.DAT
I need to cut out the _00 out of the file name everytime. It could be _00, _01,_02, etc ....
How do I cut it out to look as follows?
ABC_WIRE_TRANS_YYYYMMDD.DAT (6 Replies)
Discussion started by: lesstjm
6 Replies
5. UNIX for Dummies Questions & Answers
I have a line like:
"Jun 19 12:56:22 routername 45454:"
I want to keep all information except the seconds of the time. I tried:
sed 's/..:..:../..:../g'
but apparently I'm on the wrong track, because although that matches on the time, it replaces it with the literal ..:..
How... (6 Replies)
Discussion started by: earnstaf
6 Replies
6. UNIX for Dummies Questions & Answers
Hi everybody:
Could anybody tell me if I have several files which each one it has this pattern name:
name1.dat name2.dat name3.dat name4.dat name10.dat name11.dat name30.dat
If I would like create one like:
name_total.dat
If I do:
paste name*.dat > name_total.dat (15 Replies)
Discussion started by: tonet
15 Replies
7. UNIX for Dummies Questions & Answers
Hello - I have a folder that contains files from 2003 till 2010. I am trying to figure out a command that would seperate each years file and show me a count?
Even if i can find a command that would give me year by year count, thats good enough too.
Thanks (8 Replies)
Discussion started by: DallasT
8 Replies
8. What is on Your Mind?
From Wed Sep 4 09:35 MDT 1991
Received: from by with SMTP
(16.6/15.5+IOS 3.20) id AA25932; Wed, 4 Sep 91 09:35:27 -0600
Return-Path:
Received: by
(16.6/15.5+IOS 3.20) id AA10424; Wed, 4 Sep 91 09:34:58 -0600
Date: Wed, 4 Sep 91 09:34:58 -0600
From:
Message-Id: <>
To: ... (0 Replies)
Discussion started by: jpezz
0 Replies
LEARN ABOUT DEBIAN
zemberek-server
zemberek-server(8) System Manager's Manual zemberek-server(8)
NAME
zemberek-server - Turkish spell-checker server
SYNOPSIS
zemberek-server [CONFILE]
or
java -jar /usr/share/java/zemberek-server.jar [CONFILE]
DESCRIPTION
zemberek-server is a Java(TM) based ``Turkish'' spell checker daemon which listens a configured port (by default, 10444) for client connec-
tions and responds with the Turkish spell checked output in a server-client fashion. zemberek-server uses Zemberek, a Java(TM) based Turk-
ish NLP library, at its heart.
Certain parts of the servers' behaviour can be controlled in the CONFILE configuration file. For example, to change the listened port one
can use the SERVER_PORT setting in CONFILE.
OPTIONS
There are no options.
NOTES
To make use of the server, a client program is required (see the next section).
FILES
/etc/zemberek-server.conf
configuration file
/var/log/zemberek-server.log
server's log file when running in the background
SEE ALSO
zpspell(1)
the website for further information <https://zemberek.dev.java.net/>
AUTHOR
Zemberek Team:
Ahmet A. Akin <ahmetaa@gmail.com> and Mehmet D. Akin <mdakin@gmail.com>
This manual page was written by Recai Oktas <roktas@debian.org> for the Debian GNU/Linux system, but may be used by others.
Debian/GNU Linux February 2006 zemberek-server(8)