Location: Asia Pacific, Cyberspace, in the Dark Dystopia
Posts: 19,118
Thanks Given: 2,351
Thanked 3,359 Times in 1,878 Posts
Issue with Keyboard or Char Encoding During Migration
There is a minor issue lingering which we currently have no working solution.
For example, see this pagetext in the original (old) forum mysql DB (continued thats to hicksd8 for finding these and for looking into this interesting topic):
From original DB:
Code:
| Hi have two directory with below name in “/opt“
1-Source
2-Destination
In “Source†directory there is a lot’s of files, with extensions (doc, docx , ppt, xls,...).
In “Destination†directory only pdf version of (doc, docx) files that exist in source stored.
Now I want to create script that use “diff†command check “source†and get list of only (doc, docx) files after that look for related pdf file in “Destination†if pdf version of (doc, docx) not exist in “destination†store list of them on a file.
E.g.
1-Source
[CODE]File1.doc
File2.docx
File3.doc
File4.ppt
File5.xls
File6.doc[/CODE]
2-Destination
[CODE]File1.pdf
File3.pdf
[/CODE]
Expected result after run script is:
[CODE]File2.docx
File6.doc[/CODE]
Here is my script
[CODE]diff -r “/opt/source†“/opt/destination“[/CODE]
Any recommendation?
Thanks
UPDATE
Follow below post and work like charm:
[CODE]comm -23 <(find dir1 -type f -exec bash -c 'basename "${0%.*}"' {} \; | sort) <(find dir2 -type f -exec bash -c 'basename "${0%.*}"' {} \; | sort)
test1[/CODE]
[url=https://unix.stackexchange.com/questions/178321/diff-two-directories-but-ignore-the-extensions]filenames - diff two directories, but ignore the extensions - Unix & Linux Stack Exchange[/url]
Mojibake occurs in English most frequently due to misinterpreting and bad-transcoding between Windows-1252, ISO-8859-1, and UTF-8. This module provides a mojibake sequence to original character mapping table, and utility to recover mojibake'd text. Testing has been with English but other Latin based languages, where Windows-1252 is in the wild, should also benefit.
Location: Asia Pacific, Cyberspace, in the Dark Dystopia
Posts: 19,118
Thanks Given: 2,351
Thanked 3,359 Times in 1,878 Posts
FYI existing old mysql dB
Code:
mysql> SELECT count(postid) from post where pagetext like '%“%';
+---------------+
| count(postid) |
+---------------+
| 66 |
+---------------+
1 row in set (1.64 sec)
mysql> SELECT count(postid) from post where pagetext like '%†%';
+---------------+
| count(postid) |
+---------------+
| 14 |
+---------------+
1 row in set (1.68 sec)
mysql> SELECT count(postid) from post where pagetext like '%â€%';
+---------------+
| count(postid) |
+---------------+
| 45 |
+---------------+
1 row in set (1.66 sec)
mysql> SELECT count(postid) from post where pagetext like '%’%';
+---------------+
| count(postid) |
+---------------+
| 165 |
+---------------+
1 row in set (1.63 sec)
mysql> SELECT count(postid) from post where pagetext like '%‘%';
+---------------+
| count(postid) |
+---------------+
| 38 |
+---------------+
1 row in set (1.69 sec)
mysql> SELECT count(postid) from post where pagetext like '%•%';
+---------------+
| count(postid) |
+---------------+
| 4 |
+---------------+
1 row in set (1.70 sec)
mysql> SELECT count(postid) from post where pagetext like '%…%';
+---------------+
| count(postid) |
+---------------+
| 23 |
+---------------+
1 row in set (1.68 sec)
Now that SELECT shows some goodies, maybe UPDATE on main DB ?
Hi all!!
Im using command file -i myfile.xml to validate XML file encoding, but it is just saying regular file . Im expecting / looking an output as UTF8 or ANSI / ASCII
Is there command to display the files encoding?
Thank you! (2 Replies)
Greetings Experts,
We are migrating from AIX to RHEL Linux. I have created a script to verify and report the NULLs and SPACEs in the key columns and duplicates on key combination of "|" delimited set of big files. Following is the code that was successfully running in AIX.
awk -F "|" 'BEGIN {... (5 Replies)
Hi Experts , I want to start migrating our AIX 6.1 to AIX 7.1 . I am planning to use alt_disk_migration . Chris gibson has awesome documentation in the internet. However I am running into an issue with EMC odm filesets . So my current OS is AIX 6.1. and I have this :
lslpp -l | grep EMC
... (7 Replies)
Hello All,
PC: CuBox-i (*i.MX6) Mini-PC
OS: openSUSE 13.1 (Bottle) (armv7hl)
Kernel: 3.14.14-cubox-i
# uname -a
Linux CuBox-HQ 3.14.14-cubox-i #1 SMP Sat Sep 13 03:48:24 UTC 2014 armv7l armv7l armv7l GNU/LinuxSo I've been having this random issue happen on this PC where a few strange... (12 Replies)
Hi All,
We need to move Physical Solaris 10 system to Virtual Solaris 10(p2v). Both the servers having Solaris 10(Generic_147440-25) means physical server which we are going to move is having Solaris 10 and this physical server will be converted as a virtualserver on another physical server... (9 Replies)
I created one file on windows system and is visible as :
TestTable,INSERT,večilnin1ईगल受害者是第,2010-02-02 10:10:10.612447,137277,ईगल受害者是第večilnin!@#$%^&*()_+=-{}]
But when send this file to unix system, the file is visible as :
TestTable,INSERT,žvečilnin1ई-ल -害...是第,2010-02-02 ... (4 Replies)
I am writing a bash shell menu and would like to get a char immediately after a key is pressed. This script does not work but should give you an idea of what I am trying to do....
Thanks for the help
#! /bin/bash
ANSWER=""
echo -en "Choose item...\n"
until
do
$ANSWER = $STDIN
... (2 Replies)