Hello Perderabo,
Many thanks for your help.
I've magaged to find an easier solution to my issue.
There is a program called Fdupes which removed duplicate files. It didn't seem to mind that the file paths/names had difficult characters.
The following fdupes tutuial was useful:
fdupes Tutorial Life at the CLI
I've double checked the results with variations on the following command: find . -type f -exec md5sum {} \; | cut -d " " -f1 | sort | uniq ...... and both results seem to agree.
Just wanted to thank you for the assistance.
Best,
Roki