How to find untagged audio files?


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers How to find untagged audio files?
# 15  
Old 12-20-2010
Quote:
Originally Posted by radoulov
Yes,
awk is GNU awk now.
Try this for track ...:

Code:
find -iname '*.mp3' -exec id3 -Rl {} + |
  awk -F: '/filename: / {
    if (f) print fn
    fn = $2; f = x
    }
    /^(title|a(rtist|lbum)): *(|unknown|track.*) *$/ {
      f++
      }
    END {  
      if (f) print fn
      }' IGNORECASE=1

---------- Post updated at 10:58 PM ---------- Previous update was at 10:55 PM ----------

Sorry for the multi-correction Smilie
The GNU awk version should include always IGNORECASE=1 in order to ignore the case.
Yes that works. But adding the track doesn't work yet because if the tracknumber is empty track isn't in the output. But it detects and displays the right files now.
# 16  
Old 12-20-2010
This should do it (I hope I got the braces right Smilie):
Code:
find -iname '*.mp3' -exec id3 -Rl {} + |
  awk -F: '/filename: / {
    if (f) print fn
    fn = $2; f = x
    }
    /^(((title|a(rtist|lbum)): *(|unknown|track.*))|track:) *$/ {
      f++
      }
    END {  
      if (f) print fn
      }' IGNORECASE=1

Could you post an example output from metaflac at least for two files: one to be printed and another one to be skipped?

---------- Post updated at 11:26 PM ---------- Previous update was at 11:20 PM ----------

Given your metaflac example above, the filename seems to be missing in the output, this means that we should process the files one by one, in order to get the name ...
# 17  
Old 12-20-2010
Quote:
Originally Posted by radoulov
This should do it (I hope I got the braces right Smilie):
Code:
find -iname '*.mp3' -exec id3 -Rl {} + |
  awk -F: '/filename: / {
    if (f) print fn
    fn = $2; f = x
    }
    /^(((title|a(rtist|lbum)): *(|unknown|track.*))|track:) *$/ {
      f++
      }
    END {  
      if (f) print fn
      }' IGNORECASE=1

Could you post an example output from metaflac at least for two files: one to be printed and another one to be skipped?

---------- Post updated at 11:26 PM ---------- Previous update was at 11:20 PM ----------

Given your metaflac example above, the filename seem to be missing in the output, this means that we should process the files one by one, in order to get the name ...
This doesn't work yet. This is the output of a mp3 that should be detected, since the tracknumber isn't filled:
Code:
$ id3 -Rl alex\ roeka\ -\ 1999\ \ -\ wildernis/04\ -\ Alex\ Roeka\ -\ Help\ me\,\ ik\ ben\ echt.mp3 

Filename: alex roeka - 1999  - wildernis/04 - Alex Roeka - Help me, ik ben echt.mp3
Title: Help Me, Ik Ben Echt          
Artist: Alex Roeka                    
Album: Wildernis                     
Year:     
Genre: Other (12)
Comment: Het~Nachtdier

This FLAC is ok:
Code:
$ metaflac --export-tags-to=- 06.\ Diseases\ of\ The\ Band.flac 
ARTIST=Frank Zappa
TITLE=Diseases of The Band
ALBUM=You Can't Do That On Stage Anymore, Vol. 1, Disc 1
DATE=1988
TRACKNUMBER=06
GENRE=Rock
COMMENT=Track 6

These FLAC's are not ok:
Code:
$ metaflac --export-tags-to=- 01.\ The\ Florida\ Airport\ Tape.flac 
ARTIST=Frank Zappa
$ metaflac --export-tags-to=- 02.\ Once\ Upon\ A\ Time.flac 
$ metaflac --export-tags-to=- 03.\ Sofa\ #1.flac 
TITLE=track 1
ARTIST=Frank Zappa
ALBUM=You Can't Do That On Stage Anymore, Vol. 1, Disc 1
DATE=1988
TRACKNUMBER=03
GENRE=Rock
DESCRIPTION=Track 3
COMMENT=Track 3

From track 1 only the artist is filled. Title, album and tracknumber are missing.
From track2 none of the tags is filled, so nothing is returned
Track 3 is titled 'track 1'.
# 18  
Old 12-21-2010
OK,
I suppose all the code should be re-written.
Could you please provide again the list of mandatory tags for mp3 and flac?

For mp3:

Code:
Album
Artist
Title
Track

For flac:

Code:
ALBUM
ARTIST
DESCRIPTION
TITLE


Are those lists correct?
# 19  
Old 12-21-2010
Quote:
Originally Posted by radoulov
OK,
I suppose all the code should be re-written.
Could you please provide again the list of mandatory tags for mp3 and flac?

For mp3:

Code:
Album
Artist
Title
Track

For flac:

Code:
ALBUM
ARTIST
DESCRIPTION
TITLE

Are those lists correct?
Yes. Indeed. If there is no id tag in the mp3 all properties except 'Track' are in the output and filled with spaces. The property 'Track' isn't in the output at all.
But if there is an id3 tag, but an empty one, the property field IS in the output and has value the value '0'.

With FLAC it's more clear. Only non-empty properties are in the output.

So for mp3 the check should be like:
If title is empty(=filled with spaces) or contains unknown or track
or if artist is empty(=filled with spaces) or contains unknown
or if album is empty(=filled with spaces) or contains unknown
or if the value of the is '0' or the property track is not available
then report this file

For flac:
If title is not available or contains unknown or track
or if artist is not avaliable or contains unknown
or if album is not available or contains unknown
or if track is not available
then report this file

If no property at all is filled metaflac returns nothing at all.

Hmm.. that's really a lot of checks.
# 20  
Old 12-21-2010
Given how the requirements evolved, I decided to go for portable solution.
Check this for mp3 files:


Code:
find . -type f -name '*.[Mm][Pp]3' -exec id3 -Rl {} + |
  awk 'BEGIN {
    mp3n        = split("album artist title track",       mp3_tags)
    flacn       = split("album artist title description", flac_tags)
    ignorepatt  = "^ *(unknown|track *[0-9]*)* *$"    
    }
  /^Filename/ {
    if (fn) {
      for (i = 1; i <= mp3n; i++) {
        if (tolower(tags[mp3_tags[i]]) ~ ignorepatt) {
          invalid_tags[mp3_tags[i]] = tags[mp3_tags[i]]
          f || f++
          }
        }
      if (f) {
       print fn, "has missing/invalid tags:"
        for (t in invalid_tags)
          printf "%s -> %s\n", t, invalid_tags[t]
        }      
      }    
    split(x, tags); split(x, invalid_tags)
    fn = $2; f = x
    }
 { 
    tags[tolower($1)] = $2
    }
  END {
      for (i = 1; i <= mp3n; i++) {
        if (tolower(tags[mp3_tags[i]]) ~ ignorepatt) {
          invalid_tags[mp3_tags[i]] = tags[mp3_tags[i]]
          f || f++
          }
      if (f) {
        print fn, "has missing/invalid tags:"
          for (t in invalid_tags)
          printf "%s -> %s\n", t, invalid_tags[t]
          }            
        }  
      }' FS=:

P.S. Ignore the flac part for now.

Last edited by radoulov; 12-21-2010 at 10:52 AM.. Reason: Added the missing END block.
This User Gave Thanks to radoulov For This Post:
# 21  
Old 12-21-2010
Wow. All that in just one tiny line of commands. Smilie

This one does work perfect.
Nice output. Works perfectly on my test dir, so I did a run on my collection... I discovered I've really a lot of retagging to do on mp3. Smilie
I hope my flac collection is in better condition. Smilie

---------- Post updated at 06:20 PM ---------- Previous update was at 06:09 PM ----------

Oh... ik overlooked something in the flac specs you posted. The tags to look for are:
Code:
TITLE
ARTIST
ALBUM
TRACKNUMBER

example output:
Code:
$ metaflac --export-tags-to=- 03.\ Sofa\ #1.flac 
TITLE=track 1
ARTIST=Frank Zappa
ALBUM=You Can't Do That On Stage Anymore, Vol. 1, Disc 1
DATE=1988
TRACKNUMBER=03
GENRE=Rock
DESCRIPTION=Track 3
COMMENT=Track 3

DESCRIPTION is just the same as COMMENT. 'Track 3' is just filled in the comments. Has really nothing to do with the value of TRACKNUMBER.

---------- Post updated at 06:55 PM ---------- Previous update was at 06:20 PM ----------

And here is proof. I just changed the comments.
Code:
$ metaflac --export-tags-to=- 03.\ Sofa\ #1.flac 
TITLE=track 1
ARTIST=Frank Zappa
ALBUM=You Can't Do That On Stage Anymore, Vol. 1, Disc 1
DATE=1988
TRACKNUMBER=03
GENRE=Rock
DESCRIPTION=Radoulov For President
COMMENT=Radoulov For President

 
Login or Register to Ask a Question

Previous Thread | Next Thread

7 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Need a script for automation the convert a lot number audio files to another format

I have a lot number audio files in the MP3 proprietary format, I want to convert them to 'opus' the free and higher quality format, with keep metadata also. My selection command-line programs are SoX (Sound eXchange) for convert MP3 files to 'AIFF' format in order to keep quality and metadata*... (1 Reply)
Discussion started by: temp-usr
1 Replies

2. UNIX for Dummies Questions & Answers

Remove untagged and junk data from an XML

Hi All , I have seen a lot of code samples which suggest how to remove the junk data from and XML , I need a code in unix which removes the junk characters as well as the valid characters those are not in XML tags , for example my XML is as follows : <?xml version="1.0"... (6 Replies)
Discussion started by: IshuGupta
6 Replies

3. Slackware

Problems with audio recording in Audacity 2.0.5. Slackware64 14.1; Intel HD Audio.

I'm trying to record audio using Audacity 2.0.5 installed from SlackBuilds. My system is 64-bit Slackware 14.1 and a sound card is Intel HD Audio. I didn't change my sound system to OSS. (Default sound system in Slackware 14.1 is ALSA, isn't it?) First, I set Internal Microphone slider in KMix... (2 Replies)
Discussion started by: qzxcvbnm
2 Replies

4. Shell Programming and Scripting

Manipulating audio files server side

Hi All, I have next to zero knowledge on what I am about to ask so I will just ask it in plain English :) I am wondering how best to go about manipulating audio files server side. The manipulations required are join files one after the other, eg, audio1 + audio2 + audio3 + audio4 = audio5 ... (0 Replies)
Discussion started by: linuxgoat
0 Replies

5. Shell Programming and Scripting

Script to list files not present in audio.txt file

I am having following folder structure. /root/audios/pop /root/audios/jazz /root/audios/rock Inside those pop, jazz, rock folders there are following files, p1.ul, p2.ul, p3.ul, j1.ul, j2.ul, j3.ul, r1.ul, r2.ul, r3.ul And I have a file named as "audio.txt" in the path /root/audios,... (11 Replies)
Discussion started by: gopikrish81
11 Replies

6. UNIX for Dummies Questions & Answers

Find Audio Files With Specific Bandwidth?

Hi, I would like to write a shell script that will: -search the files of a specific user to find any audio files with a bandwidth iqual or greater than 192 kps - on the results i should see the file name along with all the whole file route and each file's size So I guess i should be using... (1 Reply)
Discussion started by: ubu-user
1 Replies

7. Programming

Playing Audio files in C

Hi All, Looking for an assistance on how to access the speakers of my machine and play audio files using C. Any tutorials will be of great help. Regards, Sayantan. (1 Reply)
Discussion started by: Sayantan
1 Replies
Login or Register to Ask a Question