cdown / mack Goto Github PK

View Code? Open in Web Editor NEW

36.0 3.0 7.0 179 KB

An opinionated, fast music organiser.

License: MIT License

Rust 100.00%

id3 mp3 music music-organizer organizer rename rust rust-lang audio tagging

mack's Introduction

mack |

mack is an opinionated, fast music organiser. It enforces:

Directory layout
File name format
Metadata consistency (e.g., consistent "feat" tagging)
Format consistency (e.g., ID3 version)
...and more!

Examples of fixes

Moving featured artists from the artist tag to the title
Enforcing a consistent "feat" format in title tags
Whitespace normalisation
Renaming files to format "{artist}/{album}/{track} {title}", or another format specified with --fmt

Usage

See --help. An example invocation is:

% mack --dry-run -o Music .
01 Pyramid.mp3: renamed to Music/宇宙コンビニ/染まる音を確認したら/01 Pyramid.mp3
02 8films.mp3: renamed to Music/宇宙コンビニ/染まる音を確認したら/02 8films.mp3
03 tobira.mp3: renamed to Music/宇宙コンビニ/染まる音を確認したら/03 tobira.mp3
04 Compass.mp3: renamed to Music/宇宙コンビニ/染まる音を確認したら/04 Compass.mp3
05 strings.mp3: renamed to Music/宇宙コンビニ/染まる音を確認したら/05 strings.mp3

You can see what would be changed first using --dry-run.

Installation

cargo install mack

Performance

mack has a strong focus on performance. Files which were not updated since the last mack run will not be examined at all. On a sample modern laptop with a mid-spec SSD, this means that we only take 0.005 seconds to run over ~3500 files under most circumstances (0.015 seconds on the very first run).

Configuration

If you don't want a particular file to be touched by mack, add _NO_MACK as a substring anywhere in the comment tag.

mack's People

Contributors

Stargazers

Watchers

Forkers

atul9 staaas lewisbelcher linsallyzhao ralim oleskiewicz nichokas

mack's Issues

Add way to opt out of fixing specific file through tags

Perhaps _NO_MACK anywhere in the comment would do.

Validate performance implications of running each individual track's fixers asynchronously

This needs to be validated to see if this would actually speed things up with a couple of thousand files, but it doesn't sound unreasonable that it might. Possible granularities:

Track (probably about right since we may have may fixers)
Fixer (this is probably going to slow things down since each fixer is very fast)

Allow custom feat format

Add rename tests

Normalise multiple dots to a single one for exfat

Support custom max path len

Investigate using something else (nom?) instead of regexes

There are currently some limitations with regexes, especially that they don't support lookahead/behind assertions in rust-regex. fancy-regex supports these but has no .replace(), which is a pain in the arse since that's most of what we want to do.

Maybe we should just write a parser with nom.

Add more documentation

At the very least, the readme and --help need enhancement when we're coming to 1.0.

"feat" in title should use "and" with oxford comma

Right now we only ensure that whatever is within the "feat" block is enclosed correctly and has the right feat verb, but not that it uses "and" (instead of "&") and that it uses the Oxford comma.

Consider configuring build configs

https://doc.rust-lang.org/cargo/reference/manifest.html

"&" in artist name may need to be split into title tag

This is somewhat difficult and is outside the current provided infrastructure because it requires one to do some checks first to see if this looks like something that should be split out. For example "Simon & Garfunkel" should stay the same, but this should not as "Die Lit" is already attributed to the primary artist:

Artist: Playboi Carti & Nicki Minaj
Album: Die Lit
Track: Poke It Out

should become

Artist: Playboi Carti
Album: Die Lit
Track: Poke It Out (feat. Nicki Minaj)

...because "Die Lit" being in both shows this is really just a feat tag.

Possible other things to look for:

&
and
feat[uring]
vs
with

Move to use audiotags

Apply Musicbrainz-style capitalisation rules

https://wiki.musicbrainz.org/Style/Language/English

This is obviously not foolproof as some of these rules require evaluation of the meaning of the title in context (and it only applies to English), but we can at least do the basics.

This also might be worth extracting to its own crate.

Add tests for each fixer

Using regexes (for example) is fairly logical for many fixers, but it's hard to validate visually that they work properly and don't regress. Adding tests would help avoid that.

It's probably worth also using quickcheck instead of static examples.

Add non-tag based fixers

The most major one is file renaming based on tags, which should be done after tag cleanup. There are others, though, like making sure case insensitive matches for the same artist or artist/album pair have the same capitalisation.

Tidy up main

Right now main is a cavalcade of nested conditions and matches. This should be cleaned up.

Work out how to deduplicate fix_tag_whitespace

I had intended to use a closure for this, but due to there not being any abstractions over taglib's Rust FFI bindings, we end up in a bit of a pickle as we end up having to borrow tags twice if we want to pass both (say) tags.album and tags.set_album to a function.

I'm not yet entirely certain how to overcome this in a DRY way, so it's implemented manually for now. This needs cleanup for sure.

Add travis tests

We can now run with cargo test.

Investigate taglib performance

Right now it seems the vast majority of our runtime is in taglib deref/malloc/free. This makes me think it might be better to just store the tags in Track.

clean_part should look at directories

After 429084d we now use clean_part for each format, but this isn't quite right -- it should be used for each directory we got.

This is a bit complicated since we don't want to sanitise the user string, but.

Add OSX/Windows tests

Looks like taglib is in homebrew, at least.

Move to pure Rust tagging lib

Pros:

The current way of passing around tags is pretty stilted due to the very C-like API provided by the taglib bindings. This would likely allow some better code structure and avoid the stilted .save() handling now.
taglib has some recent C-specific CVEs (although they don't look too worrying for our threat model).

Cons:

Right now it seems there's no generic tagging library for Rust, so we'd likely have to use separate libraries for separate tag types and abstract over them.
Pure Rust tagging libraries seem not super featureful compared to existing C solutions right now, resulting in issues even with fairly "normal" seeming mp3 files (eg. polyfloyd/rust-id3#27)

Add path renaming

We should rename things to artist/album/trackno title.ext. We need to work out how to specify the base dir, though.

Get rid of excess whitespace

This could be:

Trailing whitespace
Leading whitespace
Multiple whitespaces between words

Upload to crates.io

Add dry run mode

Right now we offer no way of showing to the user what we would do before we do it. We also don't actually show what we did even if we did do it.

At its most basic level, this is basically just gating .save(), but that's not very useful by itself. We should also implement something which prints out the before and after for changed tags.

Keeping the "before" around probably requires some shenanigans, since we'll need to extract them from the taglib::Tag object before modification. It would be ideal not to just extract them all, but only clone when we know that we're actually going to make a change. Maybe Cow can help here.

If showing the before turns out to be too much of a burden, we should at least just show the after with the affected filename.

Artist: X feat. Y
Title: Z

Becomes

Artist: X
Title: Z (feat. Y)

Allow providing multiple inputs

Right now you can only pass one dir, but we should allow multiple.