Renaming a bunch of files and directories can be tedious, error-prone work. Command-line tools to perform such tasks are numerous. Perhaps the most classic example was the Perl rename script, which has been available or installable on most Unix-inspired operating systems since the early 1990s.
The core idea of rename
was excellent. The user supplied a snippet of Perl
code as a command-line argument, followed by the original paths. Each original
path was pumped through the code snippet to generate the corresponding new
path. Because Perl was designed to make it easy to manipulate strings with very
little code, users could efficiently rename paths directly on the command line.
Even if you hardly knew Perl but at least understood how to operate its compact
regular-expression substitution syntax, you could become quite adept at bulk
path renaming.
$ rename 's/foo/bar/' *
Unfortunately, the script was a chainsaw – undeniably useful, but able to
inflict devastation after a single false move. As a result, I rarely used
rename
directly for my bulk renaming needs, which were extensive on several
projects I worked on. Instead, I wrote my own Perl script to the job. Its
operation was roughly the same, but it included precautions to help me avoid
disastrous mistakes. The most important were checking that the new paths did
not collide with existing paths on the file system and including an inspection
and confirmation step by default.
The mvs
library is an updated and enhanced version of those ideas, but
implemented in a language I use regularly (Python) rather than one in which
I have become rusty (Perl).
The primary use case envisioned for the library is its executable. In broad
terms, there are two ways to perform bulk renaming with the mvs
command: (1)
the user provides original file paths and a snippet of Python code to perform
the original-to-new computation, or (2) the user provides both original paths
and new paths directly.
Either way, before any renaming occurs, the following checks occur: are the original paths different than their corresponding new paths; do all of the original paths exist; do any new paths already exist; do any new paths collide with each other; and are the parent directories of any new paths missing? If those checks look alright, the proposed renamings are listed for inspection by the user, and renaming occurs only after confirmation.
The script provides command-line options to customize its behavior:
-
Supply input paths in various ways: positional arguments, STDIN, a text data file, or the clipboard.
-
Specify the structure of the input paths data: a flat sequence, two blank-delimited paragraphs, alternating pairs of lines, or delimited rows.
-
Use a snippet of Python code to filter out original paths before renaming, which can be handy if you want to supply paths via a command-line glob pattern but do not want to rename all of them.
-
Specify in advance how the program should respond to the validation problems listed above: skip the item with the problem, rename in spite of the problem (even it that means overwriting other paths), or take remedial action (create a missing parent).
-
Customize the start and skip values for a sequence number that can be used in the renaming code snippet.
-
Request dryrun mode, which executes the filtering, checking, and listing behavior but does not rename anything.
Install the library in the usual way.
$ pip install mvs
Get help and additional details regarding the options summarized above.
$ mvs --help
$ mvs --details
In general terms, the executable has the following usage. Note that the default
structure is flat and that the --rename
option is considered structural
because it implies that the input path data consists solely of original paths.
mvs SOURCE [STRUCTURE] [OTHER]
PATHS : positionals
SOURCE : PATHS | --stdin | --file PATH | --clipboard
STRUCTURE : --flat | --paragraphs | --pairs | --rows | --rename CODE
OTHER : other options
The different input structures can be illustrated with a simple renaming scenario that adds a file extension to the original paths. Note that if the paths were supplied via a source other than positional arguments, each path should be on its own line.
# The default: a flat sequence of paths.
$ mvs a b a.new b.new
$ mvs a b a.new b.new --flat
# Alternating pairs: old, new, etc.
$ mvs a a.new b b.new --pairs
# Paragraphs delimited by at least one blank.
$ mvs a b '' a.new b.new --paragraphs
The same renaming scenario could also be performed via a code snippet. The
snippet will be compiled into a function that receives the original path as the
local variable o
. See the program's help text for additional details about
user-supplied code.
$ mvs a b --rename 'return r"{o}.new"'
The mvs package also supports bulk renaming via a programmatic API. The first
step is to configure a RenamingPlan
. Initialization parameters and their
defaults are as follows.
from mvs import RenamingPlan
plan = RenamingPlan(
# Sequence of paths and their structure.
inputs,
structure = 'flat',
# User-supplied renaming and filtering code (str or callable).
# See mvs --details for additional information.
rename_code = None,
filter_code = None,
# Other parameters related to user-supplied code.
indent = 4,
seq_start = 1,
seq_step = 1,
# Problem controls. For each control mechanism, supply the
# names of the problems to be controlled via the mechanism,
# either as a sequence or space-delimited str.
# See mvs --details for the problem names.
skip = None,
clobber = None,
create = None,
)
plan.rename_paths()
If you do not want to rename paths immediately but do want to prepare everything for renaming, including performing the checks for problems, you can use the library in a more deliberative fashion: first prepare; then check the information provided by the plan; if desired, proceed with renaming; and in the event of unexpected failure, get information about which item led to the exception.
# The library's supported imports.
from mvs import RenamingPlan, MvsError, __version__
# Configure plan.
plan = RenamingPlan(...)
# Prepare for renaming.
plan.prepare()
# All relevant information about the plan and its original-new path pairs.
print(plan.as_dict)
# Whether preparation failed due to problems and what they are.
print(plan.failed)
print(plan.uncontrolled_problems)
# Try to rename.
try:
plan.rename_paths()
except Exception as e:
# The index of the original-new pair that was being renamed
# when the exception occurred. Pairs before that index were
# renamed succesfully; pairs after it were not attempted.
print(plan.tracking_index)
# The offending original-new pair.
print(plan.tracking_rp)