NAME
formail - mail (re)formatter
SYNOPSIS
formail [+skip] [-total] [-bczfrktnedqBY] [-p prefix]
[-D maxlen idcache]
[-x headerfield] [-X headerfield]
[-a headerfield] [-A headerfield]
[-i headerfield] [-I headerfield]
[-u headerfield] [-U headerfield]
[-R oldfield newfield]
[-m minfields] [-s [command [arg ...]]]
DESCRIPTION
formail is a filter that can be used to force mail into
mailbox format, perform `From ' escaping, generate auto-
replying headers, do simple header munging/extracting or
split up a mailbox/digest/articles file. The
mail/mailbox/article contents will be expected on stdin.
If formail is supposed to determine the sender of the mail,
but is unable to find any, it will substitute `foo@bar'.
If formail is started without any command line options, it
will force any mail coming from stdin into mailbox format
and will escape all bogus `From ' lines with a `>'.
OPTIONS
-b Don't escape any bogus mailbox headers (i.e. lines
starting with `From ').
-p prefix
Define a different quotation prefix. If unspecified it
defaults to `>'.
-Y Assume traditional Berkeley mailbox format, ignoring
any Content-Length: fields.
-c Concatenate continued fields in the header. Might be
convenient when postprocessing mail with standard (line
oriented) text utilities.
-z Ensure a space exists between field name and content.
Zap fields which contain only a space. Zap leading and
trailing whitespace on fields extracted with -x.
-f Force formail to simply pass along any non-mailbox
format (i.e. don't generate a `From ' line as the first
line).
-r Generate an auto-reply header. This will normally
throw away all the existing fields (except X-Loop:) in
the original message, fields you wish to preserve need
to be named using the -i option. If you use this
option in conjunction with -k, you can prevent the body
from being `escaped' by also specifying -b.
-k When generating the auto-reply header or when
extracting fields, keep the body as well.
-t Trust the sender to have used a valid return address in
his header. This option will be most useful when
generating auto-reply headers from news articles. If
this option is not turned on, formail tends to favour
machine-generated addresses in the header.
-s The input will be split up into separate mail messages,
and piped into a program one by one (a new program is
started for every part). -s has to be the last option
specified, the first argument following it is expected
to be the name of a program, any other arguments will
be passed along to it. If you omit the program, then
formail will simply concatenate the splitted mails on
stdout again. See FILENO.
-n Tell formail not to wait for every program to finish
before starting the next (causes splits to be processed
in parallel).
-e Do not require empty lines to be preceding the header
of a new message (i.e. the messages could start on
every line).
-d Tell formail that the messages it is supposed to split
need not be in strict mailbox format (i.e. allows you
to split digests/articles or non-standard mailbox
formats). This disables recognition of the Content-
Length: field.
-B Makes formail assume that it is splitting up a BABYL
rmail file.
-m minfields
Allows you to specify the number of consecutive fields
formail needs to find before it decides it found the
start of a new message, it defaults to 2.
-q Tells formail to (still detect but) be quiet about
write errors, duplicate messages and mismatched
Content-Length: fields. This option is on by default,
to make it display the messages use -q-.
-D maxlen idcache
Formail will detect if the Message-ID of the current
message has already been seen using an idcache file of
approximately maxlen size. If not splitting, it will
return success if a duplicate has been found. If
splitting, it will not output duplicate messages. If
used in conjunction with -r, formail will look at the
mail address of the sender instead at the Message-ID.
-x headerfield
Extract the contents of this headerfield from the
header, display it as a single line.
-X headerfield
Same as -x, but also preserves the field name.
-a headerfield
Append a custom headerfield onto the header; but only
if a similar field does not exist yet. If you specify
either one of the field names Message-ID: or Resent-
Message-ID: with no field contents, then formail will
generate a unique message-ID for you.
-A headerfield
Append a custom headerfield onto the header in any
case.
-i headerfield
Same as -A, except that any existing similar fields are
renamed by prepending an ``Old-'' prefix. If
headerfield consists only of a field-name, it will not
be appended.
-I headerfield
Same as -i, except that any existing similar fields are
simply removed. If headerfield consists only of a
field-name, it effectively deletes the field.
-u headerfield
Make the first occurrence of this field unique, and
thus delete all subsequent occurrences of it.
-U headerfield
Make the last occurrence of this field unique, and thus
delete all preceding occurrences of it.
-R oldfield newfield
Renames all occurrences of the fieldname oldfield into
newfield.
+skip
Skip the first skip messages while splitting.
-total
Output at most total messages while splitting.
ENVIRONMENT
FILENO
While splitting, formail assigns the message number
currently being output to this variable. By presetting
FILENO, you can change the initial message number being
used and the width of the zero-padded output. If
FILENO is unset it will default to 000. If FILENO is
non-empty and does not contain a number, FILENO
generation is disabled.
EXAMPLES
To split up a digest one usually uses:
formail +1 -ds >>the_mailbox_of_your_choice
or
formail +1 -ds procmail
To remove all Received: fields from the header:
formail -I Received:
To remove all fields except From: and Subject: from the
header:
formail -k -X From: -X Subject:
To supersede the Reply-To: field in a header you could use:
formail -i "Reply-To: foo@bar"
To convert a non-standard mailbox file into a standard
mailbox file you can use:
formail -ds >new_mailbox
Or, if you have a very tolerant mailer:
formail -a Date: -ds >new_mailbox
To extract the header from a message:
formail -X ""
or
sed -e '/^$/ q'
To extract the body from a message:
formail -I ""
or
sed -e '1,/^$/ d'
SEE ALSO
mail(1), binmail(1), sendmail(8), procmail(1), sed(1), sh(1)
DIAGNOSTICS
Can't fork Too many processes on this machine.
Content-Length: field exceeds actual length by nnn bytes
The Content-Length: field in the
header specified a length that was
longer than the actual body. This
causes this message to absorb a
number of subsequent messages follow-
ing it in the same mailbox.
Couldn't write to stdout
The program that formail was trying
to pipe into didn't accept all the
data formail sent to it; this diag-
nostic can be suppressed by the -q
option.
Duplicate key found: x The Message-ID or sender x in this
message was found in the idcache;
this diagnostic can be suppressed by
the -q option.
Failed to execute "x" Program not in path, or not execut-
able.
File table full Too many open files on this machine.
Invalid field-name: "x"
The specified field-name "x" contains
control characters, or cannot be a
partial field-name for this option.
WARNINGS
You can save yourself and others a lot of grief if you try
to avoid using this autoreply feature on mails coming
through mailinglists. Depending on the format of the incom-
ing mail (which in turn depends on both the original
sender's mail agent and the mailinglist setup) formail could
decide to generate an autoreply header that replies to the
list.
BUGS
When formail has to generate a leading `From ' line it nor-
mally will contain the current date. If formail is given
the option `-a Date:', it will use the date from the `Date:'
field in the header (if present). However, since formail
copies it verbatim, the format will differ from that expect-
ed by most mail readers.
If formail is instructed to delete or rename the leading
`From ' line, it will not automatically regenerate it as
usual. To force formail to regenerate it in this case, in-
clude -a 'From '.
If formail is not called as the first program in a pipe and
it is told to split up the input in several messages, then
formail will not terminate until the program it receives the
input from terminates itself.
MISCELLANEOUS
Formail is eight-bit clean.
When formail has to determine the sender's address, every
RFC 822 conforming mail address is allowed. Formail will
always strip down the address to its minimal form (deleting
excessive comments and whitespace).
The regular expression that is used to find `real' postmarks
is:
"\n\nFrom [\t ]*[^\t\n ]+[\t ]+[^\n\t ]"
If a Content-Length: field is found in a header, formail
will copy the number of specified bytes in the body verbatim
before resuming the regular scanning for message boundaries
(except when splitting digests or Berkeley mailbox format is
assumed).
NOTES
Calling up formail with the -h or -? options will cause it
to display a command-line help page.
SOURCE
This program is part of the procmail mail-processing-package
(v3.10 1994/10/31) available at your nearest USENET
comp.sources.misc archive, or at ftp.informatik.rwth-
aachen.de as pub/packages/procmail/procmail.tar.gz.
MAILINGLIST
There exists a mailinglist for questions relating to any
program in the procmail package:
procmail@informatik.rwth-aachen.de
for submitting questions/answers.
procmail-request@informatik.rwth-aachen.de
for subscription requests.
AUTHOR
Stephen R. van den Berg at RWTH-Aachen, Germany
berg@pool.informatik.rwth-aachen.de