Skip to content

Mbox format #35

@ghost

Description

Hi,

First of thanks for these scripts, they were a huge help and this question is possibly out of scope of the project. I downloaded a group with around 90k messages with no issues; I adapted the wget.sh outputted script slightly with the modification provided in #32. All the messages are now in $GROUP/mbox formatted with RFC 822.

I am looking to convert this to an actual single mbox file format, the problem I am having is I can't get the format correct. I have tried just joining the individual files together but that does not create a valid mbox format.

find $GROUP/mbox/ -type f | while read f; do cat $f >> tmp.mbox; done

I have also tried to format it using procmails formail.

for f in $GROUP/mbox/*; do formail -b < "$f" >> test2.mbox; done

while this command does work, it adds the current time to the FROM field instead of using the posted date. So when you open the file in say mutt, it shows the wrong date.

for f in $GROUP/mbox/*; do formail -a "Date:" < "$f" >> test2.mbox; done

This command creates an invalid mbox file:

mutt -f test2.mbox
Invalid mbox format

Any ideas how I can get this to a valid mbox format?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions