Download all messages from a Google group

后端 未结 3 2039
我寻月下人不归
我寻月下人不归 2020-12-25 12:16

I\'m in the process of developing a Google apps migration/archive system and at this point in development I\'m trying to come up with a way to download all messages in all t

相关标签:
3条回答
  • 2020-12-25 12:50

    I made a simple scrap utility by using selenium and htmlunit.. you can use it.. it is not very optimized and can help you download messages of small groups only(up-to 7000 msgs)

    https://github.com/himukr/google-grp-scraper

    0 讨论(0)
  • 2020-12-25 12:54

    If you don't mind using #bash, you may try a tool I wrote

    https://github.com/icy/google-group-crawler

    It can download all mbox files from Google Group. If you have a cookie file, you can even download all files from a private Google Group, and/or to see all original emails. It can also read rss feeds and fetch the latest posts ; and this is useful for daily mirror.

    An example result is here http://l.archlinuxvn.org/archlinuxvn/. MHonArch is used to convert mbox files into HTML format.

    0 讨论(0)
  • 2020-12-25 12:57

    Ultimately I ended up using the gdata python library to get a list of all groups along with their respective URLs. From there I used selenium to scrape the groups for messages and all replies. Probably not the best solution but it works for what I need.

    0 讨论(0)
提交回复
热议问题