Skip to content

Pinned Loading

  1. ArchiveBot ArchiveBot Public

    ArchiveBot, an IRC bot for archiving websites

    Python 351 72

  2. wpull wpull Public

    Wget-compatible web downloader and crawler.

    HTML 545 77

  3. IA.BAK IA.BAK Public

    We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.

    Shell 87 22

  4. seesaw-kit seesaw-kit Public

    Making a reusable toolkit for writing seesaw scripts

    Python 69 30

  5. terroroftinytown terroroftinytown Public

    URLTeam's second generation of URL shortener archiving tools

    Python 69 15

  6. NewsGrabber NewsGrabber Public

    Grabbing all news.

    Python 62 32

Repositories

Showing 10 of 679 repositories
  • ludios_wpull Public

    wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved

    ArchiveTeam/ludios_wpull’s past year of commit activity
    HTML 25 GPL-3.0 5 11 0 Updated Jul 7, 2024
  • grab-site Public

    The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

    ArchiveTeam/grab-site’s past year of commit activity
  • ArchiveBot Public

    ArchiveBot, an IRC bot for archiving websites

    ArchiveTeam/ArchiveBot’s past year of commit activity
    Python 351 MIT 72 138 34 Updated Jul 3, 2024
  • priimage-items Public

    Managing items for priimage-grab.

    ArchiveTeam/priimage-items’s past year of commit activity
    Python 0 0 0 0 Updated Jun 26, 2024
  • urls-sources Public

    Sources for urls-grab.

    ArchiveTeam/urls-sources’s past year of commit activity
    Python 4 8 14 1 Updated Jun 26, 2024
  • picdig-items Public

    Managing items for picdig-grab.

    ArchiveTeam/picdig-items’s past year of commit activity
    Python 0 0 0 0 Updated Jun 26, 2024
  • priimage-grab Public

    Archiving プリ画像 (pri-image).

    ArchiveTeam/priimage-grab’s past year of commit activity
    Lua 0 Unlicense 1 0 0 Updated Jun 26, 2024
  • picdig-grab Public

    Archiving picdig.

    ArchiveTeam/picdig-grab’s past year of commit activity
    Lua 0 Unlicense 0 0 0 Updated Jun 25, 2024
  • urls-grab Public

    Archiving URLs (outlinks) from a variety of sources.

    ArchiveTeam/urls-grab’s past year of commit activity
    Lua 15 Unlicense 5 2 0 Updated Jun 23, 2024
  • postnews-items Public

    Managing items for postnews-grab.

    ArchiveTeam/postnews-items’s past year of commit activity
    Roff 0 0 0 0 Updated Jun 23, 2024