Skip to content
Change the repository type filter

All

    Repositories list

    679 repositories

    • Archiving all metadata from YouTube (everything except videos themselves due to size)
      Lua
      The Unlicense
      22352Updated Jul 14, 2024Jul 14, 2024
    • ArchiveBot, an IRC bot for archiving websites
      Python
      MIT License
      7235213836Updated Jul 13, 2024Jul 13, 2024
    • wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved
      HTML
      GNU General Public License v3.0
      525110Updated Jul 7, 2024Jul 7, 2024
    • grab-site

      Public
      The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
      Python
      Other
      1301.3k7814Updated Jul 7, 2024Jul 7, 2024
    • Managing items for priimage-grab.
      Python
      0000Updated Jun 26, 2024Jun 26, 2024
    • Sources for urls-grab.
      Python
      84141Updated Jun 26, 2024Jun 26, 2024
    • Managing items for picdig-grab.
      Python
      0000Updated Jun 26, 2024Jun 26, 2024
    • Archiving プリ画像 (pri-image).
      Lua
      The Unlicense
      1000Updated Jun 26, 2024Jun 26, 2024
    • Archiving picdig.
      Lua
      The Unlicense
      0000Updated Jun 25, 2024Jun 25, 2024
    • urls-grab

      Public
      Archiving URLs (outlinks) from a variety of sources.
      Lua
      The Unlicense
      51520Updated Jun 23, 2024Jun 23, 2024
    • Managing items for postnews-grab.
      Roff
      0000Updated Jun 23, 2024Jun 23, 2024
    • Archiving post.news.
      Lua
      The Unlicense
      0000Updated May 24, 2024May 24, 2024
    • A Dockerfile for the ArchiveTeam Warrior
      Shell
      5727891Updated May 13, 2024May 13, 2024
    • Base Dockerfile for warrior project grab scripts
      Dockerfile
      MIT License
      3401Updated May 13, 2024May 13, 2024
    • Managing items for subscene-grab.
      1200Updated May 4, 2024May 4, 2024
    • Archiving Subscene.
      Lua
      The Unlicense
      5900Updated May 3, 2024May 3, 2024
    • wpull

      Public
      Wget-compatible web downloader and crawler.
      HTML
      GNU General Public License v3.0
      7754717620Updated Apr 29, 2024Apr 29, 2024
    • Archiving comments from the Roblox Marketplace
      Lua
      The Unlicense
      0100Updated Apr 21, 2024Apr 21, 2024
    • Managing items for roblox-marketplace-comments-grab.
      0000Updated Apr 10, 2024Apr 10, 2024
    • Archiving part of DeviantArt.
      Lua
      The Unlicense
      0000Updated Apr 8, 2024Apr 8, 2024
    • Archiving some .onion URLs.
      Python
      0100Updated Apr 4, 2024Apr 4, 2024
    • Managing items for deviantart-grab.
      0000Updated Mar 22, 2024Mar 22, 2024
    • Archiving taringa.net.
      Lua
      The Unlicense
      0000Updated Mar 20, 2024Mar 20, 2024
    • Managing items for taringa-grab.
      Python
      0000Updated Mar 19, 2024Mar 19, 2024
    • Archiving mediafire.com URLs.
      Lua
      The Unlicense
      5611Updated Mar 13, 2024Mar 13, 2024
    • Archiving imgur.
      Lua
      The Unlicense
      36500Updated Mar 7, 2024Mar 7, 2024
    • Archiving vbox7.
      Lua
      The Unlicense
      0000Updated Feb 21, 2024Feb 21, 2024
    • Grabbing everything from reddit.
      Lua
      The Unlicense
      136030Updated Feb 16, 2024Feb 16, 2024
    • Managing items for vbox7-grab.
      0000Updated Feb 1, 2024Feb 1, 2024
    • wget-lua

      Public
      Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
      C
      GNU General Public License v3.0
      1490111Updated Jan 29, 2024Jan 29, 2024