web.archive.org 自建开源替代品 ArchiveBox 试用

综述

web.archive.org 的开源自部署替代品。让你想记住的互联网内容不再被遗忘!

功能全面,超多种保存方式,可同步保存到 web.archive.org。保存下来的网页比较完整,未测试是否会自动滚动保存。有小概率保存不了,例如屏蔽海外的网站。

保存速度较慢。如果保存方式全选的话,单页大概 30~50 MB。

支持标签分类。

界面真的很丑,像是上世纪的产物.jpg

安装还算方便。不过镜像超大,1.7 GB。

截图

lqucrobk.png

lqucz04d.png

奇怪的小技巧……!!

用一些方式把它部署在了子目录(sub path)下。啊我其实是想说把它部署在了这种路径下:https://example.com/arc

用了 Nginx proxy_pass & sub_filter module,目前没太大的问题

location ^~ /arc {
    proxy_pass http://127.0.0.1:8000/; 
    proxy_redirect / /arc/; 
    proxy_cookie_path / /arc/; 
    proxy_set_header Host $host; 
    proxy_set_header X-Real-IP $remote_addr; 
    proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for; 
    proxy_set_header REMOTE-HOST $remote_addr; 
    proxy_set_header Upgrade $http_upgrade; 
    proxy_set_header Connection "upgrade"; 
    proxy_set_header X-Forwarded-Proto $scheme; 
    proxy_http_version 1.1; 
    add_header Cache-Control no-cache; 
    proxy_set_header Accept-Encoding ""; 
    sub_filter_once off; 
    sub_filter "/admin/" "/arc/admin/"; 
    sub_filter "/archive/" "/arc/archive/"; 
    sub_filter "/static/" "/arc/static/"; 
    sub_filter "<a href=\"/" "<a href=\"/arc/"; 
    sub_filter "/arc/static/admin/js/arc/admin" "/arc/static/admin/js/admin"; 
}
location ^~ /arc/arc/static/ {
    proxy_pass http://127.0.0.1:1780/static/; 
    proxy_set_header Host $host; 
    proxy_set_header X-Real-IP $remote_addr; 
    proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for; 
    proxy_set_header REMOTE-HOST $remote_addr; 
    proxy_set_header Upgrade $http_upgrade; 
    proxy_set_header Connection "upgrade"; 
    proxy_set_header X-Forwarded-Proto $scheme; 
    proxy_http_version 1.1; 
    add_header Cache-Control no-cache; 
}

location ^~ /arc/static/admin/js/arc/admin/RelatedObjectLookups.js {
    proxy_pass http://127.0.0.1:1780/static/admin/js/admin/RelatedObjectLookups.js; 
    proxy_set_header Host $host; 
    proxy_set_header X-Real-IP $remote_addr; 
    proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for; 
    proxy_set_header REMOTE-HOST $remote_addr; 
    proxy_set_header Upgrade $http_upgrade; 
    proxy_set_header Connection "upgrade"; 
    proxy_set_header X-Forwarded-Proto $scheme; 
    proxy_http_version 1.1; 
    add_header Cache-Control no-cache; 
}
评论区
头像
文章目录