310 字
2 分钟
0
web.archive.org 自建开源替代品 ArchiveBox 试用
综述
web.archive.org 的开源自部署替代品。
多种保存方式,可同步保存到 web.archive.org。保存下来的网页比较完整,未测试是否会自动滚动保存。有小概率保存不了,例如屏蔽海外的网站。
保存速度较慢。如果保存方式全选的话,单页大概 30~50 MB。
支持标签分类。
界面真的很丑,像是上世纪的产物.jpg
安装还算方便。不过镜像超大,1.7 GB。
截图
奇怪的小技巧……!!
用一些方式把它部署在了子目录(sub path)下。啊我其实是想说把它部署在了这种路径下:https://example.com/arc
用了 Nginx proxy_pass & sub_filter module,目前没太大的问题
location ^~ /arc {
proxy_pass http://127.0.0.1:8000/;
proxy_redirect / /arc/;
proxy_cookie_path / /arc/;
proxy_set_header Host $host;
proxy_set_header X-Real-IP $remote_addr;
proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
proxy_set_header REMOTE-HOST $remote_addr;
proxy_set_header Upgrade $http_upgrade;
proxy_set_header Connection "upgrade";
proxy_set_header X-Forwarded-Proto $scheme;
proxy_http_version 1.1;
add_header Cache-Control no-cache;
proxy_set_header Accept-Encoding "";
sub_filter_once off;
sub_filter "/admin/" "/arc/admin/";
sub_filter "/archive/" "/arc/archive/";
sub_filter "/static/" "/arc/static/";
sub_filter "<a href=\"/" "<a href=\"/arc/";
sub_filter "/arc/static/admin/js/arc/admin" "/arc/static/admin/js/admin";
}
location ^~ /arc/arc/static/ {
proxy_pass http://127.0.0.1:1780/static/;
proxy_set_header Host $host;
proxy_set_header X-Real-IP $remote_addr;
proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
proxy_set_header REMOTE-HOST $remote_addr;
proxy_set_header Upgrade $http_upgrade;
proxy_set_header Connection "upgrade";
proxy_set_header X-Forwarded-Proto $scheme;
proxy_http_version 1.1;
add_header Cache-Control no-cache;
}
location ^~ /arc/static/admin/js/arc/admin/RelatedObjectLookups.js {
proxy_pass http://127.0.0.1:1780/static/admin/js/admin/RelatedObjectLookups.js;
proxy_set_header Host $host;
proxy_set_header X-Real-IP $remote_addr;
proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
proxy_set_header REMOTE-HOST $remote_addr;
proxy_set_header Upgrade $http_upgrade;
proxy_set_header Connection "upgrade";
proxy_set_header X-Forwarded-Proto $scheme;
proxy_http_version 1.1;
add_header Cache-Control no-cache;
}
web.archive.org 自建开源替代品 ArchiveBox 试用
https://blog.lzc256.com/posts/archivebox-experience/