From 01d342f2b56466de7da04fd6e10e7c107d3b1d93 Mon Sep 17 00:00:00 2001 From: Nick Sweeting Date: Wed, 8 Nov 2023 22:07:27 -0800 Subject: [PATCH] Update README.md --- README.md | 14 ++++++++++---- 1 file changed, 10 insertions(+), 4 deletions(-) diff --git a/README.md b/README.md index f53450d1..4ede4cf6 100644 --- a/README.md +++ b/README.md @@ -2,6 +2,8 @@

ArchiveBox
Open-source self-hosted web archiving.

+
+ ▶️ Quickstart | Demo | GitHub | @@ -10,7 +12,7 @@ Community | Roadmap -
+
@@ -22,9 +24,9 @@ -

+
 
 
@@ -37,7 +39,11 @@ curl -sSL 'https://get.archivebox.io' | sh # (or see pip/brew/Docker instruct **You can feed it URLs one at a time, or schedule regular imports** from browser bookmarks or history, feeds like RSS, bookmark services like Pocket/Pinboard, and more. See input formats for a full list. -**It saves offline-viewable snapshots of the URLs you feed it (in a wide variety of formats: HTML, PDF, PNG, WARC, etc.). It also auto-detects the content featured *inside* each webpage and lets you extract it out to easy common file formats:** `YouTube/SoundCloud/etc. -> mp3/mp4`, `news articles -> article body text`, `github/gitlab/etc. links -> cloned source code`, and more). See output formats for a full list. +**It saves offline-viewable snapshots of the URLs you feed it in several redundant formats (HTML, PDF, PNG, WARC, etc.).** It also auto-detects the content featured *inside* each webpage extracts it out to common, easy file formats: +- `YouTube/SoundCloud/etc. -> mp3/mp4`, +- `news articles -> article body text` +- `github/gitlab/etc. links -> cloned source code` +- *[and more...](#output-formats)* ---