From 144590b0bf5027329102db1f0bde3491974bcec4 Mon Sep 17 00:00:00 2001 From: Nick Sweeting Date: Wed, 8 Nov 2023 21:58:16 -0800 Subject: [PATCH] Update README.md --- README.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.md b/README.md index b51dad71..d23e0ef4 100644 --- a/README.md +++ b/README.md @@ -10,7 +10,7 @@ Community | Roadmap -
"Your own personal internet archive" (网站存档 / 爬虫)
+

 $ curl -sSL 'https://get.archivebox.io' | sh
 
@@ -35,14 +35,14 @@ $ curl -sSL 'https://get.archivebox.io' | sh **You can feed it URLs one at a time, or schedule regular imports** from browser bookmarks or history, feeds like RSS, bookmark services like Pocket/Pinboard, and more. See input formats for a full list. -**It saves snapshots of the URLs you feed it in several formats:** HTML, PDF, PNG screenshots, WARC, and more out-of-the-box, with a wide variety of content extracted and preserved automatically (Photos/PDFs/MP3/MP4/ZIP, social media, article text, git repos, etc.). See output formats for a full list. +**It saves offline-viewable snapshots of the URLs you feed it (in a wide variety of formats: HTML, PDF, PNG, WARC, etc.). It also auto-detects the content featured *inside* each webpage and lets you extract it out to easy common file formats:** `YouTube/SoundCloud/etc. -> mp3/mp4`, `news articles -> article body text`, `github/gitlab/etc. links -> cloned source code`, and more). See output formats for a full list. --- 🏛️ ArchiveBox is for *[professionals](https://zulip.archivebox.io/#narrow/stream/167-enterprise/topic/welcome/near/1191102) and [hobbyists](https://zulip.archivebox.io/#narrow/stream/158-development)* who want to save content off the web, for example: - **Individuals:** - `preserving bookmarks or browsing history`, `backing up photos, videos, docs, etc.` + `backing up browser bookmarks/history`, `saving FB/Insta/etc. content`, `shopping lists` - **Journalists:** `crawling and collecting research`, `preserving quoted material`, `fact-checking and review` - **Lawyers:** @@ -121,7 +121,7 @@ ls ./archive/*/index.json # or browse directly via the filesyste - setup & support, team permissioning, hashing, audit logging, backups, custom archiving etc. - for **individuals**, **NGOs**, **academia**, **governments**, **journalism**, **law**, and more... -*All our work is open-source and geared towards non-profits.* +*All our work is open-source and primarily geared towards non-profits.* *Support/consulting pays for hosting and funds new ArchiveBox open-source development.*