[ARVADOS] updated: 1.3.0-2172-g3d4de36a2
git at public.arvados.org
Fri Feb 14 15:19:23 UTC 2020
Summary of changes:
README.md | 55 +++++++++++++++++++++++++++++++++++++++++--------------
1 file changed, 41 insertions(+), 14 deletions(-)
via 3d4de36a24221e499ed944f5472925581d4e276a (commit)
from 372378584b1d5ab45cde8e3914087d00327777fe (commit)
Those revisions listed above that are new to this repository have
not appeared on any other notification email; so we list those
revisions in full, below.
Author: Peter Amstutz <peter.amstutz at curii.com>
Date: Fri Feb 14 10:19:00 2020 -0500
16080: Align descriptive text with new website
Arvados-DCO-1.1-Signed-off-by: Peter Amstutz <peter.amstutz at curii.com>
diff --git a/README.md b/README.md
index 6ed29f391..2f6af250e 100644
@@ -6,24 +6,51 @@
<img align="right" src="doc/images/dax.png" height="240px">
-[Arvados](https://arvados.org) is a free software distributed computing platform
-for bioinformatics, data science, and high throughput analysis of massive data
-sets. Arvados supports a variety of cloud, cluster and HPC environments.
+[Arvados](https://arvados.org) is an open source platform for
+managing, processing, and sharing genomic and other large scientific
+and biomedical data. With Arvados, bioinformaticians run and scale
+compute-intensive workflows, developers create biomedical
+applications, and IT administrators manage large compute and storage
-Arvados consists of:
+The key components of Arvados are:
-* *Keep*: A petabyte-scale content-addressed distributed storage
- system for storing, managing and versioning collections of files.
- Like git for big data. Interoperable data access by a variety of
- methods including WebDAV, FUSE file system mount, and Arvados APIs.
-* *Crunch*: A container-based cloud and HPC workflow engine providing
- strong versioning, reproducibilty, and provenance of large-scale
- computations. Supports [Common Workflow
- Language](https://www.commonwl.org) for describing workflows.
+Keep is the Arvados storage system for managing and storing large
+collections of files. Keep combines content addressing and a
+distributed storage architecture resulting in both high reliability
+and high throughput. Every file stored in Keep can be accurately
+verified every time it is retrieved. Keep supports the creation of
+collections as a flexible way to define data sets without having to
+re-organize or needlessly copy data. Keep works on a wide range of
+underlying filesystems and object stores.
-* Related services and components including a web workbench for managing files
- and compute jobs, REST APIs, SDKs, and other tools.
+Crunch is the orchestration system for running [Common Workflow Language](https://www.commonwl.org) workflows. It is
+designed to maintain data provenance and workflow
+reproducibility. Crunch automatically tracks data inputs and outputs
+through Keep and executes workflow processes in Docker containers. In
+a cloud environment, Crunch optimizes costs by scaling compute on demand.
+The Workbench web application allows users to interactively access
+Arvados functionality. It is especially helpful for querying and
+browsing data, visualizing provenance, and tracking the progress of
+## Command Line
+The command line interface (CLI) provides convenient access to Arvados
+functionality in the Arvados platform from the command line.
+## API and SDKs
+Arvados is designed to be integrated with existing infrastructure. All
+the services in Arvados are accessed through a RESTful API. SDKs are
+available for Python, Go, R, Perl, Ruby, and Java.
# Quick start
More information about the arvados-commits