diff --git a/README.md b/README.md index 5d0b096c1de1..30505d7ca132 100644 --- a/README.md +++ b/README.md @@ -44,8 +44,9 @@ DataFusion is an extensible query engine written in [Rust] that uses [Apache Arrow] as its in-memory format. -The DataFusion libraries in this repository are used to build data-centric system software. DataFusion also provides the -following subprojects, which are packaged versions of DataFusion intended for end users. +This crate provides libraries and binaries for developers building fast and +feature rich database and analytic systems, customized to particular workloads. +See [use cases] for examples. The following related subprojects target end users: - [DataFusion Python](https://github.com/apache/datafusion-python/) offers a Python interface for SQL and DataFrame queries. @@ -54,13 +55,10 @@ following subprojects, which are packaged versions of DataFusion intended for en - [DataFusion Comet](https://github.com/apache/datafusion-comet/) is an accelerator for Apache Spark based on DataFusion. -The target audience for the DataFusion crates in this repository are -developers building fast and feature rich database and analytic systems, -customized to particular workloads. See [use cases] for examples. - -DataFusion offers [SQL] and [`Dataframe`] APIs, -excellent [performance], built-in support for CSV, Parquet, JSON, and Avro, -extensive customization, and a great community. +"Out of the box," +DataFusion offers [SQL] and [`Dataframe`] APIs, excellent [performance], +built-in support for CSV, Parquet, JSON, and Avro, extensive customization, and +a great community. DataFusion features a full query planner, a columnar, streaming, multi-threaded, vectorized execution engine, and partitioned data sources. You can diff --git a/docs/source/index.rst b/docs/source/index.rst index f11670d259bf..27dd58cf50f4 100644 --- a/docs/source/index.rst +++ b/docs/source/index.rst @@ -36,9 +36,11 @@ Apache DataFusion DataFusion is an extensible query engine written in `Rust `_ that uses `Apache Arrow `_ as its in-memory format. -This documentation is for the `core DataFusion project `_, which contains -libraries that are used to build data-centric system software. DataFusion also offers the following subprojects, which -provide packaged versions of DataFusion intended for end users, and these have separate documentation. +The documentation on this site is for the `core DataFusion project `_, which contains +libraries and binaries for developers building fast and feature rich database and analytic systems, +customized to particular workloads. See `use cases `_ for examples. + +The following related subprojects target end users and have separate documentation. - `DataFusion Python `_ offers a Python interface for SQL and DataFrame queries. @@ -47,10 +49,6 @@ provide packaged versions of DataFusion intended for end users, and these have s - `DataFusion Comet `_ is an accelerator for Apache Spark based on DataFusion. -DataFusion's target users are -developers building fast and feature rich database and analytic systems, -customized to particular workloads. See `use cases `_ for examples. - "Out of the box," DataFusion offers `SQL `_ and `Dataframe `_ APIs, excellent `performance `_, built-in support for CSV, Parquet, JSON, and Avro,