Skip to content

Commit 482901a

Browse files
committed
update readme
1 parent b25c663 commit 482901a

File tree

1 file changed

+0
-17
lines changed

1 file changed

+0
-17
lines changed

benchmarks/README.md

Lines changed: 0 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -246,25 +246,8 @@ You can enable `mimalloc` or `snmalloc` (to use either the mimalloc or snmalloc
246246
cargo run --release --features "mimalloc" --bin tpch -- benchmark datafusion --iterations 3 --path ./data --format tbl --query 1 --batch-size 4096
247247
```
248248

249-
The benchmark program also supports CSV and Parquet input file formats and a utility is provided to convert from `tbl`
250-
(generated by the `dbgen` utility) to CSV and Parquet.
251-
252-
```bash
253-
cargo run --release --bin tpch -- convert --input ./data --output /mnt/tpch-parquet --format parquet
254-
```
255-
256249
Or if you want to verify and run all the queries in the benchmark, you can just run `cargo test`.
257250

258-
#### Sorted Conversion
259-
260-
The TPCH tables generated by the dbgen utility are sorted by their first column (their primary key for most tables, the `l_orderkey` column for the `lineitem` table.)
261-
262-
To preserve this sorted order information during conversion (useful for benchmarking execution on pre-sorted data) include the `--sort` flag:
263-
264-
```bash
265-
cargo run --release --bin tpch -- convert --input ./data --output /mnt/tpch-sorted-parquet --format parquet --sort
266-
```
267-
268251
### Comparing results between runs
269252

270253
Any `dfbench` execution with `-o <dir>` argument will produce a

0 commit comments

Comments
 (0)