|
2 | 2 |
|
3 | 3 | An automated protocol for finishing long-read genome assemblies using short reads. [ntEdit](https://github.com/bcgsc/ntEdit) polishes the draft assembly and flags erroneous regions, then [Sealer](https://github.com/bcgsc/abyss/tree/master/Sealer) fills assembly gaps and erroneous sequence regions flagged by ntEdit. The protocol is implemented as a Makefile pipeline. |
4 | 4 |
|
| 5 | + |
| 6 | + |
5 | 7 | ## Dependencies |
6 | 8 |
|
7 | 9 | - GNU Make |
@@ -44,27 +46,25 @@ The corrected, finished assembly can be found with the suffix `.ntedit_edited.pr |
44 | 46 |
|
45 | 47 | ## Help Page |
46 | 48 | ``` |
47 | | -ntEdit+Sealer assembly finishing protocol |
48 | | -
|
49 | 49 | Usage: ntedit-sealer finish [OPTION=VALUE] |
50 | 50 |
|
51 | 51 | General options: |
52 | | -seqs Draft assembly name [seqs]. File must have .fa extension |
53 | | -reads Read file(s). All files must have .fq.gz extension. If multiple read files, list must be separated by spaces and surrounded by quotes |
54 | | -k K-mer sizes. List must be descending, separated by spaces and surrounded by quotes |
55 | | -t Number of threads [8] |
56 | | -time If True, will log the time for each step [False] |
| 52 | +seqs Draft assembly name [seqs]. File must have .fa extension |
| 53 | +reads Read file(s). All files must have .fq.gz extension. Must be separated by spaces and surrounded by quotes |
| 54 | +k K-mer sizes. List must be descending, separated by spaces and surrounded by quotes |
| 55 | +t Number of threads [8] |
| 56 | +time If True, will log the time for each step [False] |
57 | 57 |
|
58 | 58 | ntEdit options: |
59 | | -X Ratio of number of kmers in the k subset that should be missing in order to attempt fix (higher=stringent) [0.5] |
60 | | -Y Ratio of number of kmers in the k subset that should be present to accept an edit (higher=stringent) [0.5] |
| 59 | +X Ratio of number of kmers in the k subset that should be missing in order to attempt fix (higher=stringent) [0.5] |
| 60 | +Y Ratio of number of kmers in the k subset that should be present to accept an edit (higher=stringent) [0.5] |
61 | 61 |
|
62 | 62 | ABySS-bloom options: |
63 | | -b Bloom filter size (e.g. 100M) |
| 63 | +b Bloom filter size (e.g. 100M) |
64 | 64 |
|
65 | 65 | Sealer options: |
66 | | -L Length of flanks to be used as pseudoreads [100] |
67 | | -P Maximum alternate paths to merge; use 'nolimit' for no limit [10] |
| 66 | +L Length of flanks to be used as pseudoreads [100] |
| 67 | +P Maximum alternate paths to merge; use 'nolimit' for no limit [10] |
68 | 68 |
|
69 | 69 | Notes: |
70 | 70 | - Pass all parameter list values (reads, k) as space-separated values surrounded by quotation marks, e.g. k='80 65 50' |
|
0 commit comments