Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it normal for there to be interactions between haps? #34

Open
zoumingr opened this issue Jun 27, 2024 · 3 comments
Open

Is it normal for there to be interactions between haps? #34

zoumingr opened this issue Jun 27, 2024 · 3 comments

Comments

@zoumingr
Copy link

Q:
Haphic is very convenient for processing Hi-C data.
We combined the data of two haps together and then processed it with haphic. The resulting Hi-C heatmap shows some strong interactions between the two haps (as shown in the figure). This situation occurs with both --phasing_weight 0 and 1. I would like to ask if this is normal?
image

The karyotype of the species: 2n=100
The commands and parameters used in running HapHiC: haphic pipeline genome.fas HiC.filtered.bam ${nchrs} --quick_view --threads 20 --processes 60 --gfa "hap1.p_ctg.gfa,hap2.p_ctg.gfa" --phasing_weight 0 --correct_nrounds 2
The log files generated by HapHiC:
2024-06-27 08:50:28 <HapHiC_pipeline.py> [main] Pipeline started, HapHiC version: 1.0.3 (update: 2024.05.20)
2024-06-27 08:50:28 <HapHiC_pipeline.py> [main] Python version: 3.10.14 | packaged by conda-forge | (main, Mar 20 2024, 12:45:18) [GCC 12.3.0]
2024-06-27 08:50:28 <HapHiC_pipeline.py> [haphic_cluster] Step1: Execute preprocessing and Markov clustering for contigs...
2024-06-27 08:50:28 <HapHiC_cluster.py> [run] Program started, HapHiC version: 1.0.3 (update: 2024.05.20)
2024-06-27 08:50:28 <HapHiC_cluster.py> [run] Python version: 3.10.14 | packaged by conda-forge | (main, Mar 20 2024, 12:45:18) [GCC 12.3.0]
2024-06-27 08:50:28 <HapHiC_cluster.py> [detect_format] The file for Hi-C read alignments is detected as being in BAM format
2024-06-27 08:50:28 <HapHiC_cluster.py> [run] Ultra-long data are not supported now when assembly correction is enabled
2024-06-27 08:50:28 <HapHiC_cluster.py> [parse_fasta] Parsing input FASTA file...
2024-06-27 08:51:14 <HapHiC_cluster.py> [parse_gfa] Parsing input gfa file(s)...
2024-06-27 08:51:41 <HapHiC_cluster.py> [parse_bam_for_correction] Parsing input BAM file for contig correction...
2024-06-27 09:21:02 <HapHiC_cluster.py> [correct_assembly] Performing assembly correction...
2024-06-27 09:22:31 <HapHiC_cluster.py> [correct_assembly] Correction round 1, breakpoints are detected in 6 contig(s)
2024-06-27 09:22:31 <HapHiC_cluster.py> [break_and_update_ctgs] Breaking contigs and updating data...
2024-06-27 09:22:35 <HapHiC_cluster.py> [correct_assembly] Correction round 2, breakpoints are detected in 0 contig(s)
2024-06-27 09:22:35 <HapHiC_cluster.py> [correct_assembly] Generating corrected assembly file...
2024-06-27 09:22:35 <HapHiC_cluster.py> [correct_assembly] 6 contigs were broken into 12 contigs. Writing corrected assembly to corrected_asm.fa...
2024-06-27 09:22:59 <HapHiC_cluster.py> [stat_fragments] Making some statistics of fragments (contigs / bins)
2024-06-27 09:22:59 <HapHiC_cluster.py> [stat_fragments] bin_size is set to 0, no fragments will be split
2024-06-27 09:23:01 <HapHiC_cluster.py> [parse_alignments_for_ctgs] Parsing input alignments...
2024-06-27 09:41:54 <HapHiC_cluster.py> [output_pickle] Writing HT_link_dict to HT_links.pkl...
2024-06-27 09:41:55 <HapHiC_cluster.py> [run] Program finished in 3086.3505272865295s
2024-06-27 09:41:55 <HapHiC_pipeline.py> [haphic_reassign] Step2: Reassign and rescue contigs...
2024-06-27 09:41:55 <HapHiC_reassign.py> [run] Program started, HapHiC version: 1.0.3 (update: 2024.05.20)
2024-06-27 09:41:55 <HapHiC_reassign.py> [run] Python version: 3.10.14 | packaged by conda-forge | (main, Mar 20 2024, 12:45:18) [GCC 12.3.0]
2024-06-27 09:41:55 <HapHiC_cluster.py> [parse_fasta] Parsing input FASTA file...
2024-06-27 09:42:18 <HapHiC_cluster.py> [parse_gfa] Parsing input gfa file(s)...
2024-06-27 09:42:18 <HapHiC_reassign.py> [run] Program finished in 23.16846799850464s
2024-06-27 09:42:18 <HapHiC_pipeline.py> [haphic_sort] Step3: Order and orient contigs within each group...
2024-06-27 09:42:18 <HapHiC_sort.py> [run] Program started, HapHiC version: 1.0.3 (update: 2024.05.20)
2024-06-27 09:42:18 <HapHiC_sort.py> [run] Python version: 3.10.14 | packaged by conda-forge | (main, Mar 20 2024, 12:45:18) [GCC 12.3.0]
2024-06-27 09:42:18 <HapHiC_sort.py> [run] Checking the path of ALLHiC...
2024-06-27 09:42:18 <HapHiC_sort.py> [run] ALLHiC has been found in /mnt/01_srcs/xxx/HapHiC/scripts
2024-06-27 09:42:18 <HapHiC_sort.py> [parse_fasta] Parsing fasta file...
2024-06-27 09:42:26 <HapHiC_sort.py> [run] Loading input pickle file...
2024-06-27 09:42:26 <HapHiC_sort.py> [run] Parsing group files and clm files...
2024-06-27 09:42:28 <HapHiC_sort.py> [run] Program will be executed in multiprocessing mode (processes=2)
2024-06-27 09:42:28 <HapHiC_sort.py> [fast_sort] [group1_1461626638bp] Performing fast sorting...
2024-06-27 09:42:28 <HapHiC_sort.py> [fast_sort] [group1_1461626638bp] Checking the content of input group file...
2024-06-27 09:42:28 <HapHiC_sort.py> [fast_sort] [group1_1461626638bp] Starting fast sorting iterations...
2024-06-27 09:42:28 <HapHiC_sort.py> [fast_sort] [group2_1792589528bp] Performing fast sorting...
2024-06-27 09:42:28 <HapHiC_sort.py> [fast_sort] [group2_1792589528bp] Checking the content of input group file...
2024-06-27 09:42:28 <HapHiC_sort.py> [fast_sort] [group2_1792589528bp] Starting fast sorting iterations...
<class 'networkx.utils.decorators.argmap'> compilation 34:3: FutureWarning:

shortest_path will return an iterator that yields
(node, path) pairs instead of a dictionary when source
and target are unspecified beginning in version 3.5

To keep the current behavior, use:

    dict(nx.shortest_path(G))

<class 'networkx.utils.decorators.argmap'> compilation 34:3: FutureWarning:

shortest_path will return an iterator that yields
(node, path) pairs instead of a dictionary when source
and target are unspecified beginning in version 3.5

To keep the current behavior, use:

    dict(nx.shortest_path(G))

2024-06-27 10:00:37 <HapHiC_sort.py> [run] Program finished in 1098.1858050823212s
2024-06-27 10:00:37 <HapHiC_pipeline.py> [haphic_build] Step4: Build final scaffolds (pseudomolecules)...
2024-06-27 10:00:37 <HapHiC_build.py> [run] Program started, HapHiC version: 1.0.3 (update: 2024.05.20)
2024-06-27 10:00:37 <HapHiC_build.py> [run] Python version: 3.10.14 | packaged by conda-forge | (main, Mar 20 2024, 12:45:18) [GCC 12.3.0]
2024-06-27 10:00:37 <HapHiC_cluster.py> [parse_fasta] Parsing input FASTA file...
2024-06-27 10:01:01 <HapHiC_build.py> [parse_tours] Parsing tour files...
2024-06-27 10:01:01 <HapHiC_build.py> [build_final_scaffolds] Building final scaffolds...
2024-06-27 10:01:43 <HapHiC_build.py> [run] Program finished in 66.46069979667664s
2024-06-27 10:01:43 <HapHiC_pipeline.py> [main] HapHiC pipeline finished in 4274.942216873169s

The methods (commands) used for Hi-C read mapping and filtering:
bwa mem -5SP -t 60 genome.fas ${read1} ${read2} | samblaster | samtools view - -@ 20 -S -h -b -F 3340 -o HiC.bam
filter_bam HiC.bam 1 --nm 3 --threads 20 | samtools view - -b -@ 20 -o HiC.filtered.bam

The method used for genome assembly (e.g., hifiasm + Hi-C) and the assembly utilized for scaffolding (e.g., p_ctg, hap.p_ctg or p_utg):*
hifiasm -t 60 --h1 s_1.fq.gz --h2 s_2.fq.gz ./ccs.fastq

Statistics for the assembly input into HapHiC (e.g., N10-N90 and L10-L90):
#hap1:
StatType ContigLength ContigNumber
N50 14292390 32
N60 12748816 42
N70 9949659 55
N80 6898132 73
N90 3149213 103
Longest 46849406 1
Total 1461626638 985
Length>=1kb 1461626638 985
Length>=2kb 1461626638 985
Length>=5kb 1461626638 985
#hap2:
StatType ContigLength ContigNumber
N50 16020022 37
N60 12619360 50
N70 9100361 66
N80 6754290 89
N90 3293374 127
Longest 43036230 1
Total 1792589528 727
Length>=1kb 1792589528 727
Length>=2kb 1792589528 727
Length>=5kb 1792589528 727

@zengxiaofei
Copy link
Owner

zengxiaofei commented Jun 28, 2024

Hi @zoumingr,

Thank you so much for providing such detailed information. This could save us a lot of time.

In short, it's abnormal.

The resulting Hi-C heatmap shows some strong interactions between the two haps

I have observed that the mapping and filtering processes were performed in accordance with our instructions. Most multiple mapping reads should have been filtered out. Furthermore, the Hi-C signals between allelic contig pairs often exhibit a diagonal distribution, which should not be as strong as the contact map you provided.

This situation occurs with both --phasing_weight 0 and 1.

This parameter does not alter the Hi-C signals on the heatmap. Instead, it's just a weight to control the confidence degree of the phasing result obtained from hifiasm.

I suspect that the heterozygosity level of the species may be too low to resolve the haplotypes using hifiasm + Hi-C. Hifiasm might separate a chromosome into two parts in different hap*.gfa files. You may only be able to assemble 50 chromosomes.

To verify my assumption, could you please tell me the expected genome size (2n) of your species? Additionally, please also provide the log files generated by hifiasm. I intend to check the kmer histograms within the logs to confirm the heterozygosity level and the estimated genome size.

Best wishes,
Xiaofei

@zoumingr
Copy link
Author

Hi, Xiaofei,
Thank you very much for your quick response. Actually, this is the genome of an F1 hybrid between two subspecies, so initially, I did not expect them to have a lower heterozygosity.

However, this reminds me that when I used 3ddna to scaffold the concatenated genome, it could only generate 50 chromosomes (2n=100), but the total sequence length was the sum of the two haploid lengths. This might be because 3ddna merged the two homologous chromosomes together.

If the similarity between the two parents is high, can we still obtain all 100 chromosomes? For example, not providing the phase information of hifiasm to haphic (--gfa "hap1.p_ctg.gfa,hap2.p_ctg.gfa").

Below is the log of the hifiasm assembly.

[M::ha_analyze_count] lowest: count[7] = 951718
[M::ha_analyze_count] highest: count[29] = 63858718
[M::ha_hist_line] 2: *************************************** 24831037
[M::ha_hist_line] 3: ******** 5267100
[M::ha_hist_line] 4: **** 2346618
[M::ha_hist_line] 5: ** 1386582
[M::ha_hist_line] 6: ** 1054289
[M::ha_hist_line] 7: * 951718
[M::ha_hist_line] 8: ** 1018917
[M::ha_hist_line] 9: ** 1223521
[M::ha_hist_line] 10: ** 1579179
[M::ha_hist_line] 11: *** 2207524
[M::ha_hist_line] 12: ***** 2996676
[M::ha_hist_line] 13: ******* 4204834
[M::ha_hist_line] 14: ********* 5813774
[M::ha_hist_line] 15: ************ 7841775
[M::ha_hist_line] 16: **************** 10375542
[M::ha_hist_line] 17: ********************* 13543432
[M::ha_hist_line] 18: *************************** 17328105
[M::ha_hist_line] 19: ********************************** 21742646
[M::ha_hist_line] 20: ****************************************** 26673224
[M::ha_hist_line] 21: ************************************************** 31967269
[M::ha_hist_line] 22: *********************************************************** 37601524
[M::ha_hist_line] 23: ******************************************************************** 43324617
[M::ha_hist_line] 24: ***************************************************************************** 48865423
[M::ha_hist_line] 25: ************************************************************************************ 53845522
[M::ha_hist_line] 26: ******************************************************************************************* 57876983
[M::ha_hist_line] 27: ************************************************************************************************ 61129606
[M::ha_hist_line] 28: *************************************************************************************************** 63304473
[M::ha_hist_line] 29: **************************************************************************************************** 63858718
[M::ha_hist_line] 30: *************************************************************************************************** 63111642
[M::ha_hist_line] 31: ************************************************************************************************ 61236113
[M::ha_hist_line] 32: ******************************************************************************************* 58314259
[M::ha_hist_line] 33: ************************************************************************************* 54364812
[M::ha_hist_line] 34: ****************************************************************************** 49678746
[M::ha_hist_line] 35: ********************************************************************** 44762422
[M::ha_hist_line] 36: ************************************************************** 39556219
[M::ha_hist_line] 37: ****************************************************** 34501673
[M::ha_hist_line] 38: *********************************************** 29763259
[M::ha_hist_line] 39: **************************************** 25457839
[M::ha_hist_line] 40: ********************************** 21764165
[M::ha_hist_line] 41: ***************************** 18591168
[M::ha_hist_line] 42: ************************* 16078998
[M::ha_hist_line] 43: ********************** 14047970
[M::ha_hist_line] 44: ******************** 12539119
[M::ha_hist_line] 45: ****************** 11321542
[M::ha_hist_line] 46: ***************** 10556100
[M::ha_hist_line] 47: **************** 10117716
[M::ha_hist_line] 48: **************** 9989512
[M::ha_hist_line] 49: **************** 10070105
[M::ha_hist_line] 50: **************** 10289069
[M::ha_hist_line] 51: ***************** 10616436
[M::ha_hist_line] 52: ***************** 10980265
[M::ha_hist_line] 53: ****************** 11384721
[M::ha_hist_line] 54: ****************** 11769471
[M::ha_hist_line] 55: ******************* 12105216
[M::ha_hist_line] 56: ******************* 12378075
[M::ha_hist_line] 57: ******************** 12598111
[M::ha_hist_line] 58: ******************** 12715691
[M::ha_hist_line] 59: ******************** 12753934
[M::ha_hist_line] 60: ******************** 12704520
[M::ha_hist_line] 61: ******************* 12441027
[M::ha_hist_line] 62: ******************* 12141941
[M::ha_hist_line] 63: ****************** 11791358
[M::ha_hist_line] 64: ****************** 11315818
[M::ha_hist_line] 65: ***************** 10766789
[M::ha_hist_line] 66: **************** 10182943
[M::ha_hist_line] 67: *************** 9604583
[M::ha_hist_line] 68: ************** 8974050
[M::ha_hist_line] 69: ************* 8364606
[M::ha_hist_line] 70: ************ 7675504
[M::ha_hist_line] 71: *********** 6996430
[M::ha_hist_line] 72: ********** 6336097
[M::ha_hist_line] 73: ********* 5672561
[M::ha_hist_line] 74: ******** 5090053
[M::ha_hist_line] 75: ******* 4518960
[M::ha_hist_line] 76: ****** 3989742
[M::ha_hist_line] 77: ****** 3516783
[M::ha_hist_line] 78: ***** 3078397
[M::ha_hist_line] 79: **** 2649361
[M::ha_hist_line] 80: **** 2282682
[M::ha_hist_line] 81: *** 1976631
[M::ha_hist_line] 82: *** 1695545
[M::ha_hist_line] 83: ** 1466362
[M::ha_hist_line] 84: ** 1286906
[M::ha_hist_line] 85: ** 1137728
[M::ha_hist_line] 86: ** 1008200
[M::ha_hist_line] 87: * 879782
[M::ha_hist_line] 88: * 782881
[M::ha_hist_line] 89: * 703807
[M::ha_hist_line] 90: * 637265
[M::ha_hist_line] 91: * 586153
[M::ha_hist_line] 92: * 538235
[M::ha_hist_line] 93: * 496246
[M::ha_hist_line] 94: * 468550
[M::ha_hist_line] 95: * 443132
[M::ha_hist_line] 96: * 418844
[M::ha_hist_line] 97: * 400617
[M::ha_hist_line] 98: * 383379
[M::ha_hist_line] 99: * 375583
[M::ha_hist_line] 100: * 359506
[M::ha_hist_line] 101: * 345178
[M::ha_hist_line] 102: * 328101
[M::ha_hist_line] rest: *********************************** 22515181
[M::ha_analyze_count] left: none
[M::ha_analyze_count] right: count[59] = 12753934
[M::ha_ft_gen] peak_hom: 59; peak_het: 29
[M::ha_ct_shrink::3416.26510.00] ==> counted 5712595 distinct minimizer k-mers
[M::ha_ft_gen::3430.600
[email protected]] ==> filtered out 5712595 k-mers occurring 295 or more times
[M::ha_opt_update_cov] updated max_n_chain to 295
[M::yak_count] collected 2597123614 minimizers
[M::ha_pt_gen::4050.537*11.57] ==> counted 122118414 distinct minimizer k-mers
[M::ha_pt_gen] count[4095] = 0 (for sanity check)
[M::ha_analyze_count] lowest: count[8] = 74939
[M::ha_analyze_count] highest: count[29] = 2814313
[M::ha_hist_line] 1: ************************************************************************************************> 53232729
[M::ha_hist_line] 2: ********************************************************************************* 2283589
[M::ha_hist_line] 3: ******************** 549180
[M::ha_hist_line] 4: ******** 236948
[M::ha_hist_line] 5: ***** 135251
[M::ha_hist_line] 6: *** 94750
[M::ha_hist_line] 7: *** 77437
[M::ha_hist_line] 8: *** 74939
[M::ha_hist_line] 9: *** 81107
[M::ha_hist_line] 10: *** 96008
[M::ha_hist_line] 11: **** 124380
[M::ha_hist_line] 12: ****** 161347
[M::ha_hist_line] 13: ******** 219131
[M::ha_hist_line] 14: ********** 295523
[M::ha_hist_line] 15: ************** 391784
[M::ha_hist_line] 16: ****************** 509135
[M::ha_hist_line] 17: *********************** 655872
[M::ha_hist_line] 18: ***************************** 829315
[M::ha_hist_line] 19: ************************************* 1032340
[M::ha_hist_line] 20: ********************************************* 1253566
[M::ha_hist_line] 21: ***************************************************** 1491342
[M::ha_hist_line] 22: ************************************************************** 1738335
[M::ha_hist_line] 23: *********************************************************************** 1990420
[M::ha_hist_line] 24: ******************************************************************************* 2229502
[M::ha_hist_line] 25: *************************************************************************************** 2436252
[M::ha_hist_line] 26: ******************************************************************************************** 2602352
[M::ha_hist_line] 27: ************************************************************************************************* 2727692
[M::ha_hist_line] 28: **************************************************************************************************** 2805110
[M::ha_hist_line] 29: **************************************************************************************************** 2814313
[M::ha_hist_line] 30: ************************************************************************************************** 2766794
[M::ha_hist_line] 31: *********************************************************************************************** 2668504
[M::ha_hist_line] 32: ****************************************************************************************** 2529529
[M::ha_hist_line] 33: *********************************************************************************** 2340354
[M::ha_hist_line] 34: **************************************************************************** 2131668
[M::ha_hist_line] 35: ******************************************************************** 1912611
[M::ha_hist_line] 36: ************************************************************ 1684607
[M::ha_hist_line] 37: **************************************************** 1463062
[M::ha_hist_line] 38: ********************************************* 1258505
[M::ha_hist_line] 39: ************************************** 1075572
[M::ha_hist_line] 40: ********************************* 918251
[M::ha_hist_line] 41: **************************** 784941
[M::ha_hist_line] 42: ************************ 680550
[M::ha_hist_line] 43: ********************* 593380
[M::ha_hist_line] 44: ******************* 530702
[M::ha_hist_line] 45: ***************** 480095
[M::ha_hist_line] 46: **************** 448307
[M::ha_hist_line] 47: *************** 429420
[M::ha_hist_line] 48: *************** 424295
[M::ha_hist_line] 49: *************** 426116
[M::ha_hist_line] 50: *************** 433287
[M::ha_hist_line] 51: **************** 444256
[M::ha_hist_line] 52: **************** 458318
[M::ha_hist_line] 53: ***************** 471926
[M::ha_hist_line] 54: ***************** 483809
[M::ha_hist_line] 55: ****************** 494563
[M::ha_hist_line] 56: ****************** 503810
[M::ha_hist_line] 57: ****************** 509707
[M::ha_hist_line] 58: ****************** 511946
[M::ha_hist_line] 59: ****************** 509273
[M::ha_hist_line] 60: ****************** 505158
[M::ha_hist_line] 61: ***************** 491977
[M::ha_hist_line] 62: ***************** 478547
[M::ha_hist_line] 63: **************** 462441
[M::ha_hist_line] 64: **************** 442516
[M::ha_hist_line] 65: *************** 418731
[M::ha_hist_line] 66: ************** 396375
[M::ha_hist_line] 67: ************* 374188
[M::ha_hist_line] 68: ************ 349831
[M::ha_hist_line] 69: ************ 323957
[M::ha_hist_line] 70: *********** 297214
[M::ha_hist_line] 71: ********** 271927
[M::ha_hist_line] 72: ********* 247117
[M::ha_hist_line] 73: ******** 222250
[M::ha_hist_line] 74: ******* 201036
[M::ha_hist_line] 75: ****** 179364
[M::ha_hist_line] 76: ****** 160424
[M::ha_hist_line] 77: ***** 143349
[M::ha_hist_line] 78: ***** 127014
[M::ha_hist_line] 79: **** 111425
[M::ha_hist_line] 80: *** 97749
[M::ha_hist_line] 81: *** 86481
[M::ha_hist_line] 82: *** 77200
[M::ha_hist_line] 83: ** 68855
[M::ha_hist_line] 84: ** 62976
[M::ha_hist_line] 85: ** 57000
[M::ha_hist_line] 86: ** 52288
[M::ha_hist_line] 87: ** 47634
[M::ha_hist_line] 88: ** 44067
[M::ha_hist_line] 89: * 40780
[M::ha_hist_line] 90: * 38423
[M::ha_hist_line] 91: * 36800
[M::ha_hist_line] 92: * 34092
[M::ha_hist_line] 93: * 32716
[M::ha_hist_line] 94: * 31011
[M::ha_hist_line] 95: * 29967
[M::ha_hist_line] 96: * 28744
[M::ha_hist_line] 97: * 27787
[M::ha_hist_line] 98: * 26724
[M::ha_hist_line] 99: * 26216
[M::ha_hist_line] 100: * 25409
[M::ha_hist_line] 101: * 24718
[M::ha_hist_line] 102: * 23761
[M::ha_hist_line] 103: * 23289
[M::ha_hist_line] 104: * 22626
[M::ha_hist_line] 105: * 22270
[M::ha_hist_line] 106: * 21740
[M::ha_hist_line] 107: * 21404
[M::ha_hist_line] 108: * 20977
[M::ha_hist_line] 109: * 20596
[M::ha_hist_line] 110: * 20231
[M::ha_hist_line] 111: * 20177
[M::ha_hist_line] 112: * 19529
[M::ha_hist_line] 113: * 19276
[M::ha_hist_line] 114: * 18611
[M::ha_hist_line] 115: * 18295
[M::ha_hist_line] 116: * 17832
[M::ha_hist_line] 117: * 17752
[M::ha_hist_line] 118: * 17220
[M::ha_hist_line] 119: * 17119
[M::ha_hist_line] 120: * 16521
[M::ha_hist_line] 121: * 16451
[M::ha_hist_line] 122: * 15954
[M::ha_hist_line] 123: * 15989
[M::ha_hist_line] 124: * 15358
[M::ha_hist_line] 125: * 15556
[M::ha_hist_line] 126: * 15023
[M::ha_hist_line] 127: * 15027
[M::ha_hist_line] 128: * 14425
[M::ha_hist_line] rest: ******************************* 875120
[M::ha_analyze_count] left: none
[M::ha_analyze_count] right: count[58] = 511946
[M::ha_pt_gen] peak_hom: 58; peak_het: 29
[M::ha_ct_shrink::4050.759
11.57] ==> counted 68885685 distinct minimizer k-mers
[M::ha_pt_gen::] counting in normal mode
[M::yak_count] collected 2597123614 minimizers
[M::ha_pt_gen::4582.280
13.33] ==> indexed 2543890885 positions, counted 68885685 distinct minimizer k-mers
[M::ha_assemble::13429.190
[email protected]] ==> corrected reads for round 1
[M::ha_assemble] # bases: 92586136902; # corrected bases: 226684780; # recorrected bases: 196056
[M::ha_assemble] size of buffer: 6.176GB
[M::yak_count] collected 2590666568 minimizers
[M::ha_pt_gen::13800.964
43.27] ==> counted 70076266 distinct minimizer k-mers
[M::ha_pt_gen] count[4095] = 0 (for sanity check)
[M::ha_analyze_count] lowest: count[5] = 30406
[M::ha_analyze_count] highest: count[29] = 2787333
[M::ha_hist_line] 1: *****************************************************************************************> 4322128
[M::ha_hist_line] 2: ****** 159385
[M::ha_hist_line] 3: ** 55776
[M::ha_hist_line] 4: * 37119
[M::ha_hist_line] 5: * 30406
[M::ha_hist_line] 6: * 32663
[M::ha_hist_line] 7: * 35814
[M::ha_hist_line] 8: ** 43669
[M::ha_hist_line] 9: ** 54876
[M::ha_hist_line] 10: *** 72950
[M::ha_hist_line] 11: **** 101001
[M::ha_hist_line] 12: ***** 131344
[M::ha_hist_line] 13: ******* 184085
[M::ha_hist_line] 14: ********* 251261
[M::ha_hist_line] 15: ************ 337066
[M::ha_hist_line] 16: **************** 443638
[M::ha_hist_line] 17: ******************** 570390
[M::ha_hist_line] 18: ************************** 730098
[M::ha_hist_line] 19: ********************************* 913719
[M::ha_hist_line] 20: **************************************** 1119998
[M::ha_hist_line] 21: ************************************************ 1344265
[M::ha_hist_line] 22: ********************************************************* 1584661
[M::ha_hist_line] 23: ****************************************************************** 1830220
[M::ha_hist_line] 24: ************************************************************************** 2071730
[M::ha_hist_line] 25: ********************************************************************************** 2296815
[M::ha_hist_line] 26: ***************************************************************************************** 2474309
[M::ha_hist_line] 27: ********************************************************************************************** 2626471
[M::ha_hist_line] 28: ************************************************************************************************** 2741124
[M::ha_hist_line] 29: **************************************************************************************************** 2787333
[M::ha_hist_line] 30: *************************************************************************************************** 2772112
[M::ha_hist_line] 31: ************************************************************************************************* 2705959
[M::ha_hist_line] 32: ********************************************************************************************** 2607176
[M::ha_hist_line] 33: **************************************************************************************** 2439643
[M::ha_hist_line] 34: ********************************************************************************* 2247600
[M::ha_hist_line] 35: ************************************************************************* 2046029
[M::ha_hist_line] 36: ***************************************************************** 1820185
[M::ha_hist_line] 37: ********************************************************* 1596545
[M::ha_hist_line] 38: ************************************************** 1387526
[M::ha_hist_line] 39: ******************************************* 1187280
[M::ha_hist_line] 40: ************************************ 1016943
[M::ha_hist_line] 41: ******************************* 865825
[M::ha_hist_line] 42: *************************** 741106
[M::ha_hist_line] 43: *********************** 644562
[M::ha_hist_line] 44: ******************** 569209
[M::ha_hist_line] 45: ****************** 504719
[M::ha_hist_line] 46: ***************** 460066
[M::ha_hist_line] 47: *************** 430573
[M::ha_hist_line] 48: *************** 416261
[M::ha_hist_line] 49: *************** 409039
[M::ha_hist_line] 50: *************** 415179
[M::ha_hist_line] 51: *************** 424192
[M::ha_hist_line] 52: **************** 434780
[M::ha_hist_line] 53: **************** 447988
[M::ha_hist_line] 54: ***************** 465290
[M::ha_hist_line] 55: ***************** 474441
[M::ha_hist_line] 56: ****************** 488762
[M::ha_hist_line] 57: ****************** 495549
[M::ha_hist_line] 58: ****************** 503119
[M::ha_hist_line] 59: ****************** 502346
[M::ha_hist_line] 60: ****************** 505553
[M::ha_hist_line] 61: ****************** 500965
[M::ha_hist_line] 62: ***************** 487708
[M::ha_hist_line] 63: ***************** 477336
[M::ha_hist_line] 64: ***************** 463681
[M::ha_hist_line] 65: **************** 441428
[M::ha_hist_line] 66: *************** 420095
[M::ha_hist_line] 67: ************** 398016
[M::ha_hist_line] 68: ************* 375416
[M::ha_hist_line] 69: ************* 352560
[M::ha_hist_line] 70: ************ 327385
[M::ha_hist_line] 71: *********** 300961
[M::ha_hist_line] 72: ********** 277638
[M::ha_hist_line] 73: ********* 251297
[M::ha_hist_line] 74: ******** 226585
[M::ha_hist_line] 75: ******* 205968
[M::ha_hist_line] 76: ******* 183880
[M::ha_hist_line] 77: ****** 164705
[M::ha_hist_line] 78: ***** 148781
[M::ha_hist_line] 79: ***** 132423
[M::ha_hist_line] 80: **** 114679
[M::ha_hist_line] 81: **** 101504
[M::ha_hist_line] 82: *** 90222
[M::ha_hist_line] 83: *** 79645
[M::ha_hist_line] 84: *** 71509
[M::ha_hist_line] 85: ** 63788
[M::ha_hist_line] 86: ** 59253
[M::ha_hist_line] 87: ** 52956
[M::ha_hist_line] 88: ** 48936
[M::ha_hist_line] 89: ** 44683
[M::ha_hist_line] 90: * 41577
[M::ha_hist_line] 91: * 39317
[M::ha_hist_line] 92: * 37010
[M::ha_hist_line] 93: * 34986
[M::ha_hist_line] 94: * 33263
[M::ha_hist_line] 95: * 31767
[M::ha_hist_line] 96: * 29880
[M::ha_hist_line] 97: * 29084
[M::ha_hist_line] 98: * 27680
[M::ha_hist_line] 99: * 26979
[M::ha_hist_line] 100: * 26306
[M::ha_hist_line] 101: * 25312
[M::ha_hist_line] 102: * 25102
[M::ha_hist_line] 103: * 23727
[M::ha_hist_line] 104: * 23192
[M::ha_hist_line] 105: * 22134
[M::ha_hist_line] 106: * 21898
[M::ha_hist_line] 107: * 21500
[M::ha_hist_line] 108: * 21415
[M::ha_hist_line] 109: * 21035
[M::ha_hist_line] 110: * 20250
[M::ha_hist_line] 111: * 20677
[M::ha_hist_line] 112: * 19818
[M::ha_hist_line] 113: * 19303
[M::ha_hist_line] 114: * 19105
[M::ha_hist_line] 115: * 18732
[M::ha_hist_line] 116: * 18594
[M::ha_hist_line] 117: * 17826
[M::ha_hist_line] 118: * 17848
[M::ha_hist_line] 119: * 17050
[M::ha_hist_line] 120: * 17301
[M::ha_hist_line] 121: * 16622
[M::ha_hist_line] 122: * 16514
[M::ha_hist_line] 123: * 15846
[M::ha_hist_line] 124: * 15698
[M::ha_hist_line] 125: * 15749
[M::ha_hist_line] 126: * 15549
[M::ha_hist_line] 127: * 14933
[M::ha_hist_line] 128: * 14731
[M::ha_hist_line] 129: * 14517
[M::ha_hist_line] 130: * 14081
[M::ha_hist_line] rest: ******************************** 878484
[M::ha_analyze_count] left: none
[M::ha_analyze_count] right: count[60] = 505553
[M::ha_pt_gen] peak_hom: 60; peak_het: 29
[M::ha_ct_shrink::13801.179
43.27] ==> counted 65754138 distinct minimizer k-mers
[M::ha_pt_gen::] counting in normal mode
[M::yak_count] collected 2590666568 minimizers
[M::ha_pt_gen::14367.624
42.54] ==> indexed 2586344440 positions, counted 65754138 distinct minimizer k-mers
[M::ha_assemble::22455.995
[email protected]] ==> corrected reads for round 2
[M::ha_assemble] # bases: 92607545671; # corrected bases: 11846820; # recorrected bases: 92192
[M::ha_assemble] size of buffer: 5.453GB
[M::yak_count] collected 2590443443 minimizers
[M::ha_pt_gen::22822.381
48.38] ==> counted 66959383 distinct minimizer k-mers
[M::ha_pt_gen] count[4095] = 0 (for sanity check)
[M::ha_analyze_count] lowest: count[5] = 25526
[M::ha_analyze_count] highest: count[29] = 2784445
[M::ha_hist_line] 1: ************************************************ 1330755
[M::ha_hist_line] 2: *** 77361
[M::ha_hist_line] 3: * 34449
[M::ha_hist_line] 4: * 28237
[M::ha_hist_line] 5: * 25526
[M::ha_hist_line] 6: * 29663
[M::ha_hist_line] 7: * 34577
[M::ha_hist_line] 8: ** 42025
[M::ha_hist_line] 9: ** 53260
[M::ha_hist_line] 10: *** 71314
[M::ha_hist_line] 11: **** 99490
[M::ha_hist_line] 12: ***** 129374
[M::ha_hist_line] 13: ******* 182244
[M::ha_hist_line] 14: ********* 248670
[M::ha_hist_line] 15: ************ 334172
[M::ha_hist_line] 16: **************** 440659
[M::ha_hist_line] 17: ******************** 566128
[M::ha_hist_line] 18: ************************** 725131
[M::ha_hist_line] 19: ********************************* 907959
[M::ha_hist_line] 20: **************************************** 1115160
[M::ha_hist_line] 21: ************************************************ 1336805
[M::ha_hist_line] 22: ********************************************************* 1576006
[M::ha_hist_line] 23: ***************************************************************** 1821573
[M::ha_hist_line] 24: ************************************************************************** 2064235
[M::ha_hist_line] 25: ********************************************************************************** 2288405
[M::ha_hist_line] 26: ***************************************************************************************** 2468673
[M::ha_hist_line] 27: ********************************************************************************************** 2619433
[M::ha_hist_line] 28: ************************************************************************************************** 2737621
[M::ha_hist_line] 29: **************************************************************************************************** 2784445
[M::ha_hist_line] 30: **************************************************************************************************** 2771596
[M::ha_hist_line] 31: ************************************************************************************************* 2706157
[M::ha_hist_line] 32: ********************************************************************************************** 2609468
[M::ha_hist_line] 33: **************************************************************************************** 2444097
[M::ha_hist_line] 34: ********************************************************************************* 2252517
[M::ha_hist_line] 35: ************************************************************************** 2052752
[M::ha_hist_line] 36: ****************************************************************** 1827398
[M::ha_hist_line] 37: ********************************************************** 1603771
[M::ha_hist_line] 38: ************************************************** 1395512
[M::ha_hist_line] 39: ******************************************* 1191858
[M::ha_hist_line] 40: ************************************* 1023277
[M::ha_hist_line] 41: ******************************* 871426
[M::ha_hist_line] 42: *************************** 744686
[M::ha_hist_line] 43: *********************** 647522
[M::ha_hist_line] 44: ********************* 571153
[M::ha_hist_line] 45: ****************** 507052
[M::ha_hist_line] 46: ***************** 461572
[M::ha_hist_line] 47: *************** 430919
[M::ha_hist_line] 48: *************** 417259
[M::ha_hist_line] 49: *************** 408123
[M::ha_hist_line] 50: *************** 414078
[M::ha_hist_line] 51: *************** 423955
[M::ha_hist_line] 52: **************** 433149
[M::ha_hist_line] 53: **************** 446281
[M::ha_hist_line] 54: ***************** 464226
[M::ha_hist_line] 55: ***************** 473087
[M::ha_hist_line] 56: ****************** 487583
[M::ha_hist_line] 57: ****************** 494391
[M::ha_hist_line] 58: ****************** 503066
[M::ha_hist_line] 59: ****************** 502191
[M::ha_hist_line] 60: ****************** 504779
[M::ha_hist_line] 61: ****************** 501489
[M::ha_hist_line] 62: ****************** 487812
[M::ha_hist_line] 63: ***************** 477223
[M::ha_hist_line] 64: ***************** 464529
[M::ha_hist_line] 65: **************** 443021
[M::ha_hist_line] 66: *************** 420735
[M::ha_hist_line] 67: ************** 399251
[M::ha_hist_line] 68: ************** 376375
[M::ha_hist_line] 69: ************* 354535
[M::ha_hist_line] 70: ************ 328768
[M::ha_hist_line] 71: *********** 302147
[M::ha_hist_line] 72: ********** 279870
[M::ha_hist_line] 73: ********* 253309
[M::ha_hist_line] 74: ******** 227968
[M::ha_hist_line] 75: ******* 207652
[M::ha_hist_line] 76: ******* 185499
[M::ha_hist_line] 77: ****** 166113
[M::ha_hist_line] 78: ***** 150174
[M::ha_hist_line] 79: ***** 133437
[M::ha_hist_line] 80: **** 115594
[M::ha_hist_line] 81: **** 102836
[M::ha_hist_line] 82: *** 90878
[M::ha_hist_line] 83: *** 80804
[M::ha_hist_line] 84: *** 72148
[M::ha_hist_line] 85: ** 64066
[M::ha_hist_line] 86: ** 59442
[M::ha_hist_line] 87: ** 53544
[M::ha_hist_line] 88: ** 49254
[M::ha_hist_line] 89: ** 45078
[M::ha_hist_line] 90: ** 42110
[M::ha_hist_line] 91: * 39338
[M::ha_hist_line] 92: * 37311
[M::ha_hist_line] 93: * 35202
[M::ha_hist_line] 94: * 33468
[M::ha_hist_line] 95: * 31900
[M::ha_hist_line] 96: * 29976
[M::ha_hist_line] 97: * 29397
[M::ha_hist_line] 98: * 27688
[M::ha_hist_line] 99: * 26970
[M::ha_hist_line] 100: * 26373
[M::ha_hist_line] 101: * 25277
[M::ha_hist_line] 102: * 25099
[M::ha_hist_line] 103: * 23417
[M::ha_hist_line] 104: * 23398
[M::ha_hist_line] 105: * 22435
[M::ha_hist_line] 106: * 21843
[M::ha_hist_line] 107: * 21521
[M::ha_hist_line] 108: * 21445
[M::ha_hist_line] 109: * 21098
[M::ha_hist_line] 110: * 20326
[M::ha_hist_line] 111: * 20605
[M::ha_hist_line] 112: * 19771
[M::ha_hist_line] 113: * 19498
[M::ha_hist_line] 114: * 19044
[M::ha_hist_line] 115: * 18750
[M::ha_hist_line] 116: * 18474
[M::ha_hist_line] 117: * 17927
[M::ha_hist_line] 118: * 17798
[M::ha_hist_line] 119: * 17293
[M::ha_hist_line] 120: * 17197
[M::ha_hist_line] 121: * 16427
[M::ha_hist_line] 122: * 16597
[M::ha_hist_line] 123: * 16003
[M::ha_hist_line] 124: * 15786
[M::ha_hist_line] 125: * 15619
[M::ha_hist_line] 126: * 15443
[M::ha_hist_line] 127: * 15023
[M::ha_hist_line] 128: * 14904
[M::ha_hist_line] 129: * 14550
[M::ha_hist_line] 130: * 14341
[M::ha_hist_line] rest: ******************************** 880939
[M::ha_analyze_count] left: none
[M::ha_analyze_count] right: count[60] = 504779
[M::ha_pt_gen] peak_hom: 60; peak_het: 29
[M::ha_ct_shrink::22822.589
48.38] ==> counted 65628628 distinct minimizer k-mers
[M::ha_pt_gen::] counting in normal mode
[M::yak_count] collected 2590443443 minimizers
[M::ha_pt_gen::23324.905
47.92] ==> indexed 2589112688 positions, counted 65628628 distinct minimizer k-mers
[M::ha_assemble::31468.880
[email protected]] ==> corrected reads for round 3
[M::ha_assemble] # bases: 92609336805; # corrected bases: 1508800; # recorrected bases: 100060
[M::ha_assemble] size of buffer: 5.075GB
[M::yak_count] collected 2590445829 minimizers
[M::ha_pt_gen::31847.932
50.50] ==> counted 66754646 distinct minimizer k-mers
[M::ha_pt_gen] count[4095] = 0 (for sanity check)
[M::ha_analyze_count] lowest: count[5] = 23926
[M::ha_analyze_count] highest: count[29] = 2784481
[M::ha_hist_line] 1: ***************************************** 1153914
[M::ha_hist_line] 2: ** 61415
[M::ha_hist_line] 3: * 29333
[M::ha_hist_line] 4: * 25408
[M::ha_hist_line] 5: * 23926
[M::ha_hist_line] 6: * 28571
[M::ha_hist_line] 7: * 34059
[M::ha_hist_line] 8: * 41511
[M::ha_hist_line] 9: ** 52798
[M::ha_hist_line] 10: *** 71240
[M::ha_hist_line] 11: **** 99170
[M::ha_hist_line] 12: ***** 129043
[M::ha_hist_line] 13: ******* 181849
[M::ha_hist_line] 14: ********* 248393
[M::ha_hist_line] 15: ************ 334081
[M::ha_hist_line] 16: **************** 440105
[M::ha_hist_line] 17: ******************** 565670
[M::ha_hist_line] 18: ************************** 724599
[M::ha_hist_line] 19: ********************************* 907384
[M::ha_hist_line] 20: **************************************** 1114667
[M::ha_hist_line] 21: ************************************************ 1336427
[M::ha_hist_line] 22: ********************************************************* 1575550
[M::ha_hist_line] 23: ***************************************************************** 1820896
[M::ha_hist_line] 24: ************************************************************************** 2063707
[M::ha_hist_line] 25: ********************************************************************************** 2287941
[M::ha_hist_line] 26: ***************************************************************************************** 2468168
[M::ha_hist_line] 27: ********************************************************************************************** 2619429
[M::ha_hist_line] 28: ************************************************************************************************** 2737504
[M::ha_hist_line] 29: **************************************************************************************************** 2784481
[M::ha_hist_line] 30: **************************************************************************************************** 2771497
[M::ha_hist_line] 31: ************************************************************************************************* 2706596
[M::ha_hist_line] 32: ********************************************************************************************** 2609554
[M::ha_hist_line] 33: **************************************************************************************** 2444916
[M::ha_hist_line] 34: ********************************************************************************* 2252782
[M::ha_hist_line] 35: ************************************************************************** 2053227
[M::ha_hist_line] 36: ****************************************************************** 1828061
[M::ha_hist_line] 37: ********************************************************** 1604294
[M::ha_hist_line] 38: ************************************************** 1396078
[M::ha_hist_line] 39: ******************************************* 1192488
[M::ha_hist_line] 40: ************************************* 1023610
[M::ha_hist_line] 41: ******************************* 871847
[M::ha_hist_line] 42: *************************** 744885
[M::ha_hist_line] 43: *********************** 647514
[M::ha_hist_line] 44: ********************* 571199
[M::ha_hist_line] 45: ****************** 507198
[M::ha_hist_line] 46: ***************** 461799
[M::ha_hist_line] 47: *************** 430753
[M::ha_hist_line] 48: *************** 417301
[M::ha_hist_line] 49: *************** 408063
[M::ha_hist_line] 50: *************** 414068
[M::ha_hist_line] 51: *************** 423848
[M::ha_hist_line] 52: **************** 432921
[M::ha_hist_line] 53: **************** 446217
[M::ha_hist_line] 54: ***************** 464269
[M::ha_hist_line] 55: ***************** 472948
[M::ha_hist_line] 56: ****************** 487634
[M::ha_hist_line] 57: ****************** 494526
[M::ha_hist_line] 58: ****************** 502857
[M::ha_hist_line] 59: ****************** 502401
[M::ha_hist_line] 60: ****************** 504737
[M::ha_hist_line] 61: ****************** 501478
[M::ha_hist_line] 62: ****************** 487763
[M::ha_hist_line] 63: ***************** 477362
[M::ha_hist_line] 64: ***************** 464690
[M::ha_hist_line] 65: **************** 443058
[M::ha_hist_line] 66: *************** 420821
[M::ha_hist_line] 67: ************** 399273
[M::ha_hist_line] 68: ************** 376569
[M::ha_hist_line] 69: ************* 354447
[M::ha_hist_line] 70: ************ 329044
[M::ha_hist_line] 71: *********** 302122
[M::ha_hist_line] 72: ********** 279946
[M::ha_hist_line] 73: ********* 253474
[M::ha_hist_line] 74: ******** 228053
[M::ha_hist_line] 75: ******* 207660
[M::ha_hist_line] 76: ******* 185684
[M::ha_hist_line] 77: ****** 166173
[M::ha_hist_line] 78: ***** 150182
[M::ha_hist_line] 79: ***** 133436
[M::ha_hist_line] 80: **** 115764
[M::ha_hist_line] 81: **** 102840
[M::ha_hist_line] 82: *** 90857
[M::ha_hist_line] 83: *** 80800
[M::ha_hist_line] 84: *** 72198
[M::ha_hist_line] 85: ** 64270
[M::ha_hist_line] 86: ** 59333
[M::ha_hist_line] 87: ** 53532
[M::ha_hist_line] 88: ** 49227
[M::ha_hist_line] 89: ** 45123
[M::ha_hist_line] 90: ** 42101
[M::ha_hist_line] 91: * 39375
[M::ha_hist_line] 92: * 37293
[M::ha_hist_line] 93: * 35209
[M::ha_hist_line] 94: * 33523
[M::ha_hist_line] 95: * 31898
[M::ha_hist_line] 96: * 29869
[M::ha_hist_line] 97: * 29385
[M::ha_hist_line] 98: * 27803
[M::ha_hist_line] 99: * 26982
[M::ha_hist_line] 100: * 26452
[M::ha_hist_line] 101: * 25338
[M::ha_hist_line] 102: * 25081
[M::ha_hist_line] 103: * 23383
[M::ha_hist_line] 104: * 23419
[M::ha_hist_line] 105: * 22399
[M::ha_hist_line] 106: * 21830
[M::ha_hist_line] 107: * 21523
[M::ha_hist_line] 108: * 21541
[M::ha_hist_line] 109: * 21052
[M::ha_hist_line] 110: * 20318
[M::ha_hist_line] 111: * 20586
[M::ha_hist_line] 112: * 19763
[M::ha_hist_line] 113: * 19529
[M::ha_hist_line] 114: * 19071
[M::ha_hist_line] 115: * 18771
[M::ha_hist_line] 116: * 18428
[M::ha_hist_line] 117: * 17864
[M::ha_hist_line] 118: * 17875
[M::ha_hist_line] 119: * 17280
[M::ha_hist_line] 120: * 17199
[M::ha_hist_line] 121: * 16401
[M::ha_hist_line] 122: * 16553
[M::ha_hist_line] 123: * 16071
[M::ha_hist_line] 124: * 15798
[M::ha_hist_line] 125: * 15641
[M::ha_hist_line] 126: * 15471
[M::ha_hist_line] 127: * 14997
[M::ha_hist_line] 128: * 14934
[M::ha_hist_line] 129: * 14542
[M::ha_hist_line] 130: * 14269
[M::ha_hist_line] rest: ******************************** 881323
[M::ha_analyze_count] left: none
[M::ha_analyze_count] right: count[60] = 504737
[M::ha_pt_gen] peak_hom: 60; peak_het: 29
[M::ha_ct_shrink::31848.146
50.50] ==> counted 65600732 distinct minimizer k-mers
[M::ha_pt_gen::] counting in normal mode
[M::yak_count] collected 2590445829 minimizers
[M::ha_pt_gen::32386.529
50.06] ==> indexed 2589291915 positions, counted 65600732 distinct minimizer k-mers
[M::ha_assemble::34487.608
[email protected]] ==> found overlaps for the final round
[M::ha_print_ovlp_stat] # overlaps: 285135927
[M::ha_print_ovlp_stat] # strong overlaps: 89378043
[M::ha_print_ovlp_stat] # weak overlaps: 195757884
[M::ha_print_ovlp_stat] # exact overlaps: 253819676
[M::ha_print_ovlp_stat] # inexact overlaps: 31316251
[M::ha_print_ovlp_stat] # overlaps without large indels: 284173173
[M::ha_print_ovlp_stat] # reverse overlaps: 48110116
[M::ha_opt_update_cov_min] updated max_n_chain to 300
Writing reads to disk...
Reads has been written.
Writing ma_hit_ts to disk...
ma_hit_ts has been written.
Writing ma_hit_ts to disk...
ma_hit_ts has been written.
bin files have been written.
[M::purge_dups] homozygous read coverage threshold: 60
[M::purge_dups] purge duplication coverage threshold: 75
[M::ug_ext_gfa::] # tips::104
Writing raw unitig GFA to disk...
[M::ug_ext_gfa::] # tips::1
Writing processed unitig GFA to disk...
[M::purge_dups] homozygous read coverage threshold: 60
[M::purge_dups] purge duplication coverage threshold: 75
[M::mc_solve:: # edges: 1342]
[M::mc_solve_core_adv::0.182] ==> Partition
[M::adjust_utg_by_primary] primary contig coverage range: [51, infinity]
Writing hifiasm.asm.hic.p_ctg.gfa to disk...
[M::ha_opt_update_cov] updated max_n_chain to 300
[M::gen_trans_base_count_comp::2535.322] ==> Qualification
[M::build_unitig_index::186.978] ==> Counting
[M::build_unitig_index::53.057] ==> Memory allocating
[M::build_unitig_index::250.511] ==> Filling pos
[M::build_unitig_index::1.665] ==> Sorting pos
[M::build_unitig_index::492.223] ==> HiC index has been built
[M::write_hc_pt_index] Index has been written.
[M::alignment_worker_pipeline::4165.646] ==> Qualification
[M::dedup_hits::40.699] ==> Dedup
[M::dedup_hits::17.599] ==> Dedup
[M::stat] # misjoined unitigs: 73 (N50: 1541473); # corrected unitigs: 146 (N50: 926380)
[M::adjust_weight_kv_u_trans_advance::163.684]
[M::mc_solve:: # edges: 11691150]
[M::mb_solve_core::283.904] ==> Partition
[M::mc_solve_core_adv::252.898] ==> Partition
[M::adjust_weight_kv_u_trans_advance::1789.880]
[M::mc_solve:: # edges: 11773154]
[M::mb_solve_core::196.932] ==> Partition
[M::mc_solve_core_adv::199.769] ==> Partition
[M::adjust_weight_kv_u_trans_advance::1771.436]
[M::mc_solve:: # edges: 11773174]
[M::mb_solve_core::199.917] ==> Partition
[M::mc_solve_core_adv::244.024] ==> Partition
[M::stat] # heterozygous bases: 3554185693; # homozygous bases: 118744190
[M::reduce_hamming_error_adv::2.966] # inserted edges: 6374, # fixed bubbles: 151
[M::adjust_utg_by_trio] primary contig coverage range: [51, infinity]
[M::recall_arcs] # transitive arcs::1600
[M::recall_arcs] # new arcs::86288, # old arcs::47858
[M::clean_trio_untig_graph] # adjusted arcs::0
[M::adjust_utg_by_trio] primary contig coverage range: [51, infinity]
[M::recall_arcs] # transitive arcs::2318
[M::recall_arcs] # new arcs::86796, # old arcs::50116
[M::clean_trio_untig_graph] # adjusted arcs::0
[M::output_trio_graph_joint] dedup_base::2253157, miss_base::0
Writing hifiasm.asm.hic.hap1.p_ctg.gfa to disk...
Writing hifiasm.asm.hic.hap2.p_ctg.gfa to disk...
Inconsistency threshold for low-quality regions in BED files: 70%
[M::main] Version: 0.19.8-r603
[M::main] CMD: hifiasm -t 60 --h1 WR1_1.fq.gz --h2 WR1_2.fq.gz ./ccs.fastq
[M::main] Real time: 53097.322 sec; CPU: 1958013.253 sec; Peak RSS: 376.421 GB
done

@zengxiaofei
Copy link
Owner

zengxiaofei commented Jun 30, 2024

[M::ha_assemble] # bases: 92586136902; # corrected bases: 226684780; # recorrected bases: 196056

You have generated ~92.6 Gb of HiFi data. Based on the histograms, a main peak appears at ~29X depth, with no noticeable peak lower than this depth. The estimated genome size should be ~3.2 Gb, indicating that my initial assumption was incorrect. Both haplotypes have already been assembled.

I‘m curious about the strong Hi-C signals observed between homologous chromosomes. Several aspects should be verified:

  1. Confirm that the Hi-C data and HiFi data were both obtained from the same individual.
  2. The contact map displays low resolution, with a color range cutoff set too low. Could you provide an example of a pair of homologous chromosomes using a higher color range?
  3. While the methods for read mapping and filtering you provided are correct, please double-check to ensure that the BAM file was correctly used for haphic pipeline and Juicebox visualization .

You may also consider:

  1. Using a higher MAPQ filtering threshold, such as 10.
  2. Using unitigs (p_utg) instread.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants