-
Notifications
You must be signed in to change notification settings - Fork 1
/
CITATION.cff
executable file
·62 lines (59 loc) · 3.05 KB
/
CITATION.cff
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
# Copyright (C) 2024 Roberto Rossini <[email protected]>
#
# SPDX-License-Identifier: MIT
cff-version: 1.2.0
message: 'If you use this software, please cite it using the metadata from this file.'
authors:
- given-names: Roberto
family-names: Rossini
orcid: 'https://orcid.org/0000-0003-3096-1470'
email: [email protected]
affiliation: 'Department of Biosciences, University of Oslo'
title: hictk
abstract: 'Blazing fast toolkit to work with .hic and .cool files.'
doi: '10.5281/zenodo.8214220'
url: 'https://github.com/paulsengroup/hictk'
repository-code: 'https://github.com/paulsengroup/hictk'
repository-artifact: 'https://github.com/paulsengroup/hictk/pkgs/container/hictk'
type: software
license: MIT
keywords:
- bioinformatics
- cxx
- conversion
- cooler
- cli-application
- hic
- cxx17
- cxx-library
- hictk
preferred-citation:
type: article
authors:
- given-names: Roberto
family-names: Rossini
orcid: 'https://orcid.org/0000-0003-3096-1470'
email: [email protected]
affiliation: 'Department of Biosciences, University of Oslo'
- given-names: Jonas
family-names: Paulsen
orcid: 'https://orcid.org/0000-0002-7918-5495'
email: [email protected]
affiliation: 'Department of Biosciences, University of Oslo'
doi: '10.1093/bioinformatics/btae408'
url: 'https://academic.oup.com/bioinformatics/article/40/7/btae408/7698028'
journal: 'Bioinformatics'
year: 2024
month: 06
title: 'hictk: blazing fast toolkit to work with .hic and .cool files'
abstract: >
Hi-C is gaining prominence as a method for mapping genome organization.
With declining sequencing costs and a growing demand for higher-resolution data, efficient tools for processing Hi-C datasets at different resolutions are crucial.
Over the past decade, the .hic and Cooler file formats have become the de-facto standard to store interaction matrices produced by Hi-C experiments in binary format.
Interoperability issues make it unnecessarily difficult to convert between the two formats and to develop applications that can process each format natively.
We developed hictk, a toolkit that can transparently operate on .hic and .cool files with excellent performance.
The toolkit is written in C++ and consists of a C++ library with Python and R bindings as well as CLI tools to perform common operations directly from the shell, including converting between .hic and .mcool formats. We benchmark the performance of hictk and compare it with other popular tools and libraries.
We conclude that hictk significantly outperforms existing tools while providing the flexibility of natively working with both file formats without code duplication.
The hictk library, Python bindings and CLI tools are released under the MIT license as a multi-platform application available at github.com/paulsengroup/hictk.
Pre-built binaries for Linux and macOS are available on bioconda.
Python bindings for hictk are available on GitHub at github.com/paulsengroup/hictkpy, while R bindings are available on GitHub at github.com/paulsengroup/hictkR.