The BEDTools utilities allow one to address common genomics tasks such as finding feature overlaps and computing coverage. The utilities are largely based on four widely-used file formats: BED, GFF/GTF, VCF, and SAM/BAM. Using BEDTools, one can develop sophisticated pipelines that answer complicated research questions by "streaming" several BEDTools together. The following are examples of common questions that one can address with BEDTools.
Intersecting two BED files in search of overlapping features.
Culling/refining/computing coverage for BAM alignments based on genome features.
Merging overlapping features.
Screening for paired-end (PE) overlaps between PE sequences and existing genomic features.
Calculating the depth and breadth of sequence coverage across defined "windows" in a genome.
Screening for overlaps between "split" alignments and genomic features.
Version on CSC's Servers
BEDTools can also be used through the Chipster graphical user interface
To use in Taito
module load biokit
BEDtools can be very picky about the formatting of the BED files. If you have problems running the tools, try running these commands on your BED file:
dos2unix <filename> sed -i -e '$a\' <filename>
The first command will convert the line endings in the file to be Unix-compliant and the second command will add a new line character to the end of the file if it is missing.
When you use BEDTools, please cite:
Quinlan AR and Hall IM, 2010. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 26, 6, pp. 841–842.
See BEDTools homepage for details.