- Bioinformatics with Python Cookbook
- Tiago Antao
- 187字
- 2021-06-10 19:01:49
Processing NGS data with HTSeq
HTSeq (https://htseq.readthedocs.io) is an alternative library that's used for processing NGS data. Most of the functionality made available by HTSeq is actually available in other libraries covered in this book, but you should be aware of it as an alternative way of processing NGS data. HTSeq supports, among others, FASTA, FASTQ, SAM (via pysam), VCF, GFF, and Browser Extensible Data (BED) file formats. It also includes a set of abstractions for processing (mapped) genomic data, encompassing concepts like genomic positions and intervals or alignments. A complete examination of the features of this library is beyond our scope, so we will concentrate on a small subset of features. We will take this opportunity to also introduce the BED file format.
The BED format allows for the specification of features for annotations tracks. It has many uses, but it's common to load BED files into genome browsers to visualize features. Each line includes information about at least the position (chromosome, start and end) and also optional fields such as name or strand. Full details about the format can be found at https://genome.ucsc.edu/FAQ/FAQformat.html#format1.
- VMware View Security Essentials
- Microsoft Dynamics 365 Extensions Cookbook
- 區(qū)塊鏈:以太坊DApp開發(fā)實戰(zhàn)
- Internet of Things with Intel Galileo
- Learning ArcGIS Pro
- Oracle Database 12c Security Cookbook
- 單片機應用與調(diào)試項目教程(C語言版)
- ANSYS Fluent 二次開發(fā)指南
- SQL經(jīng)典實例(第2版)
- 動手學數(shù)據(jù)結(jié)構(gòu)與算法
- Java并發(fā)編程:核心方法與框架
- Node.js從入門到精通
- Java服務端研發(fā)知識圖譜
- 區(qū)塊鏈原理與技術(shù)應用
- Python快速編程入門