- Bioinformatics with Python Cookbook
- Tiago Antao
- 164字
- 2021-06-10 19:01:47
Getting ready
As discussed in the previous recipe, we will use data from the 1,000 Genomes Project. We will use the exome alignment for chromosome 20 of female NA18489. This is just 312 MB. The whole exome alignment for this individual is 14.2 GB, and the whole genome alignment (at a low coverage of 4x) is 40.1 GB. This data is a paired-end with reads of 76 bp. This is common nowadays, but slightly more complex to process. We will take this into account. If your data is not paired, just simplify the following recipe appropriately.
As usual, if you use Notebook, the cell at the top of Chapter02/Working_with_BAM.ipynb will download the data for you. If you don't use Notebooks, get the data from our dataset list at https://github.com/PacktPublishing/Bioinformatics-with-Python-Cookbook-Second-Edition/blob/master/Datasets.ipynb. The files you will want are NA18490_20_exome.bam and NA18490_20_exome.bam.bai.
We will use pysam, a Python wrapper to the SAMtools C API. This was installed in Chapter 1, Python and the Surrounding Software Ecology.
- OpenCV實例精解
- Spring Cloud、Nginx高并發核心編程
- 零基礎學Java程序設計
- Hands-On Reinforcement Learning with Python
- App Inventor創意趣味編程進階
- Android驅動開發權威指南
- ASP.NET Web API Security Essentials
- 大學計算機應用基礎(Windows 7+Office 2010)(IC3)
- LabVIEW數據采集
- Get Your Hands Dirty on Clean Architecture
- Clojure for Finance
- Microsoft Windows Identity Foundation Cookbook
- Android項目實戰:博學谷
- Oracle Database 12c DBA官方手冊(第8版)
- 編程風格:程序設計與系統構建的藝術(原書第2版)