- Bioinformatics with Python Cookbook
- Tiago Antao
- 328字
- 2021-06-10 19:01:51
Getting ready
Organism genomes come in widely different sizes, ranging from viruses such as HIV, which is 9.7 kbp, to bacteria such as E. coli, to protozoans like Plasmodium falciparum, with a 22 Mbp spread across 14 chromosomes, mitochondrion, and apicoplast, to the fruit fly with three autosomes, a mitochondrion, and X/Y sex chromosomes, to humans with their three Gbp pairs spread across 22 autosomes, X/Y chromosomes, and mitochondria, all the way up to Paris japonica, a plant with 150 Gbp of genome. Along the way, you have different ploidy and different sex chromosome organizations.
To make this recipe less of a burden, we will use a small eukaryotic genome from P. falciparum. This genome still has many typical features of larger genomes (for example, multiple chromosomes). Therefore, it's a good compromise between complexity and size. Note that with a genome of the size of P. falciparum, it will be possible to perform many operations by loading the whole genome in-memory. However, we opted for a programming style that can be used with bigger genomes (for example, mammals) so that you can use this recipe in a more general way, but feel free to use more memory-intensive approaches with small genomes like this.
We will use Biopython, which you installed in Chapter 1, Python and the Surrounding Software Ecology. As usual, this recipe is available for the Jupyter Notebook at Chapter03/Reference_Genome.ipynb in the code bundle of this book.
- HTML5+CSS3+JavaScript從入門到精通:上冊(微課精編版·第2版)
- 數據庫原理及應用(Access版)第3版
- Apache Spark 2 for Beginners
- Mastering Julia
- Building an RPG with Unity 2018
- R大數據分析實用指南
- 移動界面(Web/App)Photoshop UI設計十全大補
- Unity 2D Game Development Cookbook
- OpenCV with Python By Example
- UML2面向對象分析與設計(第2版)
- Python網絡爬蟲實例教程(視頻講解版)
- Mastering OpenStack
- Java 11 and 12:New Features
- 從零開始學算法:基于Python
- Java Script從入門到精通(第5版)