- Scala Machine Learning Projects
- Md. Rezaul Karim
- 163字
- 2021-06-30 19:05:43
Population scale clustering and geographic ethnicity
Next-generation genome sequencing (NGS) reduces overhead and time for genomic sequencing, leading to big data production in an unprecedented way. In contrast, analyzing this large-scale data is computationally expensive and increasingly becomes the key bottleneck. This increase in NGS data in terms of number of samples overall and features per sample demands solutions for massively parallel data processing, which imposes extraordinary challenges on machine learning solutions and bioinformatics approaches. The use of genomic information in medical practice requires efficient analytical methodologies to cope with data from thousands of individuals and millions of their variants.
One of the most important tasks is the analysis of genomic profiles to attribute individuals to specific ethnic populations, or the analysis of nucleotide haplotypes for disease susceptibility. The data from the 1000 Genomes project serves as the prime source to analyze genome-wide single nucleotide polymorphisms (SNPs) at scale for the prediction of the individual's ancestry with regards to continental and regional origins.
- 構建高質量的C#代碼
- Visual FoxPro 6.0數據庫與程序設計
- 電腦上網直通車
- Multimedia Programming with Pure Data
- Apache Spark Deep Learning Cookbook
- 大型數據庫管理系統技術、應用與實例分析:SQL Server 2005
- ESP8266 Home Automation Projects
- 中國戰略性新興產業研究與發展·工業機器人
- PVCBOT機器人控制技術入門
- Hands-On Dashboard Development with QlikView
- Cortex-M3嵌入式處理器原理與應用
- 基于人工免疫原理的檢測系統模型及其應用
- Java組件設計
- Office 2010輕松入門
- 微計算機原理及應用