期刊论文详细信息
BMC Bioinformatics
ERStruct: a fast Python package for inferring the number of top principal components from whole genome sequencing data
Software
Zhonghua Liu1  Gao Wang2  Yuyang Xu3  Minhao Yao3  Jinghan Yang3 
[1] Department of Biostatistics, Columbia University, New York, NY, USA;Department of Neurology, Gertrude. H. Sergievsky Center, Columbia University, New York, NY, USA;Department of Statistics and Actuarial Science, The University of Hong Kong, Pokfulam, Hong Kong SAR, China;
关键词: Population structure;    Principal component;    Random matrix theory;    Sequencing data;    Spectral analysis;   
DOI  :  10.1186/s12859-023-05305-0
 received in 2022-11-07, accepted in 2023-04-25,  发布年份 2023
来源: Springer
PDF
【 摘 要 】

BackgroundLarge-scale multi-ethnic DNA sequencing data is increasingly available owing to decreasing cost of modern sequencing technologies. Inference of the population structure with such sequencing data is fundamentally important. However, the ultra-dimensionality and complicated linkage disequilibrium patterns across the whole genome make it challenging to infer population structure using traditional principal component analysis based methods and software.ResultsWe present the ERStruct Python Package, which enables the inference of population structure using whole-genome sequencing data. By leveraging parallel computing and GPU acceleration, our package achieves significant improvements in the speed of matrix operations for large-scale data. Additionally, our package features adaptive data splitting capabilities to facilitate computation on GPUs with limited memory.ConclusionOur Python package ERStruct is an efficient and user-friendly tool for estimating the number of top informative principal components that capture population structure from whole genome sequencing data.

【 授权许可】

CC BY   
© The Author(s) 2023

【 预 览 】
附件列表
Files Size Format View
RO202308152778762ZK.pdf 2180KB PDF download
41116_2023_36_Article_IEq471.gif 1KB Image download
MediaObjects/12888_2023_4840_MOESM2_ESM.pdf 221KB PDF download
MediaObjects/12888_2023_4756_MOESM3_ESM.docx 223KB Other download
41116_2023_36_Article_IEq627.gif 1KB Image download
MediaObjects/12888_2023_4756_MOESM4_ESM.docx 372KB Other download
41116_2023_36_Article_IEq631.gif 1KB Image download
41116_2023_36_Article_IEq633.gif 1KB Image download
MediaObjects/12888_2023_4756_MOESM5_ESM.docx 301KB Other download
41116_2023_36_Article_IEq639.gif 1KB Image download
MediaObjects/12888_2023_4866_MOESM1_ESM.docx 279KB Other download
Fig. 2 243KB Image download
41116_2023_36_Article_IEq680.gif 1KB Image download
41116_2023_36_Article_IEq684.gif 1KB Image download
41116_2023_36_Article_IEq689.gif 1KB Image download
41116_2023_36_Article_IEq692.gif 1KB Image download
40517_2023_258_Article_IEq34.gif 1KB Image download
MediaObjects/12888_2023_4753_MOESM2_ESM.pdf 487KB PDF download
41116_2023_36_Article_IEq696.gif 1KB Image download
41116_2023_36_Article_IEq698.gif 1KB Image download
41116_2023_36_Article_IEq699.gif 1KB Image download
41116_2023_36_Article_IEq700.gif 1KB Image download
40517_2023_258_Article_IEq40.gif 1KB Image download
41116_2023_36_Article_IEq701.gif 1KB Image download
41116_2023_36_Article_IEq702.gif 1KB Image download
41116_2023_36_Article_IEq706.gif 1KB Image download
41116_2023_36_Article_IEq708.gif 1KB Image download
41116_2023_36_Article_IEq710.gif 1KB Image download
40517_2023_258_Article_IEq46.gif 1KB Image download
Fig. 2 1327KB Image download
41116_2023_36_Article_IEq714.gif 1KB Image download
41116_2023_36_Article_IEq716.gif 1KB Image download
41116_2023_36_Article_IEq718.gif 1KB Image download
40517_2023_258_Article_IEq51.gif 1KB Image download
41116_2023_36_Article_IEq720.gif 1KB Image download
41116_2023_36_Article_IEq722.gif 1KB Image download
Fig. 6 931KB Image download
41116_2023_36_Article_IEq724.gif 1KB Image download
41116_2023_36_Article_IEq726.gif 1KB Image download
Fig. 1 169KB Image download
Fig. 3 965KB Image download
40517_2023_258_Article_IEq58.gif 1KB Image download
41116_2023_36_Article_IEq731.gif 1KB Image download
41116_2023_36_Article_IEq734.gif 1KB Image download
Fig. 1 262KB Image download
41116_2023_36_Article_IEq797.gif 1KB Image download
41116_2023_36_Article_IEq799.gif 1KB Image download
41116_2023_36_Article_IEq801.gif 1KB Image download
41116_2023_36_Article_IEq803.gif 1KB Image download
41116_2023_36_Article_IEq806.gif 1KB Image download
41116_2023_36_Article_IEq809.gif 1KB Image download
41116_2023_36_Article_IEq811.gif 1KB Image download
41116_2023_36_Article_IEq813.gif 1KB Image download
41116_2023_36_Article_IEq815.gif 1KB Image download
41116_2023_36_Article_IEq816.gif 1KB Image download
41116_2023_36_Article_IEq817.gif 1KB Image download
41116_2023_36_Article_IEq818.gif 1KB Image download
41116_2023_36_Article_IEq115.gif 1KB Image download
41116_2023_36_Article_IEq116.gif 1KB Image download
41116_2023_36_Article_IEq117.gif 1KB Image download
40517_2023_256_Article_IEq4.gif 1KB Image download
40517_2023_256_Article_IEq5.gif 1KB Image download
Fig. 5 204KB Image download
40517_2023_256_Article_IEq11.gif 1KB Image download
Fig. 7 383KB Image download
40517_2023_258_Article_IEq121.gif 1KB Image download
Fig. 1 256KB Image download
Fig. 1 584KB Image download
Fig. 2 1027KB Image download
40517_2023_258_Article_IEq126.gif 1KB Image download
40517_2023_258_Article_IEq127.gif 1KB Image download
40517_2023_258_Article_IEq128.gif 1KB Image download
40517_2023_258_Article_IEq129.gif 1KB Image download
40517_2023_258_Article_IEq130.gif 1KB Image download
40517_2023_258_Article_IEq131.gif 1KB Image download
40517_2023_258_Article_IEq132.gif 1KB Image download
40517_2023_258_Article_IEq133.gif 1KB Image download
Fig. 1 592KB Image download
40517_2023_258_Article_IEq135.gif 1KB Image download
Fig. 8 517KB Image download
40517_2023_258_Article_IEq137.gif 1KB Image download
40517_2023_258_Article_IEq138.gif 1KB Image download
【 图 表 】

40517_2023_258_Article_IEq138.gif

40517_2023_258_Article_IEq137.gif

Fig. 8

40517_2023_258_Article_IEq135.gif

Fig. 1

40517_2023_258_Article_IEq133.gif

40517_2023_258_Article_IEq132.gif

40517_2023_258_Article_IEq131.gif

40517_2023_258_Article_IEq130.gif

40517_2023_258_Article_IEq129.gif

40517_2023_258_Article_IEq128.gif

40517_2023_258_Article_IEq127.gif

40517_2023_258_Article_IEq126.gif

Fig. 2

Fig. 1

Fig. 1

40517_2023_258_Article_IEq121.gif

Fig. 7

40517_2023_256_Article_IEq11.gif

Fig. 5

40517_2023_256_Article_IEq5.gif

40517_2023_256_Article_IEq4.gif

41116_2023_36_Article_IEq117.gif

41116_2023_36_Article_IEq116.gif

41116_2023_36_Article_IEq115.gif

41116_2023_36_Article_IEq818.gif

41116_2023_36_Article_IEq817.gif

41116_2023_36_Article_IEq816.gif

41116_2023_36_Article_IEq815.gif

41116_2023_36_Article_IEq813.gif

41116_2023_36_Article_IEq811.gif

41116_2023_36_Article_IEq809.gif

41116_2023_36_Article_IEq806.gif

41116_2023_36_Article_IEq803.gif

41116_2023_36_Article_IEq801.gif

41116_2023_36_Article_IEq799.gif

41116_2023_36_Article_IEq797.gif

Fig. 1

41116_2023_36_Article_IEq734.gif

41116_2023_36_Article_IEq731.gif

40517_2023_258_Article_IEq58.gif

Fig. 3

Fig. 1

41116_2023_36_Article_IEq726.gif

41116_2023_36_Article_IEq724.gif

Fig. 6

41116_2023_36_Article_IEq722.gif

41116_2023_36_Article_IEq720.gif

40517_2023_258_Article_IEq51.gif

41116_2023_36_Article_IEq718.gif

41116_2023_36_Article_IEq716.gif

41116_2023_36_Article_IEq714.gif

Fig. 2

40517_2023_258_Article_IEq46.gif

41116_2023_36_Article_IEq710.gif

41116_2023_36_Article_IEq708.gif

41116_2023_36_Article_IEq706.gif

41116_2023_36_Article_IEq702.gif

41116_2023_36_Article_IEq701.gif

40517_2023_258_Article_IEq40.gif

41116_2023_36_Article_IEq700.gif

41116_2023_36_Article_IEq699.gif

41116_2023_36_Article_IEq698.gif

41116_2023_36_Article_IEq696.gif

40517_2023_258_Article_IEq34.gif

41116_2023_36_Article_IEq692.gif

41116_2023_36_Article_IEq689.gif

41116_2023_36_Article_IEq684.gif

41116_2023_36_Article_IEq680.gif

Fig. 2

41116_2023_36_Article_IEq639.gif

41116_2023_36_Article_IEq633.gif

41116_2023_36_Article_IEq631.gif

41116_2023_36_Article_IEq627.gif

41116_2023_36_Article_IEq471.gif

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  文献评价指标  
  下载次数:10次 浏览次数:5次