PARALLEL COMPUTING ALGORITHMS FOR TANDEM
dc.contributor.advisor | Wu, Fangxiang | en_US |
dc.contributor.committeeMember | Zhang, Chris | en_US |
dc.contributor.committeeMember | Kusalik, Tony | en_US |
dc.contributor.committeeMember | Teng, Daniel | en_US |
dc.creator | Sun, Jian | en_US |
dc.date.accessioned | 2013-07-31T12:00:11Z | |
dc.date.available | 2013-07-31T12:00:11Z | |
dc.date.created | 2013-04 | en_US |
dc.date.issued | 2013-07-30 | en_US |
dc.date.submitted | April 2013 | en_US |
dc.description.abstract | Tandem mass spectrometry, also known as MS/MS, is an analytical technique to measure the mass-to-charge ratio of charged ions and widely used in genomics, proteomics and metabolomics areas. There are two types of automatic ways to interpret tandem mass spectra: de novo methods and database searching methods. Both of them need to use massive computational resources and complicated comparison algorithms. The real-time peptide-spectrum matching (RT-PSM) algorithm is a database searching method to interpret tandem mass spectra with strict time constraints. Restricted by the hardware and architecture of an individual workstation the RT-PSM algorithm has to sacrifice the level of accuracy in order to provide prerequisite processing speed. The peptide-spectrum similarity scoring module is the most time-consuming part out of four modules in the RT-PSM algorithm, which is also the core of the algorithm. In this study, a multi-core computing algorithm is developed for individual workstations. Moreover, a distributed computing algorithm is designed for a cluster. The improved algorithms can achieve the speed requirement of RT-PSM without sacrificing the accuracy. With some expansion, this distributed computing algorithm can also support different PSM algorithms. Simulation results show that compared with the original RT-PSM, the parallelization version achieves 25 to 34 times speed-up based on different individual workstations. A cluster with 240 CPU cores could accelerate the similarity score module 210 times compare with the single-thread similarity score module and the whole peptide identification process 85 times compare with the single-thread peptide identification process. | en_US |
dc.identifier.uri | http://hdl.handle.net/10388/ETD-2013-04-1115 | en_US |
dc.language.iso | eng | en_US |
dc.subject | real-time peptide-spectrum matching (RT-PSM) algorithm | en_US |
dc.subject | tandem mass spectrum | en_US |
dc.subject | parallel computing algorithm | en_US |
dc.subject | multi-core computing algorithm | en_US |
dc.subject | distributed computing algorithm | en_US |
dc.subject | peptide identification | en_US |
dc.title | PARALLEL COMPUTING ALGORITHMS FOR TANDEM | en_US |
dc.type.genre | Thesis | en_US |
dc.type.material | text | en_US |
thesis.degree.department | Biomedical Engineering | en_US |
thesis.degree.discipline | Biomedical Engineering | en_US |
thesis.degree.grantor | University of Saskatchewan | en_US |
thesis.degree.level | Masters | en_US |
thesis.degree.name | Master of Science (M.Sc.) | en_US |