Date of Award


Document Type

Thesis (Undergraduate)


Department of Computer Science

First Advisor

Metin Akay


The plethora of genomic data currently available has resulted in a search for new algorithms and analysis techniques to interpret genomic data. In this two-fold study we explore techniques for locating critical amino acid residues in protein sequences and for estimating the similarity between proteins. We demonstrate the use of the Short-Time Fourier Transform and the Continuous Wavelet Transform together with amino acid hydrophobicity in locating important amino acid domains in proteins and also show that the Kolmogorov-Smirnov statistic can be used as a metric of protein similarity.


Originally posted in the Dartmouth College Computer Science Technical Report Series, number TR2002-431.