Date of Award

5-1-2002

Document Type

Thesis (Undergraduate)

Department or Program

Department of Computer Science

First Advisor

Metin Akay

Abstract

The plethora of genomic data currently available has resulted in a search for new algorithms and analysis techniques to interpret genomic data. In this two-fold study we explore techniques for locating critical amino acid residues in protein sequences and for estimating the similarity between proteins. We demonstrate the use of the Short-Time Fourier Transform and the Continuous Wavelet Transform together with amino acid hydrophobicity in locating important amino acid domains in proteins and also show that the Kolmogorov-Smirnov statistic can be used as a metric of protein similarity.

Comments

Originally posted in the Dartmouth College Computer Science Technical Report Series, number TR2002-431.

COinS