Dartmouth College Undergraduate Theses

The Next Generation of EMPRESS: A Metadata Management System For Accelerated Scientific Discovery at Exascale

Margaret R. Lawson, Dartmouth College

Date of Award

6-1-2018

Document Type

Thesis (Undergraduate)

Department or Program

Department of Computer Science

First Advisor

Charles Palmer

Abstract

Scientific data sets have grown rapidly in recent years, outpacing the growth in memory and network bandwidths. This I/O bottleneck has made it increasingly difficult for scientists to read and search outputted datasets in an attempt to find features of interest. In this paper, we will present the next generation of EMPRESS, a scalable metadata management service that offers the following solution: users can "tag" features of interest and search these tags without having to read in the associated datasets. EMPRESS provides, in essence, a digital scientific notebook where scientists can write down observations and highlight interesting results, and an efficient way to search these annotations. EMPRESS also provides storage-system independent physical metadata, providing a portable way for users to read both metadata and the associated data. EMPRESS offers scalability through two different deployment modes: "local", which runs on the compute nodes and "dedicated," which uses a set of dedicated, shared-nothing servers. EMPRESS also provides robust fault tolerance and transaction management, which is crucial to supporting workflows.

Comments

Originally posted in the Dartmouth College Computer Science Technical Report Series, number TR2018-846.

Recommended Citation

Lawson, Margaret R., "The Next Generation of EMPRESS: A Metadata Management System For Accelerated Scientific Discovery at Exascale" (2018). Dartmouth College Undergraduate Theses. 129.
https://digitalcommons.dartmouth.edu/senior_theses/129

Download

Included in

Computer Sciences Commons

COinS

Dartmouth College Undergraduate Theses

The Next Generation of EMPRESS: A Metadata Management System For Accelerated Scientific Discovery at Exascale

Date of Award

Document Type

Department or Program

First Advisor

Abstract

Comments

Recommended Citation

Included in

Browse

Search

Contribute

Questions?

Dartmouth College Undergraduate Theses

The Next Generation of EMPRESS: A Metadata Management System For Accelerated Scientific Discovery at Exascale

Author

Date of Award

Document Type

Department or Program

First Advisor

Abstract

Comments

Recommended Citation

Included in

Share

Browse

Search

Contribute

Questions?