Date of Award

2026

Document Type

Thesis (Master's)

Department or Program

Computer Science

First Advisor

SouYoung Jin

Second Advisor

Nikhil Singh

Third Advisor

Adam Breuer

Abstract

Large-scale image datasets frequently contain identifiable or sensitive content, raising privacy risks when training models that may memorize and leak such information. We present Unsafe2Safe, a fully automated pipeline that detects privacy-prone images and rewrites only their sensitive regions using multimodally guided diffusion editing. Unsafe2Safe operates in two stages. Stage 1 uses a vision--language model to (i) inspect images for privacy risks, (ii) generate paired private and public captions that respectively include and omit sensitive attributes, and (iii) prompt a large language model to produce structured, identity-neutral edit instructions conditioned on the public caption. Stage 2 employs instruction-driven diffusion editors to apply these dual textual prompts, producing privacy-safe images that preserve global structure and task-relevant semantics while neutralizing private content. To measure anonymization quality, we introduce a unified evaluation suite covering Quality, Cheating, and Utility dimensions. Across Caltech101 and MIT Indoor67, Unsafe2Safe reduces face similarity, text similarity, and demographic predictability by large margins, while maintaining downstream model accuracy comparable to training on raw data. Fine-tuning diffusion editors on our automatically generated triplets (private caption, public caption, edit instruction) further improves both privacy protection and semantic fidelity. Unsafe2Safe provides a scalable, principled solution for constructing large, privacy-safe datasets without sacrificing visual consistency or downstream utility.

Recommended Citation

Dinh, Minh, "Unsafe2Safe: Controllable Image Anonymization for Downstream Utility" (2026). Dartmouth College Master’s Theses. 260.
https://digitalcommons.dartmouth.edu/masters_theses/260

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Dartmouth College Master’s Theses

Unsafe2Safe: Controllable Image Anonymization for Downstream Utility

Date of Award

Document Type

Department or Program

First Advisor

Second Advisor

Third Advisor

Abstract

Recommended Citation

Included in

Browse

Search

Contribute

Questions?

Dartmouth College Master’s Theses

Unsafe2Safe: Controllable Image Anonymization for Downstream Utility

Author

Date of Award

Document Type

Department or Program

First Advisor

Second Advisor

Third Advisor

Abstract

Recommended Citation

Included in

Share

Browse

Search

Contribute

Questions?