Online Archive of University of Virginia Scholarship
Evaluating the Importance of Demographic and Technical Factors in Creating Authentic-Sounding AI-Generated Human Voice Clones; Deepfakes and the Social Construction of Trust: A SCOT Analysis of Stakeholder Responses81 views
Author
Kaur, Baani, School of Engineering and Applied Science, University of Virginia
Advisors
Gerling, Gregory, EN-SIE, University of Virginia
Elliott, Travis, University of Virginia
Abstract
This project pairs engineering tests of AI voice-cloning tools with a social-science look at how deepfakes chip away at trust. My team and I built a library of authentic and cloned voices, then asked listeners to judge each clip’s realism; the same clips were also scored by a machine evaluation model (NISQA). Most clones still sounded less genuine than the authentic voices, but clones trained on short samples from young male speakers and mixed with background noise fooled listeners as often as the originals. NISQA scores did not track these perceptions, highlighting a gap between technical ratings and what people hear. Using the Social Construction of Technology framework, I compared how platforms, banks, courts, and the public frame the deepfake problem. Banks, facing direct losses, quickly adopted multi-factor security, while media firms, regulators, and courts still disagree on standards. The analysis suggests four steps toward closure: cryptographic watermarks tied to platform checks, early courtroom reliability hearings, annual audits of detection accuracy, and broad media-literacy programs. Technical insights guide where safeguards are most needed, and social insights show how they might gain acceptance.
Degree
BS (Bachelor of Science)
Keywords
Deepfake; Artificial Intelligence; AI Voice Cloning; Banking; Fraud; Authenticity perception; NISQA evaluation
Notes
School of Engineering and Applied Science
Bachelor of Science in Systems Engineering
Technical Advisor: Gregory Gerling
STS Advisor: Travis Elliott
Technical Team Members: Rhea Agarwal, Drake Ferri, Vishnu Lakshmanan, Padma Lim, Fahima Mysha
Language
English
Rights
All rights reserved (no additional license for public reuse)
Kaur, Baani. Evaluating the Importance of Demographic and Technical Factors in Creating Authentic-Sounding AI-Generated Human Voice Clones; Deepfakes and the Social Construction of Trust: A SCOT Analysis of Stakeholder Responses. University of Virginia, School of Engineering and Applied Science, BS (Bachelor of Science), 2025-05-09, https://doi.org/10.18130/vcdb-4617.