Synthetic Data Generation: Generating High-Utility Synthetic Parcel Data; Applications and Implications of Synthetic Data

Jiang, Janessa

Synthetic Data Generation: Generating High-Utility Synthetic Parcel Data; Applications and Implications of Synthetic Data 346 views

Author

Jiang, Janessa, School of Engineering and Applied Science, University of Virginia 0000-0003-0184-2755

Advisors

Elliott, Travis , EN-Engineering and Society , University of Virginia
Baritaud, Catherine , EN-Engineering and Society , University of Virginia
Graham, Daniel , EN-Comp Science Dept , University of Virginia

Abstract

As technological advancements occur at an exponential rate, the demand for data also increases. The spread of information through online platforms has also raised concerns about data privacy. To address scarcity and privacy concerns, synthetic data has been gaining popularity and acceptance in various fields. Synthetic data is algorithmically-created and serves as an alternative to real-world data. Due to the applicability of this technology, the impact of synthetic data will continue to grow as the field of artificial intelligence flourishes. An example of data that requires anonymity is customer personal information. At Amazon Web Services, customer billing data and usage statistics are bundled into packages called parcels. Data scarcity and risk of re-identification of anonymized data were issues related to using real parcel data for testing. The technical report follows the research, process, and challenges faced while generating high-utility synthetic data to fit business needs. The STS research paper further explores applications of synthetic data in addition to the social, ethical, and environmental implications of this technology. Synthetic data usage comes with significant risks related to bias, lack of consistent legislation, and environmental impacts.

Degree

BS (Bachelor of Science)

Keywords

artificial intelligence; synthetic data

Language

English

Rights

Issued Date

2022-05-06

Suggested Citation

Jiang, Janessa. Synthetic Data Generation: Generating High-Utility Synthetic Parcel Data; Applications and Implications of Synthetic Data. University of Virginia, School of Engineering and Applied Science, BS (Bachelor of Science), 2022-05-06, https://doi.org/10.18130/4hse-s572.

Files

Jiang_Janessa_Prospectus.pdf

Downloads: 469

Download

Jiang_Janessa_STS_Research_Paper.pdf

Downloads: 716

Download

Jiang_Janessa_Sociotechnical_Synthesis.pdf

Downloads: 126

Download

Jiang_Janessa_Technical_Report.pdf

Downloads: 145

Download

Synthetic Data Generation: Generating High-Utility Synthetic Parcel Data; Applications and Implications of Synthetic Data 346 views

Author

Advisors

Abstract

Degree

Keywords

Language

Rights

Issued Date

Persistent Link

Suggested Citation

Files