Release Date: March 26, 2024
This release covers a range of features that allow you to more easily get started with SDV and customize it for your needs.
Clean your data to create referential integrity. If your real multi-table data contains missing or unknown references, you can now use SDV’s utility functions to clean it up. SDV expects and guarantees referential integrity in the data – real and synthetic.
Update metadata in bulk. High quality metadata makes for high quality synthetic data … but what if it’s taking too long to update your metadata? Use our new bulk update features to make changes faster, and get your synthetic data sooner.
Even more anonymization options. Control the amount of anonymized PII data you want to create in synthetic data, and whether it should repeat. Supply a cardinality rule to let SDV know whether to fake unique values or repeated anonymized data.
Additional updates
- We’ve improved our error messaging around invalid foreign keys, column relationships, and constraints to better help you understand the issues and debug them.
- We’ve consolidated the way you can retrieve parameters from your synthesizers.