SDV Enterprise Version 0.4.0

Release Date: August 15, 2023

This release improves the quality of your synthetic data and provides you with additional options for modeling your datasets.

:star: A new, faster multi-table synthesizer: The IndependentSynthesizer is our fastest synthesizer yet! Use it for modeling an unlimited number of tables in complex configurations. (For a full list of features and tradeoffs, see the SDV Synthesizer Guide.)

:globe_showing_americas: Contextual anonymization for phone numbers and emails. Understand the deeper meaning behind phone number and email data to anonymize the PII in a hyper-realistic way. For eg., match general geographical regions while obfuscating the precise, sensitive information.

:bar_chart: Evaluate outliers in your synthetic data. The previous release allowed you to model rare events. In this release, you can apply metrics to quantify the results and guard against common failure modes. See metrics for OutlierCoverage and SmoothnessSimilarity.

Additional Updates

  • Support for Python 3.11. You can now use the SDV Enterprise with any of the currently active versions of Python (3.8-3.11).
  • Create and apply custom constraints for ID columns (such as primary keys) as well as PII columns (such as phone numbers).
  • Improved performance and progress tracking during quality evaluation. This release fixed bugs that led to incorrect scores, crashes and repeated warning messages.