SDV Enterprise Version 0.3.0

Release Date: July 18, 2023

In this release, we’re adding features that improve synthetic data quality along with bug fixes and security recommendations.

:backhand_index_pointing_right: Improve synthetic data cardinality. The HSASynthesizer will now create synthetic data that conforms to the real multi-table patterns. For example, you can synthesize data with an exact 1-1 relationship between entities. Another common case is when a parent entity must have at least 1 child.

:shooting_star: Model and recreate rare events. You now have more options to synthesize rare events that closely resemble the real data. Use the UniformEncoder to capture imbalanced categories and the OutlierEncoder to identify outlier data points.

:performing_arts: Anonymize PII more realistically. In your real data, some records may have empty or missing PII attributes. You synthetic data will now include missing values in the correct proportions, leading to more realistic anonymization.

Additional Updates

  • Dropping support for Python 3.7. This version has officially reached its end-of-life and the Python organization will no longer update its security. For your safety, we recommend upgrading Python to 3.8 or above.
  • Bug fixes for datetime columns. We fixed issues that you may have encountered if you stored datetime information as integers.