Combining two columns, include code table for reference only and synthesize subset of table

Which software are you using? SDV Enterprise

Software Details SDV 0.44, Python 3.13

Description

Hi SDV team, I have couple of questions.

  1. How to combine two columns and generate the value for the third column when data is synthesized.
    For example, Employee table has Name, EmployeeID and SalaryGrid column. There is another column EmployeeKey (Primary Key) which is a combination of three columns and the value is ‘Name EmployeeID ! SalaryGrid’. For example, ‘John 151 ! 5’. How to define this relationship in metadata?
  2. How to include code table for reference only and define foreign keys? For example, Address table has column Country which is a foreign key from Countries table. In Countries table, there are two columns CountryID, CountryName. How to not synthesize the Countries table but only include as a reference to Address table and keep the CountryID intact in Countries table?
  3. How to synthesize only a subset of a table? For example, Employee table has columns; EmployeeID, FirstName, Surname, Address, StartDate. How to synthesize only FirstName and Surname and merge the synthetic data back to the table?

Thanks a lot!

For additional resources see the API Docs.

I see Constraint API has ReferenceTable constraint and it is currently in Beta. When it is expected to be release? Is there a workaround?

Hi @rizwan I have start separate threads with each of these questions. This will allow us to better organize the responses around each question/theme and ensure we don’t accidentally skip answering any of them. Please see the separated questions (and my responses) below:

You can reply in each of the threads independently, as I expect there may be some back-and-forth for a few of these questions. Thanks!