[Resolved] Num_row_per_table=1000 DayZSynthesizer Multitable

epicvu · March 18, 2024, 9:22am

Hi!
I have a little issue while trying to set my number of rows per table when using the MultiTable DayZSynthesizer. I get a little message whenever I try to force a number (the default settings works just fine) but not when I want a specific number of rows for all the tables.

{
“name”: “AttributeError”,
“message”: “‘int’ object has no attribute ‘get’”,
“stack”: "---------------------------------------------------------------------------
AttributeError Traceback (most recent call last)
Cell In[27], line 1
----> 1 synthetic_data = synthesizer.sample(num_rows_per_table=1000)

File packaging\\sdv_enterprise\\sdv\\multi_table\\dayz\\day_zero.pyx:75, in sdv_enterprise.sdv.multi_table.dayz.day_zero.expirable.wrapper()

File packaging\\sdv_enterprise\\sdv\\multi_table\\dayz\\day_zero.pyx:223, in sdv_enterprise.sdv.multi_table.dayz.day_zero.DayZSynthesizer.sample()

AttributeError: ‘int’ object has no attribute ‘get’"
}
I get this message I wanted to know if I did something wrong.
For the context I wrote this piece of code

synthetic_data = synthesizer.sample(num_rows_per_table=1000)

You can try on any metadata.
Thanks!

Srini · March 18, 2024, 2:54pm

Hey Charles, it looks like you stumbled into a bug! That function and parameter combo is supposed to work an integer to drive the sampling across all of your tables. I opened an issue internally to have the team investigate and fix.

For now as a workaround, I recommend using num_rows instead:

from sdv.multi_table import DayZSynthesizer
from sdv.datasets.demo import download_demo

data, metadata = download_demo(
    modality='multi_table',
    dataset_name='fake_hotels'
)

guests_table = data['guests']
hotels_table = data['hotels']

synthesizer = DayZSynthesizer(metadata)
synthetic_data = synthesizer.sample(num_rows=1000)

This will synthesize 1000 rows from each table and conform to the metadata.

epicvu · March 18, 2024, 3:11pm

Thanks a lot, I’ll use the recommended parameters !

Srini · March 18, 2024, 5:36pm

After some more digging, it looks like our documentation here is a bit confusing and the error message that’s returned isn’t that helpful!

The num_rows_per_table parameter expects you to pass in a dictionary of values, while num_rows will accept an integer value.

So the library is working as intended but I thought I’d clarify some things further to help you better understand the mental model here.

So there are 2 ways of specify how much data you want synthesizes using DayZSynthesizer.sample() and you can use these parameters together if you want:

the num_rows parameter lets you define the number of rows (as an integer) you want as the default amount for all tables
the num_rows_per_table parameter lets you specify the number of rows you want specifically at the table level

So there are a few different ways to use these:

Specify a uniform # of rows for all tables

# 1000 rows from every table
synthetic_data = synthesizer.sample(num_rows=1000)

Synthesize a default # of rows, but provide specific guidance for some tables

The following will generate 1000 rows like the hotels table but only 10 rows like the guests table. If you had more tables than these 2, then 1000 rows would be synthesized like those ones too since it’s the default value!

# 1000 rows from every table (as a default)
sampling_dict = {'guests': 10}
synthetic_data = synthesizer.sample(num_rows=1000, num_rows_per_table=sampling_dict)

Choose specific row counts for every table

This will synthesize 15 rows that resemble the guests table and 15,000 rows that resemble the hotels table.

sampling_dict = {'guests': 15, 'hotels': 15_000 }
synthetic_data = synthesizer.sample(num_rows_per_table=sampling_dict)

epicvu · March 19, 2024, 8:36am

Hi, thanks for the clarifications! I was following the documentation and you could see at the top the sample with num_rows_per_table=1000. Probably a typo
Once again thanks for the help !

Srini · March 19, 2024, 2:18pm

Updated first code snippet in the documentation to reflect this (may need to clear cache). Thanks for the feedback!

singhe · April 9, 2026, 3:40pm

Hi Srini,
Do this num_row_per_table and num_rows parameters work with DayZSynthesizer only?

neha · April 21, 2026, 7:08pm

Hi @singhe, assuming this is in reference to your other question. There is no way to selective scale up specific tables during sampling like this. We recommending using the ReferenceTable constraint.

(FYI this particular thread is over 2 years old. The DayZSynthesizer now uses the same scale parameter as all other multi-table synthesizers.)

Topic		Replies	Views
Scaling specific tables in a multi-table Synthetic Data Vault (SDV) model Synthetic Data Creation	5	53	May 12, 2026
DayZSynthesizer get_parameters() Synthetic Data Creation feature-request	3	15	March 20, 2024
Synthesizing a subset of a table Synthetic Data Creation	3	21	April 14, 2026
Multi-table models should support sequential data Synthetic Data Creation multi-table , sequential , feature-request	0	15	June 6, 2024
Generating complex relationships in multi-table data Synthetic Data Creation multi-table	1	28	June 21, 2024

[Resolved] Num_row_per_table=1000 DayZSynthesizer Multitable

Related topics