Open question. The instinct is that static synthetic datasets are mostly local objects — they fit the fact pattern they were generated against and strip a lot of their value when handed to someone else's problem. The useful artifact may be the generation contract, not the dataset. But I haven't watched enough hand-offs go right or wrong to be sure where the line is.
questionMay 9, 2026
Are static synthetic datasets actually useful to share?
If a synthetic dataset is tuned to a specific fact pattern, what survives when someone else picks it up for their own?