What's the chance that these limericks are now in the training set? As others me...

sftombu · on May 14, 2024

Previous answer to this question:

https://news.ycombinator.com/item?id=40361419

causal · on May 15, 2024

Your test is a good one but the point still stands that a novel dataset is the next step to being sure.

dontupvoteme · on May 15, 2024

One could also programmatically (e.g. with nltk or spacy, replace nouns, named entities, etc) modify the dataset, even up to the point that every test run is unique.

You could also throw in vector similarity if you wanted to keep words as more synonyms or antonyms.