Incenjucar
Legend
This is incorrect. They can return exact sequences of tokens which are highly represented in the training data. You can get "tomorrow, and tomorrow, and tomorrow" out, because it is one of the most famous passages in English literature. You can't get the House on the Borderland, because it is more obscure.
It just takes more work to filter out. It's a matter of the ratio of needles to hay. The relationships between tokens are the record of what was written. In my puzzle analogy, it's just mixing lots of puzzles together to make it harder to put the less common puzzles together.