view article Article There is no such thing as a tokenizer-free lunch catherinearnett • Sep 25, 2025 • 98
Detecting Pretraining Data from Large Language Models Paper • 2310.16789 • Published Oct 25, 2023 • 11