From Text to Token: How Tokenization Pipelines Work

[!NOTE]
When you type a sentence into a search box, it’s easy to imagine the search engine seeing the same thing you do. In reality, search engines (or search databases) don’t store blobs of text, and they don’t store sentences. They don’t even store words in the way we think of them.

[!TIP] Source link: From Text to Token: How Tokenization Pipelines Work