Computational Linguist
Congratulations on making it to the last post in making a tokenizer. Let’s jump right in and bring in our...
Grammar So as mentioned in the first post, the grammar is where rules are specified. These grammatical rules will check...
Exceptions So let’s get started making the exceptions.py file. As you see this, is pretty basic. It’s just a Python...
Introduction So in this post, we’ll start making a simple English tokenizer in pure Python. Simple in the sense that...
Introduction Tokenization is an import step in the NLP pipeline. It is often part of the text normalization process. Many...
Internationalization & Localization in Python In this post, we’ll look briefly into Internationalization and Localization in Python using the GNU...
Normalizing Text with Regex Groups in Python In this post we’re going to look at how regex groups can help...
Plotting Vowel Formants In this tutorial, I’ll demonstrate how to create a basic vowel plot using the ggplot2 library in...