In last month’s blog, I talked about sentiment analysis for social media analysis in the field of computational linguistics, which takes human language and translates it so… Read more “Natural Language Processing: N-gram Extraction for World Cup”
Four-Year Anniversary Blog: Sentiment Analysis
Today I celebrate the four-year anniversary of my blog that tries to explain key data science ideas in plain language and socialize what continues to make its… Read more “Four-Year Anniversary Blog: Sentiment Analysis”
Artificial Neural Networks for Predicting Coffee Rust Case Study
After looking at the nuts and bolts of natural language processing in my last blog, today I want to look at how artificial neural networks (ANNs) can… Read more “Artificial Neural Networks for Predicting Coffee Rust Case Study”
NLP Punctuation, Lower-Case and StopWords Pre-Processing
In my March blog, I explained how to use the stemming technique in Natural Language Processing (NLP) to predict whether a particular Tweet could be geolocated to… Read more “NLP Punctuation, Lower-Case and StopWords Pre-Processing”
NLP Stemming
In my February blog, I explained how to use the tokenization technique in Natural Language Processing (NLP) to predict whether a particular Tweet could be geolocated to… Read more “NLP Stemming”
NLP Tokenization
Image from aio-tv Today I want to continue looking at machine learning case studies for beginners and in particular, the use of tokenization in natural language processing.… Read more “NLP Tokenization”
Missing Data with k-Nearest Neighbor Imputation
In today’s blog, I want to give a case study of using k-Nearest Neighbor (kNN) imputation to fill in missing data. About a year ago, I talked… Read more “Missing Data with k-Nearest Neighbor Imputation”
Machine Learning Algorithm Case Study 4: Spearman’s Dimensionality Reduction
In a September 2018 blog, I talked about a K-means clustering case study of cyber profiling in Indonesia. Today I want to continue that discussion by giving… Read more “Machine Learning Algorithm Case Study 4: Spearman’s Dimensionality Reduction”
The Data Scientist Clarifies the Question – Dengue Data Search
Originally posted on The Data Lass:
One of the first steps in the Data Science process is identifying what data you need to answer the question. In…
The Data Scientist Clarifies the Question
One of the first steps in the Data Science process is identifying what data you need to answer the question. In March 2017, I featured a series… Read more “The Data Scientist Clarifies the Question”