Need better organic traffic? Find the most important keyword topics with cluster analysis.
Let’s say someone is sending you a file with 500,000 keywords to create an SEO content strategy. Understanding the status quo clustering techniques, help you better understand what the main searches are.
Want a head start? N-Gram Analysis of your keywords will be perfect for your next project. N-Gram analysis of your keywords are also a great starting point.
Grouping your keywords with cluster algorithms will give you the main ideas of keyword topics. When you put these keyword clusters in hierarchical order, you are able to build an SEO content strategy with big main topics and linked subtopics. Thanks to cluster algorithms, you won’t spend much time and effort. Besides, you’ll get quick results compared to other keyword research approaches like powerful keyword tagging.
First, you should bring your keywords to lowercase and remove special characters with some RegEx operations. Split your full keyword into single words and use stemming functions like Porter or Snowball Stemmer. It’ll reduce every word to their root form and makes it possible to group similar words.
Cluster algorithms run on numeric data. For that reason, we have to transform all keywords into a vector space. Scikit has modules that will do most of the job. We recommend using TfidfVectorizer for the feature extraction from your keywords. It’s using Tf-Idf numbers as weightings for each word within your vector.
You can choose between a lot of different clustering algorithms. K-means clustering is one of the most powerful methods. How can you figure out the number of clusters you need to create? No worries, the “K” stands for it. Sometimes estimating the number of clusters can be difficult. To avoid it, you can use approaches like DBSCAN and tweak some parameters here.
In the end, it’s all about using the results of the cluster analysis for actions. For some use cases, exporting the full keywords to a CSV file is fine. Put the cluster name in one column, and concatenate all assigned keywords with a delimiter together. Then show them in the second column. If you like visualizations more, you can create a word cloud per cluster and arrange to show them next to each other.
Probably, you think Google Autosuggest is the first step to create various keywords. It’s just one method you can try. Feel free to have a look at our free Google Autosuggest tool to scrape keywords. For sure, finding relevant keywords to Autocomplete will positively impact your performance. Otherwise, the user experience would be bad within the search engine. We always wonder why many queries never appear when you look at the search volume of a keyword. Time to think the significance of Google Autocomplete.
It’s a great source if you use the existing queries within your Google Ads account. Especially for SEO people, it’s difficult to quantify a keyword, based on business value. Don’t worry. We can achieve it by using conversion metrics in paid search. You’ll realize that the focus on search volume isn’t the best way. There will be many hidden long-tail keywords when it comes to a transaction-based perspective. Optimize for these keywords to increase the value of your traffic.
The Google Keyword Planner is another good resource for getting new keywords. You can access it with ease from your Google Ads account. It will help you find new keywords to spend money on. Of course, you can also try it for your SEO projects. A nice recent add-on was the “refine” keywords option. Google is clustering the suggested keywords by entities they are recognized. Besides, they are in a generic way that fits all businesses out there.
Google Trends is another great source for free keyword ideas. There is no official API available, but with the Python module Pytrends, you can query Google Trends in an automated way. It allows you to identify trending search queries for your business. If you know these trends, you can be the first person who provides the perfect content pages to match those queries. The competition is probably lower, and easier to get some organic rankings.
Cluster your SEO or PPC Keywords with this Python script with less than 50 lines of code. The basic concepts are used:
Cluster your keywords semantically by making use of this simple Python script.