The practical summary of Latent Dirichlet Allocation for SEO
What’s the hottest trend in search engine optimization that you’ve never heard of (yet)? The folks over at SEOmoz have been doing a great series on Latent Dirichlet Allocation (LDA), which is a context-based algorithm for determining search relevance. Their research has shown strong correlation between LDA and search rankings. However, it’s little things like this:

Photo credit: SEOMoz
…that make people flee in terror from LDA, and who can blame them?
So here’s what you need to know about LDA as it relates to search engine optimization:
Your content has to be about something and worth reading.
Huge surprise, huh? Google has said for years that its stated aim is to get search engine rankings in alignment with “human rankings” – that is, if the content is valuable to a human being, it should rank well. If the content isn’t valuable to a human being, then it should rank poorly. For years, Google has used PageRank and inbound links as proxies for judging the value of content, but now there’s a theory in the SEO community, supported by the SEOmoz data, that on-page content may play more of a role in your rankings than previously thought.
What makes this different from the early years of SEO is that it’s somewhat harder to game. Instead of simple on-page optimization tricks that Google can devalue quickly (bold text, H1 titles, etc), the LDA algorithm looks at the total picture of the content and its context. Does a web page talking about World of Warcraft mention paladins, death knights, and fish feasts, or is it just badly repurposed, valueless content surrounded by gold spam ads?
So how do you make use of this knowledge? Here are three immediate to-do tasks:
1. Make sure you are using the rel=canonical tag.
Use this tag in your web site, blog, and any place where you have ownership of your content. As more and more algorithms are tuned to contextual content, the reward of ripping off someone else’s content will be much greater, so using this tag will help at least assign some level of ownership to stuff you write. If you’re using WordPress, the All In One SEO plugin will do this for you automagically. Want to learn more about this tag? Read what Matt Cutts of Google has to say about it.
2. Make your web site about something.
A personal blog is fine to be all over the place, one day talking about cooking, the next day talking about Twitter, etc. A professional blog and/or your corporate web site has to be about something and needs to have lots of original, high quality, on-topic content using semantically related words in the copy that correlate to the search terms you’re going after. For example, Blue Sky Factory’s new web site (shiny!) has a TON of new content that talks about email marketing in all of its various aspects, using all of the different ways people talk about it. You can’t get away with two sentence pages and minimally valuable content any more – you have to do the hard work of creating good stuff in order to leverage this algorithm effectively. That’s why we’re seeing strong correlations between the LDA algorithm and Google’s results – Google wants to continue rewarding valuable content and making life harder for lazy SEO folks.
3. Stop feeding the social media machine all your stuff.
This one will be controversial but true. It’s perfectly okay to have conversations, to engage, to be interesting on Twitter, Facebook, LinkedIn, etc. but I want you to stop putting your best stuff there in full. Why? Because this algorithm is all about quality AND quantity of content. If your blog or website is gathering dust while your Facebook page is bursting at the seams, you’re doubly harming yourself. Not only are you making yourself dependent on an entity that doesn’t give a rat’s ass about you, but you’re penalizing your own web site/blog by not having context- sensitive information on it. Keep sharing, keep linking, keep conversing, but don’t give the keys to your kingdom – your content – to the social media sites. Excerpts? Fine. Full blog posts? Not so fine. Teasers from eBooks? Fine. Large chunks of copy? Not so fine.
Is LDA a game-changer as many say? Maybe it is, maybe it isn’t, but if you follow the practice of creating lots of original, great stuff on properties you own, you’ll never go wrong.
If you enjoyed this, please click here and share it with your network!
Want to read more like this from Christopher Penn? If so, please subscribe right now!
Click here to read my blog on Google Currents on your mobile!
Marketing White Belt |
Watch me speak:
Attend virtually! |
I recommend:![]() for Twitter audience building. |
Optimization demands exploration
Optimize, optimize, optimize. The creed of the day. Search engine optimization. Email marketing optimization. Social media optimization. With all this optimization, you’d think that organizations would be sales and marketing machines, banging out the profits faster than ever.
Strangely, most of the folks promoting their optimization services barely have two nickels to rub together. The companies who are unlucky enough to hire these folks end up out a lot of money and become bitter, disenchanted with the idea of optimizing anything.
Why does most optimization fail so hard?
Think about it this way. Let’s say you want to do the most basic optimization possible – you want to optimize your commute home. You want to shave a minute or a mile off that daily drive, that way you’ve always done it.
Now let’s say that you only know the way you’ve already been going for the last day/week/month/year/life. How successful will your optimization be?
Exactly. You will achieve nothing, no significant gains at all.
So how would you optimize that commute? Before you can find the best route home, you have to know more than one route. Explore. Learn. Listen. Drive on all the back roads and side roads in and around your commute. Talk to other people who drive that route or who live in the area, gas station attendants, waiters and waitresses. Learn everything there is to learn about all of the ways between your house and your office, and then test them. One day you take a southern road. One day you take the light before your usual light. Run all the variations that you practically run, learn, explore, and get to know all the places between home and the office.
The time it takes you to learn and explore is absolutely vital. There’s no substitute for that research. There’s no pre-drawn road map that will tell you in perfect, precise details how to get from your house to your office in exactly the right way. There’s no mentor you can seek who will tell you exactly how to get to your house from your office – though there are plenty of fellow travelers who can share tips about how they get home. In the end, only exploring and learning all the routes available will let you “optimize” and choose the best way home.
Now expand this analogy to everything you’re trying to do in your business. How much time, energy, and resources are you putting into research and exploration? How many questions are you asking each week, the equivalent of taking a different turn, knowing that a huge number of ideas will be dead ends? How often do you listen carefully to customers, prospects, and other fellow travelers to hear what they’re finding in their own exploration?
Most important: how much are you spending on “optimization” that’s ultimately going to be fruitless because you don’t know any different ways or worse, because your corporate culture is mired in “that’s the way we’ve always done it”?
Explore first. Optimize only after you’ve learned new ways to get home, or you’ll only repeat the mistakes of the past.
Did you enjoy this blog post? If so, please subscribe right now!
Get this and other great articles from the source at www.ChristopherSPenn.com! Want to take your conference or event to the next level? Book me to speak and get the same quality information on stage as you do on this blog.
Why Google Buzz is brilliant and deadly to social media 1.0
From the moment it launched, Google Buzz generated buzz:
- OMG another social network to manage
- OMG there’s too much noise
- OMG this is so redundant
And for the early adopters, it’s exactly that and more. It’s noise. It’s clutter.
It’s brilliant.
Here’s why. Google wants the best of the best data. Remember this. They are a data company. They are a data quality company. They are algorithmic in their approaches to solving problems.
For a lot of the social media crowd, the moment Buzz turned on, our valued inboxes became insanely cluttered as we linked up all our social media sites, networks, and properties. We discovered that frankly, we didn’t want the firehose of social media in our inboxes.
We realized quickly, if we didn’t already know, that most of our “friends” are in fact valueless robots spewing garbage at us all day. On services like Twitter and Facebook, we don’t really notice because it’s bite size garbage that passed by quickly. When it piles up in the inbox, we notice. Fast.
So for the early adopters, those who keep Buzz on, we’re pruning back hard. We’re not following back. We’re dropping auto-follows. We’re down to just a handful of people, close friends, that we REALLY want in our inboxes. How many of the self-proclaimed social media gurus are you actually allowing inside your inbox, in Buzz? Exactly.
Buzz is working as intended. Google wants data quality. We immediately filter out completely all the noisemakers who bring no value to the table.
Buzz also incentivizes us in a couple of ways. It tells us to prune back our own spewage lest our friends, the ones we care about truly, unfollow us and eliminate us. It tells us that redundancy of information is of no value to anyone using Buzz, since you can get blog posts and status updates already from FriendFaceTwitterFeedBookSquareWallReader service (now with more blatant self-promotion from social media experts!). So we share and discuss only the stuff that’s either super high quality that we just can’t afford to miss, even if it’s redundant, because of the quality, or we share stuff that’s not being shared elsewhere.
Google figures out from our activity in Buzz that either there’s new stuff to be examined (remember in the initial presentation that Buzzed stuff gets indexed the moment it’s shared, and Google wants to find EVERYTHING to index) or there’s stuff that’s so important and so good that you’ll let it into your inbox even if you can get it elsewhere.
By placing Buzz so close to the incredibly precious, valuable territory that is our inbox, Google is forcing users to reveal what we truly value, what we’re willing to let into a very private space. It’s the perfect walled garden, because instead of enforcing the walls on us, Google simply lets us build the walls for them.
The lesson for marketers and content creators is this: social media 1.0 is drawing to a close. Social Media 2.0 is about relevance, value, and authentic connection, because you will never, as a marketer, get through the gates of the walled garden with a boring-as-crap press release or product announcement. No one cares about you. All of the services, but especially the big ones, are giving users more tools to screen out anything they don’t care about, anything that doesn’t engage them, anything that isn’t actually great quality.
Buzz is just a very visible demonstration of how much crap our “friends” spew out that’s of no value, and why we were so annoyed by it. Now that it’s under control, now that we’re isolating actual friends from “friends” and our networks are getting trimmed, we’re starting to get more value out of it.
And you can bet Google is paying VERY close attention to us and what we do with our Buzz.
Did you enjoy this blog post? If so, please subscribe right now!
Get this and other great articles from the source at www.ChristopherSPenn.com! Want to take your conference or event to the next level? Book me to speak and get the same quality information on stage as you do on this blog.
Ultimate Search Engine Optimization
What’s the ultimate search engine optimization?
The same thing that everyone has been saying for years – content. Good content rules all.
One of my Student Loan Network coworkers came back from an SES (Search Engine Strategies) conference yesterday with an interesting tidbit:
Search engine algorithms are getting so sophisticated now that they’re starting to mimic human behavior.
Think about that for a second. That means an eventual end to stupidity like doorway pages, keyword bait, and all the other tricks that the SEO industry has promoted over the years. An end to pointless linkbait, Digg articles that are misleading at best, and best of all, the endless flow of emails from folks saying, “Let’s exchange links between my crappy PPC (pills/porn/casino) site and your reputable little blog”.
Good content. That means the skillsets for future SEO professionals will likely include:
- Excellent writing
- Audio engineering – because great video starts with great audio
- Video creation and editing
- Web design and development
- Graphic arts
- Marketing and sales skills
Funny enough, that looks like a list of skills at any major media outlet. The evolution of “new media” and “social media” to just media continues.
Did you enjoy this blog post? If so, please subscribe right now!
Get this and other great articles from the source at www.ChristopherSPenn.com
Would you buy .sex?
CNN is reporting that .sex domains may become available
I wonder what you would expect to find at studentloan.sex and financialaid.sex… still, probably should buy them when they become available. Or MarketingOverCoffee.sex? Or PodCamp.sex? (eww)
For that matter, what would you expect to see at CNN.sex?
Did you enjoy this blog post? If so, please subscribe right now!
Get this and other great articles from the source at www.ChristopherSPenn.com











