Archive

Author Archive

Smarter Searchers Use Longer Queries

December 16th, 2009

Everyday I preach the benefits of using natural text within your site’s pages. Aside from the fact that most shop owners will inevitably spam if they dwell to deeply on keywords, it really works. I can give you tons of examples why, but we will cover just a few metrics today.

First and foremost, using natural text helps to grab customers more and high school report researchers less. For example, I was speaking with a customer today who sells engraved silver and pewter gifts. We were making a plan of attack for next year. The primary concern for this plan is to better target his Jefferson cups. While he is on page one for Jefferson cups, he is number 10.

So if you search Google for Jefferson cups, you will see that these very historical cups have many informational pages indexed and presented for this query. My point is, that the search rank for Jefferson cups matters little as these are not shoppers…. So if I was looking to buy one of these I would and most other searchers would use longer and more targeted queries.

  • Buy Jefferson Cup
  • Engraved Jefferson Cup
  • Jefferson cups for gifts
  • Pewter Jefferson cup

These are just a few examples of how one may search when the target is a purchase and not just information searching.

These longtail searches are more easily converted and easily targeted by using the naturally related words associated with the products and services in the product’s page copy. Not spam, just high usability and proper search-ability.

Traffic is traffic you may say…. But it doesn’t make it true. Fact is that ecommerce stores are designed to sell, therefore attracting customers is clearly the first order of business. Most information seekers will not be converted in to a sale… They just found the wrong site for the information they were seeking… Additionally, the likelihood that they will return someday to purchase because they remembered the site is store address is slim to none.

Why do you suppose users are searching with longer queries than say 5 years ago?

It’s really not that complicated. The many millions of unsuccessful searches have trained searchers to use longer queries to better target their search results. This isn’t the only thing searchers have been trained to do either. Another huge change is that searchers will read scan the text at the top of the page’s main content container for about 15 seconds upon arriving on a site. Not images… Not videos…. TEXT.

Searchers do this for a very clear and understandable reason… They are trying to very quickly determine if this page is relative to their search. Makes perfect sense… who wants to spend 10 minutes on a page that isn’t what we were searching for?

If I told you that less than 48% of all Google US searches were 2 words or less, would you believe me?

It’s true, fact is that the majority of Google searchers use 3 words or more in their query. In fact I would bet anyone that repeat searches are double and about 80% or better.

Longer Search Query User Data

Longer Search Query User Data

Source: Experian Hitwise

The need for helpful, descriptive and natural text on your store’s page cannot be understated. Your shoppers need this text to help them find you, identify your page as a good result and better navigate your site.

Melanie E-Commerce SEO

Free Zen Cart Report Purely Poultry

December 16th, 2009

Our November 2009 free site report winner is Purely Poultry. We welcome you to read their report below and visit their site as well. Please show some respect and do not create a bunch of test accounts and havoc on their website while poking around. We do welcome you to provide constructive criticism and suggestions for them to improve their store and business online.

Purely Poultry presents quite a unique online product, as a matter of fact this is one product line we have never worked with before. This fairly unique product line affords them an edge for rank, as the pool of fishes are few to compete with. Fixing some of the issues we found with the cart could clearly have very positive and fast results.

Below is a mildly edited copy of the Zen Cart site report we already provided to Purely Poultry. They have graciously agreed to allow us to publish this and share their issues. We thanks them for providing this medium to help other shop owners who may have similar issues and problems.

Load Speed Main Page: 237.53 seconds at 28.8kbs. We would like to see this well under 50, but the ideal standard is 30 seconds @ 28.8kbs.

On the main page only 5 HTML errors were noted, this is not bad at all… But repairing these will increase the ability for their site to be effectively crawled by the search engines and additionally increase the usability of the website.


Line Description
37 Warning: <img> lacks “alt” attribute
134 Warning: <table> lacks “summary” attribute
150 Warning: <script> escaping malformed URI reference
206 Warning: <img> escaping malformed URI reference
266 Warning: <img> escaping malformed URI reference

Title: Main page title is 14 characters long and the suggested search engine friendly limit is 65. Your title is 100% relevant to your main page content, but lacks good information relevant to the page. This tag should contain a short (65 characters or less including spaces) description of what this site is about.

Current Title: Purely Poultry

Description: Main page description contains 113 characters and is 100% relevant to the main page content.

This description should contain text under 250 characters in natural language to prompt searchers to click through to your website. The Meta description is NOT used for the purpose of ranking your site’s pages, but rather as a means to elicit a click through from searchers. Note that your description is somewhat spammy as it is essentially a keyword list. Additionally, this tag should be a minimum of 100 characters for Google not to give a short description error in webmasters tools.

Current Description: Purely Poultry : – Pheasants Chickens Bantams Guinea Keets Books Turkey Poults Ducklings Goslings Quail Peafowl

Canonical Domain url Check:
Your site is returning a proper canonical 301 redirect from www to non-www or vice verse. Google currently has cached versions of both.

This means that sitewide canonical duplication exists as both the non-www and the www versions of the urls work. Duplication of this nature can cause many rank issues and even split PageRank between the 2 different canonical versions.

Google indicates that pages have been omitted from regular search results for being to similar or duplicate in nature.

Total indexed in Google.com: 624

Pages omitted from Google’s Main Index: 129

Pages within the Google’s regular search index: 495

*Your site has 718 pages indexed in Yahoo search.

*Your site has 15 indexed pages in MSN/Bing.

*You have 1 page indexed in Ask.com’s search index

Sitemap and Robots.txt:

A valid search engine sitemap.xml for your site was not found. This document, which is a specific sitemap for search engines, is designed to help search engine spiders locate and crawl your content better and therefore should NOT be styled for human visitors.

A valid, but miss-configured robots.txt for your site was found. A robots.txt is a search engine universal markup to prevent well behaved bots from crawling pages which have content of no value to the searchable index, are duplicate or blocked from indexing.

This has been partially corrected.

Backlinks: Your entire domain currently has 647 total back link(s) from domains other than your own.

This isn’t bad at all, keep up the good work!

PageRank: Your main page has a toolbar Page Rank 4 and there is no issue with your PageRank dispersion with regard to canonical duplication.

Sites on IP: Your site’s IP is 000.000.000 and there are a total of 8 domains hosted on that IP, none of which appear to be flagged as “bad neighbors” or hosting questionable content.

Spam and Hidden Text: Upon scanning your site hidden text was found, that appears to be related to your CSS menu across the top of your page.

Invisible text found. Method(s): CSS ‘display’ property set to ‘none’.
Invisibility purpose: Impossible to say.
Text: New Products

Miscellaneous:

  • Your site appears to have what we call the “nofollow bug”. While this is a Zen Cart
    bug, a fix has been available for quite some time. Essentially every page on your
    site currently has a <meta name=”robots” content=”noindex, nofollow” /> tag
    with in the head element that tells the search engines not to follow or index links
    found within that page.
  • You should nofollow ancillary page links, or advertising links within your site
    (once the bug fix has been applied) to stop the flow of PageRank and prevent the
    crawlers from crawling these pages unnecessarily wasting precious crawl time.
  • We would advise you to place a phone number in a prominent location through out
    your site. eg: in the header. To give troubled customers a sense of security, and a
    means of contacting some one should an issue arise. Without it shoppers tend to
    abandon the cart sales when they encounter any issues what so ever. They will not
    go looking for a contact number! Remember, not every search visitors lands on your
    main page, most land on interior pages.
  • The use of mailto links or valid email addresses even in a textual format, like on
    your “Privacy Notice” page will promote spam, as these are crawled and read by
    spiders for the purpose of creating email list for sale to spammers.
  • Your “Conditions of Use” is completely blank.
  • On your privacy page under “How do we protect your information?” the last line
    of the second paragraph reads “and are required to�keep the information
    confidential.”
  • You have unsecured elements on your secured SSL pages causing a broken lock.
    This is the number 1 cause of checkout abandonment.
  • Your main page and category pages have very little textual content in order to rank
    with. You will find it very difficult to rank well in the major search engines with out
    it. Additionally pages like http://www.purelypoultry.com/bantams-c-155.html with
    very little text and an overwhelming amount of links lacking textual support may be
    viewed negatively by Google.
  • It is a very large liability to have “Subscribe to Our Newsletter” checked by default
    on account creation, as shoppers will inadvertently get your newsletter and report
    you as spam. This is a FTC spam violation and has financial and business related
    liability in the US and many other countries.
  • Your shopping cart pages still shows part of the default Zen Cart default text
    (defined in includes/languages/english/shopping_cart.php).
  • Your “Checkout Success” page contains the following text “This file is located in
    /languages/english/html_includes/classic/
    NOTE: Always backup the files in /languages/english/html_includes/
    your_template”
  • You would greatly benefit by reducing the steps involved in the checkout process.
  • Removal of the side bars during the checkout process will reduce customer
    distraction as well as checkout abandonment. Once the customer has committed to
    the checkout process you should avoid giving them opportunities to click away.
  • Your checkout pages are no secured. This is a PCI compliance violation and your
    company can be fined by the credit card companies.

I want to personally thank Purely Poultry for allowing us to audit and blog their site report results. I invite you all to comment any additional factors and suggestions to help them improve their Zen cart and business.

Melanie Public Site Reports

Zen Cart robots.txt Tutorial

December 14th, 2009

robots.txt for Zen Cart

Zen Cart robots.txt

Is there a robots.txt with Zen Cart?

Do you have a default robots.txt?

The answer is no… But here is why.  Every cart is different, the very nature of Zen Cart’s flexibility prevents the ability to create a robots.txt that can be applied to even most carts. Things like easy pages and additional pages and installation paths make the process basically a custom procedure for each and every shop. The robots protocol isn’t really difficult, as a matter fact it’s quite logical once you get to know it a bit.

First, lets cover the basic function and ability of a robots.txt.

A robots.txt is a text document called robots.txt (exactly), which resides in the root of the domain. You can actually hide your robots.txt from basic Internet users by using a server side application such as Apache to deliver only qualified user agents access to the location and name of the robots directive on your site. This however, is really more work than necessary as password protect directories, such as login admin areas cannot be indexed unless there is a crawl able link to them somewhere.

robots.txt is an exclusion protocol for well behaved web spider or robots to access in order to receive directives regarding where they may and may not crawl your site’s pages. Blocking a page(s) from crawl in robots.txt WILL NOT prevent them from being indexed or referenced in the search engines results…. It only blocks them from being crawled.

I will cover the most common uses of robots.txt and their directives below for your Zen Cart.

First to cover is that there is NO Allow directive in robots.txt, this is in fact the default behavior… So only Disallow is valid. So if I have a page foo.html which I wish not to be crawled by any behaved crawlers, then the syntax will be as follows:

User-agent: *
Disallow: /foo.html

If I wish that a specific crawler, such as GoogleBot not crawl this page, then the syntax is as follows:

User-agent: Googlebot
Disallow: /foo.html

You can view a  list of crawlers in the robots database here. But for reference her are your most common ones.

So we know how to block a single page from crawl, how about a page in a directory…  But not the whole directory.

User-agent: *
Disallow: /folder/foo.html

The index page within a directory only.

User-agent: *
Disallow: /folder/

All pages in the directory blocked from crawl.

User-agent: *
Disallow: /folder

All pages within a directory can also be accomplished with a wildcard. However, not all crawlers support wildcards. Google does, Yahoo does… Ask.com does not and although Bing claims to have partial support for wildcards…. They don’t. So we must first name a support crawler to apply the directive with a wildcard.

User-agent: Googlebot
Disallow: /folder/*

The wildcard give us some greater control and ability with regard to robots.txt directives… Especially on our dynamically generated Zen Cart pages. The example below blocks a duplicate page from crawling in your Zen Cart shop.

User-agent: Googlebot
Disallow: /*&alpha_filter_id=*

This is to block a URI created by the alpha sorter in your product index pages. Essentially, every URL containing the parameter &alpha_filter_id= is blocked from crawl for Googlebot.

Lastly, the Universal Sitemap Protocol is supported by all major search engines and should be included. This is simply the correct way to let the search engines know where your search engine sitemap resides.

Sitemap: http://pro-webs.net/sitemap.xml

Against my better judgment I am going to post what some may consider a generic/default robots.txt for standard Zen Carts installed in to the root of your domain. I strongly suggest you read and learn to properly use the robots protocol and directives instead. I will take no responsibility for this, as it is a suggestion and you have been warned that every Zen Cart is different. Additionally, this WILL NOT work at all if you have rewritten your page’s urls.

# Robots.txt file for http://www.domain.com/

Sitemap: http://www.domain.com/sitemap.xml

User-agent: Googlebot
Disallow: /*&action=notify$
Disallow: /*&number_of_uploads=0&action=notify
Disallow: /index.php?main_page=discount_coupon
Disallow: /index.php?main_page=checkout_shipping
Disallow: /index.php?main_page=shippinginfo
Disallow: /index.php?main_page=privacy
Disallow: /index.php?main_page=conditions
Disallow: /index.php?main_page=contact_us
Disallow: /index.php?main_page=advanced_search
Disallow: /index.php?main_page=login
Disallow: /index.php?main_page=unsubscribe
Disallow: /index.php?main_page=shopping_cart
Disallow: /index.php?main_page=product_reviews_write&cPath=*
Disallow: /index.php?main_page=tell_a_friend&products_id=*
Disallow: /index.php?main_page=product_reviews_write&products_id=*
Disallow: /index.php?main_page=popup_shipping_estimator
Disallow: /index.php?main_page=account
Disallow: /index.php?main_page=password_forgotten
Disallow: /index.php?main_page=checkout_shipping_address
Disallow: /index.php?main_page=logoff
Disallow: /index.php?main_page=gv_faq
Disallow: /gv_faq.html?faq_item=*
Disallow: /*&sort=*
Disallow: /*alpha_filter_id=*
Disallow: /*&disp_order=*

User-agent: Slurp
Disallow: /*&action=notify$
Disallow: /*&number_of_uploads=0&action=notify
Disallow: /index.php?main_page=discount_coupon
Disallow: /index.php?main_page=checkout_shipping
Disallow: /index.php?main_page=shippinginfo
Disallow: /index.php?main_page=privacy
Disallow: /index.php?main_page=conditions
Disallow: /index.php?main_page=contact_us
Disallow: /index.php?main_page=advanced_search
Disallow: /index.php?main_page=login
Disallow: /index.php?main_page=unsubscribe
Disallow: /index.php?main_page=shopping_cart
Disallow: /index.php?main_page=product_reviews_write&cPath=*
Disallow: /index.php?main_page=tell_a_friend&products_id=*
Disallow: /index.php?main_page=product_reviews_write&products_id=*
Disallow: /index.php?main_page=popup_shipping_estimator
Disallow: /index.php?main_page=account
Disallow: /index.php?main_page=password_forgotten
Disallow: /index.php?main_page=checkout_shipping_address
Disallow: /index.php?main_page=logoff
Disallow: /index.php?main_page=gv_faq
Disallow: /gv_faq.html?faq_item=*
Disallow: /*&sort=*
Disallow: /*alpha_filter_id=*
Disallow: /*&disp_order=*

User-agent: *
Disallow: /index.php?main_page=faqs_new
Disallow: /index.php?main_page=discount_coupon
Disallow: /index.php?main_page=checkout_shipping
Disallow: /index.php?main_page=shippinginfo
Disallow: /index.php?main_page=privacy
Disallow: /index.php?main_page=conditions
Disallow: /index.php?main_page=contact_us
Disallow: /index.php?main_page=advanced_search
Disallow: /index.php?main_page=login
Disallow: /index.php?main_page=unsubscribe
Disallow: /index.php?main_page=shopping_cart
Disallow: /index.php?main_page=popup_shipping_estimator
Disallow: /index.php?main_page=account
Disallow: /index.php?main_page=password_forgotten
Disallow: /index.php?main_page=checkout_shipping_address
Disallow: /index.php?main_page=logoff
Disallow: /index.php?main_page=gv_faq
Disallow: /gv_faq.html?faq_item=1
Disallow: /gv_faq.html?faq_item=2
Disallow: /gv_faq.html?faq_item=3
Disallow: /gv_faq.html?faq_item=4
Disallow: /gv_faq.html?faq_item=5

Once again, every Zen Cart has other pages that should be blocked from crawl, and many have pages within these blocked pages that should be allowed. This is best used as a guide, once you have learned to use the Robots Exclusion Protocol correctly.

Melanie E-Commerce SEO

Google Caffeine – What to Expect

December 8th, 2009
Google Caffeine

Google Caffeine

Google Caffeine is essentially Google’s new, upgraded search engine. While Google has said the official launch of the new search will not be made until after the holidays, it’s actually live in a few data centers already.

Google announced the new search platform development back in August of this year (2009) and has provided us a cool sandbox to test results in. While the sandbox is no longer available, we can tell you that for *most the ranking changes will be mild. I would like to, however, discuss the expected ranking metrics that will likely be changed or tweaked when the the Caffeine update goes live.

Back in August Google had the following to say regarding the needs of a quality search engine.

To build a great web search engine, you need to:

  1. Crawl a large chunk of the web.
  2. Index the resulting pages and compute how reputable those pages are.
  • Rank and return the most relevant pages for users’ queries as quickly as possible.
  • So what can you expect?

    We think you can expect Google to really go after website quality metrics such as, speed, broken links, navigation and content. This will likely have a far greater impact on rank that in the past.

    We also believe Google will add even more rank consideration to the signs of regularly maintained websites. We tell customers that Google likes to know the lights are on AND someone is home. The freshness and quality of content has been hugely important this year and we believe this will not only continue… But become even more important with Caffeine.

    You can expect Google to continue to show strong distaste for paid and reciprocal links and can add linking out to bad neighborhood and spammy websites to boot. Certainly you have little or no control over who links to you, but you *should have complete control over who you link to. This is just another responsibility factor…. Many of the noted items that we feel are being weighted better center around professionalism, maintenance and effort. These are all things you should already be doing, this is your business… remember?

    Spam. Google has consistently demonstrated that spam will not be tolerated. Things like hidden text, unnatural text, pumped up keyword density and comma separated lists are not going to be tolerated. We have already seen Google moving quickly and efficiently on these spam and quality issues over the last 4 months… Do not make this mistake, the cost can be more than your business can afford.

    I think you can expect to see some very long time huge ranking factors fall away from the added weight they currently carry. Metrics like the age of the website will still be important, but not enough to maintain rank as they have been in the past. Those old, unmaintained authority type websites will begin to topple, making way for fresh and maintained content from any age website.

    Shoot me now… But I think you will see Google taking a much deeper look at the type and quality of backlinks, as opposed to a number of backlinks or inventory type attitude. So links from Social media sources, which are considered highly fresh and indicate activity will carry more weight that they currently do. I think additionally, links from seed sites, which are ranking well and relevant will carry even more weight than ever before.

    The way i see this, is that Google is doing its job and delivering the best search experience to their users they can. As a company they have consistently made advances and changes to better their products and services… I think you should understand, they expect the same from you. Remember, being ranked and indexed by Google is NOT your right… You have no rights, it is rather a privilege by contrast.

    Melanie E-Commerce SEO