Archive

Archive for November, 2009

What is the robots.txt file?

November 25th, 2009

Web robots – often referred to as crawlers, bots, or spiders – are software programs that constantly travel the web, indexing the information found on millions and millions of websites every single day. Some sites, however, don’t wish to be indexed in search engines or accessed by these Web Robots. Now that you know what a Web Robot is and what it does, it’s important you know what can be done to limit their access to your site if you so desire. There may be a number of reasons for wanting to prevent bot access to a website page or specific directory. The most common reasons are related to security, privacy and duplicate content.

The Robots Exclusion Protocol, more commonly referred to as a /robots.txt file, provides webmasters with the ability to provide instructions on indexing the site to bots. The file, which must reside in the domain’s root directory, serves to limit the bots’ access to files within that domain’s root directory. There are often a large number of pages that make up an entire site, but many of those pages – like registration, login, 404 error, privacy policy and order confirmation pages – should not be indexed by search engines. The /robots.txt file also comes in particularly handy for webmasters with a wide network of sites with identical privacy policies, terms and conditions or e-commerce sites that have checkout pages, shopping carts, etc.

 

Addressing Duplicate Content with /robots.txt

The /robots.txt file can also help to eliminate duplicate content issues that arise with blogging software, such as WordPress. With WordPress – and all blogging software, for that matter – content from blog posts is published on the post URL itself, but copies of that content are also published on category pages, as well as tag and author archives. This inadvertently creates several pages of duplicate content. Since duplicate content can have a negative impact on a site’s ranking in the organic search results, the /robots.txt file can help to reduce the potential for duplicate content that can adversely affect the site’s search marketing strategy.

 

Understanding How to Use /robots.txt

In order to function properly, the /robots.txt file should be accessible at http://www.domain.com/robots.txt and reside in the domain’s root directory. The file itself should be created as a plain text document. Do NOT use Microsoft Word or another word processing program – the standard Notepad program that is installed with Windows or SimpleText/TextEdit with the Mac OS work best. The file name must be robots.txt and uploaded directly to the domain’s root directory. The commands within the file itself can be as simple or complex as your needs demand.

The standard, generic /robots.txt file – one that does not limit access to any of the information in your domain’s root directory – would be formatted like this:

User-agent: *
Disallow:

In order to block bot access to the domain’s root directory completely requires adding only one character to the standard or generic /robots.txt file and would look like this:

User-agent: *
Disallow: /

What if you want to limit bot access only to certain subdirectories or specific pages of the site? Not a problem. You would simply add each individual subdirectory or URL to the /robots.txt files as follows:

User-agent: *
Disallow: /checkout.asp
Disallow: /add_cart.asp
Disallow: /view_cart.asp
Disallow: /error.asp
Disallow: /shipquote.asp

 

The Robots.txt File Is Not Fool Proof

While the /robots.txt file does a good job of blocking a bot’s access to the domain’s root directory, it isn’t fool proof. Each individual page you do not want bots to index should also incorporate a properly formatted robots META tag. The standard robots META tag is configured like this:

<meta name=”robots” content=”index, follow” />

To help to prevent the bots from accessing individual URLs, the robots META tag in the header of the page should look like this:

<meta name=”robots” content=”noindex, nofollow” />

or

<meta name=”robots” content=”noindex, follow” />

 

The Bottom Line

A /robots.txt is a very useful tool and, unfortunately, an often overlooked and neglected aspect of web development. Now that you have a better understanding of what it is, what it does and how to use it, take some time to consider how your site may benefit from having a properly configured /robots.txt file. In the meantime, start checking out the /robots.txt files of the sites you visit to familiarize yourself with different configurations and uses for it.

Sam Mc e-Marketing Information , , , , , , , , , , , , , , , , , , , , , , , , , ,

More Features with Google Image Search

November 18th, 2009

Google has added up various features with options in its search engine. This feature has now been added into its Image Search property feature also. This is a very simple feature but could save on a lot of time by searching the most accurate results. This image search options makes it essential for users to combine search options to get the best search results faster with provision of introducing more new features in the future as well.

Google Image Search Features

Google Image Search Features
This new feature will give one the same layout as that of Google search with various options listed on left side of image search results page. These image search options would include color, size, and type.

Color option would let one specify the different colors of image which one is searching for and includes the various colors which include red, pink, green along with black and white features.

The size option which lets you specify different sizes aside from usual image sized that users are searching for.

The type option which lets one specify image types whether one wants to have clipart, photo, face or line drawing within the image search results.

Sam Mc e-Marketing Information , , , , , , , ,

You Can Track RSS Feed Clicks by Using Google Analytics

November 16th, 2009

Do you want to know number of people who are visiting your site via RSS feeds? If you have been using FeedBurner to syndicate the RSS feeds of your blog then now you can easily track your incoming traffic from feed clicks through Google Analytics.

For that you just need to open your FeedBurner dashboard, then click the title of RSS feed and select “Configure Stats” under the Analyze Tab. Now tick the options that say “Items views”, “Item click” and “Track clicks as a traffic source in Google Analytics” and save it.

Besides the click counts, you will also know the exact source from where the click may have originated. This is an important data because it will help you to know if people who subscribe your feed in Google Reader are more engaged than you “My Yahoo” subscribers or vice-versa.

Traffic from the clicks inside your RSS feed will come under “All Traffic Sources” and “Campaigns” in the reports of Google Analytics reports. Select “Ad Content” from the category drop down in the traffic source data table and it will allow you to see the incoming traffic from your RSS feed segmented by the specific feed readers or the email clients.

Sam Mc e-Marketing Information , , , , ,

Search Engine Optimization (SEO) Strategies for Yahoo

November 13th, 2009

Use the following strategies to get good traffic and rankings in Yahoo search engine. 



1. Use keyword-rich titles, about 20 words long and unique for each page of your site. These titles are the first things that Yahoo! sees and are the single most important aspect of your site regardless of which search engine you are optimizing for.
 

2. Use the meta description tag on each page too, with a description that fits your site well. Yahoo! still uses meta tags for its descriptive purposes but not necessarily word for word. Make sure that you have a quality description so that you don’t run into situations where you don’t get listed because of lack thereof. Basically, your description tag is an evasive action rather than an attempt at attaining a high number of hits or a higher rating.  

3. A meta keywords tag should be included on every page of your website, and it should list words and phrases related to the site. Make sure that the key words describe the specific page rather than the site itself except for the index page. You don’t want a bunch of people to be going to the wrong front page. They may see the page, not find their answer and leave. If they find your home page, they’ll probably see that it’s a home page, look for a relevant link, and find the answer or material that they are looking for.  

4. Well-written, keyword-rich content will help you a lot with Yahoo. Well-written content is important regardless of search engines. This content is the entire reason for your site’s existence. Make sure that the content that you deliver is something that you want your name on. Something that you can stand behind.
 
5. Easy-to-follow internal links in plain text will help Yahoo’s spider – the bottom of the page is a good place for them.

Sam Mc e-Marketing Information , , , , , , , , , , , , ,

Google Analytics Developer Documents

November 9th, 2009

The Google Analytics developer documentation contains everything you need to know to implement Google Analytics technology for:

  • Website tracking
  • Adobe Flash rich media tracking
  • Gadget Tracking
  • Exporting Analytics data 

The Analytics developer documentation is divided into the following sections:

  • Overview

    This section covers important technical concepts in Google Analytics that will help you correctly modify or use the APIs for Google Analytics. Here, you’ll learn:

    • how the tracking code works
    • how Analytics accounts differ from Google Accounts
    • which cookies are set and for what purpose
    • how and when data is sampled
  • Tracking Code

    The Google Analytics Tracking Code, ga.js, works with any web page or web application that can execute JavaScript. You can install the tracking code for your site with a simple copy-and-paste operation to enable hundreds of reports for your website traffic.

  • Data Export API

    Use the Google Analytics Data Export API to download Analytics data in the form of Google Data API feeds. With the Data Export API, you can embed a customized dashboard in your website to display the most commonly viewed or requested page statistics for all your reports users. Or, you can integrate your Google Analytics Data with other website data that you might be tracking in a separate system.

 

Content Taken from Google Analytics Info

 

Sam Mc e-Marketing Information , , , , , , , , , ,

Advantages to e-Retailers with the Search Engine Optimization

November 1st, 2009

Internet retailers do a remarkable job with their paid search marketing hard work in most of the times; however the search engine optimization techniques appear to be missing according a study based upon Google search results in the 2010 edition of the Internet Retailer Search Marketing Guide. Creating a solid paid search strategy is something that major internet retailers appreciate and excel at because it is computable and can be fine tuned to meet back-end goals.

 

Working to stay at the top of the organic search results for their top product categories involves a very different approach, which is not as easy to succeed at without the right approach and plan. Since organic search results are a reflection of Google’s assessment of site content and link popularity, it means that internet retailers need to continually create content and establish links to their online store from other sites. This is not as easy as it sounds because retailers need the right type of content and links if they want to be first for their desired product category keywords. 

 

The study used over 300 keywords to determine where top internet retailers ranked on Google’s search engine for a number of product categories. It found that they only owned the top three spots in a little more than 50% of the 20 merchandising categories, placing in 35 of 60 possible spots.  

 

Results were slightly improved over the prior year, but still indicated that Google completely passes over internet retailers for the top positions in the organic results. For example, when searching for "refrigerator" 3 out of 5 of the top results did not even sell the product and were informational sites like wikipedia.org and energystar.org. Profit margins on organic search traffic are typically significantly higher than that of paid search and if internet retailers want to improve their overall profitability then they will need to focus on a strong search engine optimization strategy.

 

Sam Mc e-Commerce Information , , , , , , , , , , , , ,

e