Data Intelligence, Business Analytics
When you do a search for "career objectives" on Google India (www.google.in), the first result showing up is from a US-based job board specializing in data mining and analytical jobs. The Google link in question redirects to a page that does not even contain the string "career objective". In short, Google is pushing a US web site that has nothing to do with "career objectives" as the #1 web site for "career objectives" in India. In addition, Google totally failed to recognize that the web site in question is about analytics and data mining.
So here's an idea to improve search engine indexes, and to develop better search engine technology:
This feature could be implemented by having webmasters using special blocking meta tags in web pages, recognized by the search engines willing to implement them.
Comment
Comment by Vincent Granville on October 21, 2011 at 10:53am Regarding the idea to build a website that provides search result pages not just for keywords, but also for related links, I've found one that provides high quality search results when someone is searching for related links. Its name is similarsites.com, and you can check the results, if you search for websites similar to Analyticbridge, by clicking on www.similarsites.com/site/analyticbridge.com.
Clearly its strengths is to show related websites (which link to the target domain, in this case Analyticbridge), by ordering the results (related domains) using a combination of outgoing links and website traffic.
You can create a search engine like Similarsites by building a table with the top 1 million websites (available for download at www.quantcast.com/top-sites-1), and for each of these 1 million websites, have up to 100 related websites (also from the same list of 1 million domains). So you could have a great search engine with a database containing less than 100 x 1 million pair of (related) domains: that's a data set fairly easy to manage (not too big).
Comment by Jozo Kovac on September 28, 2011 at 4:27pm To protect your webpage from unwanted traffic you may just disable Alexa, Quantcast, etc. code for bad visits.
So visitor can see his content and measurement tools aren't affected (display measure code only for good visits).
If you block a crawler you may loose you pagerank and many good visitors with it. And GoogleBot is probably the same in India and in US too.
Comment by Vincent Granville on September 28, 2011 at 11:20am Good point Jozo. Not sure where you would block the traffic, I've been thinking to block google.in via robots.txt, as this would
Blocking can also be made via .htaccess. Here's an example of .htaccess file which blocks lots of undesirable traffic: http://multifinanceit.com/htaccess.txt.
If I add "career objective" in the block list, users visiting the website, following a search query with this keyword, would be redirected to an "access denied" page.
Comment by Jozo Kovac on September 28, 2011 at 3:04am Vincent, can't you write set of rules what would handle a traffic from unwanted sources?
e.g. IF HTTP_REFERRER like "%google.in%q=%career%' THEN dont_count_a_visit
Comment by Vincent Granville on September 26, 2011 at 12:02pm
Comment by Vincent Granville on September 15, 2011 at 5:37pm
Comment by Vincent Granville on September 14, 2011 at 2:59pm
Comment by Roberto Danny Salinas on September 14, 2011 at 2:45pm
Comment by Amy on September 14, 2011 at 9:59am
© 2013 AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC
You need to be a member of AnalyticBridge to add comments!
Join AnalyticBridge