Irrelevant Results in Google Search – Contd..

Pin It

This is in continuation of the previous blog article Most irrelevant Results in Google Search – A Case Study ,  I have done a more detailed research into what’s happening with the Google search results. The following aspects have been considered in this study:

  1. How exactly the third party site (say site B) has been linking to the hacked site (site A, we are using the term hacked site, though it has been off-site)
  2. What are the Title and Description of the attacker’s site(site b)  and the page that it is forwarding the visitor when he clicks on the search result
  3. Whether site B, which was considered to be the attacker’s site is itself a victim? It is very much possible that the hacker used site B to redirect traffic to site C (the ultimate beneficiary?)

Now, going to point #1, it was observed that the following code had been used to link to victim site (backlink URL):

http://www.sunnydaysandlovelyways.com/?htm=LabSim/network-lab-simulator.htm
In the above backlink, if we replace the domain name and the “?htm”, it exactly corresponds to the victim site URL, which is

http://www.site-A.com/LabSim/network-lab-simulator.htm

Note: Domain name has been changed to site-A.com.

It was observed by going through the set of backlinks, almost entire site had been reconstructed by the spammer with a different domain name, but with same link structure. Point #2. The meta tags like Title and Description appear to have been duplicated. For example, the search result for key word “ccna exam” is given below:

In the above search result, the title is “CCNA 200-125 Practice Exam” and the description is “200-125 ccna practice exam consists of 425+ questions with flashcard explanation”. The  site A webpage corresponding to this result has the same exact Title and Description. On clicking on this search result, you will find that the destination page has nothing to do with the search term.

Result: Effectively, another irrelevant website (site B) has taken the place of Site A without having to hack site A. It was also observed that the webpage that matches the Title and Description shown above, had been delisted and doesn’t show in indexed list of web pages.

As per point#3, it was observed that when clicked on the site B’s link, it had been redirected to another site (site C) intermittently, resulting in suspecting that site B is an intermediary to final beneficiary website.

The results, though for a sample site, have far reaching ramifications. Just assume that this hacking has been done on a broader scale (hopefully, it is not so as of now), the whole search results become more or less irrelevant for the user.

It is very surprising that the search algo could not detect this kind of site hacking which is external to the victim’s website. It also points out to the fact that the search engine “memory” is not deep enough to remember the history of the web page, as new pages (weeks or months old) which are duplicates of the original pages (several years old pages, in this case 10+ years for the URL and the core  content of the page has not changed) are showing up in the results.

Will it continue? It appears so. The only solution is to have deep memory combined with processing power for the search algo which may not be possible due to reasons like huge processing overhead, update schedules, and delivery of the search results. Even if the algo is modified to fix one hack, another form of hack may surface due to above limitations.

Though in this case, one particular search engine results were taken, it is possible that similar hacks might happen with other search engines like bing as the mechanisms of hacking are the same. It necessitates that webmasters assess the web metrics such as keywords, backlinks, ranking, etc. continually and do the maintenance on a continual basis!!

So, now what happens to what some major search engine FAQs that say: “Just concentrate on your web pages and create value to your website visitors.” It is partly true, as the webmasters now have to really work through web metrics like keywords, ranking, backlinks, analysis of backlinks, various types of possible off-site hacks, removal of backlinks, reporting of spam sites to respective search engines, etc. And this is a specialized work, and one needs a professional to do this work, and not many individuals, and small businesses could afford it. As a result, it is possible that most of these sites are going to vanish from major search engine results over a period of time, unless there is more heuristic approach to search mechanisms.

Disclaimer: This is in the opinion of the author and does not represent Anand Software and Training’s view.

Author: Vijay Anand

Irrelevant Results in Google Search – A Case Study

Pin It

This is a case study, where in, the Google search results were analyzed for any inconstancy and possible susceptibility to hacking. The study is confined to a few search terms that we have been working with, and it is suggested to enlarge the scope of the study for wider ramifications.

The following search terms were considered for this purpose:

  1. simulation exams
  2. ccna exam
  3. ccna details

The results for all the three search terms were analyzed and discussed with relevant snap shots.

  1. simulation exams: The website at no.2 place is totally not related to the term as can be seen by going through the result.

simulation exams
The second result in the search results, with title CCNA 200-125 Practice Exam is totally irrelevant. Don’t get misled by the title (though Google did). The link sunnydaysandlovelyways is leading to a website that is totally in another category not nothing to do with exams or simulations. A screenshot of this site as you click on the link is given below:

sunnydays
It was also observed, at times (randomly) the click is getting forwarded to the website topdum  .  com, dump website as shown in the screenshot below:

topdump
It appears to be an illegal dump site. The redirect is not consistent, it forwards to topdump . com at times, but not always.

Now, lets move to another search term that shows up in Google search results.

2. CCNA exam:

This is a popular search term for CCNA exam, CCNA stands for Cisco Certified Network Associate, one of the most popular networking cert offered by Cisco.

ccna exam
Heree again, the website ranks third for this keyword. As may be seen, the website has nothing to do with exams, let alone CCNA exam. A pure misplacement by Google search.

3. CCNA detail:

Now, we move on to the third search term, ccna detail. A screenshot of the same is shown below:

ccna detail
As can be seen, the website sunnydaysandlovelyways has ranked 5th in the Google results. however, as mentioned before, it has nothing to do with CCNA or exam. The website some times (but not always) forwarding to topdump . com.

There are a few possibilities for the above results:

  1. The website sunnydaysandlovelyways has been changed. or
  2. The website sunnydaysandlovelyways has been hacked

The first possibility may be ruled out by looking at the history. The second possibility means that the site has been hacked and ranking high in the google (or black hat seo). However, it is surprising that google ranking some random site so high in search results. The hacker appears to have beaten google search algo at least for a few weeks 9particularly, the Christmas weeks). This is likely to break some businesses, because the slot actually belongs to a credible web site, but taken over by an irrelevant and probably spammy or hacked site.

This is really some thing that one can’t expect, as even bing and yahoo are not showing this particular site for the given search terms in the first 100 pages!