Updated December 1st, 2021 — version 307

The UnFair Advantage Book
Winning the Search Engine Wars

Chapter Three, The Search Engine Order of Importance and the Web page Discovery Process

Chapter Three
The Search Engine Order of Importance and the Web Page Discovery Process

Search Engines Market Share

The pie chart shows that Google attracts 93% of the English reading eyeballs that search the Internet. This makes Google really the only relevant search engine in North America.

For the English speaking market, only two search engines count: Google and (in a small way) Bing.

Globally, Google has over 93% of the worldwide mobile market search market. The Chinese search engine, Baidu, is 2nd with a market share of 4%. Yahoo has 1% (boosted by Yahoo Japan) and Bing is 2nd with 2% of the worldwide market.

It's very important to note the above figures reflect the mobile market share and that Google is the 800 pound gorilla because...

Mobile users have out numbered their desktop counter parts in most industries since 2016. Today there we're seeing more of a 70/30 split in regards to mobile users.

...which effectively gives Google a monopoly on Mobile Search!

The Web Page Discovery & Update Process

Earlier we mentioned spiders, crawlers, and bots. These are actually sophisticated computer programs that are designed to "crawl" the web and discover new and updated web pages to save to their index. Search engine results would get stale very quickly if they didn't continually look for new pages and update old ones. And, of course, this discovery and update process must start somewhere. These starting points are known as Seed Sets — sometimes also referred to as the Crawl Frontier.

Seed Sets

As the graphic indicates, sites that are part of a core seed set are likely to have a clearly identifiable authority like .gov, .edu, or .org. In addition they could otherwise be a well known commercial entity like NYTimes.com, CNN.com, , etc.

A site that is part of a seed set is likely to have the highest Trust and Authority values as the spiders begin their crawl.

Trust is determined by a combination of factors such as website quality, popularity, and incoming links from authoritative sites. For example, a site with a link from NASA.gov would gain trust. But trust can also be lost by linking out to disreputable sites. Trust & Authority

Authority sites would include the likes of National Cancer Institute, NASA.org, Wikipedia.org, Electronic Frontier Foundation, National Geographic, PC Magazine, Wall Street Journal, and so forth.

Hubs are sites of any size that link to authoritative web pages. It is good to be a Hub and it is good to be linked-to by a Hub. Any page that links to other websites with known authority can be a good page from which to get a link.