Spider蜘蛛爬行遇到的六大“陷阱”

网站相关 SEO优化(图片点击可放大 可拖动)

这张图形象说明了蜘蛛爬行过程中会遇到的不利因素,下面简单对其剖析。

第一个“陷阱”:Orphan Pages
     Orphan译为“孤儿”,Orphan Pages可以理解为孤立的网页,也就是没有链接的网页。对于spider来说,一个没有链接的网页是难以抓取的,故而这使spider陷入第一个“陷阱”。

第二个“陷阱”:Unfriendly SEO-CMS System
     SEO是Search Engine Optimization缩写,大家都明白什么意思,CMS是Content Management System的缩写,意为”内容管理系统”,一个对SEO不友好的CMS系统会不利于蜘蛛抓取,这是spider的第二个“陷阱”。

第三个“陷阱”:Bad server Configuration
     糟糕的服务配置结构。例如robots文件配置不当阻止搜索引擎的访问,302 重定向所造成的网址URL 劫持,服务器的不稳定及访问权限,都可使spider掉入这第三个“陷阱”。

第四个“陷阱”:Cloaking
     作弊手段。 在Web服务器上使用一定的手段,对搜索引擎中的巡回机器人显示出与普通阅览者不同内容的网页,或有些站长为提高搜索引擎的名次,堆砌与博文内容不相干的关键词等,搜索引擎视其为不正当手段而采取从目录中排除,或是大幅度地降低排名等处置。这是spider的“第四个陷阱”。

第五个“陷阱”:Session Based Coding
     cookie随机变换、html代码中大量的字符等session造成的祸根,对于一个基于SEO的网页来说可是致命的,这就是第5个“陷阱”。

第六个“陷阱”:No Error Handling
     没有错误处理页面。如404页面,wordpress有默认的404页面,我们可以美化该页面。这个问题对wordpress用户倒是不大,在这里蜘蛛不容易栽跟头。

对这张图感较为感兴趣,此博文为伪原创,分享注明来源:http://www.glwzu.com/spider-traps.html

您可能会喜欢:

53 Comments.

Leave a comment
  1. I truely acknowledge with what you are claiming here on your blog. Dispite the fact some of this material may be correct, I just have trouble with it.

    [回复]

  2. I easily contradict with stuff you are reporting here on your site. Nevertheless, some of this advise may be correct, I easily feel a problem with it.

    [回复]

  3. Kudos to you! I hadn’t tghouht of that!

    [回复]

  4. I think that this does not matter if choose custom dissertation services or thesis writing services! Because only the quality of the masters thesis relating with this topic is the most important.

    [回复]

  5. Your article connected with this good post is obviously great and a lot of students would take this for their english dissertation. And some of scholars still take the support of the thesis service.

    [回复]

  6. Was about to give up tonight on researching this topic but then I found your site containing all the information I need.

    [回复]

Leave a Reply

:laugh: :cool: :ding: :blaugh: :evil: :close: :han: :rp: more »
click to change 请输入验证码

( Ctrl + Enter )