UK SEO and Internet Marketing Forums
Robots.txt for SMF forum with pretty urls

December 02, 2008, 12:08:27 AM *
Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
News: We are back and we've changed software
 
   Home   Help Search Login Register  
Del.icio.us Digg FURL FaceBook Stumble Upon Reddit SlashDot

Pages: 1 [2]
  Print  
Author Topic: Robots.txt for SMF forum with pretty urls  (Read 446 times)
Webnauts
Global Moderator
Captain
*****

Karma: +5/-0
Offline Offline

Posts: 415


Search Editor & Consultant


View Profile WWW
« Reply #15 on: August 12, 2008, 12:52:19 PM »

We still need to disallow

/BB?topic       - with no forward slash after BB

I saw some print pages indexed aswell, "Disallow: /BB/*?*" is in the robots.txt which I think should solve it, unless they're old caches.
Have you seen these issue being logged in? If not can you share some example URLs?

I've seen the print pages cached in Google, but about a month ago. It seemed that was all (and profiles) that were wanting to be indexed at the time.

/BB?topic was coming up through stumble buttons. In essence stumble were giving 3 different urls for the same post depending where you were to the stumble button from.
So how did the URL look like?

Like this http:// www. davidcastle.org/ BB?topic or like this http:// www. davidcastle.org/ BB/ BB? topic   Huh?
« Last Edit: August 12, 2008, 02:56:00 PM by Webnauts » Logged

daniboy
Colonel
*****

Karma: +18/-0
Offline Offline

Posts: 1432



View Profile WWW
« Reply #16 on: August 12, 2008, 01:09:26 PM »

The first one;   http: // www. davidcastle. org/ BB? topic
Logged

Discount Shopping UK for Voucher Codes
Savings on LCD TVs, drinks from The Purveyor and the Wii Fit
Webnauts
Global Moderator
Captain
*****

Karma: +5/-0
Offline Offline

Posts: 415


Search Editor & Consultant


View Profile WWW
« Reply #17 on: August 12, 2008, 02:59:02 PM »

We still need to disallow

/BB?topic       - with no forward slash after BB

I saw some print pages indexed aswell, "Disallow: /BB/*?*" is in the robots.txt which I think should solve it, unless they're old caches.
Have you seen these issue being logged in? If not can you share some example URLs?

I've seen the print pages cached in Google, but about a month ago. It seemed that was all (and profiles) that were wanting to be indexed at the time.

/BB?topic was coming up through stumble buttons. In essence stumble were giving 3 different urls for the same post depending where you were to the stumble button from.

Thanks buddy. I updated the robots.txt in my above post accordingly.
Logged

Webnauts
Global Moderator
Captain
*****

Karma: +5/-0
Offline Offline

Posts: 415


Search Editor & Consultant


View Profile WWW
« Reply #18 on: August 12, 2008, 04:22:03 PM »

I am sure you have already "killed" the install files?
and reverted any temporary 777 perms back to normal.
Can't remember whether SMF does all that on auto or
nags you.

Just double checking .... not lessons on sucking eggs  Grin
Is this an official invitation for hackers?  Undecided
Logged

Webnauts
Global Moderator
Captain
*****

Karma: +5/-0
Offline Offline

Posts: 415


Search Editor & Consultant


View Profile WWW
« Reply #19 on: August 21, 2008, 01:02:27 PM »

A while ago, still when we had the old board I created a robots.txt and I reduced the supplemental pages down to 1%!
I created above a new one http://www.mapelli.info/tools/supplemental-index-ratio-calculator?domain=davidcastle.org%2FBB%2F but I see it is not implemented. And here are the bad news:

Supplemental Ratio for davidcastle.org/BB/: 32.7%

    * Google has a total of 4710 pages indexed from davidcastle.org/BB/
    * 3170 are in the main index
    * 1540 are in the supplemental index

To be honest I am very disappointed...  :Smiley
« Last Edit: August 21, 2008, 03:37:57 PM by ash » Logged

MuNKy
Global Moderator
General
*****

Karma: +6/-0
Offline Offline

Posts: 3760



View Profile WWW
« Reply #20 on: August 21, 2008, 01:15:38 PM »

Doh! Thats not good, you spent ages sorting all that crap out  :Smiley
Logged

Webnauts
Global Moderator
Captain
*****

Karma: +5/-0
Offline Offline

Posts: 415


Search Editor & Consultant


View Profile WWW
« Reply #21 on: August 21, 2008, 01:47:55 PM »

Doh! Thats not good, you spent ages sorting all that crap out  :Smiley
That is what really pissed me off Darren. One thing is sure: I will never do that mistake again.
Logged

ash
Administrator
General
*****

Karma: +7/-0
Offline Offline

Posts: 4462


View Profile WWW
« Reply #22 on: August 21, 2008, 02:30:59 PM »

Webnauts

I've implemented the robots text now.

By the way i've edited your link above as the site you were linking to was flagged by my firewall / antivirus software as carrying a trojan virus / file and kept crashing my browser.
Logged

MuNKy
Global Moderator
General
*****

Karma: +6/-0
Offline Offline

Posts: 3760



View Profile WWW
« Reply #23 on: August 21, 2008, 03:22:07 PM »

Strange, i use that site all the time and i've not had any problems with it. You sure its not something thats on your PC ash?
Logged

ash
Administrator
General
*****

Karma: +7/-0
Offline Offline

Posts: 4462


View Profile WWW
« Reply #24 on: August 21, 2008, 03:37:34 PM »

Strange, i use that site all the time and i've not had any problems with it. You sure its not something thats on your PC ash?

Just tested in on another comp and still getting the same warning. Strange. I'll reinstate the link and see if anyone else is having issues.
Logged

Pages: 1 [2]
  Print  
 
Jump to:  

Powered by SMF | SMF © 2006-2008, Simple Machines LLC | Sitemap Valid XHTML 1.0! Valid CSS!


Google visited last this page November 22, 2008, 08:04:06 PM