[mb-users] Why is robots.txt disallowing all of this?
Philip Jägenstedt
philip at foolip.org
Sun May 4 08:50:49 UTC 2008
Personally, I often try to find something I know I read in an
annotation, but it obviously never works. Also, when looking for
Chinese releases stuff google automatically converts
simplified/traditional Chinese, so it's great if you don't know what
script the release is entered with).
I would suggest appending the following to the current robots.txt:
User-Agent: Googlebot
Allow: /artist/
Allow: /release/
Allow: /track/
Allow: /show/ (for seeing full AR listings)
For some reason labels aren't blocked in robots.txt, but that could be
an oversight.
If the traffic gets to be unreasonably high just set the crawl rate in
http://www.google.com/webmasters/tools/
Philip
On 5/4/08, Chad Wilson <chad.wilson at gmx.net> wrote:
> Steve Wyles wrote:
> > On Sat, 3 May 2008, Philip Jägenstedt wrote:
> >
> >> Wondering why Google doesn't index MusicBrainz very well I turned to
> >> http://musicbrainz.org/robots.txt
> >>
> >> It seems most of everything is off limits to the search engines. Why?
> >> Wouldn't search traffic generate more traffic to MusicBrainz, giving
> >> us more new users?
> >
> > The load generated by search engines indexing the whole site could
> > make it unusable for users.
> >
> > Steve (inhouseuk)
> >
>
> I personally think we should we trying to find the ability to
> accommodate this kind of hit. One doesn't take over the world by
> shutting ones doors. If we want MB data to become a serious reference
> point for people, it at least needs to be searchable and available.
> Several sites that take feeds of our data manage to be Google indexed;
> so it'd seem embarrassing for us not to be able to handle it.
>
> Could we provide mirror hardware specially for search engines for the
> initial index? Or is the architecture/code fundamentally unscalable to
> high volumes of traffic?
>
> Chad / voice
>
> _______________________________________________
> MusicBrainz-users mailing list
> MusicBrainz-users at lists.musicbrainz.org
> http://lists.musicbrainz.org/mailman/listinfo/musicbrainz-users
>
More information about the MusicBrainz-users
mailing list