[mb-devel] Collaborative Filtering: Artist - Artist Relationships (Summer of Code Proposal)

sharon myrtle sharon.myrtle at gmail.com
Tue Mar 27 07:12:57 UTC 2007


On 3/26/07, david scotson <david.scotson at gmail.com> wrote:
>
> On 3/24/07, sharon myrtle <sharon.myrtle at gmail.com> wrote:
>
> > I've been reading the discussion on the custom tagging system. I think
> that
> > Collaborative Filtering algorithm would benefit from it as this would
> help
> > in analysis. However, for the proposal, I agree that implementing a
> rating
> > system will be conducive to accumulate valuable data to feed into the CF
> > algorithm (apart from the Artist Albums, Search Logs and Artist
> > Subscriptions).
>
> This sounds like a great project.
>
> Can I suggest artist - recording label connections (as present on the
> test server) as a potential similarity vector? This won't always be
> the case but for some (e.g Motown, Stax, Blue Note, SubPop, Philles) I
> think it's a very important factor. And as well as intra-label
> connections you could then make a 2nd order connection via labels e.g.
> people signed to Motown are 'similar' to people signed to Stax or
> Philles.
>
> Also, the Wikipedia has a bunch of relation data in the page link
> (e.g. the Lennon article will link to Beatles and the other members)
> and category link data (e.g. the Beatles are categorised as an
> English, Liverpudlan, 1960s, Parlophone signed, Beat group) which can
> be downloaded seperately and analysed. Not only would this be useful
> to examine in it's own right, but by comparing Wikipedia clusters with
> Musicbrainz clusters you can be more sure that MB links to the correct
> page in Wikipedia (e.g. Pavement (band) vs. Pavement).
>
> On the tags and ratings issue: I have some artist/album/song tags in
> last.fm, and some track ratings both in iTunes/iPod (at home) and the
> Linux player of the week (at the office) but I can't really be
> bothered investing time in building up these data stores until I can
> share them between apps, back them up properly and be sure to be able
> to take them with me as I change operating system and music player
> software. Having them hosted permanently and remotely by Musicbrainz
> would be very nice for me, especially if done in an open format so I
> could optionally also host them on my own server if I really wanted.
>
> Also, though slightly off-topic, PicardQt is rocking my world right now.
>
> regards,
>
> dave
>
> _______________________________________________
> MusicBrainz-devel mailing list
> MusicBrainz-devel at lists.musicbrainz.org
> http://lists.musicbrainz.org/mailman/listinfo/musicbrainz-devel
>

Hi,

The artist-recording label similarity attribute can be used (with suitable
weights assigned). However, isn't it possible that taking this similarity
via users might not result in accurate results, as people who like one
artist belonging to a recording label, might not necessarily like other
artists under the same recording label?

By the way, this is the proposal I've submitted -
http://www.sharonmyrtle.com/Projects/Google
Summer of Code/MusicBrainz
proposal.html<http://www.sharonmyrtle.com/Projects/Google%20Summer%20of%20Code/MusicBrainz%20proposal.html>

Information about the Indian Movie Recommender System (my Final Year Project
which uses CF algorithm) is put up here -
http://www.sharonmyrtle.com/Projects/IMRS/imrs.html
--
Regards,
Sharon.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.musicbrainz.org/pipermail/musicbrainz-devel/attachments/20070327/031b8af1/attachment.htm


More information about the MusicBrainz-devel mailing list