I am starting development with R and I am still having "beginner problems" with the language. I would like to do the following:
- I have a matrix (data frame:=user) with ~900 columns, each of them is the name of a band (Nirvana, Green Day, Daft-Punk, etc.).
- In each row I have an user and the user's music taste (Nirvana = 10, Green Day=5, Daft Punkt=0)
- I would like to query another dataframe(:=artists - with the artist's music tags) and substitute the name of the bands by its Genre-Tag (Nirvana --> Rock, Green Day --> Rock, Daft-Punk --> Techno). There are ~120 Tags for music taste (120 < 900)
- And finally, I would like to "aggregate" the values over all columns to avoid duplicated columns. In the example from (3) - with the aggregation function "SUM" - the row would have only 2 entries and not 3: (Rock = 15, Techno=0)
Any clues on how to do that with R? Thanks in advance for any help!
Data:
user
: pastebin.com/4gVe004T
artists
: pastebin.com/dm7weLMG