Project Dataset

Project Dataset

Original location of the dataset (ideally a URL to the original data):

The following is a link to the original dataset: http://pokemondb.net/pokedex/all .
The following lists pokemon by generation: http://bulbapedia.bulbagarden.net/wiki/List_of_Pok%C3%A9mon_by_National_Pok%C3%A9dex_number .

License information (e.g. whether this dataset can be shared publicly):

This data is listed publicly on many websites. On pokemon.com they state that they allow for this data to be used for personal purposes.

Type of data in the dataset (e.g. numerical, categorical, unstructured, and so on):

Why you chose this dataset (3 to 5 sentences)

      I chose this dataset because I thought it would be perfect for creating a visualization that lets you partition your dataset in many different ways. There are many different metrics and categories that the different creatures fall into. Furthermore, I have already spent many hours getting familiar with a portion of this dataset.
      One of the main questions I have is overall philosophy of the creatures as the games progressed. When creating games you need to make sure that there's a general level of fairness. If one race or creature is too powerful the game loses replay value so developers make sure to make them as balanced as possible. It will be very interesting to see how that idea holds up against the different creatures through different generations.

Link to the dataset in your own repository

Cleaned Dataset: pokemon.csv.
To clean the dataset, I added quotes around strings and put Type(s) in proper case rather than upper case. I also added which generation each pokemon came from since I thought it would be interesting to group by in the visualization.

$ head pokemon.csv
"id","Name","Type","Total","Health Points","Attack","Defense","Special Attack","Special Defense","Speed","Generation"
001,"Bulbasaur","Grass Poison",318,45,49,49,65,65,45,1
002,"Ivysaur","Grass Poison",405,60,62,63,80,80,60,1
003,"Venusaur","Grass Poison",525,80,82,83,100,100,80,1
003,"Mega Venusaur","Grass Poison",625,80,100,123,122,120,80,1
004,"Charmander","Fire",309,39,52,43,60,50,65,1
005,"Charmeleon","Fire",405,58,64,58,80,65,80,1
006,"Charizard","Fire Flying",534,78,84,78,109,85,100,1
006,"Mega Charizard X","Fire Dragon",634,78,130,111,130,85,100,1
006,"Mega Charizard Y","Fire Flying",634,78,104,78,159,115,100,1

(In case you're curious, they're up to 721 now.)
$ tail pokemon.csv
712,"Bergmite","Ice",304,55,69,85,32,35,28,6
713,"Avalugg","Ice",514,95,117,184,44,46,28,6
714,"Noibat","Flying Dragon",245,40,30,35,45,40,55,6
715,"Noivern","Flying Dragon",535,85,70,80,97,80,123,6
716,"Xerneas","Fairy",680,126,131,95,131,98,99,6
717,"Yveltal","Dark Flying",680,126,131,95,131,98,99,6
718,"Zygarde","Dragon Ground",600,108,100,121,81,95,95,6
719,"Diancie","Rock Fairy",600,50,100,150,100,150,50,6
720,"Hoopa","Psychic Ghost",600,80,110,60,150,130,70,6
721,"Volcanion","Fire Water",600,80,110,120,130,90,70,6


Steven Rea - MSAN 622