Popularity data set, methodology or algorithm
Can someone please comment from what data the popularity statistics are computed?
Is it for instance based on google ngram data, something else, or what?As an example, If the name Vincenzo has a percentage popularity of 0.00123 in 1970 USA, what is the source of the 1970 data that produced the percentages? There would
be quite a difference if the data set was derived from USA birth records for 1970
versus frequency of "Vincenzo" in available scanned text from 1970.I hope I am being clear enough.Thank You
vote up1vote down

Replies

For the US data, you can find some info here:
http://www.ssa.gov/oact/babynames/background.html
vote up1vote down