American Women's History Initiative Symposium Project first names
datasetposted on 22.10.2020, 01:39 by Rebecca Dikow
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
Dataset with top 10 traditionally "men's" and "women's" names extracted from each of 4 Smithsonian Annual Reports (1920, 1950, 1970, 1990). Names were extracted using NER in spaCy using the lg English model and curated by hand.