{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Gender disparities in computing\n", "\n", "\n", "## Goal\n", "* We want to redo the analysis in the Nature article \n", "* Data from DBLP\n", "* **Publish in C ACM**\n", "\n", "\n", "## Nature article features\n", "* coauthorship (and prominent place in the list(first/last/sole author)\n", "* citations\n", "* affiliation (per country)\n", " * I could ask Easychair for just the countrynames and the urls\n", "* gender\n", "\n", "\n", "### Other relevant feautures\n", "* Impact factors per journal\n", "* \"Status\" of conferences\n", " * See (easy to parse, linked up with DBLP, but old (from 2003)\n", " * http://www.springer.com/gp/impact-factor-2014/if-computer-science\n", " * http://www.guide2research.com/journals/\n", "\n", "### Google Scholar\n", "* Contains a wealth of information on the author\n", " * Citations per article\n", " * h-index\n", " * affiliation\n", " * ...\n", "* except gender of course ;-)\n", "* For parsing you could use \n", "* I asked Theo Gevers on software for gender identification based on the photo\n", "\n", "#### Scraping authors\n", "* In staat een script dat Google scholar afgraast. \n", "\n", "### Todo\n", "\n", "#### Before starting:\n", "* Order the entities from DBLP.xml, and start collecting data based on how often the entities occur. Starting with the most often occurring.\n", "\n", "\n", "1. Collect impact factor for each venue in DBLP.\n", " * Like but based on recent data.\n", "2. Collect Google scholar author page of all authors in DBLP\n", " * parse use and retrieve main features\n", " \n", "#### Aside: Women in Conference Committees\n", "* I downloaded DBworld archive\n", " * `nohup wget -r -np -q http://database-world.1314.n7.nabble.com/&`\n", " * On mashup2: /home/mmarx3/marx/GenderComputing/database-world.1314.n7.nabble.com" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] } ], "metadata": { "kernelspec": { "display_name": "Python 2", "language": "python", "name": "python2" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 2 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython2", "version": "2.7.11" } }, "nbformat": 4, "nbformat_minor": 0 }