Sponsors

Microsoft logo

Radboud University logo

TU Delft logo

Become a sponsor

Accessing personal data

As of Mar 2016, the GHTorrent project does not offer personal data (namely, emails and real names) for download. For research purposes, you can request access to a file containing a mapping between logins and personal data.

To access the file containing personal data, you will need to edit this page to include the following details. When your pull request has been accepted, we will mail you the link to the data.

<dl>
  <dt>Master Student</dt>
  <dd>Aditi Rawat, Masters Computer Science, Web Information Systems research group, EEMCS Faculty, Technical University, Delft. A.rawat@student.tudelft.nl</dd>

<dt>23 February, 2017</dt> <dd></dd>

<dt>Needed for Master Thesis</dt> <dd>I am doing my master thesis on the topic 'personalised expetise recommendation in community question answering systems using data from multiple web collaborative platforms'. In order to do my analysis work, i need to combine github data with stackoverflow data and twitter data. Thus i require the email hash and other login details of all users. Kindly share the personal data with me. </dd>

</dl>

People with access to personal data

Georgios Gousios

Researcher
Georgios Gousios, Assistant Prof. Radboud University Nijmegen, g.gousios@cs.ru.nl
Date of request
Mar 14, 2016
Intended use
Maintenance of the GHTorrent internal databases.
Researcher
Diomidis Spinellis, Professor, Athens University of Economics and Business, Greece, dds@aueb.gr
Date of request
July 1, 2016
Intended use
Research regarding commit practices of company employees. Correlate projects with commits through git blame.
Researcher
Tong WANG, Lecturer, University of Edinburgh tong.wang@ed.ac.uk
Date of request
Aug. 30, 2016
Intended use
Research regarding Open Source software network, especially focus on the interaction between programming habitants and company employees
Researcher
Chris Chabot, Semmle.com chabotc@semmle.com
Date of request
Dec. 11, 2016
Intended use
Normalizing and de-duplicating of author contribution data on our free for open source lgtm.com project, which provides source code analysis and fault detection, as well as showing coding velocity and quality per author and organization

Disclaimer

The data is provided as is with no further guarantees of data quality or law compliance. Redistribution is strictly not allowed! The GHTorrent project is not responsible for any illegal uses of the provided data.