This repository contains codes and files used during the subject course "Reproducibility in Research" at Unicamp. The main objective was to create an Executable Paper by providing all the necessary sources to reproduce the research.
The files are organized as follow:
- /data - Dataset Facebook metrics (original and processed)
- /deliver - The executable paper and experiment's codes
- /dev - Codes and notebooks used during the experiment
- /figures - Figures used on research
- /utils - Some recommendations about the process of research in a Best Practices file
Before to start the experiment, make sure your setup is according to what is describe here.
For running the experiments in this research the following languages and libraries were selected:
- Environment: Anaconda3 4.3.1
- Programming Language: Python 3.3
- Dataframe Library: Panda 0.19.2.
- Neo4j-Python Driver: Py2neo v3
- Install the Neo4j 3.2.0 Community Edition
- Run the Neo4j
- In the Neo4j's Interface select an empty directory for your Graph Database (The Server has a default, check out)
- Start the Server
- Open it in your Browser (http://127.0.0.1:7474/browser/)
- In the first access, the
user
andpassword
areneo4j
, but you have to change thepassword
(For the experiment used in this repository, the password used isneo4jresearch
, but you can create yours) - Go to /deliver to run the codes according "Workflow".
- Install the Docker toolbox
- Create your account in Docker Hub
- In Docker Hub the Neo4j's image is available on: Neo4j in Docker Hub
- Open your Docker Toolbox
- Make the pull of the image:
docker pull neo4j
- Start an instance of Neo4j:
docker run \
--publish=7474:7474 --publish=7687:7687 \
--volume=$HOME/neo4j/data:/data \
neo4j
- Open in your Browser (http://localhost:7474) If you are using Windows, check out in the benning of your Docker shell the IP Address that it configures, and replace the localhost to the IP Address
- In the first access, the
user
andpassword
areneo4j
, but you have to change thepassword
(For the experiment used in this repository, the password used isneo4jresearch
, but you can create yours) Note: The Graph Database will be create in the directory:$HOME/Neo4j/Data
The workflow with the inputs and outputs of this research is organized as follows:
To execute the experiment, follow the steps:
- Make sure you have did all installation steps (Development Environment).
- Start your Neo4j Server (Local or Docker).
- Verifiy the code
connection_neo4j.ipynb
. Thehost
,user
andpassword
need to be corrects. - Follow the workflow image: first, execute the Prepreprocessing (
preprocessing_data.ipynb
). - Secondly, execute the
indexing_data.ipynb
. - Open the Neo4j Server in Browser.
- The graph generated can be visualized by running the Cypher query:
MATCH (n) RETURN n
The Graph Database will return and display the complete graph.
The experiment results and paper of research is available in /deliver
directory, or you can click here: Paper