These are the old instructions we used for Dataverse 4 and 5. They should still work but these days we favor running Dataverse in Docker as described in Development Environment.
These instructions are purposefully opinionated and terse to help you get your development environment up and running as quickly as possible! Please note that familiarity with running commands from the terminal is assumed.
The quickest way to get Dataverse running is in Docker as explained in Development Usage section of the Container Guide.
Since before Docker existed, we have encouraged installing Dataverse and all its dependencies directly on your development machine, as described below. This can be thought of as the “classic” development environment for Dataverse.
However, in 2023 we decided that we’d like to encourage all developers to start using Docker instead and opened https://github.com/IQSS/dataverse/issues/9616 to indicate that we plan to rewrite this page to recommend the use of Docker.
There’s nothing wrong with the classic instructions below and we don’t plan to simply delete them. They are a valid alternative to running Dataverse in Docker. We will likely move them to another page.
Mac OS X or Linux is required because the setup scripts assume the presence of standard Unix utilities.
Windows is gaining support through Docker as described in the Windows Development section.
The Dataverse Software requires Java 11.
We suggest downloading OpenJDK from https://adoptopenjdk.net
On Linux, you are welcome to use the OpenJDK available from package managers.
NetBeans IDE is recommended, and can be downloaded from https://netbeans.org . Developers may use any editor or IDE. We recommend NetBeans because it is free, works cross platform, has good support for Jakarta EE projects, and includes a required build tool, Maven.
Below we describe how to build the Dataverse Software war file with Netbeans but if you prefer to use only Maven, you can find installation instructions in the Tools section.
On Mac, install Homebrew to simplify the steps below: https://brew.sh
Fork https://github.com/IQSS/dataverse and then clone your fork like this:
git clone email@example.com:[YOUR GITHUB USERNAME]/dataverse.git
If you installed Netbeans, follow these steps:
Launch Netbeans and click “File” and then “Open Project”. Navigate to where you put the Dataverse Software code and double-click “Dataverse” to open the project.
If you see “resolve project problems,” go ahead and let Netbeans try to resolve them. This will probably including downloading dependencies, which can take a while.
Allow Netbeans to install nb-javac (required for Java 8 and below).
Select “Dataverse” under Projects and click “Run” in the menu and then “Build Project (Dataverse)”. Check back for “BUILD SUCCESS” at the end.
If you installed Maven instead of Netbeans, run
mvn package. Check for “BUILD SUCCESS” at the end.
NOTE: Do you use a locale different than
en_US.UTF-8 on your development machine? Are you in a different timezone
than Harvard (Eastern Time)? You might experience issues while running tests that were written with these settings
in mind. The Maven
pom.xml tries to handle this for you by setting the locale to
en_US.UTF-8 and timezone
UTC, but more, not yet discovered building or testing problems might lurk in the shadows.
On Mac, run this command:
brew install jq
On Linux, install
jq from your package manager or download a binary from https://stedolan.github.io/jq/
Payara 6.2023.8 or higher is required.
To install Payara, run the following commands:
sudo curl -O -L https://nexus.payara.fish/repository/payara-community/fish/payara/distributions/payara/6.2023.8/payara-6.2023.8.zip
sudo unzip payara-6.2023.8.zip
sudo chown -R $USER /usr/local/payara6
If nexus.payara.fish is ever down for maintenance, Payara distributions are also available from https://repo1.maven.org/maven2/fish/payara/distributions/payara/
The Dataverse Software has been tested with PostgreSQL versions up to 13. PostgreSQL version 10+ is required.
On Mac, go to https://www.postgresql.org/download/macosx/ and choose “Interactive installer by EDB” option. Note that version 13.5 is used in the command line examples below, but the process should be similar for other versions. When prompted to set a password for the “database superuser (postgres)” just enter “password”.
After installation is complete, make a backup of the
pg_hba.conf file like this:
sudo cp /Library/PostgreSQL/13/data/pg_hba.conf /Library/PostgreSQL/13/data/pg_hba.conf.orig
pg_hba.conf with an editor such as vi:
sudo vi /Library/PostgreSQL/13/data/pg_hba.conf
In the “METHOD” column, change all instances of “scram-sha-256” (or whatever is in that column) to “trust”. This will make it so PostgreSQL doesn’t require a password.
In the Finder, click “Applications” then “PostgreSQL 13” and launch the “Reload Configuration” app. Click “OK” after you see “server signaled”.
Next, to confirm the edit worked, launch the “pgAdmin” application from the same folder. Under “Browser”, expand “Servers” and double click “PostgreSQL 13”. When you are prompted for a password, leave it blank and click “OK”. If you have successfully edited “pg_hba.conf”, you can get in without a password.
On Linux, you should just install PostgreSQL using your favorite package manager, such as
yum. (Consult the PostgreSQL section of Prerequisites in the main Installation guide for more info and command line examples). Find
pg_hba.conf and set the authentication method to “trust” and restart PostgreSQL.
Solr 9.3.0 is required.
To install Solr, execute the following commands:
sudo mkdir /usr/local/solr
sudo chown $USER /usr/local/solr
curl -O https://archive.apache.org/dist/solr/solr/9.3.0/solr-9.3.0.tgz
tar xvfz solr-9.3.0.tgz
cp -r configsets/_default collection1
curl -O https://raw.githubusercontent.com/IQSS/dataverse/develop/conf/solr/9.3.0/schema.xml
curl -O https://raw.githubusercontent.com/IQSS/dataverse/develop/conf/solr/9.3.0/schema_dv_mdb_fields.xml
mv schema*.xml collection1/conf
curl -O https://raw.githubusercontent.com/IQSS/dataverse/develop/conf/solr/9.3.0/solrconfig.xml
mv solrconfig.xml collection1/conf/solrconfig.xml
(Please note that the extra jetty argument below is a security measure to limit connections to Solr to only your computer. For extra security, run a firewall.)
bin/solr start -j "-Djetty.host=127.0.0.1"
bin/solr create_core -c collection1 -d server/solr/collection1/conf
To avoid having to install service dependencies like PostgreSQL or Solr directly on your localhost, there is the alternative of using the
docker-compose-dev.yml file available in the repository root. For this option you need to have Docker and Docker Compose installed on your machine.
docker-compose-dev.yml can be configured to only run the service dependencies necessary to support a Dataverse installation running directly on localhost. In addition to PostgreSQL and Solr, it also runs a SMTP server.
Before running the Docker Compose file, you need to update the value of the
DATAVERSE_DB_USER environment variable to
postgres. The variable can be found inside the
.env file in the repository root. This step is required as the Dataverse installation script expects that database user.
To run the Docker Compose file, go to the Dataverse repository root, then run:
docker-compose -f docker-compose-dev.yml up -d --scale dev_dataverse=0
Note that this command omits the Dataverse container defined in the Docker Compose file, since Dataverse is going to be installed directly on localhost in the next section.
The command runs the containers in detached mode, but if you want to run them attached and thus view container logs in real time, remove the
-d option from the command.
Data volumes of each dependency will be persisted inside the
docker-dev-volumes folder, inside the repository root.
If you want to stop the containers, then run (for detached mode only, otherwise use
Ctrl + C):
docker-compose -f docker-compose-dev.yml stop
If you want to remove the containers, then run:
docker-compose -f docker-compose-dev.yml down
If you want to run a single container (the mail server, for example) then run:
docker-compose -f docker-compose-dev.yml up dev_smtp
For a fresh installation, and before running the Software Installer Script, it is recommended to delete the docker-dev-env folder to avoid installation problems due to existing data in the containers.
Navigate to the directory where you cloned the Dataverse Software git repo change directories to the
scripts/installer directory like this:
Create a Python virtual environment, activate it, then install dependencies:
python3 -m venv venv
pip install psycopg2-binary
The installer will try to connect to the SMTP server you tell it to use. If you haven’t used the Docker Compose option for setting up the dependencies, or you don’t have a mail server handy, you can run
nc -l 25 in another terminal and choose “localhost” (the default) to get past this check.
Finally, run the installer (see also
README_python.txt if necessary):
After the script has finished, you should be able to log into your Dataverse installation with the following credentials:
Run the following command:
curl http://localhost:8080/api/admin/settings/:DoiProvider -X PUT -d FAKE
This will disable DOI registration by using a fake (in-code) DOI provider. Please note that this feature is only available in Dataverse Software 4.10+ and that at present, the UI will give no indication that the DOIs thus minted are fake.
Developers may also wish to consider using PermaLinks
Out of the box, a JSF setting is configured for production use and prevents edits to the GUI (xhtml files) from being visible unless you do a full deployment.
It is recommended that you run the following command so that simply saving the xhtml file in Netbeans is enough for the change to show up.
asadmin create-system-properties "dataverse.jsf.refresh-period=1"
For more on JSF settings like this, see Java Server Faces (JSF) Configuration Options.
You’re almost ready to start hacking on code. Now that the installer script has you up and running, you need to continue on to the Tips section to get set up to deploy code from your IDE or the command line.