Features

This page highlights features for administrators and power-users of a Dataverse installation.

See What is Dataverse? to learn about its Core Capabilities for researchers if you’re new to Dataverse.

Artifical Intelligence

AI Tools

A number of AI tools integrate with Dataverse.

Model Context Protocol

Model Context Protocol (MCP) is a standard for AI Agents to communicate with tools and services.

Access and Download

Faceted Search

Facets are data driven and customizable per collection.

File Previews

A preview is available for text, tabular, image, audio, video, and geospatial files.

Preview URL

Create a URL for reviewers to view an unpublished (and optionally anonymized) dataset.

Guestbook

Optionally collect data about who is downloading the files from your datasets.

Download in Open Tabular Formats

Proprietary tabular formats are converted into TSV and RData for download.

Administration

User Management

Dashboard for common user-related tasks.

Quotas

For number of files, amount of storage, etc.

Usage Statistics and Metrics

Download counters, support for Make Data Count.

Configurable Notifications

In-app and email notifications for access requests, requests for review, etc. can be muted.

Authentication

Login via Shibboleth

Single Sign On (SSO) using your institution’s credentials.

Login via ORCID, Google, GitHub, or Microsoft

Log in using popular OAuth2 providers.

Login via OpenID Connect (OIDC)

Log in using your institution’s identity provider or a third party.

Customization

Branding

Your installation can be branded with a custom homepage, header, footer, CSS, etc.

Internationalization

The Dataverse software has been translated into multiple languages.

Customization of Collections

Each personal or organizational collection can be customized and branded.

Widgets

Embed listings of data in external websites.

FAIR Data Publication

Support for FAIR Data Principles

Findable, Accessible, Interoperable, Reusable.

Versioning

History of changes to datasets and files are preserved.

Prepublication Review Support

Datasets start as drafts and can be submitted for review before publication where curators can mark datasets with curation status labels.

Labels for Traditional Knowledge

Integrate with the Local Contexts platform, enabling the use of Traditional Knowledge and Biocultural Labels, and Notices.

File Management

File Hierarchy

Users are able to control dataset file hierarchy and directory structure.

Restricted Files

Control who can download files and choose whether or not to enable a “Request Access” button.

Embargo

Make files inaccessible until an embargo end date.

Retention Periods

Make files inaccessible once the retention period set has passed.

Metadata Extraction from Files

Populate dataset metadata fields from tabular, NetCDF, HDF5, and FITS files.

Configurable Storage

Choose between filesystem or object storage, configurable per collection and per dataset.

Direct Upload and Download for S3

After a permission check, files can pass freely and directly between a client computer and S3.

Fixity Checks for Files

MD5, SHA-1, SHA-256, SHA-512, UNF.

Auxiliary Files for Data Files

Each data file can have any number of auxiliary files for documentation or other purposes (experimental).

Geospatial Data Support

Geospatial Metadata Fields

There is a dedicated geospatial metadata block.

Geospatial File Preview

GeoJSON, GeoTIFF, and Shapefiles can be previewed as a map.

Geospatial Search API

Pass geo_point and geo_radius to find datasets based on their bounding box.

Integrations

DataCite

DOIs are reserved, and when datasets are published, their metadata is published to DataCite.

Handle

Handles are a Persistent ID (PID) that are an alternative to DOIs.

Globus

Upload from and download to Dataverse using Globus endpoints.

RSpace

Exchange data and metadata with RSpace (e.g. IGSN ID). For example, a Data Management Plan (DMP) can be uploaded to RSpace and updated with the DOI of a Dataverse dataset.

GitHub

A GitHub Action is available to upload files from GitHub to a dataset.

iRODS

Pull data from an iRODS instance to a Dataverse dataset.

Dropbox

Upload files stored on Dropbox.

Jupyter Notebooks

Datasets can be opened in Binder to run code in Jupyter notebooks, RStudio, and other computation environments. They can also be previewed in Dataverse itself.

Galaxy

Import files directly from Dataverse into Galaxy as well as publish datasets containing artifacts (Histories, datasets, etc.) from Galaxy to Dataverse.

External Tools

Enable additional features not built in to the Dataverse software.

Additional Integrations

Dataverse integrates with a wide variety of third party systems, some of which are highlighted above.

Interoperability

APIs

Search API, Data Deposit API, Data Access API, Metrics API, Migration API, etc. and client libraries in various languages.

OAI-PMH Metadata Harvesting

Serve and harvest metadata to and from other systems (e.g. DataCite, other Dataverse installations, etc.) using standardized metadata formats.

Schema.org JSON-LD

Used by Google Dataset Search and other services for discoverability.

Croissant

Export metadata as linked data following the Croissant ontology.

Signposting

Enable easier machine access to datasets by adding linkset in a Dataverse header.

External Vocabulary

Let users pick from external vocabularies (provided via API/SKOSMOS) when filling in metadata.

BagIt Export

For preservation, bags can be sent to the local filesystem, Duracloud, and Google Cloud.

RO-Crate

Export dataset metadata as an ro-crate.json.

Reusability

Multiple License Support

Users can select from multiple standard and provided custom licenses.

Custom Terms of Use

Users can write custom terms of use in place of a predefined license.

Data Citation Formats

EndNote XML, RIS, BibTeX, or 1000+ CSL formats at the dataset or file level.

Provenance

At the file level, upload standard W3C provenance files or enter free text instead.

Post-Publication Workflows

Allow publication of a dataset to trigger external processes and integrations.