Skip to contents

ETH Board Open Research Data (ORD) Program Project Metadata and Report Data


Installation

You can install the development version of ethord from GitHub with:

devtools::install_github("Global-Health-Engineering/ethord", dependencies = TRUE)

Download as CSV Files

If you prefer to work with the data outside of R, you can download individual datasets as CSV files.

  1. Right-click on the “Download CSV” link for the dataset you want.
  2. Select “Save Link As” or “Download Linked File” .
  3. Choose where you’d like to save the file on your computer.
dataset CSV
docs_detail Download CSV
docs_proposal Download CSV
docs_report Download CSV
portal Download CSV

Data


docs_detail

The dataset docs_detail has 7 observations and 20 variables

get("docs_detail") |> 
  head(3) |> 
  gt::gt() |> 
  gt::as_raw_html()
project_id call_category applicant_id main_applicant_institution_name applicant_type applicant_title applicant_first_name applicant_last_name applicant_gender applicant_institution applicant_department_name applicant_lab_name applicant_orcid_id title acronym abstract keywords project_start_dd_mm_yyyy project_duration_months funding_requested
1 Explore ORD2000111 ETH Zürich Primary Prof. Dr. Elizabeth Tilley Female ETH Zürich NA Global Health Engineering NA Open WASH data by building Open Science Compete… openwashdata Poor data management practices hold back progre… open data, reproducible research, reproducibili… 01.10.2022 18 149990
1 Explore ORD2000111 ETH Zürich Secondary Mr. Lars Schöbitz Male ETH Zürich NA NA NA Open WASH data by building Open Science Compete… openwashdata Poor data management practices hold back progre… open data, reproducible research, reproducibili… 01.10.2022 18 149990
1 Explore ORD2000111 ETH Zürich Secondary Dr. Matthias Bannert Male ETH Zürich NA NA NA Open WASH data by building Open Science Compete… openwashdata Poor data management practices hold back progre… open data, reproducible research, reproducibili… 01.10.2022 18 149990

For an overview of the variable names, see the following table.

variable_name variable_type description
project_id numeric A unique identifier for each project, represented as a numerical value.
call_category character The category of the project call, with possible values including Explore and Contribute.
applicant_id character A unique identifier for the applicant, typically in the format ORD followed by a series of numbers.
main_applicant_institution_name character The name of the main institution associated with the applicant.
applicant_type character The type of applicant, categorized as either Primary or Secondary.
applicant_title character The title of the applicant, such as Prof. Dr. 
applicant_first_name character The first name of the applicant.
applicant_last_name character The last name of the applicant.
applicant_gender character The gender of the applicant, categorized as either Female or Male.
applicant_institution character The institution affiliated with the applicant.
applicant_department_name logical The name of the department within the applicant’s institution.
applicant_lab_name character The name of the laboratory associated with the applicant.
applicant_orcid_id logical The ORCID identifier for the applicant.
title character The title of the project.
acronym character A shortened acronym representing the project title.
abstract character A brief summary or abstract of the project.
keywords character Relevant keywords associated with the project.
project_start_dd_mm_yyyy character The start date of the project in the format DD.MM.YYYY.
project_duration_months numeric The duration of the project in months, represented as a numerical value.
funding_requested numeric The amount of funding requested for the project, represented as a numerical value.

docs_proposal

The dataset docs_proposal has 5 observations and 11 variables

get("docs_proposal") |> 
  head(3) |> 
  gt::gt() |> 
  gt::as_raw_html()
project_id cost_personnel_senior_staff_fr cost_personnel_postdocs_fr cost_personnel_other_fr cost_personnel_students_fr cost_travel_fr cost_equipment_fr cost_publication_fr cost_social_fr cost_other_fr cost_subcontracting_fr
1 NA NA NA NA NA NA NA NA NA NA
2 NA NA 13500 NA 5600 NA NA 4900 NA 6000
3 NA NA 26905 NA 3000 NA NA NA NA NA

For an overview of the variable names, see the following table.

variable_name variable_type description
project_id numeric A unique identifier for each project, represented as a numerical value.
cost_personnel_senior_staff_fr logical The cost of personnel for senior staff in Swiss Francs (CHF), expected to be a numerical value.
cost_personnel_postdocs_fr numeric The cost of personnel for postdoctoral researchers in Swiss Francs (CHF), represented as a numerical value.
cost_personnel_other_fr numeric The cost of personnel for other staff members in Swiss Francs (CHF), represented as a numerical value.
cost_personnel_students_fr logical The cost of personnel for students in Swiss Francs (CHF), expected to be a numerical value.
cost_travel_fr numeric The cost of travel expenses in Swiss Francs (CHF), represented as a numerical value.
cost_equipment_fr numeric The cost of equipment in Swiss Francs (CHF), represented as a numerical value.
cost_publication_fr logical The cost of publication expenses in Swiss Francs (CHF), expected to be a numerical value.
cost_social_fr numeric The cost of social expenses in Swiss Francs (CHF), represented as a numerical value.
cost_other_fr numeric The cost of other expenses in Swiss Francs (CHF), represented as a numerical value.
cost_subcontracting_fr numeric The cost of subcontracting in Swiss Francs (CHF), represented as a numerical value.

docs_report

The dataset docs_report has 60 observations and 7 variables

get("docs_report") |> 
  head(3) |> 
  gt::gt() |> 
  gt::as_raw_html()
project_id item_id item_name_long item_name_short quantity description link
1 1 New or enhanced website(s), web interface, plat… website 1 new website created: openwashdata community web… https://openwashdata.org/
1 1 New or enhanced website(s), web interface, plat… website 1 new website created: data science for openwashd… https://ds4owd-001.github.io/website/
1 2 New or enhanced repositories and/or catalogues repository 1 Zenodo catalogue / community: https://zenodo.org/communities/openwashdata?q=&;…

For an overview of the variable names, see the following table.

variable_name variable_type description
project_id numeric A unique identifier for each project, represented as a numerical value.
item_id numeric A unique identifier for each item within a project, represented as a numerical value with decimal places.
item_name_long character A detailed name describing the item, such as ‘New or enhanced webs’ or ‘New or enhanced data’.
item_name_short character A brief name summarizing the item, such as ‘website’, ‘repository’, or ‘dataset’.
quantity character The number of items, represented as a discrete numerical value.
description character A text description providing additional context about the item.
link character A URL linking to the item or its associated resource, such as a website or repository.

portal

The dataset portal has 96 observations and 8 variables

get("portal") |> 
  head(3) |> 
  gt::gt() |> 
  gt::as_raw_html()
url title category institutions data_type field researchers abstract
https://open-research-data-portal.ch/projects/m MMS (Masonry MicroStructures database) - A 3D m… Contribute EPFL Microstructure database Materials Science Shah, Mati Ullah Stone masonry is an eco-friendly construction m…
https://open-research-data-portal.ch/projects/a Application Programming Interface for the River… Contribute ETH Zurich Environnement Earth sciences Paradis, Sarah In order to advance our understanding of the ca…
https://open-research-data-portal.ch/projects/d Development of standardized Respiratory Open Ac… Contribute EPFL Medical data Life sciences Dan, Jonathan Chronic cough is a common condition globally. W…

For an overview of the variable names, see the following table.

variable_name variable_type description
url character The URL linking to the specific research project’s details or webpage.
title character The title of the research project.
category character The category or classification of the research project, such as Contribute, Explore, or Establish.
institutions character The institutions involved in the research project, such as EPFL, ETH Zurich, or Empa.
data_type character The type of data associated with the research project, such as Microstructure data, Environmental data, or Medical data.
field character The field of study or discipline of the research project, such as Materials Science, Earth sciences, or Life sciences.
researchers character The names of the researchers involved in the project.
abstract character A brief summary or abstract of the research project.

License

Data are available as CC-BY.

Citation

#> To cite package 'ethord' in publications use:
#> 
#>   Massari N, Schöbitz L, Tilley E (2025). "ethord: ETH Board Open
#>   Research Data (ORD) Program Project Metadata and Report Data."
#>   doi:10.5281/zenodo.15554776
#>   <https://doi.org/10.5281/zenodo.15554776>,
#>   <https://github.com/Global-Health-Engineering/ethord>.
#> 
#> A BibTeX entry for LaTeX users is
#> 
#>   @Misc{massari_etall:2025,
#>     title = {ethord: ETH Board Open Research Data (ORD) Program Project Metadata and Report Data},
#>     author = {Nicolo Massari and Lars Schöbitz and Elizabeth Tilley},
#>     year = {2025},
#>     doi = {10.5281/zenodo.15554776},
#>     url = {https://github.com/Global-Health-Engineering/ethord},
#>     abstract = {This data package contains metadata and report data from 96 research projects funded by the joint Open Research Data (ORD) program of ETH Zurich, EPFL, and the four research institutes of the ETH Domain. The package includes project-level metadata, such as project titles, descriptions, and categorizations (Establish, Explore, Contribute), as well as data extracted from reports submitted by each project, providing insights into the projects' objectives, methods, outcomes, and impacts.},
#>     version = {0.0.2},
#>   }