Presence-Absence Points for Tree Species Distribution Modelling for Europe




Carmelo Bonannella; Tomislav Hengl; Johannes Heisig; Leandro Parente; Marvin N. Wright; Martin Herold; Sytze de Bruin


Creative Commons Attribution 4.0 International

Publication date

January 4, 2022

The dataset is a collection of presence and absence points for forest tree species for Europe. Each unique combination of longitude, latitude and year was considered as an independent sample. Presence data was obtained from the harmonized tree species occurrence dataset by Heising and Hengl (2020) and absence data from the LUCAS (in-situ source) dataset.

The final dataset contains 4,359,999 observations for and a total of 630 columns. 

The first 8 columns of the dataset contain metadata information used to uniquely identify the points:

  • id: unique point identifier,
  • year: year of observation,
  • postprocess: quality flag to identify if the temporal reference of an observation comes from the original dataset or is the result of spatiotemporal overlay with forest masks,
  • Tile_ID: contains the tile id from the eu_tiling_system (30 km grid),
  • easting: longitude coordinates in Coordinate Reference System ETRS89 / LAEA Europe (= EPSG code 3035),
  • northing: latitude coordinates in Coordinate Reference System ETRS89 / LAEA Europe (= EPSG code 3035),
  • Atlas_class: name of the tree species according to the European Atlas of Forest Tree Species or NULL in case of absence point,
  • lc1: contains original LUCAS land cover class or NULL if it's a presence point.

The remaining columns contain the extracted values of a series of predictor variables (temperature, precipitation, elevation, topographical information, spectral reflectance) useful for species distribution modeling applications.

To suggest any improvement/fix use:

Code on structure and applications of the dataset is available on Gitlab at:

Content text

European coverage, 30m resolution, Cloud Optimized Geotiff, https download


Bonannella, Carmelo, Hengl, Tomislav, Heisig, Johannes, Leal Parente, Leandro, Wright, Marvin, Herold, Martin, & de Bruin, Sytze. (2022). Presence-Absence Points for Tree Species Distribution Modelling for Europe (0.2) [Data set]. Zenodo.

The dataset is available on Zenodo

					#### Create folder for the project, download files from Zenodo ####

dir.create(paste0(getwd(), "/veg_mapping/"))
setwd(paste0(getwd(), "/veg_mapping/"))

## The whole regression matrix occupies 16 GB on RAM, make sure your workstation can handle it
## Source our custom functions from the script "vegetation_mapping_functions", change the path accordingly


## We use custom functions to read and write RDS files in parallel to speed up processes using pigz
## Functions are customized for Linux systems, you can replace them with the basic, single-core R function
## saveRDS.gz -> saveRDS
## readRDS.gz -> readRDS

a.rgr <- curl::curl_download("", 
                    tempfile()) %>%
                    readRDS.gz() %>% 

Spread the love