--- title: "Introduction to rosmium: R bindings for Osmium Tool" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{Introduction to rosmium: R bindings for Osmium Tool} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>", eval = identical(tolower(Sys.getenv("NOT_CRAN")), "true") ) ``` **rosmium** is a package that allows one to use [Osmium Tool](https://osmcode.org/osmium-tool/) from R. Osmium is a multipurpose command line tool that enables one to manipulate and analyze OpenStreetMap (OSM) files through several different commands. Currently, the package does not aim to cover the entirety of Osmium's API, instead making available functions that wrap only a very limited set of Osmium's features. This vignette briefly overviews the package with a few reproducible examples. # Installation Before using **rosmium** please make sure that you have it installed in your computer. You can download either the most stable version from CRAN... ```{r, eval = FALSE} install.packages("rosmium") ``` ...or the development version from GitHub. ```{r, eval = FALSE} # install.packages("remotes") remotes::install_github("ipeaGIT/rosmium") ``` Please note that **rosmium** requires Osmium to be installed and added to the PATH environment variable of your local system. For instructions on how to install Osmium, please check [its official website](https://osmcode.org/osmium-tool/#download). # Usage The package currently includes only three entrypoints to Osmium's API. To demonstrate them, we will use some sample data bundled with the package. ```{r, eval = requireNamespace("ggplot2", quietly = TRUE) && identical(tolower(Sys.getenv("NOT_CRAN")), "true"), fig.width = 6, fig.height = 5.5} library(rosmium) library(ggplot2) cur_pbf <- system.file("extdata/cur.osm.pbf", package = "rosmium") cur_pbf_lines <- sf::st_read(cur_pbf, layer = "lines", quiet = TRUE) ggplot(cur_pbf_lines) + geom_sf() ``` ### `extract()` `extract()` creates geographical extracts from OSM files. In its most basic form, the function takes the path to the OSM file whose geographical extent should be extracted from, the extent, either as a bounding box or as a (multi)polygon, and the path to the file where the output should be written to. Additionally, the function can also take the strategy to be used when creating the extract, which defaults to `"complete_ways"`. Please check the function documentation for details on the available strategies and their behavior. ```{r, eval = requireNamespace("ggplot2", quietly = TRUE) && identical(tolower(Sys.getenv("NOT_CRAN")), "true"), fig.width = 6, fig.height = 4.5} # buffering the pbf bounding box 4000 meters inward and using the result # extent to extract the osm data inside it. transforming the crs because # inward buffers only work with projected crs bbox <- sf::st_bbox(cur_pbf_lines) bbox_polygon <- sf::st_as_sf(sf::st_as_sfc(bbox)) smaller_bbox_poly <- sf::st_buffer(sf::st_transform(bbox_polygon, 5880), -4000) smaller_bbox_poly <- sf::st_transform(smaller_bbox_poly, 4326) output_path <- extract( cur_pbf, smaller_bbox_poly, tempfile(fileext = ".osm.pbf"), spinner = FALSE ) extracted_pbf_lines <- sf::st_read(output_path, layer = "lines", quiet = TRUE) ggplot() + geom_sf(data = extracted_pbf_lines) + geom_sf(data = smaller_bbox_poly, color = "red", fill = NA) ``` ### `tags_filter()` `tags_filter()` filters OSM files, keeping objects matching at least one of the specified expressions from the input. In its most basic form, the function takes the path to the OSM file to which the filters should be applied, the filter expressions that should be applied and the path to the file where the output should be written to. Please check the function documentation for a description of the filter expression format. By default, not only the objects matching the expressions will be kept in the output, but also the objects referenced by them. This behavior can be changed with the `omit_referenced` parameter. The function also includes the `invert_match` parameter, that inverts the sense of matching (excluding objects that match the filters), and the `remove_tags` parameter, used to remove tags from objects that are referenced by objects matching the filters, but which do not match the filter themselves. Both arguments default to `FALSE`. ```{r} # get all amenity nodes output <- tags_filter(cur_pbf, "n/amenity", tempfile(fileext = ".osm.pbf")) nodes <- sf::st_read(output, layer = "points", quiet = TRUE) head(nodes$other_tags) # get all objects (nodes, ways or relations) with an addr:* tag output <- tags_filter( cur_pbf, "addr:*", tempfile(fileext = ".osm.pbf"), omit_referenced = TRUE, spinner = FALSE ) nodes <- sf::st_read(output, layer = "points", quiet = TRUE) head(nodes$other_tags) ``` ### `show_content()` Finally, `show_content()` displays the content of an OSM file either in `.html`, `.xml` or `.opl` format. The function takes as input the path to the OSM file whose content should be shown, the output format in which the content should be displayed (defaulting to `.html`, the most human readable format, although also the slowest to open and inspect, depending on the size of the input file) and the type of objects that should be included in the output (defaulting to all existing objects in the input). The function returns the path to the temporary file in which the OSM file content was saved and opens the content in the web browser or the most appropriate application, depending on the output format. ```{r} # displays the content of the previous tags_filter() output in html format show_content(output, spinner = FALSE) ``` ```{r, echo = FALSE} knitr::include_graphics("../man/figures/filtered_file_content.png") ``` ## Acknowledgement **rosmium** is developed by a team at the Institute for Applied Economic Research (Ipea), Brazil. We would like to thank the authors and contributors of Osmium for the development of [Osmium Tool](https://github.com/osmcode/osmium-tool).